Data Anonymization

Overview

Personally identifiable information (PII) includes any data that can identify, contact, or locate an individual—such as Social Security Numbers, email addresses, credit card numbers, CVV codes, passport numbers, and home addresses.

The Platform detects sensitive data using regex patterns you define, anonymizes it before sending it to the LLM, and restores the original values after the LLM responds. Conversation history and debug logs always show the original data.

LLM-layer anonymization works alongside globally declared PII. Global settings take precedence.

Method	Description
Redaction	Replaces the value with a placeholder that hides it entirely.
Replacement	Substitutes the value with a predefined string.
Mask with Character	Conceals part of the value while preserving a recognizable format—for example, showing only the last four digits of a card number.

Method

Description

Redaction

Replaces the value with a placeholder that hides it entirely.

Replacement

Substitutes the value with a predefined string.

Mask with Character

Conceals part of the value while preserving a recognizable format—for example, showing only the last four digits of a card number.

Configure Data Anonymization

Steps:

Go to Generative AI Tools > Safeguards > Data Anonymization.

Click Get Started or + New Field.

Enter the Information Type and Regex Pattern, then select the Display Type.

Define patterns as substrings, not exact values. The Platform scans the entire request payload—not just user input. Use a regex that matches the target value within a broader context. For example, use (?<!\d)890839(?!\d) instead of ^890839$ to redact 890839.

Click Save.

Prompt Guidance for Smaller Models

For smaller LLMs, add explicit instructions in the system prompt to prevent the model from modifying redacted values. If the model alters a redacted value, the Platform can’t restore the original, which breaks end-to-end functionality.

The Platform wraps redacted values with ##. Use this marker in your prompt instructions.

Example 1 — Masked value:

Sensitive data like a credit card number may appear as ##************5678##.
MANDATORILY DO NOT reject, modify, or alter any such values in your responses.

Example 2 — Redacted label:

Sensitive data like "EmailAddress" may appear as ##EmailAddress##.
MANDATORILY DO NOT reject, modify, or alter any such values in your responses.

Overview

Anonymization Methods

Supported Features

Configure Data Anonymization

Prompt Guidance for Smaller Models

​Overview

​Anonymization Methods

​Supported Features

​Configure Data Anonymization

​Prompt Guidance for Smaller Models

Overview

Anonymization Methods

Supported Features

Configure Data Anonymization

Prompt Guidance for Smaller Models