Text pattern markup uses regular expressions to find and annotate sensitive data such as Social Security numbers, phone numbers, email addresses, and credit card numbers. You can enable multiple pattern types at once and configure region-specific formats.
| Pattern | Description | Example Matches |
|---|---|---|
| SSN | Social Security Number | 123-45-6789 |
| EIN | Employer Identification Number | 12-3456789 |
| Phone | Telephone numbers (region-specific formats) | (555) 123-4567 |
| Email addresses | user@example.com | |
| Date | Date formats (region-specific) | 01/15/2025, 15-Jan-2025 |
| Address | Street addresses | 123 Main Street |
| Credit Card | Credit card numbers | 4111-1111-1111-1111 |
| Brackets | Text enclosed in brackets | [redacted text], (sensitive) |
| Custom | User-defined regular expression | Any regex pattern |
Pattern matching adapts to regional formats. Select the region that matches the document's locale to ensure accurate detection.
| Region | Affected Patterns |
|---|---|
| United States | SSN, phone, date, address formats typical of US documents. |
| United Kingdom | UK phone numbers, date formats (DD/MM/YYYY), National Insurance numbers. |
| Canada | Canadian phone formats, SIN patterns, postal codes. |
| Australia | Australian phone formats, TFN patterns, date formats. |
| European Union | EU date formats, continental phone number patterns. |
Tip
If your document contains data from multiple regions, run the pattern search once per region on the same document. Annotations accumulate and do not overwrite each other.
| Option | Description |
|---|---|
| Case Sensitive | When enabled, pattern matching distinguishes between uppercase and lowercase letters. Primarily affects the Custom regex pattern. |
| Whole Word | When enabled, matches must occur at word boundaries. Prevents partial matches within longer strings. |
| Direct Redact | When enabled, matched content is immediately redacted without first creating markup annotations. Use with caution — this is not reversible. |
Warning
The Direct Redact option permanently removes matched content in a single step, bypassing the preview stage. Always test your patterns with Direct Redact disabled first to verify correct matches.
Enable the Custom pattern type and enter a standard regular expression in the custom regex field. The engine uses ECMAScript regex syntax. Some useful examples:
Case\s*No\.\s*\d+ — Matches case numbers like "Case No. 12345"\b[A-Z]{2}\d{6}\b — Matches codes like "AB123456"CONFIDENTIAL|SECRET|TOP SECRET — Matches classification markings