PII Detection & Anonymization

Regex-first protection for GDPR compliance. 317 deterministic pattern recognizers for structured data, NLP for names and locations—transparent, auditable results on ISO 27001-certified servers in Germany.

contract_review.txt
Dear Dr. Sarah Mitchell, Regarding your account inquiry from January 15, 2026, we have verified your identity using the email test.demo@example.com and phone +49 170 123 4567. Your registered address at Friedrichstr. 43, 10117 Berlin has been confirmed. Please transfer the outstanding balance to DE89 3704 0044 0532 0130 00. Best regards, Thomas Weber
ISO 27001:2022
GDPR Compliant
Hosted in Germany
Built on Microsoft Presidio

Image Redaction — Your Killer Feature

Automatically detect and redact PII in scanned documents, photos, and screenshots. OCR-powered analysis with Tesseract supports 38 languages.

  • Detect faces, names, dates, and addresses in images
  • 38 OCR languages with automatic orientation correction
  • Bounding-box redaction with entity-type color coding
  • Batch process entire folders of scanned documents
passport_scan.png
2.4 MB — PNG
PII Detected12 entities
PERSONDATEADDRESSPHONE
Redaction complete

Regex-First Detection

317 deterministic pattern recognizers deliver reproducible results for structured data like IDs, tax numbers, and credit cards. NLP models supplement for names and locations — all running on our own servers in Germany, never sending data to third parties. Fully auditable for regulatory compliance.

Learn about our technology

Servers in Germany, ISO 27001 Certified

All processing happens in Hetzner's ISO 27001-certified data centers in Germany. Your data stays in the EU with no surprise jurisdiction issues.

View security details

Token-Based Pricing You Can Understand

Pay for what you use with our transparent token system. Free tier includes 200 tokens (~15-18 pages/month). No hidden fees, no surprises.

See pricing
320+
Entity Types
48
Languages Supported
7
Anonymization Methods
99.9%
Uptime SLA

Solutions for Every Workflow

Protect sensitive data wherever you work — AI chats, APIs, documents, or across languages

AI Chatbot Protection

Anonymize PII before it reaches ChatGPT, Claude, Gemini, and other AI platforms. Real-time interception with reversible encryption.

Protect AI conversations

PII Redaction API

RESTful API with JavaScript and Python SDKs. Detect and anonymize 320+ entity types programmatically.

Explore the API

Reversible Encryption

AES-256-GCM encryption that preserves data utility. Decrypt anonymized data anytime with your personal key.

Learn about encryption

48-Language Detection

Full PII detection across 48 languages and 70+ countries. RTL support for Arabic, Hebrew, Persian, and Urdu.

View supported languages

How It Works

Four simple steps to protect sensitive data in your documents

1

Upload or Paste

Input your text via our web interface, API, or Office Add-in

2

Analyze

Our detection engine scans for 320+ PII entity types across 48 languages using regex patterns and NLP

3

Review

Human-in-the-loop review: see detections with confidence scores, override false positives, and approve before anonymization

4

Anonymize

Apply your chosen anonymization method and download results

Frequently Asked Questions

What is PII detection and anonymization?

PII (Personally Identifiable Information) detection scans text for sensitive data like names, emails, phone numbers, tax IDs, and passport numbers. Anonymization then replaces, masks, redacts, hashes, or encrypts these entities so the data can be shared or processed safely — without exposing personal information.

How does cloak.business protect data sent to AI chatbots?

Our Chrome Extension intercepts messages before they reach ChatGPT, Claude, Gemini, and other AI platforms. It detects PII in real time and replaces sensitive values with anonymized tokens. When the AI responds, the extension automatically decrypts the values back to their originals — so you get useful AI answers without ever exposing personal data.

Is cloak.business GDPR compliant?

Yes. All processing happens on ISO 27001:2022-certified servers in Germany. Data never leaves the EU. Our regex-first detection is fully deterministic and auditable, which satisfies GDPR's transparency and accountability requirements. We also support HIPAA, PCI-DSS, and other compliance frameworks.

What languages does cloak.business support?

We support 48 languages including English, German, Spanish, French, Italian, Portuguese, Japanese, Chinese, Korean, Arabic, Hindi, and more. Our 317 regex-based recognizers cover country-specific entities like tax IDs, national IDs, and phone formats for 70+ countries. RTL languages (Arabic, Hebrew, Persian, Urdu) are fully supported.

Can I reverse the anonymization?

Yes — our Encrypt method uses AES-256-GCM encryption with your personal key. You can decrypt anonymized data back to its original form at any time. This is ideal for AI workflows where you need to anonymize before sending and restore originals after receiving a response. Other methods (Replace, Mask, Redact, Hash) are one-way.

How do I integrate cloak.business into my application?

Use our RESTful API with official SDKs for JavaScript (npm: @cloak-business/sdk) and Python (PyPI: cloak-business). Three endpoints cover the full workflow: analyze (detect PII), anonymize (protect data), and deanonymize (restore encrypted values). A free tier with 200 tokens is available to get started.

What entity types can cloak.business detect?

Over 320 entity types across 70+ countries. This includes names, emails, phone numbers, addresses, credit card numbers, IBANs, SSNs, passport numbers, tax IDs, driver's licenses, national IDs, IP addresses, URLs, and more. We use 317 regex-based recognizers for structured data plus NLP models for names and locations.

Is there a free plan?

Yes. Our free tier includes 200 tokens per billing cycle (roughly 15–18 pages of text). No credit card required. You get access to all features including the API, Chrome Extension, and all 7 anonymization methods. Paid plans start at affordable rates for higher volumes.

Ready to Protect Your Data?

Start with our free tier—200 tokens per cycle, no credit card required.