cloak.business Entity & Preset Inventory
Last Updated: 2026-02-12
Analyzer Version: 6.9.0
Image Redactor Version: 5.2.0
Total Recognizers: 317 PatternRecognizers + NLP models
Total Presets: 157+
Executive Summary#
cloak.business provides enterprise-grade PII detection with:
- 317 custom regex PatternRecognizers covering personal data, technical secrets, and industry-specific identifiers
- NLP detection via spaCy (25 languages), Stanza NER (7 languages), and XLM-RoBERTa transformers (16 languages) for names, locations, and organizations
- 390+ entity types detected across 75+ countries
- 157+ presets for 100+ countries, regions, and industry verticals
- Support for 48 languages with country-specific patterns
- Global coverage: Europe, Americas, Asia-Pacific, Africa, Middle East
- Image anonymization with 37 OCR languages (see IMAGE_ANONYMIZATION.md)
Table of Contents#
- Entity Categories
- Presets by Region
- Industry-Specific Presets
- Technical/Developer Presets
- Complete Entity Reference
- Complete Preset Reference
Entity Categories#
Personal Identification (78 entities)#
| Category | Entities | Countries |
|---|
| National ID Numbers | 45+ | All EU, US, APAC |
| Passport Numbers | 30+ | Global |
| Driver Licenses | 35+ | Global |
| Tax IDs | 25+ | Global |
| Social Security | 15+ | US, EU, APAC |
Examples by Country
| Country | Entity Types |
|---|
| Germany | DE_ID_CARD, DE_PASSPORT, DE_DRIVER_LICENSE, DE_TAX_ID, DE_SVN, DE_STEUERNUMMER |
| USA | US_SSN, US_PASSPORT, US_DRIVER_LICENSE, US_ITIN, US_EIN |
| UK | UK_NINO, UK_PASSPORT, UK_UTR, UK_NHS |
| France | FR_CNI, FR_PASSPORT, FR_NIR, FR_TAX_ID |
Financial Identifiers (35 entities)#
| Entity | Description | Example Pattern |
|---|
| CREDIT_CARD | Credit/debit card numbers | Luhn-validated 13-19 digits |
| IBAN_CODE | International Bank Account Number | Country-specific formats |
| SWIFT_CODE | Bank identifier codes | 8-11 alphanumeric |
| CUSIP | US/Canada securities | 9 alphanumeric |
| ISIN | International securities | 12 alphanumeric |
| LEI | Legal Entity Identifier | 20 alphanumeric |
| SEDOL | UK securities | 7 alphanumeric |
Country-Specific Banking
| Country | Entities |
|---|
| Germany | DE_IBAN, DE_BIC |
| Austria | AT_IBAN |
| Switzerland | CH_IBAN |
| France | FR_IBAN |
| UK | UK_SORT_CODE |
| USA | US_BANK_NUMBER, US_ROUTING_NUMBER |
Healthcare Identifiers (25 entities)#
| Entity | Description | Region |
|---|
| NPI | National Provider Identifier | US |
| MBI | Medicare Beneficiary Identifier | US |
| DEA_NUMBER | Drug Enforcement Administration | US |
| ICD10_CODE | Diagnosis codes | International |
| ICD11_CODE | Diagnosis codes (new) | International |
| CPT_CODE | Procedure codes | US |
| HCPCS_CODE | Healthcare supply codes | US |
| NDC_CODE | National Drug Code | US |
| LOINC_CODE | Lab observation codes | International |
| SNOMED_CODE | Clinical terminology | International |
| MRN | Medical Record Number | Universal |
| RX_NUMBER | Prescription numbers | US |
| ACCESSION_NUMBER | Lab/radiology specimens | Universal |
| ENCOUNTER_NUMBER | Visit/admission IDs | Universal |
Country-Specific Healthcare
| Country | Entities |
|---|
| Germany | DE_KVNR (health insurance), DE_RVNR (pension) |
| UK | UK_NHS |
| Japan | JP_HEALTH_INSURANCE |
| Italy | IT_HEALTH_CARD |
Technical Secrets (48 entities)#
AI Provider API Keys
| Entity | Provider | Pattern |
|---|
| OPENAI_API_KEY | OpenAI | sk-...T3BlbkFJ... |
| ANTHROPIC_API_KEY | Anthropic Claude | sk-ant-api03-... |
| GOOGLE_AI_KEY | Google AI | AIza... |
| AZURE_OPENAI_KEY | Azure OpenAI | 32 hex characters |
| MISTRAL_API_KEY | Mistral AI | 32 alphanumeric |
| HUGGINGFACE_TOKEN | HuggingFace | hf_... |
| COHERE_API_KEY | Cohere | 40 alphanumeric |
| REPLICATE_API_TOKEN | Replicate | r8_... |
Cloud Provider Credentials
| Entity | Provider | Pattern |
|---|
| AWS_ACCESS_KEY | AWS | AKIA... (20 chars) |
| AWS_SECRET_KEY | AWS | 40 char base64 |
| AWS_SESSION_TOKEN | AWS | Long base64 |
| AZURE_CLIENT_SECRET | Azure | 34-40 alphanumeric |
| AZURE_STORAGE_KEY | Azure | 88 char base64 |
| GCP_API_KEY | Google Cloud | AIza... |
| DIGITALOCEAN_TOKEN | DigitalOcean | dop_v1_... |
Version Control Tokens
| Entity | Platform | Pattern |
|---|
| GITHUB_TOKEN | GitHub PAT | ghp_... |
| GITHUB_FINE_GRAINED_TOKEN | GitHub | github_pat_... |
| GITLAB_TOKEN | GitLab | glpat-... |
| BITBUCKET_TOKEN | Bitbucket | Context-based |
| NPM_TOKEN | npm | npm_... |
| PYPI_TOKEN | PyPI | pypi-... |
Authentication & Secrets
| Entity | Description |
|---|
| JWT_TOKEN | JSON Web Tokens |
| BEARER_TOKEN | Bearer authentication |
| BASIC_AUTH | Basic auth headers |
| SSH_PRIVATE_KEY | SSH private keys |
| SSH_PUBLIC_KEY | SSH public keys |
| PGP_PRIVATE_KEY | PGP/GPG keys |
| X509_CERTIFICATE | SSL/TLS certificates |
Database Connection Strings
| Entity | Database |
|---|
| MONGODB_URI | MongoDB |
| POSTGRESQL_URI | PostgreSQL |
| MYSQL_URI | MySQL/MariaDB |
| REDIS_URI | Redis |
| Entity | Platform |
|---|
| SLACK_TOKEN | Slack |
| SLACK_WEBHOOK | Slack webhooks |
| STRIPE_KEY | Stripe payments |
| SENDGRID_KEY | SendGrid email |
| MAILGUN_KEY | Mailgun email |
| TWILIO_CREDENTIALS | Twilio |
| DISCORD_TOKEN | Discord |
| TELEGRAM_TOKEN | Telegram |
Infrastructure Identifiers (24 entities)#
Network Identifiers
| Entity | Description | Example |
|---|
| MAC_ADDRESS | Hardware address | AA:BB:CC:DD:EE:FF |
| UUID | Universal unique ID | xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx |
| IPV6_ADDRESS | IPv6 addresses | 2001:db8::1 |
Container & Orchestration
| Entity | Description |
|---|
| DOCKER_CONTAINER_ID | Docker container IDs |
| DOCKER_IMAGE_DIGEST | Docker image digests |
| K8S_RESOURCE_NAME | Kubernetes pod/resource names |
| K8S_SECRET_DATA | Kubernetes secret data |
Cloud Resource IDs
| Entity | Provider |
|---|
| AWS_ARN | Amazon Resource Names |
| AWS_ACCOUNT_ID | AWS account IDs |
| AZURE_RESOURCE_ID | Azure resource paths |
| AZURE_SUBSCRIPTION_ID | Azure subscription GUIDs |
| GCP_PROJECT_ID | GCP project identifiers |
Observability
| Entity | Description |
|---|
| TRACE_ID | Distributed tracing IDs |
| REQUEST_ID | Request correlation IDs |
Build & CI/CD
| Entity | Description |
|---|
| GIT_COMMIT_HASH | Git SHA-1 hashes |
| GIT_REF | Git branch/tag references |
| SEMVER | Semantic versions |
| BUILD_NUMBER | CI/CD build numbers |
| NPM_PACKAGE_REF | npm package@version |
Industry-Specific Identifiers (40 entities)#
Legal/Court
| Entity | Jurisdiction |
|---|
| US_COURT_CASE | US federal/state courts |
| EU_COURT_CASE | ECJ/ECHR |
| UK_COURT_CASE | UK courts |
| DE_AKTENZEICHEN | German courts |
Insurance
| Entity | Description |
|---|
| INSURANCE_POLICY | Policy numbers |
| INSURANCE_CLAIM | Claim IDs |
| HEALTH_MEMBER_ID | Member IDs |
| INSURANCE_GROUP | Group numbers |
Automotive
| Entity | Description |
|---|
| VIN | Vehicle Identification Number |
| VEHICLE_TITLE | Title numbers |
| EU_VEHICLE_REG | EU registration docs |
| DE_KFZ | German license plates |
| AT_KFZ | Austrian license plates |
| CH_KFZ | Swiss license plates |
Telecommunications
| Entity | Description |
|---|
| IMEI | Mobile equipment ID |
| IMSI | Mobile subscriber ID |
| ICCID | SIM card numbers |
| MSISDN | Mobile phone numbers |
Real Estate
| Entity | Description |
|---|
| PARCEL_NUMBER | Property parcel/APN |
| MLS_NUMBER | MLS listing numbers |
| DE_GRUNDBUCH | German land registry |
Logistics
| Entity | Description |
|---|
| TRACKING_NUMBER | UPS, FedEx, DHL, USPS |
| BILL_OF_LADING | Shipping BOL |
| CONTAINER_NUMBER | ISO container IDs |
HR/Employment
| Entity | Description |
|---|
| EMPLOYEE_ID | Employee identifiers |
| DE_PERSONALNUMMER | German personnel numbers |
| BADGE_NUMBER | Access card numbers |
| STUDENT_ID | Student identifiers |
| TRANSCRIPT_NUMBER | Academic records |
Location & Address (25 entities)#
Postal Codes by Country
| Entity | Country | Format |
|---|
| DE_PLZ | Germany | 5 digits |
| AT_PLZ | Austria | 4 digits |
| CH_PLZ | Switzerland | 4 digits |
| FR_CODE_POSTAL | France | 5 digits |
| UK_POSTCODE | UK | Alphanumeric |
| US_ZIP_CODE | USA | 5 or 9 digits |
| CA_POSTAL_CODE | Canada | A1A 1A1 |
| AU_POSTCODE | Australia | 4 digits |
| JP_POSTAL_CODE | Japan | 7 digits |
| IN_PIN_CODE | India | 6 digits |
| BR_CEP | Brazil | 8 digits |
| IT_CAP | Italy | 5 digits |
| ES_CP | Spain | 5 digits |
| NL_POSTCODE | Netherlands | 4 digits + 2 letters |
| BE_POSTCODE | Belgium | 4 digits |
| PL_KOD_POCZTOWY | Poland | XX-XXX |
Other Location
| Entity | Description |
|---|
| GEO_COORDINATES | GPS lat/long |
| PHONE_NUMBER | Phone numbers |
| EMAIL_ADDRESS | Email addresses |
Presets by Region#
Western Europe (15 presets)#
| Preset | Entities | Languages |
|---|
| Deutschland - Vollständig | 41 | German |
| Deutschland - Finanzwesen | 20 | German |
| Österreich - Vollständig | 26 | German |
| Schweiz - Vollständig | 24 | German, French, Italian |
| France - Complet | 27 | French |
| France - Services Financiers | 20 | French |
| Belgique - Complet | 20 | French, Dutch |
| Nederland - Volledig | 20 | Dutch |
| Luxembourg - Complet | 14 | French, German |
| Liechtenstein - Vollständig | 17 | German |
| Monaco - Complet | 13 | French |
DACH Region (6 presets)#
| Preset | Entities | Focus |
|---|
| DACH Region - Vollständig | 55 | Complete DE/AT/CH/LI |
| DACH - Finanzwesen | 27 | Financial sector |
| DACH - Gesundheitswesen | 22 | Healthcare |
| DACH - Unternehmensdaten | 22 | Business data |
| DACH - Fahrzeugdaten | 21 | Vehicle data |
United Kingdom & Ireland (4 presets)#
| Preset | Entities | Focus |
|---|
| United Kingdom - Complete | 26 | Full UK coverage |
| United Kingdom - Healthcare | 15 | NHS data |
| Ireland - Complete | 15 | Full Ireland |
Southern Europe (8 presets)#
| Preset | Entities | Languages |
|---|
| Italia - Completo | 22 | Italian |
| Italia - Servizi Finanziari | 17 | Italian |
| España - Completo | 21 | Spanish |
| España - Servicios Financieros | 17 | Spanish |
| Portugal - Completo | 16 | Portuguese |
| Catalunya - Complet | 16 | Catalan |
| Andorra - Complet | 13 | Catalan |
| San Marino - Completo | 13 | Italian |
Nordic (6 presets)#
| Preset | Entities | Languages |
|---|
| Nordic - Complete | 30 | All Nordic |
| Sverige - Komplett | 15 | Swedish |
| Norge - Komplett | 15 | Norwegian |
| Danmark - Komplet | 14 | Danish |
| Suomi - Täydellinen | 14 | Finnish |
| Ísland - Heildar | 14 | Icelandic |
Eastern Europe (8 presets)#
| Preset | Entities | Languages |
|---|
| Central Europe - Complete | 30 | PL, CZ, SK, HU |
| Polska - Kompletny | 21 | Polish |
| Polska - Usługi Finansowe | 17 | Polish |
| Česká republika - Kompletní | 15 | Czech |
| Slovensko - Kompletné | 15 | Slovak |
| Magyarország - Teljes | 16 | Hungarian |
| România - Complet | 16 | Romanian |
| България - Пълен | 15 | Bulgarian |
Baltic States (4 presets)#
| Preset | Entities | Languages |
|---|
| Baltic - Complete | 23 | All Baltic |
| Lietuva - Pilnas | 15 | Lithuanian |
| Latvija - Pilnīgs | 15 | Latvian |
| Eesti - Täielik | 15 | Estonian |
Balkans (9 presets)#
| Preset | Entities | Languages |
|---|
| Balkans - Complete | 52 | All Balkans |
| Hrvatska - Potpuno | 15 | Croatian |
| Slovenija - Popolno | 16 | Slovenian |
| Србија - Потпуно | 15 | Serbian |
| Bosna i Hercegovina - Potpuno | 14 | Bosnian |
| Македонија - Комплетно | 14 | Macedonian |
| Shqipëri - I plotë | 14 | Albanian |
| Crna Gora - Potpuno | 14 | Montenegrin |
| Kosova - E plotë | 13 | Albanian |
CIS/Eastern (4 presets)#
| Preset | Entities | Languages |
|---|
| Россия - Полный | 16 | Russian |
| Україна - Повний | 15 | Ukrainian |
| Беларусь - Поўны | 14 | Belarusian |
| Moldova - Complet | 14 | Romanian |
Americas (10 presets)#
| Preset | Entities | Languages |
|---|
| United States - Complete | 31 | English |
| United States - Healthcare | 24 | English |
| United States - Financial | 23 | English |
| Canada - Complete | 20 | English, French |
| México - Completo | 14 | Spanish |
| Brasil - Completo | 19 | Portuguese |
| Brasil - Serviços Financeiros | 15 | Portuguese |
| Argentina - Completo | 14 | Spanish |
| Chile - Completo | 13 | Spanish |
| Colombia - Completo | 14 | Spanish |
Asia Pacific (8 presets)#
| Preset | Entities | Languages |
|---|
| Asia Pacific - Complete | 42 | Multiple |
| 日本 - 完全 | 16 | Japanese |
| 中国 - 完整 | 13 | Chinese |
| 한국 - 완전 | 14 | Korean |
| भारत - पूर्ण | 18 | Hindi/English |
| Singapore - Complete | 14 | English |
| Australia - Complete | 21 | English |
| New Zealand - Complete | 15 | English |
Middle East (5 presets)#
| Preset | Entities | Languages |
|---|
| Middle East - Complete | 21 | Multiple |
| الإمارات - كامل | 13 | Arabic |
| Türkiye - Tam | 15 | Turkish |
| Ελλάδα - Πλήρες | 16 | Greek |
| Cyprus - Complete | 15 | Greek, English |
Multi-Region (5 presets)#
| Preset | Entities | Coverage |
|---|
| European Union - Complete | 132 | All 27 EU states |
| Anglophone - Complete | 48 | UK, US, CA, AU, NZ, IE, SG |
| Francophone - Complet | 26 | FR, BE, LU, CH, MC |
| Hispanoamérica - Completo | 27 | ES + Latin America |
| Lusófono - Completo | 20 | PT + Brazil |
Industry-Specific Presets#
Healthcare (3 presets)#
| Preset | Entities | Focus |
|---|
| US Healthcare - HIPAA Compliant | 22 | HIPAA compliance |
| Healthcare Extended EHR | 21 | EHR/EMR systems |
| United Kingdom - Healthcare | 15 | NHS data |
Financial Services (4 presets)#
| Preset | Entities | Focus |
|---|
| Financial Securities and Trading | 17 | Securities (CUSIP, ISIN, LEI) |
| Deutschland - Finanzwesen | 20 | German financial |
| España - Servicios Financieros | 17 | Spanish financial |
| United States - Financial | 23 | US financial |
Legal (1 preset)#
| Preset | Entities | Focus |
|---|
| Legal and Court Documents | 12 | Court case numbers |
Insurance (1 preset)#
| Preset | Entities | Focus |
|---|
| Insurance Industry | 14 | Policy/claim IDs |
Automotive (1 preset)#
| Preset | Entities | Focus |
|---|
| Automotive Industry | 10 | VIN, registrations |
Telecommunications (1 preset)#
| Preset | Entities | Focus |
|---|
| Telecommunications | 10 | IMEI, IMSI, SIM |
Real Estate (1 preset)#
| Preset | Entities | Focus |
|---|
| Real Estate Property | 10 | Parcel, MLS, Grundbuch |
Logistics (1 preset)#
| Preset | Entities | Focus |
|---|
| Logistics and Shipping | 8 | Tracking, containers |
HR/Education (2 presets)#
| Preset | Entities | Focus |
|---|
| HR and Employment | 12 | Employee IDs |
| Education FERPA | 9 | Student records |
Utilities (1 preset)#
| Preset | Entities | Focus |
|---|
| Utilities and Energy | 8 | Meter numbers |
Immigration (1 preset)#
| Preset | Entities | Focus |
|---|
| Immigration Documents | 11 | Visas, green cards |
Technical/Developer Presets#
Complete Developer Coverage (3 presets)#
| Preset | Entities | Focus |
|---|
| Developer Secrets - Complete | 44 | ALL secrets |
| AI and LLM Provider Keys | 8 | AI API keys only |
| Git Repository Secrets | 12 | Git tokens, SSH |
Cloud Provider (3 presets)#
| Preset | Entities | Focus |
|---|
| Cloud AWS | 9 | AWS credentials |
| Cloud Azure | 7 | Azure credentials |
| Cloud GCP Google | 7 | GCP credentials |
Infrastructure (3 presets)#
| Preset | Entities | Focus |
|---|
| DevOps Infrastructure | 15 | Containers, K8s |
| Network Infrastructure | 7 | MAC, IP, trace IDs |
| CI/CD Secrets | 11 | Build/deploy secrets |
Database & Licensing (2 presets)#
| Preset | Entities | Focus |
|---|
| Database Connection URIs | 9 | DB connection strings |
| Software Licensing | 8 | License keys |
Cryptocurrency (1 preset)#
| Preset | Entities | Focus |
|---|
| Cryptocurrency and Web3 | 10 | BTC, ETH addresses |
Auto-Detection (5 presets)#
| Preset | Entities | Focus |
|---|
| Auto - General Detection | 11 | Language-agnostic |
| Auto - European Multi-Language | 13 | EU PII |
| Auto - Financial Universal | 11 | Financial data |
| Auto - Healthcare Universal | 9 | Healthcare data |
| Auto - Development & Testing | 33 | Dev/test secrets |
Complete Entity Reference#
All 519 Entity Types (Alphabetical)#
Phase 7-8 Additions:
Asian Markets (54): CN_BANK_CARD, CN_PASSPORT, CN_PHONE, CN_SOCIAL_CREDIT_CODE, CN_VEHICLE_PLATE,
ID_NIK, ID_NPWP, ID_PASSPORT, IN_BANK_ACCOUNT, IN_CIN, IN_DIN, IN_PASSPORT, IN_TAN, IN_UAN,
IN_VEHICLE_REG, JP_BANK_ACCOUNT, JP_PASSPORT, JP_PENSION_NUMBER, JP_VEHICLE_PLATE, JP_ZAIRYU_CARD,
KR_ALIEN_REG, KR_BANK_ACCOUNT, KR_PASSPORT, KR_VEHICLE_PLATE, MY_NRIC, MY_PASSPORT, MY_TIN,
PH_PASSPORT, PH_SSS, PH_TIN, PK_CNIC, PK_NTN, SG_FIN, TH_NATIONAL_ID, TH_PASSPORT, TH_TAX_ID,
VN_NATIONAL_ID, VN_PASSPORT, VN_TAX_CODE
Latin America (44): BO_CI, BO_NIT, BO_PASSPORT, CR_CEDULA, CR_CEDULA_JURIDICA, CR_DIMEX, CR_PASSPORT,
CU_CI, DO_CEDULA, DO_RNC, EC_CEDULA, EC_RUC, EC_PASSPORT, GT_DPI, GT_NIT, HN_DNI, HN_RTN,
JM_NIDS, JM_TRN, NI_CEDULA, NI_RUC, PA_CEDULA, PA_PASSPORT, PA_RUC, PE_CE, PE_DNI, PE_PASSPORT,
PE_RUC, PK_IBAN, PK_NIC_OLD, PK_PASSPORT, PK_STRN, PK_VEHICLE_REG, PY_CI, PY_PASSPORT, PY_RUC,
SV_DUI, SV_NIT, UY_CI, UY_PASSPORT, UY_RUT, VE_CEDULA, VE_PASSPORT, VE_RIF
Africa & Middle East (64): BH_CPR, BH_CR, DZ_NIN, DZ_PASSPORT, EG_NATIONAL_ID, EG_PASSPORT, EG_TAX_ID,
ET_ID, GH_GHANA_CARD, GH_PASSPORT, GH_TIN, IL_PASSPORT_EXT, IL_TEUDAT_ZEHUT_EXT, JO_NATIONAL_ID,
JO_PASSPORT, KE_KRA_PIN, KE_NATIONAL_ID, KE_NHIF, KE_PASSPORT, KW_CIVIL_ID, KW_CR, LB_NATIONAL_ID,
LB_PASSPORT, MA_CIN, MA_ICE, MA_PASSPORT, MA_TAX_ID, NG_BVN, NG_CAC_NUMBER, NG_NIN, NG_PASSPORT,
NG_PHONE, NG_TIN, OM_CR, OM_RESIDENT_CARD, QA_CR, QA_QID, SA_CR, SA_IQAMA, SA_VAT, TN_CIN,
TN_PASSPORT, TR_PASSPORT_EXT, TR_TC_KIMLIK_EXT, TR_VERGI_KIMLIK_EXT, TZ_NIDA, AE_EMIRATES_ID,
AE_TRADE_LICENSE, AE_TRN, UG_NIN, ZA_BANK_ACCOUNT, ZA_COMPANY_REG, ZA_DRIVER_LICENSE, ZA_ID_NUMBER,
ZA_PASSPORT, ZA_TAX_NUMBER
Original 357 Entities:
ACCESSION_NUMBER DE_AKTENZEICHEN INSURANCE_GROUP
AD_NRT DE_AUFENTHALTSTITEL INSURANCE_POLICY
AE_EMIRATES_ID DE_BETRIEBSNUMMER INTL_PHONE_E164
AL_ID_CARD DE_BIC IN_VOTER_ID
AL_NIPT DE_DRIVER_LICENSE IPV6_ADDRESS
ANSIBLE_VAULT DE_GRUNDBUCH IS_DRIVER_LICENSE
ANTHROPIC_API_KEY DE_HANDELSREGISTER ISIN
AR_CUIT DE_IBAN IS_KENNITALA
AR_DNI DE_ID_CARD IS_PASSPORT
AR_PASSPORT DE_KFZ IT_CAP
AT_DRIVER_LICENSE DE_KVNR IT_DRIVER_LICENSE
AT_FIRMENBUCH DE_MATRIKELNUMMER IT_FISCAL_CODE
AT_IBAN DE_PASSPORT IT_HEALTH_CARD
AT_KFZ DE_PERSONALNUMMER IT_VAT_CODE
AT_PLZ DE_PLZ JP_CORPORATE_NUMBER
AT_SVN DE_RVNR JP_DRIVER_LICENSE
AT_TAX_ID DE_STEUERNUMMER JP_HEALTH_INSURANCE
AT_UID DE_SVN JP_MY_NUMBER
AU_ABN DE_USTID JP_POSTAL_CODE
AU_ACN DE_ZAEHLERNUMMER JWT_TOKEN
AU_MEDICARE DIGITALOCEAN_TOKEN K8S_RESOURCE_NAME
AU_POSTCODE DISCORD_TOKEN K8S_SECRET_DATA
AU_TFN DISCORD_WEBHOOK KR_BRN
AWS_ACCESS_KEY DK_CPR KR_DRIVER_LICENSE
AWS_ACCOUNT_ID DK_DRIVER_LICENSE KR_POSTAL_CODE
AWS_ARN DOCKER_CONTAINER_ID KR_RRN
AWS_MWS_KEY DOCKER_IMAGE_DIGEST LEI
AWS_SECRET_KEY EE_DRIVER_LICENSE LI_AHV
AZURE_CLIENT_SECRET EE_ID_CARD LICENSE_KEY
AZURE_OPENAI_KEY EE_IK LI_IBAN
AZURE_RESOURCE_ID EMAIL_ADDRESS LI_ID_CARD
AZURE_STORAGE_KEY EMPLOYEE_ID LI_MWST
AZURE_SUBSCRIPTION_ID ENCOUNTER_NUMBER LI_PLZ
BADGE_NUMBER ENV_SECRET_ASSIGNMENT LOINC_CODE
BA_ID_CARD ES_CODIGO_POSTAL LT_ASMENS_KODAS
BA_JMBG ES_NIE LT_DRIVER_LICENSE
BASIC_AUTH ES_NIF LT_ID_CARD
BEARER_TOKEN ES_NSS LU_NATIONAL_ID
BE_DRIVER_LICENSE ETH_ADDRESS LU_PASSPORT
BE_NATIONAL_NUMBER EU_COURT_CASE LU_VAT
BE_POSTCODE EU_VEHICLE_REG LV_DRIVER_LICENSE
BE_VAT FACILITY_NPI LV_ID_CARD
BG_DRIVER_LICENSE FI_DRIVER_LICENSE LV_PERSONAS_KODS
BG_EGN FI_HETU MAC_ADDRESS
BG_ID_CARD FR_CNI MAILCHIMP_KEY
BG_PASSPORT FR_CODE_POSTAL MAILGUN_KEY
BILL_OF_LADING FR_SIREN MC_ID_CARD
BITBUCKET_TOKEN FR_SIRET MD_ID_CARD
BR_CEP FR_TAX_ID MD_IDNP
BR_CNPJ FR_TVA ME_ID_CARD
BR_CPF GCP_API_KEY ME_JMBG
BR_PIS GCP_PROJECT_ID MISTRAL_API_KEY
BR_RG GENERIC_API_KEY MK_EMBG
BTC_ADDRESS GEO_COORDINATES MK_ID_CARD
BUILD_NUMBER GIT_COMMIT_HASH MK_PASSPORT
BY_ID_CARD GITHUB_FINE_GRAINED MLS_NUMBER
BY_PASSPORT GITHUB_TOKEN MONGODB_URI
BY_TAX_ID GITLAB_TOKEN MRN
CA_BN GIT_REF MSISDN
CA_DRIVER_LICENSE GOOGLE_AI_KEY MT_ID_CARD
CA_HEALTH_CARD GR_AFM MT_VAT
CA_POSTAL_CODE GR_AMKA MX_CODIGO_POSTAL
CA_SIN GR_DRIVER_LICENSE MX_CURP
CH_AHV GR_ID_CARD MX_NSS
CH_DRIVER_LICENSE HCPCS_CODE MX_RFC
CH_IBAN HEALTH_MEMBER_ID MYSQL_URI
CH_KFZ HK_ID NDC_CODE
CH_MWST HR_DRIVER_LICENSE NL_BSN
CH_PLZ HR_ID_CARD NL_POSTCODE
CH_UID HR_OIB NL_VAT
CL_PASSPORT HU_DRIVER_LICENSE NO_D_NUMBER
CL_RUT HUGGINGFACE_TOKEN NO_DRIVER_LICENSE
CN_POSTAL_CODE HU_ID_CARD NO_FODSELSNUMMER
CN_RESIDENT_ID HU_SSN NPM_PACKAGE_REF
CN_USCC HU_TAX_NUMBER NPM_TOKEN
CO_CC ICCID NZ_DRIVER_LICENSE
COHERE_API_KEY ICD10_CODE NZ_IRD
CO_NIT ICD11_CODE NZ_NHI
CONTAINER_NUMBER IE_PPS OPENAI_API_KEY
CO_PASSPORT IE_VAT PARCEL_NUMBER
CPT_CODE IL_ID_NUMBER PASSWORD_EXPOSED
CREDIT_CARD IMEI PGP_PRIVATE_KEY
CUSIP IMSI PHONE_NUMBER
CY_ID_CARD IN_AADHAAR PL_DRIVER_LICENSE
CY_TAX_ID IN_DRIVER_LICENSE PL_ID_CARD
CY_VAT IN_GSTIN PL_KOD_POCZTOWY
CZ_DRIVER_LICENSE IN_IFSC PL_NIP
CZ_ID_CARD IN_PAN PL_PESEL
CZ_RODNE_CISLO IN_PIN_CODE PL_REGON
POSTGRESQL_URI SI_ID_CARD UK_VAT
PT_DRIVER_LICENSE SI_PASSPORT US_COURT_CASE
PT_NIF SI_TAX_NUMBER US_EIN
PT_NISS SK_DRIVER_LICENSE US_GREEN_CARD
PYPI_TOKEN SK_ID_CARD US_MBI
REDIS_URI SK_RODNE_CISLO US_NPI
REPLICATE_API_TOKEN SLACK_TOKEN US_ROUTING_NUMBER
REQUEST_ID SLACK_WEBHOOK US_STATE_LICENSE
RO_CNP SM_ID_CARD US_VIN
RO_CUI SNOMED_CODE US_VISA_NUMBER
RO_DRIVER_LICENSE SSH_PRIVATE_KEY US_ZIP_CODE
RO_ID_CARD SSH_PUBLIC_KEY UTILITY_ACCOUNT
RS_DRIVER_LICENSE STRIPE_KEY UUID
RS_ID_CARD STUDENT_ID VAULT_TOKEN
RS_JMBG TELEGRAM_TOKEN VEHICLE_TITLE
RS_PASSPORT TERRAFORM_SECRET VIN
RU_DRIVER_LICENSE TRACE_ID WINDOWS_PRODUCT_KEY
RU_INN TRACKING_NUMBER X509_CERTIFICATE
RU_PASSPORT_FOREIGN TRANSCRIPT_NUMBER XK_ID_CARD
RU_PASSPORT_INTERNAL TR_KIMLIK_NO
RU_SNILS TR_TAX_ID
RX_NUMBER TW_ID
SA_NATIONAL_ID TWILIO_CREDENTIALS
SA_PASSPORT UA_DRIVER_LICENSE
SCHENGEN_VISA UA_ID_CARD
SEDOL UA_IPN
SEMVER UK_COMPANY_NUMBER
SENDGRID_KEY UK_COURT_CASE
SE_PERSONNUMMER UK_NINO
SE_TAX_ID UK_POSTCODE
SG_POSTAL_CODE UK_UTR
SG_UEN
SI_DRIVER_LICENSE
SI_EMSO
Complete Preset Reference#
All 157 Presets (Alphabetical)#
Phase 8 Additions (32 new presets):
- Peru - Completo | Venezuela - Completo | Ecuador - Completo
- Uruguay - Completo | Paraguay - Completo | Bolivia - Completo
- Costa Rica - Completo | Panama - Completo | Honduras - Completo
- El Salvador - Completo | Guatemala - Completo | Nicaragua - Completo
- Dominican Republic - Completo | Cuba - Completo | Jamaica - Complete
- Latin America - Complete | South Africa - Complete | Nigeria - Complete
- Kenya - Complete | Egypt - Complete | Morocco - Complet
- Ghana - Complete | Saudi Arabia - Complete | UAE - Complete
- Qatar - Complete | Kuwait - Complete | Bahrain - Complete
- Oman - Complete | Gulf States - Complete | Africa - Major Economies
- Middle East - Complete | Pakistan - Extended
Original 125 Presets:
| # | Preset Name | Entities | Description |
|---|
| 1 | AI and LLM Provider Keys | 8 | AI/LLM API keys |
| 2 | AI Provider API Keys | 12 | Extended AI keys |
| 3 | Andorra - Complet | 13 | Andorran PII |
| 4 | Anglophone - Complete | 48 | English-speaking regions |
| 5 | Argentina - Completo | 14 | Argentine PII |
| 6 | Asia Pacific - Complete | 42 | APAC coverage |
| 7 | Australia - Complete | 21 | Australian PII |
| 8 | Auto - Development & Testing | 33 | Dev/test secrets |
| 9 | Auto - European Multi-Language | 13 | EU multi-language |
| 10 | Auto - Financial Universal | 11 | Universal financial |
| 11 | Auto - General Detection | 11 | Language-agnostic |
| 12 | Auto - Healthcare Universal | 9 | Universal healthcare |
| 13 | Automotive Industry | 10 | Vehicle data |
| 14 | Balkans - Complete | 52 | All Balkans |
| 15 | Baltic - Complete | 23 | Baltic states |
| 16 | Belgique - Complet | 20 | Belgian PII |
| 17 | Bosna i Hercegovina - Potpuno | 14 | Bosnian PII |
| 18 | Brasil - Completo | 19 | Brazilian PII |
| 19 | Brasil - Serviços Financeiros | 15 | Brazilian financial |
| 20 | Canada - Complete | 20 | Canadian PII |
| 21 | Catalunya - Complet | 16 | Catalan PII |
| 22 | Central Europe - Complete | 30 | Visegrad countries |
| 23 | Česká republika - Kompletní | 15 | Czech PII |
| 24 | Chile - Completo | 13 | Chilean PII |
| 25 | CI/CD Secrets | 11 | CI/CD secrets |
| 26 | Cloud AWS | 9 | AWS credentials |
| 27 | Cloud Azure | 7 | Azure credentials |
| 28 | Cloud GCP Google | 7 | GCP credentials |
| 29 | Cloud Provider Credentials | 10 | Multi-cloud |
| 30 | Colombia - Completo | 14 | Colombian PII |
| 31 | Communication Platform Tokens | 5 | Chat platform tokens |
| 32 | Crna Gora - Potpuno | 14 | Montenegrin PII |
| 33 | Cryptocurrency and Web3 | 10 | Crypto addresses |
| 34 | Cyprus - Complete | 15 | Cypriot PII |
| 35 | DACH - Fahrzeugdaten | 21 | DACH vehicles |
| 36 | DACH - Finanzwesen | 27 | DACH financial |
| 37 | DACH - Gesundheitswesen | 22 | DACH healthcare |
| 38 | DACH Region - Vollständig | 55 | Complete DACH |
| 39 | DACH - Unternehmensdaten | 22 | DACH business |
| 40 | Danmark - Komplet | 14 | Danish PII |
| 41 | Database Connection Strings | 5 | DB URIs |
| 42 | Database Connection URIs | 9 | Extended DB URIs |
| 43 | Deutschland - Finanzwesen | 20 | German financial |
| 44 | Deutschland - Vollständig | 41 | Complete German |
| 45 | Developer Secrets - Complete | 44 | All dev secrets |
| 46 | DevOps Infrastructure | 15 | DevOps patterns |
| 47 | Education FERPA | 9 | Student data |
| 48 | Eesti - Täielik | 15 | Estonian PII |
| 49 | España - Completo | 21 | Spanish PII |
| 50 | España - Servicios Financieros | 17 | Spanish financial |
| 51 | European Postal Codes | 12 | EU postal codes |
| 52 | European Union - Complete | 132 | All EU countries |
| 53 | European VAT Numbers | 8 | EU VAT |
| 54 | Financial Securities and Trading | 17 | Securities IDs |
| 55 | France - Complet | 27 | French PII |
| 56 | France - Services Financiers | 20 | French financial |
| 57 | Francophone - Complet | 26 | French-speaking |
| 58 | Git Repository Secrets | 12 | Git tokens |
| 59 | Global Address and Location | 21 | Global addresses |
| 60 | Healthcare Extended EHR | 21 | EHR data |
| 61 | Hispanoamérica - Completo | 27 | Spanish-speaking |
| 62 | HR and Employment | 12 | Employee data |
| 63 | Hrvatska - Potpuno | 15 | Croatian PII |
| 64 | Immigration Documents | 11 | Visa/immigration |
| 65 | Insurance Industry | 14 | Insurance data |
| 66 | Ireland - Complete | 15 | Irish PII |
| 67 | Ísland - Heildar | 14 | Icelandic PII |
| 68 | Italia - Completo | 22 | Italian PII |
| 69 | Italia - Servizi Finanziari | 17 | Italian financial |
| 70 | Kosova - E plotë | 13 | Kosovar PII |
| 71 | Latvija - Pilnīgs | 15 | Latvian PII |
| 72 | Legal and Court Documents | 12 | Court cases |
| 73 | Liechtenstein - Vollständig | 17 | Liechtenstein PII |
| 74 | Lietuva - Pilnas | 15 | Lithuanian PII |
| 75 | Logistics and Shipping | 8 | Shipping data |
| 76 | Lusófono - Completo | 20 | Portuguese-speaking |
| 77 | Luxembourg - Complet | 14 | Luxembourg PII |
| 78 | Magyarország - Teljes | 16 | Hungarian PII |
| 79 | Malta - Complete | 14 | Maltese PII |
| 80 | México - Completo | 14 | Mexican PII |
| 81 | Middle East - Complete | 21 | Middle East |
| 82 | Moldova - Complet | 14 | Moldovan PII |
| 83 | Monaco - Complet | 13 | Monaco PII |
| 84 | Nederland - Volledig | 20 | Dutch PII |
| 85 | Network Infrastructure | 7 | Network IDs |
| 86 | New Zealand - Complete | 15 | NZ PII |
| 87 | Nordic - Complete | 30 | Nordic countries |
| 88 | Norge - Komplett | 15 | Norwegian PII |
| 89 | Österreich - Vollständig | 26 | Austrian PII |
| 90 | Polska - Kompletny | 21 | Polish PII |
| 91 | Polska - Usługi Finansowe | 17 | Polish financial |
| 92 | Portugal - Completo | 16 | Portuguese PII |
| 93 | Real Estate Property | 10 | Property data |
| 94 | România - Complet | 16 | Romanian PII |
| 95 | San Marino - Completo | 13 | San Marino PII |
| 96 | Schweiz - Vollständig | 24 | Swiss PII |
| 97 | Shqipëri - I plotë | 14 | Albanian PII |
| 98 | Singapore - Complete | 14 | Singapore PII |
| 99 | Slovenija - Popolno | 16 | Slovenian PII |
| 100 | Slovensko - Kompletné | 15 | Slovak PII |
| 101 | Software Licensing | 8 | License keys |
| 102 | Suomi - Täydellinen | 14 | Finnish PII |
| 103 | Sverige - Komplett | 15 | Swedish PII |
| 104 | Telecommunications | 10 | Telecom IDs |
| 105 | Türkiye - Tam | 15 | Turkish PII |
| 106 | United Kingdom - Complete | 26 | UK PII |
| 107 | United Kingdom - Healthcare | 15 | UK NHS |
| 108 | United States - Complete | 31 | US PII |
| 109 | United States - Financial | 23 | US financial |
| 110 | United States - Healthcare | 24 | US healthcare |
| 111 | US Healthcare - HIPAA Compliant | 22 | HIPAA |
| 112 | Utilities and Energy | 8 | Utility data |
| 113 | Vatican City - Complete | 12 | Vatican PII |
| 114 | Ελλάδα - Πλήρες | 16 | Greek PII |
| 115 | Беларусь - Поўны | 14 | Belarusian PII |
| 116 | България - Пълен | 15 | Bulgarian PII |
| 117 | Македонија - Комплетно | 14 | Macedonian PII |
| 118 | Россия - Полный | 16 | Russian PII |
| 119 | Србија - Потпуно | 15 | Serbian PII |
| 120 | Україна - Повний | 15 | Ukrainian PII |
| 121 | الإمارات - كامل | 13 | UAE PII |
| 122 | भारत - पूर्ण | 18 | Indian PII |
| 123 | 中国 - 完整 | 13 | Chinese PII |
| 124 | 日本 - 完全 | 16 | Japanese PII |
| 125 | 한국 - 완전 | 14 | Korean PII |
API Usage#
Analyze Text#
curl -X POST https://cloak.business/api/analyze \
-H "Content-Type: application/json" \
-d '{
"text": "My SSN is 123-45-6789",
"language": "en",
"entities": ["US_SSN", "CREDIT_CARD"]
}'
Use Preset#
curl -X POST https://cloak.business/api/analyze \
-H "Content-Type: application/json" \
-d '{
"text": "Contact: john@example.com, SSN: 123-45-6789",
"preset": "United States - Complete"
}'
Analyze Image#
Detect PII in an image and return bounding box locations.
curl -X POST https://cloak.business/api/presidio/image \
-F "file=@document.png" \
-F "mode=analyze" \
-F "language=en" \
-F 'entities=["PERSON","EMAIL_ADDRESS","PHONE_NUMBER"]' \
-F "score_threshold=0.5"
Response:
{
"entities": [
{
"entity_type": "PERSON",
"start": 0, "end": 9,
"score": 0.95,
"left": 100, "top": 50, "width": 120, "height": 30
}
],
"processing_time": 2.345
}
Redact Image#
Detect and redact PII, returning a new image with redaction rectangles.
curl -X POST https://cloak.business/api/presidio/image \
-F "file=@document.png" \
-F "mode=redact" \
-F "language=en" \
-F "fill_color=black" \
--output redacted.png
Supported fill colors: black, white, red, green, blue, gray.
Health Check#
# Text analyzer
curl https://cloak.business/health/analyzer
# {"status":"healthy","version":"6.8.0","custom_recognizers":317}
# Image redactor
curl https://cloak.business/health/image
# {"status":"healthy","version":"5.2.0","tesseract_languages":37,"custom_recognizers":317}
Image Processing Notes#
- Supported formats: PNG, JPG, BMP, TIFF (max 10 MB, max 25 megapixels)
- OCR languages: 37 Tesseract language packs installed
- EXIF orientation is auto-corrected before OCR
- Bounding box padding: 4px default for clean redaction coverage
- Adjacent boxes merged for multi-word entities
For full image anonymization documentation, see IMAGE_ANONYMIZATION.md.
Version History#
| Version | Date | Changes |
|---|
| 6.8.1 | 2026-02-12 | Hash_type validation (400 for invalid), analyzer memory optimization (MAX_CACHED_LANGUAGES 5→3), MD5 removed from API contracts |
| 6.8.0 | 2026-02-11 | Backend-enforced request limits (entity filters, ad-hoc recognizers, regex patterns), /limits endpoint, WatchdogSec disabled |
| 6.7.0 | 2026-02-10 | 317 recognizers — 39 US & global entities (US IDs, healthcare codes, financial instruments, crypto, GPS coordinates) |
| 6.6.0 | 2026-02-10 | 279 recognizers — 68 technical secrets & infrastructure recognizers (API keys, cloud credentials, database URIs) |
| 6.4.3 | 2026-02-09 | 210 recognizers (246 patterns), 48 languages, 75+ countries, boundary assertion migration |
| 6.4.1 | 2026-02-08 | Image entities validation, desktop image redaction fix |
| 6.4.0 | 2026-02-08 | Desktop image mode, plan-based limits, image token cost rebalancing |
| 6.3.3 | 2026-02-07 | Image bounding box overlay accuracy fix |
| 6.3.2 | 2026-02-07 | Image Redactor v5.2.0 — OCR language hints, EXIF handling, bbox padding/merging |
| 6.3.0 | 2026-02-07 | Analyzer ReDoS protection, input size limits |
| 6.2.0 | 2026-02-06 | Added 186 new recognizers (Phases 1-5) |
| 6.1.0 | 2026-02-03 | Initial 171 recognizers |
Document maintained by cloak.business
Contact: support@cloak.business