cloak.business Entity & Preset Inventory

Last Updated: 2026-02-12 Analyzer Version: 6.9.0 Image Redactor Version: 5.2.0 Total Recognizers: 317 PatternRecognizers + NLP models Total Presets: 157+


Executive Summary#

cloak.business provides enterprise-grade PII detection with:

  • 317 custom regex PatternRecognizers covering personal data, technical secrets, and industry-specific identifiers
  • NLP detection via spaCy (25 languages), Stanza NER (7 languages), and XLM-RoBERTa transformers (16 languages) for names, locations, and organizations
  • 390+ entity types detected across 75+ countries
  • 157+ presets for 100+ countries, regions, and industry verticals
  • Support for 48 languages with country-specific patterns
  • Global coverage: Europe, Americas, Asia-Pacific, Africa, Middle East
  • Image anonymization with 37 OCR languages (see IMAGE_ANONYMIZATION.md)

Table of Contents#

  1. Entity Categories
  2. Presets by Region
  3. Industry-Specific Presets
  4. Technical/Developer Presets
  5. Complete Entity Reference
  6. Complete Preset Reference

Entity Categories#

Personal Identification (78 entities)#

CategoryEntitiesCountries
National ID Numbers45+All EU, US, APAC
Passport Numbers30+Global
Driver Licenses35+Global
Tax IDs25+Global
Social Security15+US, EU, APAC

Examples by Country

CountryEntity Types
GermanyDE_ID_CARD, DE_PASSPORT, DE_DRIVER_LICENSE, DE_TAX_ID, DE_SVN, DE_STEUERNUMMER
USAUS_SSN, US_PASSPORT, US_DRIVER_LICENSE, US_ITIN, US_EIN
UKUK_NINO, UK_PASSPORT, UK_UTR, UK_NHS
FranceFR_CNI, FR_PASSPORT, FR_NIR, FR_TAX_ID

Financial Identifiers (35 entities)#

EntityDescriptionExample Pattern
CREDIT_CARDCredit/debit card numbersLuhn-validated 13-19 digits
IBAN_CODEInternational Bank Account NumberCountry-specific formats
SWIFT_CODEBank identifier codes8-11 alphanumeric
CUSIPUS/Canada securities9 alphanumeric
ISINInternational securities12 alphanumeric
LEILegal Entity Identifier20 alphanumeric
SEDOLUK securities7 alphanumeric

Country-Specific Banking

CountryEntities
GermanyDE_IBAN, DE_BIC
AustriaAT_IBAN
SwitzerlandCH_IBAN
FranceFR_IBAN
UKUK_SORT_CODE
USAUS_BANK_NUMBER, US_ROUTING_NUMBER

Healthcare Identifiers (25 entities)#

EntityDescriptionRegion
NPINational Provider IdentifierUS
MBIMedicare Beneficiary IdentifierUS
DEA_NUMBERDrug Enforcement AdministrationUS
ICD10_CODEDiagnosis codesInternational
ICD11_CODEDiagnosis codes (new)International
CPT_CODEProcedure codesUS
HCPCS_CODEHealthcare supply codesUS
NDC_CODENational Drug CodeUS
LOINC_CODELab observation codesInternational
SNOMED_CODEClinical terminologyInternational
MRNMedical Record NumberUniversal
RX_NUMBERPrescription numbersUS
ACCESSION_NUMBERLab/radiology specimensUniversal
ENCOUNTER_NUMBERVisit/admission IDsUniversal

Country-Specific Healthcare

CountryEntities
GermanyDE_KVNR (health insurance), DE_RVNR (pension)
UKUK_NHS
JapanJP_HEALTH_INSURANCE
ItalyIT_HEALTH_CARD

Technical Secrets (48 entities)#

AI Provider API Keys

EntityProviderPattern
OPENAI_API_KEYOpenAIsk-...T3BlbkFJ...
ANTHROPIC_API_KEYAnthropic Claudesk-ant-api03-...
GOOGLE_AI_KEYGoogle AIAIza...
AZURE_OPENAI_KEYAzure OpenAI32 hex characters
MISTRAL_API_KEYMistral AI32 alphanumeric
HUGGINGFACE_TOKENHuggingFacehf_...
COHERE_API_KEYCohere40 alphanumeric
REPLICATE_API_TOKENReplicater8_...

Cloud Provider Credentials

EntityProviderPattern
AWS_ACCESS_KEYAWSAKIA... (20 chars)
AWS_SECRET_KEYAWS40 char base64
AWS_SESSION_TOKENAWSLong base64
AZURE_CLIENT_SECRETAzure34-40 alphanumeric
AZURE_STORAGE_KEYAzure88 char base64
GCP_API_KEYGoogle CloudAIza...
DIGITALOCEAN_TOKENDigitalOceandop_v1_...

Version Control Tokens

EntityPlatformPattern
GITHUB_TOKENGitHub PATghp_...
GITHUB_FINE_GRAINED_TOKENGitHubgithub_pat_...
GITLAB_TOKENGitLabglpat-...
BITBUCKET_TOKENBitbucketContext-based
NPM_TOKENnpmnpm_...
PYPI_TOKENPyPIpypi-...

Authentication & Secrets

EntityDescription
JWT_TOKENJSON Web Tokens
BEARER_TOKENBearer authentication
BASIC_AUTHBasic auth headers
SSH_PRIVATE_KEYSSH private keys
SSH_PUBLIC_KEYSSH public keys
PGP_PRIVATE_KEYPGP/GPG keys
X509_CERTIFICATESSL/TLS certificates

Database Connection Strings

EntityDatabase
MONGODB_URIMongoDB
POSTGRESQL_URIPostgreSQL
MYSQL_URIMySQL/MariaDB
REDIS_URIRedis

SaaS Platform Tokens

EntityPlatform
SLACK_TOKENSlack
SLACK_WEBHOOKSlack webhooks
STRIPE_KEYStripe payments
SENDGRID_KEYSendGrid email
MAILGUN_KEYMailgun email
TWILIO_CREDENTIALSTwilio
DISCORD_TOKENDiscord
TELEGRAM_TOKENTelegram

Infrastructure Identifiers (24 entities)#

Network Identifiers

EntityDescriptionExample
MAC_ADDRESSHardware addressAA:BB:CC:DD:EE:FF
UUIDUniversal unique IDxxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx
IPV6_ADDRESSIPv6 addresses2001:db8::1

Container & Orchestration

EntityDescription
DOCKER_CONTAINER_IDDocker container IDs
DOCKER_IMAGE_DIGESTDocker image digests
K8S_RESOURCE_NAMEKubernetes pod/resource names
K8S_SECRET_DATAKubernetes secret data

Cloud Resource IDs

EntityProvider
AWS_ARNAmazon Resource Names
AWS_ACCOUNT_IDAWS account IDs
AZURE_RESOURCE_IDAzure resource paths
AZURE_SUBSCRIPTION_IDAzure subscription GUIDs
GCP_PROJECT_IDGCP project identifiers

Observability

EntityDescription
TRACE_IDDistributed tracing IDs
REQUEST_IDRequest correlation IDs

Build & CI/CD

EntityDescription
GIT_COMMIT_HASHGit SHA-1 hashes
GIT_REFGit branch/tag references
SEMVERSemantic versions
BUILD_NUMBERCI/CD build numbers
NPM_PACKAGE_REFnpm package@version

Industry-Specific Identifiers (40 entities)#

Legal/Court

EntityJurisdiction
US_COURT_CASEUS federal/state courts
EU_COURT_CASEECJ/ECHR
UK_COURT_CASEUK courts
DE_AKTENZEICHENGerman courts

Insurance

EntityDescription
INSURANCE_POLICYPolicy numbers
INSURANCE_CLAIMClaim IDs
HEALTH_MEMBER_IDMember IDs
INSURANCE_GROUPGroup numbers

Automotive

EntityDescription
VINVehicle Identification Number
VEHICLE_TITLETitle numbers
EU_VEHICLE_REGEU registration docs
DE_KFZGerman license plates
AT_KFZAustrian license plates
CH_KFZSwiss license plates

Telecommunications

EntityDescription
IMEIMobile equipment ID
IMSIMobile subscriber ID
ICCIDSIM card numbers
MSISDNMobile phone numbers

Real Estate

EntityDescription
PARCEL_NUMBERProperty parcel/APN
MLS_NUMBERMLS listing numbers
DE_GRUNDBUCHGerman land registry

Logistics

EntityDescription
TRACKING_NUMBERUPS, FedEx, DHL, USPS
BILL_OF_LADINGShipping BOL
CONTAINER_NUMBERISO container IDs

HR/Employment

EntityDescription
EMPLOYEE_IDEmployee identifiers
DE_PERSONALNUMMERGerman personnel numbers
BADGE_NUMBERAccess card numbers
STUDENT_IDStudent identifiers
TRANSCRIPT_NUMBERAcademic records

Location & Address (25 entities)#

Postal Codes by Country

EntityCountryFormat
DE_PLZGermany5 digits
AT_PLZAustria4 digits
CH_PLZSwitzerland4 digits
FR_CODE_POSTALFrance5 digits
UK_POSTCODEUKAlphanumeric
US_ZIP_CODEUSA5 or 9 digits
CA_POSTAL_CODECanadaA1A 1A1
AU_POSTCODEAustralia4 digits
JP_POSTAL_CODEJapan7 digits
IN_PIN_CODEIndia6 digits
BR_CEPBrazil8 digits
IT_CAPItaly5 digits
ES_CPSpain5 digits
NL_POSTCODENetherlands4 digits + 2 letters
BE_POSTCODEBelgium4 digits
PL_KOD_POCZTOWYPolandXX-XXX

Other Location

EntityDescription
GEO_COORDINATESGPS lat/long
PHONE_NUMBERPhone numbers
EMAIL_ADDRESSEmail addresses

Presets by Region#

Western Europe (15 presets)#

PresetEntitiesLanguages
Deutschland - Vollständig41German
Deutschland - Finanzwesen20German
Österreich - Vollständig26German
Schweiz - Vollständig24German, French, Italian
France - Complet27French
France - Services Financiers20French
Belgique - Complet20French, Dutch
Nederland - Volledig20Dutch
Luxembourg - Complet14French, German
Liechtenstein - Vollständig17German
Monaco - Complet13French

DACH Region (6 presets)#

PresetEntitiesFocus
DACH Region - Vollständig55Complete DE/AT/CH/LI
DACH - Finanzwesen27Financial sector
DACH - Gesundheitswesen22Healthcare
DACH - Unternehmensdaten22Business data
DACH - Fahrzeugdaten21Vehicle data

United Kingdom & Ireland (4 presets)#

PresetEntitiesFocus
United Kingdom - Complete26Full UK coverage
United Kingdom - Healthcare15NHS data
Ireland - Complete15Full Ireland

Southern Europe (8 presets)#

PresetEntitiesLanguages
Italia - Completo22Italian
Italia - Servizi Finanziari17Italian
España - Completo21Spanish
España - Servicios Financieros17Spanish
Portugal - Completo16Portuguese
Catalunya - Complet16Catalan
Andorra - Complet13Catalan
San Marino - Completo13Italian

Nordic (6 presets)#

PresetEntitiesLanguages
Nordic - Complete30All Nordic
Sverige - Komplett15Swedish
Norge - Komplett15Norwegian
Danmark - Komplet14Danish
Suomi - Täydellinen14Finnish
Ísland - Heildar14Icelandic

Eastern Europe (8 presets)#

PresetEntitiesLanguages
Central Europe - Complete30PL, CZ, SK, HU
Polska - Kompletny21Polish
Polska - Usługi Finansowe17Polish
Česká republika - Kompletní15Czech
Slovensko - Kompletné15Slovak
Magyarország - Teljes16Hungarian
România - Complet16Romanian
България - Пълен15Bulgarian

Baltic States (4 presets)#

PresetEntitiesLanguages
Baltic - Complete23All Baltic
Lietuva - Pilnas15Lithuanian
Latvija - Pilnīgs15Latvian
Eesti - Täielik15Estonian

Balkans (9 presets)#

PresetEntitiesLanguages
Balkans - Complete52All Balkans
Hrvatska - Potpuno15Croatian
Slovenija - Popolno16Slovenian
Србија - Потпуно15Serbian
Bosna i Hercegovina - Potpuno14Bosnian
Македонија - Комплетно14Macedonian
Shqipëri - I plotë14Albanian
Crna Gora - Potpuno14Montenegrin
Kosova - E plotë13Albanian

CIS/Eastern (4 presets)#

PresetEntitiesLanguages
Россия - Полный16Russian
Україна - Повний15Ukrainian
Беларусь - Поўны14Belarusian
Moldova - Complet14Romanian

Americas (10 presets)#

PresetEntitiesLanguages
United States - Complete31English
United States - Healthcare24English
United States - Financial23English
Canada - Complete20English, French
México - Completo14Spanish
Brasil - Completo19Portuguese
Brasil - Serviços Financeiros15Portuguese
Argentina - Completo14Spanish
Chile - Completo13Spanish
Colombia - Completo14Spanish

Asia Pacific (8 presets)#

PresetEntitiesLanguages
Asia Pacific - Complete42Multiple
日本 - 完全16Japanese
中国 - 完整13Chinese
한국 - 완전14Korean
भारत - पूर्ण18Hindi/English
Singapore - Complete14English
Australia - Complete21English
New Zealand - Complete15English

Middle East (5 presets)#

PresetEntitiesLanguages
Middle East - Complete21Multiple
الإمارات - كامل13Arabic
Türkiye - Tam15Turkish
Ελλάδα - Πλήρες16Greek
Cyprus - Complete15Greek, English

Multi-Region (5 presets)#

PresetEntitiesCoverage
European Union - Complete132All 27 EU states
Anglophone - Complete48UK, US, CA, AU, NZ, IE, SG
Francophone - Complet26FR, BE, LU, CH, MC
Hispanoamérica - Completo27ES + Latin America
Lusófono - Completo20PT + Brazil

Industry-Specific Presets#

Healthcare (3 presets)#

PresetEntitiesFocus
US Healthcare - HIPAA Compliant22HIPAA compliance
Healthcare Extended EHR21EHR/EMR systems
United Kingdom - Healthcare15NHS data

Financial Services (4 presets)#

PresetEntitiesFocus
Financial Securities and Trading17Securities (CUSIP, ISIN, LEI)
Deutschland - Finanzwesen20German financial
España - Servicios Financieros17Spanish financial
United States - Financial23US financial
PresetEntitiesFocus
Legal and Court Documents12Court case numbers

Insurance (1 preset)#

PresetEntitiesFocus
Insurance Industry14Policy/claim IDs

Automotive (1 preset)#

PresetEntitiesFocus
Automotive Industry10VIN, registrations

Telecommunications (1 preset)#

PresetEntitiesFocus
Telecommunications10IMEI, IMSI, SIM

Real Estate (1 preset)#

PresetEntitiesFocus
Real Estate Property10Parcel, MLS, Grundbuch

Logistics (1 preset)#

PresetEntitiesFocus
Logistics and Shipping8Tracking, containers

HR/Education (2 presets)#

PresetEntitiesFocus
HR and Employment12Employee IDs
Education FERPA9Student records

Utilities (1 preset)#

PresetEntitiesFocus
Utilities and Energy8Meter numbers

Immigration (1 preset)#

PresetEntitiesFocus
Immigration Documents11Visas, green cards

Technical/Developer Presets#

Complete Developer Coverage (3 presets)#

PresetEntitiesFocus
Developer Secrets - Complete44ALL secrets
AI and LLM Provider Keys8AI API keys only
Git Repository Secrets12Git tokens, SSH

Cloud Provider (3 presets)#

PresetEntitiesFocus
Cloud AWS9AWS credentials
Cloud Azure7Azure credentials
Cloud GCP Google7GCP credentials

Infrastructure (3 presets)#

PresetEntitiesFocus
DevOps Infrastructure15Containers, K8s
Network Infrastructure7MAC, IP, trace IDs
CI/CD Secrets11Build/deploy secrets

Database & Licensing (2 presets)#

PresetEntitiesFocus
Database Connection URIs9DB connection strings
Software Licensing8License keys

Cryptocurrency (1 preset)#

PresetEntitiesFocus
Cryptocurrency and Web310BTC, ETH addresses

Auto-Detection (5 presets)#

PresetEntitiesFocus
Auto - General Detection11Language-agnostic
Auto - European Multi-Language13EU PII
Auto - Financial Universal11Financial data
Auto - Healthcare Universal9Healthcare data
Auto - Development & Testing33Dev/test secrets

Complete Entity Reference#

All 519 Entity Types (Alphabetical)#

Phase 7-8 Additions:

Asian Markets (54): CN_BANK_CARD, CN_PASSPORT, CN_PHONE, CN_SOCIAL_CREDIT_CODE, CN_VEHICLE_PLATE,
  ID_NIK, ID_NPWP, ID_PASSPORT, IN_BANK_ACCOUNT, IN_CIN, IN_DIN, IN_PASSPORT, IN_TAN, IN_UAN,
  IN_VEHICLE_REG, JP_BANK_ACCOUNT, JP_PASSPORT, JP_PENSION_NUMBER, JP_VEHICLE_PLATE, JP_ZAIRYU_CARD,
  KR_ALIEN_REG, KR_BANK_ACCOUNT, KR_PASSPORT, KR_VEHICLE_PLATE, MY_NRIC, MY_PASSPORT, MY_TIN,
  PH_PASSPORT, PH_SSS, PH_TIN, PK_CNIC, PK_NTN, SG_FIN, TH_NATIONAL_ID, TH_PASSPORT, TH_TAX_ID,
  VN_NATIONAL_ID, VN_PASSPORT, VN_TAX_CODE

Latin America (44): BO_CI, BO_NIT, BO_PASSPORT, CR_CEDULA, CR_CEDULA_JURIDICA, CR_DIMEX, CR_PASSPORT,
  CU_CI, DO_CEDULA, DO_RNC, EC_CEDULA, EC_RUC, EC_PASSPORT, GT_DPI, GT_NIT, HN_DNI, HN_RTN,
  JM_NIDS, JM_TRN, NI_CEDULA, NI_RUC, PA_CEDULA, PA_PASSPORT, PA_RUC, PE_CE, PE_DNI, PE_PASSPORT,
  PE_RUC, PK_IBAN, PK_NIC_OLD, PK_PASSPORT, PK_STRN, PK_VEHICLE_REG, PY_CI, PY_PASSPORT, PY_RUC,
  SV_DUI, SV_NIT, UY_CI, UY_PASSPORT, UY_RUT, VE_CEDULA, VE_PASSPORT, VE_RIF

Africa & Middle East (64): BH_CPR, BH_CR, DZ_NIN, DZ_PASSPORT, EG_NATIONAL_ID, EG_PASSPORT, EG_TAX_ID,
  ET_ID, GH_GHANA_CARD, GH_PASSPORT, GH_TIN, IL_PASSPORT_EXT, IL_TEUDAT_ZEHUT_EXT, JO_NATIONAL_ID,
  JO_PASSPORT, KE_KRA_PIN, KE_NATIONAL_ID, KE_NHIF, KE_PASSPORT, KW_CIVIL_ID, KW_CR, LB_NATIONAL_ID,
  LB_PASSPORT, MA_CIN, MA_ICE, MA_PASSPORT, MA_TAX_ID, NG_BVN, NG_CAC_NUMBER, NG_NIN, NG_PASSPORT,
  NG_PHONE, NG_TIN, OM_CR, OM_RESIDENT_CARD, QA_CR, QA_QID, SA_CR, SA_IQAMA, SA_VAT, TN_CIN,
  TN_PASSPORT, TR_PASSPORT_EXT, TR_TC_KIMLIK_EXT, TR_VERGI_KIMLIK_EXT, TZ_NIDA, AE_EMIRATES_ID,
  AE_TRADE_LICENSE, AE_TRN, UG_NIN, ZA_BANK_ACCOUNT, ZA_COMPANY_REG, ZA_DRIVER_LICENSE, ZA_ID_NUMBER,
  ZA_PASSPORT, ZA_TAX_NUMBER

Original 357 Entities:

ACCESSION_NUMBER        DE_AKTENZEICHEN         INSURANCE_GROUP
AD_NRT                  DE_AUFENTHALTSTITEL     INSURANCE_POLICY
AE_EMIRATES_ID          DE_BETRIEBSNUMMER       INTL_PHONE_E164
AL_ID_CARD              DE_BIC                  IN_VOTER_ID
AL_NIPT                 DE_DRIVER_LICENSE       IPV6_ADDRESS
ANSIBLE_VAULT           DE_GRUNDBUCH            IS_DRIVER_LICENSE
ANTHROPIC_API_KEY       DE_HANDELSREGISTER      ISIN
AR_CUIT                 DE_IBAN                 IS_KENNITALA
AR_DNI                  DE_ID_CARD              IS_PASSPORT
AR_PASSPORT             DE_KFZ                  IT_CAP
AT_DRIVER_LICENSE       DE_KVNR                 IT_DRIVER_LICENSE
AT_FIRMENBUCH           DE_MATRIKELNUMMER       IT_FISCAL_CODE
AT_IBAN                 DE_PASSPORT             IT_HEALTH_CARD
AT_KFZ                  DE_PERSONALNUMMER       IT_VAT_CODE
AT_PLZ                  DE_PLZ                  JP_CORPORATE_NUMBER
AT_SVN                  DE_RVNR                 JP_DRIVER_LICENSE
AT_TAX_ID               DE_STEUERNUMMER         JP_HEALTH_INSURANCE
AT_UID                  DE_SVN                  JP_MY_NUMBER
AU_ABN                  DE_USTID                JP_POSTAL_CODE
AU_ACN                  DE_ZAEHLERNUMMER        JWT_TOKEN
AU_MEDICARE             DIGITALOCEAN_TOKEN      K8S_RESOURCE_NAME
AU_POSTCODE             DISCORD_TOKEN           K8S_SECRET_DATA
AU_TFN                  DISCORD_WEBHOOK         KR_BRN
AWS_ACCESS_KEY          DK_CPR                  KR_DRIVER_LICENSE
AWS_ACCOUNT_ID          DK_DRIVER_LICENSE       KR_POSTAL_CODE
AWS_ARN                 DOCKER_CONTAINER_ID     KR_RRN
AWS_MWS_KEY             DOCKER_IMAGE_DIGEST     LEI
AWS_SECRET_KEY          EE_DRIVER_LICENSE       LI_AHV
AZURE_CLIENT_SECRET     EE_ID_CARD              LICENSE_KEY
AZURE_OPENAI_KEY        EE_IK                   LI_IBAN
AZURE_RESOURCE_ID       EMAIL_ADDRESS           LI_ID_CARD
AZURE_STORAGE_KEY       EMPLOYEE_ID             LI_MWST
AZURE_SUBSCRIPTION_ID   ENCOUNTER_NUMBER        LI_PLZ
BADGE_NUMBER            ENV_SECRET_ASSIGNMENT   LOINC_CODE
BA_ID_CARD              ES_CODIGO_POSTAL        LT_ASMENS_KODAS
BA_JMBG                 ES_NIE                  LT_DRIVER_LICENSE
BASIC_AUTH              ES_NIF                  LT_ID_CARD
BEARER_TOKEN            ES_NSS                  LU_NATIONAL_ID
BE_DRIVER_LICENSE       ETH_ADDRESS             LU_PASSPORT
BE_NATIONAL_NUMBER      EU_COURT_CASE           LU_VAT
BE_POSTCODE             EU_VEHICLE_REG          LV_DRIVER_LICENSE
BE_VAT                  FACILITY_NPI            LV_ID_CARD
BG_DRIVER_LICENSE       FI_DRIVER_LICENSE       LV_PERSONAS_KODS
BG_EGN                  FI_HETU                 MAC_ADDRESS
BG_ID_CARD              FR_CNI                  MAILCHIMP_KEY
BG_PASSPORT             FR_CODE_POSTAL          MAILGUN_KEY
BILL_OF_LADING          FR_SIREN                MC_ID_CARD
BITBUCKET_TOKEN         FR_SIRET                MD_ID_CARD
BR_CEP                  FR_TAX_ID               MD_IDNP
BR_CNPJ                 FR_TVA                  ME_ID_CARD
BR_CPF                  GCP_API_KEY             ME_JMBG
BR_PIS                  GCP_PROJECT_ID          MISTRAL_API_KEY
BR_RG                   GENERIC_API_KEY         MK_EMBG
BTC_ADDRESS             GEO_COORDINATES         MK_ID_CARD
BUILD_NUMBER            GIT_COMMIT_HASH         MK_PASSPORT
BY_ID_CARD              GITHUB_FINE_GRAINED     MLS_NUMBER
BY_PASSPORT             GITHUB_TOKEN            MONGODB_URI
BY_TAX_ID               GITLAB_TOKEN            MRN
CA_BN                   GIT_REF                 MSISDN
CA_DRIVER_LICENSE       GOOGLE_AI_KEY           MT_ID_CARD
CA_HEALTH_CARD          GR_AFM                  MT_VAT
CA_POSTAL_CODE          GR_AMKA                 MX_CODIGO_POSTAL
CA_SIN                  GR_DRIVER_LICENSE       MX_CURP
CH_AHV                  GR_ID_CARD              MX_NSS
CH_DRIVER_LICENSE       HCPCS_CODE              MX_RFC
CH_IBAN                 HEALTH_MEMBER_ID        MYSQL_URI
CH_KFZ                  HK_ID                   NDC_CODE
CH_MWST                 HR_DRIVER_LICENSE       NL_BSN
CH_PLZ                  HR_ID_CARD              NL_POSTCODE
CH_UID                  HR_OIB                  NL_VAT
CL_PASSPORT             HU_DRIVER_LICENSE       NO_D_NUMBER
CL_RUT                  HUGGINGFACE_TOKEN       NO_DRIVER_LICENSE
CN_POSTAL_CODE          HU_ID_CARD              NO_FODSELSNUMMER
CN_RESIDENT_ID          HU_SSN                  NPM_PACKAGE_REF
CN_USCC                 HU_TAX_NUMBER           NPM_TOKEN
CO_CC                   ICCID                   NZ_DRIVER_LICENSE
COHERE_API_KEY          ICD10_CODE              NZ_IRD
CO_NIT                  ICD11_CODE              NZ_NHI
CONTAINER_NUMBER        IE_PPS                  OPENAI_API_KEY
CO_PASSPORT             IE_VAT                  PARCEL_NUMBER
CPT_CODE                IL_ID_NUMBER            PASSWORD_EXPOSED
CREDIT_CARD             IMEI                    PGP_PRIVATE_KEY
CUSIP                   IMSI                    PHONE_NUMBER
CY_ID_CARD              IN_AADHAAR              PL_DRIVER_LICENSE
CY_TAX_ID               IN_DRIVER_LICENSE       PL_ID_CARD
CY_VAT                  IN_GSTIN                PL_KOD_POCZTOWY
CZ_DRIVER_LICENSE       IN_IFSC                 PL_NIP
CZ_ID_CARD              IN_PAN                  PL_PESEL
CZ_RODNE_CISLO          IN_PIN_CODE             PL_REGON
POSTGRESQL_URI          SI_ID_CARD              UK_VAT
PT_DRIVER_LICENSE       SI_PASSPORT             US_COURT_CASE
PT_NIF                  SI_TAX_NUMBER           US_EIN
PT_NISS                 SK_DRIVER_LICENSE       US_GREEN_CARD
PYPI_TOKEN              SK_ID_CARD              US_MBI
REDIS_URI               SK_RODNE_CISLO          US_NPI
REPLICATE_API_TOKEN     SLACK_TOKEN             US_ROUTING_NUMBER
REQUEST_ID              SLACK_WEBHOOK           US_STATE_LICENSE
RO_CNP                  SM_ID_CARD              US_VIN
RO_CUI                  SNOMED_CODE             US_VISA_NUMBER
RO_DRIVER_LICENSE       SSH_PRIVATE_KEY         US_ZIP_CODE
RO_ID_CARD              SSH_PUBLIC_KEY          UTILITY_ACCOUNT
RS_DRIVER_LICENSE       STRIPE_KEY              UUID
RS_ID_CARD              STUDENT_ID              VAULT_TOKEN
RS_JMBG                 TELEGRAM_TOKEN          VEHICLE_TITLE
RS_PASSPORT             TERRAFORM_SECRET        VIN
RU_DRIVER_LICENSE       TRACE_ID                WINDOWS_PRODUCT_KEY
RU_INN                  TRACKING_NUMBER         X509_CERTIFICATE
RU_PASSPORT_FOREIGN     TRANSCRIPT_NUMBER       XK_ID_CARD
RU_PASSPORT_INTERNAL    TR_KIMLIK_NO
RU_SNILS                TR_TAX_ID
RX_NUMBER               TW_ID
SA_NATIONAL_ID          TWILIO_CREDENTIALS
SA_PASSPORT             UA_DRIVER_LICENSE
SCHENGEN_VISA           UA_ID_CARD
SEDOL                   UA_IPN
SEMVER                  UK_COMPANY_NUMBER
SENDGRID_KEY            UK_COURT_CASE
SE_PERSONNUMMER         UK_NINO
SE_TAX_ID               UK_POSTCODE
SG_POSTAL_CODE          UK_UTR
SG_UEN
SI_DRIVER_LICENSE
SI_EMSO

Complete Preset Reference#

All 157 Presets (Alphabetical)#

Phase 8 Additions (32 new presets):

  • Peru - Completo | Venezuela - Completo | Ecuador - Completo
  • Uruguay - Completo | Paraguay - Completo | Bolivia - Completo
  • Costa Rica - Completo | Panama - Completo | Honduras - Completo
  • El Salvador - Completo | Guatemala - Completo | Nicaragua - Completo
  • Dominican Republic - Completo | Cuba - Completo | Jamaica - Complete
  • Latin America - Complete | South Africa - Complete | Nigeria - Complete
  • Kenya - Complete | Egypt - Complete | Morocco - Complet
  • Ghana - Complete | Saudi Arabia - Complete | UAE - Complete
  • Qatar - Complete | Kuwait - Complete | Bahrain - Complete
  • Oman - Complete | Gulf States - Complete | Africa - Major Economies
  • Middle East - Complete | Pakistan - Extended

Original 125 Presets:

#Preset NameEntitiesDescription
1AI and LLM Provider Keys8AI/LLM API keys
2AI Provider API Keys12Extended AI keys
3Andorra - Complet13Andorran PII
4Anglophone - Complete48English-speaking regions
5Argentina - Completo14Argentine PII
6Asia Pacific - Complete42APAC coverage
7Australia - Complete21Australian PII
8Auto - Development & Testing33Dev/test secrets
9Auto - European Multi-Language13EU multi-language
10Auto - Financial Universal11Universal financial
11Auto - General Detection11Language-agnostic
12Auto - Healthcare Universal9Universal healthcare
13Automotive Industry10Vehicle data
14Balkans - Complete52All Balkans
15Baltic - Complete23Baltic states
16Belgique - Complet20Belgian PII
17Bosna i Hercegovina - Potpuno14Bosnian PII
18Brasil - Completo19Brazilian PII
19Brasil - Serviços Financeiros15Brazilian financial
20Canada - Complete20Canadian PII
21Catalunya - Complet16Catalan PII
22Central Europe - Complete30Visegrad countries
23Česká republika - Kompletní15Czech PII
24Chile - Completo13Chilean PII
25CI/CD Secrets11CI/CD secrets
26Cloud AWS9AWS credentials
27Cloud Azure7Azure credentials
28Cloud GCP Google7GCP credentials
29Cloud Provider Credentials10Multi-cloud
30Colombia - Completo14Colombian PII
31Communication Platform Tokens5Chat platform tokens
32Crna Gora - Potpuno14Montenegrin PII
33Cryptocurrency and Web310Crypto addresses
34Cyprus - Complete15Cypriot PII
35DACH - Fahrzeugdaten21DACH vehicles
36DACH - Finanzwesen27DACH financial
37DACH - Gesundheitswesen22DACH healthcare
38DACH Region - Vollständig55Complete DACH
39DACH - Unternehmensdaten22DACH business
40Danmark - Komplet14Danish PII
41Database Connection Strings5DB URIs
42Database Connection URIs9Extended DB URIs
43Deutschland - Finanzwesen20German financial
44Deutschland - Vollständig41Complete German
45Developer Secrets - Complete44All dev secrets
46DevOps Infrastructure15DevOps patterns
47Education FERPA9Student data
48Eesti - Täielik15Estonian PII
49España - Completo21Spanish PII
50España - Servicios Financieros17Spanish financial
51European Postal Codes12EU postal codes
52European Union - Complete132All EU countries
53European VAT Numbers8EU VAT
54Financial Securities and Trading17Securities IDs
55France - Complet27French PII
56France - Services Financiers20French financial
57Francophone - Complet26French-speaking
58Git Repository Secrets12Git tokens
59Global Address and Location21Global addresses
60Healthcare Extended EHR21EHR data
61Hispanoamérica - Completo27Spanish-speaking
62HR and Employment12Employee data
63Hrvatska - Potpuno15Croatian PII
64Immigration Documents11Visa/immigration
65Insurance Industry14Insurance data
66Ireland - Complete15Irish PII
67Ísland - Heildar14Icelandic PII
68Italia - Completo22Italian PII
69Italia - Servizi Finanziari17Italian financial
70Kosova - E plotë13Kosovar PII
71Latvija - Pilnīgs15Latvian PII
72Legal and Court Documents12Court cases
73Liechtenstein - Vollständig17Liechtenstein PII
74Lietuva - Pilnas15Lithuanian PII
75Logistics and Shipping8Shipping data
76Lusófono - Completo20Portuguese-speaking
77Luxembourg - Complet14Luxembourg PII
78Magyarország - Teljes16Hungarian PII
79Malta - Complete14Maltese PII
80México - Completo14Mexican PII
81Middle East - Complete21Middle East
82Moldova - Complet14Moldovan PII
83Monaco - Complet13Monaco PII
84Nederland - Volledig20Dutch PII
85Network Infrastructure7Network IDs
86New Zealand - Complete15NZ PII
87Nordic - Complete30Nordic countries
88Norge - Komplett15Norwegian PII
89Österreich - Vollständig26Austrian PII
90Polska - Kompletny21Polish PII
91Polska - Usługi Finansowe17Polish financial
92Portugal - Completo16Portuguese PII
93Real Estate Property10Property data
94România - Complet16Romanian PII
95San Marino - Completo13San Marino PII
96Schweiz - Vollständig24Swiss PII
97Shqipëri - I plotë14Albanian PII
98Singapore - Complete14Singapore PII
99Slovenija - Popolno16Slovenian PII
100Slovensko - Kompletné15Slovak PII
101Software Licensing8License keys
102Suomi - Täydellinen14Finnish PII
103Sverige - Komplett15Swedish PII
104Telecommunications10Telecom IDs
105Türkiye - Tam15Turkish PII
106United Kingdom - Complete26UK PII
107United Kingdom - Healthcare15UK NHS
108United States - Complete31US PII
109United States - Financial23US financial
110United States - Healthcare24US healthcare
111US Healthcare - HIPAA Compliant22HIPAA
112Utilities and Energy8Utility data
113Vatican City - Complete12Vatican PII
114Ελλάδα - Πλήρες16Greek PII
115Беларусь - Поўны14Belarusian PII
116България - Пълен15Bulgarian PII
117Македонија - Комплетно14Macedonian PII
118Россия - Полный16Russian PII
119Србија - Потпуно15Serbian PII
120Україна - Повний15Ukrainian PII
121الإمارات - كامل13UAE PII
122भारत - पूर्ण18Indian PII
123中国 - 完整13Chinese PII
124日本 - 完全16Japanese PII
125한국 - 완전14Korean PII

API Usage#

Analyze Text#

curl -X POST https://cloak.business/api/analyze \
  -H "Content-Type: application/json" \
  -d '{
    "text": "My SSN is 123-45-6789",
    "language": "en",
    "entities": ["US_SSN", "CREDIT_CARD"]
  }'

Use Preset#

curl -X POST https://cloak.business/api/analyze \
  -H "Content-Type: application/json" \
  -d '{
    "text": "Contact: john@example.com, SSN: 123-45-6789",
    "preset": "United States - Complete"
  }'

Analyze Image#

Detect PII in an image and return bounding box locations.

curl -X POST https://cloak.business/api/presidio/image \
  -F "file=@document.png" \
  -F "mode=analyze" \
  -F "language=en" \
  -F 'entities=["PERSON","EMAIL_ADDRESS","PHONE_NUMBER"]' \
  -F "score_threshold=0.5"

Response:

{
  "entities": [
    {
      "entity_type": "PERSON",
      "start": 0, "end": 9,
      "score": 0.95,
      "left": 100, "top": 50, "width": 120, "height": 30
    }
  ],
  "processing_time": 2.345
}

Redact Image#

Detect and redact PII, returning a new image with redaction rectangles.

curl -X POST https://cloak.business/api/presidio/image \
  -F "file=@document.png" \
  -F "mode=redact" \
  -F "language=en" \
  -F "fill_color=black" \
  --output redacted.png

Supported fill colors: black, white, red, green, blue, gray.

Health Check#

# Text analyzer
curl https://cloak.business/health/analyzer
# {"status":"healthy","version":"6.8.0","custom_recognizers":317}

# Image redactor
curl https://cloak.business/health/image
# {"status":"healthy","version":"5.2.0","tesseract_languages":37,"custom_recognizers":317}

Image Processing Notes#

  • Supported formats: PNG, JPG, BMP, TIFF (max 10 MB, max 25 megapixels)
  • OCR languages: 37 Tesseract language packs installed
  • EXIF orientation is auto-corrected before OCR
  • Bounding box padding: 4px default for clean redaction coverage
  • Adjacent boxes merged for multi-word entities

For full image anonymization documentation, see IMAGE_ANONYMIZATION.md.


Version History#

VersionDateChanges
6.8.12026-02-12Hash_type validation (400 for invalid), analyzer memory optimization (MAX_CACHED_LANGUAGES 5→3), MD5 removed from API contracts
6.8.02026-02-11Backend-enforced request limits (entity filters, ad-hoc recognizers, regex patterns), /limits endpoint, WatchdogSec disabled
6.7.02026-02-10317 recognizers — 39 US & global entities (US IDs, healthcare codes, financial instruments, crypto, GPS coordinates)
6.6.02026-02-10279 recognizers — 68 technical secrets & infrastructure recognizers (API keys, cloud credentials, database URIs)
6.4.32026-02-09210 recognizers (246 patterns), 48 languages, 75+ countries, boundary assertion migration
6.4.12026-02-08Image entities validation, desktop image redaction fix
6.4.02026-02-08Desktop image mode, plan-based limits, image token cost rebalancing
6.3.32026-02-07Image bounding box overlay accuracy fix
6.3.22026-02-07Image Redactor v5.2.0 — OCR language hints, EXIF handling, bbox padding/merging
6.3.02026-02-07Analyzer ReDoS protection, input size limits
6.2.02026-02-06Added 186 new recognizers (Phases 1-5)
6.1.02026-02-03Initial 171 recognizers

  • Image Anonymization Guide — Full image processing documentation, OCR languages, known limitations, FAQ, and troubleshooting

Document maintained by cloak.business Contact: support@cloak.business