지원되는 48개 언어
전체 플랫폼에서 완전한 PII 탐지 및 익명화
spaCy NLP - Runs Locally (25개 언어)
EnglishGermanSpanishFrenchItalianPortugueseDutchPolishRussianJapaneseChineseKoreanRomanianGreekCroatianSlovenianMacedonianSwedishDanishNorwegianFinnishUkrainianLithuanianCatalanTurkish
Stanza NER - Runs Locally (7개 언어)
BulgarianHungarianHebrew (RTL)VietnameseAfrikaansArmenianBasque
XLM-RoBERTa Transformer - Runs Locally (16개 언어)
Arabic (RTL)HindiCzechSlovakIndonesianThaiPersian (RTL)SerbianLatvianEstonianMalayBengaliUrdu (RTL)SwahiliTagalogIcelandic
RTL 지원
아랍어히브리어페르시아어우르두어
고급 NLP로 구동
최대 언어 커버리지를 위한 세 가지 NLP 엔진이 함께 작동
- 메모리 효율성을 위한 지연 로드 모델 (최대 5개 캐시됨)
- 자동 언어 탐지
- 혼합 언어 문서 처리
- 언어별 개체 패턴
Country-Specific Formats
We detect PII in formats specific to each country and region.
European Formats
- German: Personalausweis, Steuer-ID, Reisepass
- French: NIR, Carte Nationale, Permis
- Italian: Codice Fiscale, Carta d'Identità
- Spanish: DNI, NIE, NIF
- Dutch: BSN, Rijbewijs
- Polish: PESEL, NIP, REGON
Asia-Pacific Formats
- Japan: My Number, Passport
- India: Aadhaar, PAN, GSTIN, Vehicle Registration
- Thailand: National ID, Tax ID, Passport
- Indonesia: NIK, NPWP, Passport
- Vietnam: CCCD, Tax Code, Passport
- Malaysia: MyKad, Tax ID, Passport
Americas, Africa & Middle East
- US: SSN, Driver's License, Passport
- UK: National Insurance, NHS Number
- Canada: SIN, Driver's License
- Australia: TFN, Medicare, ABN
- Kenya: National ID, KRA PIN, Passport
- South Africa: ID Number, Tax Number, Passport