System Architecture

Built on Microsoft Presidio, our architecture separates analysis, anonymization, and processing into dedicated services for maximum reliability.

Core Components

Analyzer Service

Detects PII using 317 custom regex recognizers (regex patterns) for structured data like IDs and tax numbers, plus spaCy/Stanza/XLM-RoBERTa NLP for names and locations. All models run on our own servers — no data is ever sent to third-party AI providers. Returns entity positions, types, and confidence scores.

Anonymizer Service

Applies your chosen transformation method. Supports Replace, Redact, Hash, Encrypt, and Mask.

Web Application

Next.js frontend with real-time processing. Responsive design for all devices.

REST API

RESTful endpoints for programmatic access. JWT authentication. Rate limiting.

Data Flow

Documents flow through our services in a secure, auditable pipeline.

  1. 1

    Client sends document via HTTPS

  2. 2

    Analyzer service detects PII entities

  3. 3

    Results returned with positions and scores

  4. 4

    Client selects anonymization method

  5. 5

    Anonymizer applies transformation

  6. 6

    Anonymized document returned to client

Build on Our Architecture

API access included in all plans. Full documentation available.