IDP

Intelligent Document Processing

Production AI Pipeline — Deep Technical Reference

A production IDP platform processing legal contracts with a 4-agent pipeline, 5-layer hallucination detection, 3-tier eval architecture, and full observability. 94% extraction accuracy. <1% hallucination rate. 75% auto-approval. $18K/month (down from $120K).

94%

Extraction Accuracy

<1%

Hallucination Rate

75%

Auto-Approved

85%

Cost Reduction

<3s

Query SLA

2M+

Documents Indexed

Unified Pipeline — Both Lanes

Both extraction and query lanes route through the same pipeline: Input → Guard → Cache → Agent Pipeline (Supervisor → Research → Analysis → Critic) → Hallucination Detection → Route → Output.

Input

User Request

React + FastAPI<50ms · $0

Document upload (PDF/DOCX) or natural language query from compliance analysts, legal counsel, or operations leadership.

Guard

PII Redaction

Regex + DynamoDB + KMS<1ms · $0

Detect SSN, email, phone, CC → tokenize. Reversible for authorized reviewers via KMS-encrypted DynamoDB mapping.

Guard

Injection Detection

Regex + Bedrock Guardrails<15ms · ~$0.001

Pattern match + ML scoring. >0.7 REJECT + security alert. 0.3-0.7 SANITIZE (XML wrap). <0.3 PASS.

Cache

Semantic Cache

ElastiCache Redis 7<5ms · $0 on hit

SHA-256 hash lookup. 35% hit rate. TTL 1hr. Saves ~$700/month in LLM costs. ROI: 14x.

Agent

Supervisor

LangGraph AgentState3ms · $0

Deterministic state router. Pure if/else. Routes: Research → Analysis → Critic → retry or done.

Agent

Research

Aurora PostgreSQL15ms · $0

Hybrid search: pgvector (cosine) + BM25 (keyword) + Reciprocal Rank Fusion. P@5 = 0.85.

Agent

Analysis

AWS Bedrock (tiered)~2.8s · $0.01-$0.50

ONLY agent that costs $. Tiered model routing: 60% Haiku ($0.01) / 30% Sonnet ($0.05) / 10% Opus ($0.50).

Agent

Critic

Python + fuzz8ms · $0

4-6 deterministic checks. LLM-free. Source verification, field validation, date consistency, denied topics.

Validate

Hallucination Detection

Custom scoring~5ms · $0

5-layer weighted composite. Source verification (0.25), cross-validation (0.25), historical (0.20), schema (0.15), LLM confidence (0.15).

Route

Confidence Routing

Logic<1ms · $0

≥0.85 auto-approve (75%). 0.60-0.85 needs review (20%). <0.60 reject (5%).

Output

Structured Result

FastAPI + DynamoDB<1ms · $0

JSON + per-field confidence + full audit trail. 7-year immutable retention.

Need this level of rigor in your AI system?

We build production AI systems with the same eval architecture, hallucination detection, and observability you see here.

Let's Talk sariph@exosolve.io