From 340,000 documents to 12-second answers.
Client context
A leading Swiss pharmaceutical company (client name protected under NDA) managing 14 active clinical trials across oncology and rare disease. Their regulatory affairs team was spending 60% of analyst time manually locating and cross-referencing documents across 6 disconnected systems.
Challenge
- ▪ 11-week regulatory submission cycle
- ▪ 340,000+ unstructured documents, many scanned PDFs
- ▪ Swissmedic audit flagged data lineage gaps
Solution
Multi-agent document intelligence platform with four specialised AI agents — OCR, text extraction, classification (~95% accuracy on held-out samples), and entity extraction — orchestrated via LangChain routing with confidence-based human-in-the-loop escalation. Knowledge graph (Neo4j) + vector search (pgvector) for natural-language regulatory queries. Full infrastructure on Azure Switzerland North via Terraform.
18 weeks — 2 engineers full-time, 1 part-time
See full storyStack
Service areas
Exhibit 1
Measured movement from the legacy search workflow to the governed production assistant. Values are rounded to avoid implying precision beyond the NDA-safe sample.
- Preparation cycle
- 11 wk → ~6–7 wk
- Two filing windows after rollout
- Median retrieval
- ~45 min → <12 sec
- Cross-system regulatory queries
Filing package preparation
about 40% shorter
Normalized index
Legacy process
11 weeks
100
Production workflow
~6–7 weeks
60
Cross-system document retrieval
same-session answerability
Normalized index
Manual search
~45 minutes
100
Audited query path
<12 seconds
8
Source Engagement run logs and two filing-window retrospectives; anonymised and normalised for publication.
Control Low-confidence classifications remained in the human review queue; the exhibit excludes exploratory prompt tests.