Evidence analysis toolkit. 1,300+ items processed: documents, emails, 850+ images. 9 forensic reports per run, 644 cross-evidence correlations. Built for a real employment law case.
The Story
Real employment law case, real stakes. 1,300+ evidence items across
documents, emails, and 850+ photographs. 4-stage pipeline:
ingest (SHA256 content-addressed storage), analyse (GPT-4o Vision + text), correlate
(cross-evidence pattern matching), package (9 forensic reports per run).
6 generations, each building on the last. Started with basic document analysis.
Added image OCR, entity correlation, forensic integrity with chain of custody,
48 Pydantic models, and AI entity resolution (+114% yield).
Same pipeline works in compliance, insurance, HR, investigations.
Stage 1Ingest
→
Stage 2Analyze
→
Stage 3Correlate
→
Stage 4Report
6 Generations of Capability
Each generation compounds on the last — not rewrites, each new capability adds to the existing pipeline.
Text analysis for legal evidence. Extracts key facts, identifies contradictions, builds timelines, and highlights patterns across large document sets.
NLPDocument IntelligencePython
Image Evidence Analyzer
In Development
GPT-4o Vision forensic analysis. 850+ images processed with 0.89 average confidence. OCR text extraction, scene description, object detection, legal relevance scoring across 6 domains. Thread-safe parallel processing at ~$0.001/image.
GPT-4o VisionOCRForensicsPython
Legal AI System
In Development
Legal Document Intelligence and Decision Support System. Built with FastAPI, PydanticAI, Supabase, and MCP. Expert knowledge base with scoping, multi-agent analysis, and structured legal reasoning.
FastAPIPydanticAIMCPSupabase
Approach
Domain-Transferable
Same evidence analysis patterns work in compliance, insurance, HR, investigations. The toolkit adapts to any document-heavy domain.
AI-Native
Built with PydanticAI and MCP from the ground up. Structured outputs, typed tool calls, reproducible analysis chains.
Battle-Tested
Used in a live case. Real document volumes, real evidence standards, real deadlines.
Domain-Transferable
Same pipeline works in compliance, insurance, HR investigations, regulatory cases.
Different source material, same method.