CarExtract - Medical Document Field Extraction Platform

CarExtract is a full-stack platform for extracting structured fields from medical and care documents — patient intake forms, clinical notes, prescriptions, referrals, and more — using vision-capable language models.
Developed as an open-source reference implementation under the Cloud2 Labs Innovation Hub, CarExtract demonstrates how user-defined field schemas, OpenAI-compatible provider routing, and inline ground truth editing can be packaged into a production-grade microservices architecture. Upload typed or handwritten care documents, define the fields you need to extract, connect any vision model via a single endpoint config, and measure extraction accuracy across providers — against ground truth you review and correct yourself.
It showcases a two-service architecture: a FastAPI backend handling field schema management, provider CRUD, dynamic prompt construction, batch async extraction, and accuracy evaluation — paired with a React + Vite + TypeScript + Tailwind frontend for document management, live analysis runs, result visualisation, and CSV export. All services are containerised via Docker Compose.

What It Demonstrates

CarExtract illustrates how to:

Extract user-defined structured fields from medical document images (JPG, PNG, PDF) — patient names, dates of birth, phone numbers, addresses, diagnosis codes, medications, and more — using any vision LLM served over an OpenAI-compatible API
Build a dynamic prompt engine that generates system and user prompts at runtime from a persisted field schema, with type-aware hints for strings, dates, phones, addresses, and numbers — no hard-coded templates
Enable inline ground truth editing so users review and correct extracted values in the UI before running accuracy analysis — evaluation reflects real, human-verified labels rather than a static pre-stored file
Route extraction requests to multiple providers simultaneously and evaluate per-field accuracy across all models in a single analysis run
Apply type-aware field comparison: exact string matching, date normalisation, phone digit stripping, fuzzy address scoring, and numeric parsing
Deploy a two-service full-stack application with Docker Compose, with nginx in production and Vite dev proxy in development
Designed for AI engineers, healthtech teams, and document automation builders who need a practical reference for multi-model LLM evaluation over real-world medical document datasets.

Key Capabilities

Get Started

Explore the source code, architecture, and setup instructions on GitHub

GitHub Repository

Disclaimer
CarExtract is provided for demonstration and informational purposes only. It is not a certified medical device and does not constitute clinical decision support. AI-extracted field values must always be reviewed and verified by a qualified clinician or administrator before use in official records, clinical workflows, or billing. Always validate extracted data before use in any patient-facing or regulated workflow.

Call us:

CarExtract - Medical Document Field Extraction Platform

What It Demonstrates

CarExtract illustrates how to:

Key Capabilities

Built for Care Documents

Dynamic Field Schema

Document Extract with Human-in-the-Loop Editing

Extraction
Instructions

Provider-Agnostic Architecture

Multi-Provider Analysis Runs

Type-Aware Accuracy Evaluation

Full Telemetry & CSV Export

Get Started

Explore the source code, architecture, and setup instructions on GitHub

Download:

GET IT ON

Download on the

Quick Links

Sign Up for Our Newsletter

Our Services

Quick Links

Reach Us

Call us:

CarExtract - Medical Document Field Extraction Platform

What It Demonstrates

CarExtract illustrates how to:

Key Capabilities

Built for Care Documents

Dynamic Field Schema

Document Extract with Human-in-the-Loop Editing

Extraction Instructions

Provider-Agnostic Architecture

Multi-Provider Analysis Runs

Type-Aware Accuracy Evaluation

Full Telemetry & CSV Export

Get Started

Explore the source code, architecture, and setup instructions on GitHub

Download:

GET IT ON

Download on the

Quick Links

Sign Up for Our Newsletter

Our Services

Quick Links

Reach Us

Create your account

Log in to Your Account

Extraction
Instructions