The Pipeline

Five stages. Fully automated.

Every medical record passes through a multi-stage AI pipeline — each stage purpose-built to extract, structure, and enrich clinical data with maximum accuracy.

Upload

Medical records uploaded as PDFs

Parse

Document structure analysed page by page

Extract

Biomarkers, diagnoses, procedures, genetics

Enrich

Standardised names, units, and clinical context

Report

AI-generated clinical health reports

Stage 1

Upload Medical Records

Upload any medical PDF — lab reports, clinical notes, imaging results, genetic tests, or discharge summaries. Our platform accepts records from any healthcare provider, in any format.

Any provider, any format

No standardised template required. Upload PDFs from hospitals, labs, clinics, or specialists worldwide.

Secure cloud storage

Records are stored in encrypted cloud storage with signed URLs. Access is scoped to the patient and their clinician.

Batch or single upload

Upload individual records or entire patient histories at once. Each document is processed independently through the pipeline.

Stage 2

Document Parsing & Classification

Before extraction begins, each document is parsed and every page is intelligently classified. This pre-screening step identifies which pages contain clinical data and what type — so only relevant content reaches the extraction models.

Intelligent page parsing

Advanced OCR and visual language models extract text, tables, and images from each page — even handling scanned documents, handwritten notes, and complex table layouts.

Content classification

Each page is classified by data type — biomarker results, clinical findings, genetic data, or non-medical content. Pages without clinical relevance are filtered out early.

Smart routing

Pages with visual content (charts, scanned tables) are routed to specialised vision models, while text-heavy pages use faster text models — optimising both accuracy and speed.

Stage 3

Clinical Data Extraction

Four specialised AI processors run in parallel — each trained to extract a specific category of clinical data with high precision. Every data point is attributed to its source page and validated against clinical standards.

Parallel processing

All four data categories are extracted simultaneously across multiple pages, with concurrent processing for maximum throughput.

Intelligent date inference

When test dates are missing from a page, the system propagates dates from adjacent pages and cross-references between data types to ensure temporal accuracy.

Deduplication

Duplicate entries that appear across multiple pages are automatically detected and merged, preserving the most complete version of each data point.

Stage 4

Standardisation & Enrichment

Different labs use different names and units for the same test. The enrichment stage solves this by canonicalising biomarker names, standardising units, and adding clinical context — making data comparable across providers and over time.

Canonical naming

Medical embedding models cluster similar biomarker names (e.g. "HbA1c", "Hemoglobin A1C", "Glycated Haemoglobin") under a single canonical name — enabling longitudinal tracking across providers.

Unit conversion

Results are converted to standard units so values from different labs are directly comparable. Reference ranges are recalculated to match, preserving clinical meaning.

Clinical classification

Each biomarker is assigned to health areas (cardiovascular, metabolic, thyroid, etc.) and clinical groups — enabling filtering, organisation, and targeted analysis by body system.

Stage 5

AI Report Generation

With structured, standardised data in place, AI generates comprehensive health reports tailored to each patient. Multiple report types are available — each designed for a different clinical lens and consultation need.

Evidence-based analysis

Every insight and recommendation is grounded in the patient's actual data and peer-reviewed clinical guidelines. The AI cites its sources and explains its reasoning.

Multiple report types

From comprehensive clinical analysis to focused biomarker trend tracking — choose the report type that fits the consultation. Each offers a distinct clinical perspective.

Actionable output

Reports surface the insights that matter most — flagged values, emerging trends, risk factors, and prioritised recommendations — ready for clinical decision-making.

Explore report types

Built for accuracy and scale.

Parallel processing, intelligent routing, automatic retry and recovery, and comprehensive observability — engineered for clinical-grade reliability.

From records to
clinical intelligence.

Five stages. Fully automated.

Upload

Parse

Extract

Enrich

Report

Upload Medical Records

Any provider, any format

Secure cloud storage

Batch or single upload

Document Parsing & Classification

Intelligent page parsing

Content classification

Smart routing

Clinical Data Extraction

Parallel processing

Intelligent date inference

Deduplication

Standardisation & Enrichment

Canonical naming

Unit conversion

Clinical classification

AI Report Generation

Evidence-based analysis

Multiple report types

Actionable output

Built for accuracy and scale.

Ready to transform your
clinical workflow?

From records toclinical intelligence.

Five stages. Fully automated.

Upload

Parse

Extract

Enrich

Report

Upload Medical Records

Any provider, any format

Secure cloud storage

Batch or single upload

Document Parsing & Classification

Intelligent page parsing

Content classification

Smart routing

Clinical Data Extraction

Parallel processing

Intelligent date inference

Deduplication

Standardisation & Enrichment

Canonical naming

Unit conversion

Clinical classification

AI Report Generation

Evidence-based analysis

Multiple report types

Actionable output

Built for accuracy and scale.

Ready to transform yourclinical workflow?

From records to
clinical intelligence.

Ready to transform your
clinical workflow?