Upload a medical PDF and our AI pipeline handles the rest — reading the document, extracting biomarkers, diagnoses, procedures, and genetics, standardising results across labs, and generating an actionable health report.
Every medical record passes through a five-stage AI pipeline. Each stage is purpose-built — reading the document, finding the clinical data, standardising it across labs, and writing the final report.
Medical records uploaded as PDFs
Each page read and classified by content type
Biomarkers, diagnoses, procedures, genetics
Names and units harmonised across labs and providers
AI-generated clinical health reports
Upload any medical PDF — lab reports, clinical notes, imaging results, genetic tests, or discharge summaries. Our platform accepts records from any healthcare provider, in any format.
No standardised template required. Upload PDFs from hospitals, labs, clinics, or specialists worldwide.
Records are stored in encrypted cloud storage with signed URLs. Access is scoped to the patient and their clinician.
Upload individual records or entire patient histories at once. Each document is processed independently through the pipeline.
Before extraction begins, the document is read end-to-end and every page is classified by what it contains. This pre-screening identifies which pages hold clinical data and what type — so only relevant content reaches the extraction models.
Advanced OCR and visual language models read text, tables, and images on every page — even scanned documents, handwritten notes, and complex table layouts.
Each page is sorted by data type — biomarker results, clinical findings, genetic data, or non-medical content. Pages without clinical relevance are filtered out early.
Pages with visual content (charts, scanned tables) are sent to specialised vision models, while text-heavy pages use faster text models — optimising both accuracy and speed.
Four specialised AI processors run in parallel — each trained to extract a specific category of clinical data with high precision. Every data point is attributed to its source page and validated against clinical standards.
All four data categories are extracted simultaneously across multiple pages, with concurrent processing for maximum throughput.
When test dates are missing from a page, the system propagates dates from adjacent pages and cross-references between data types to ensure temporal accuracy.
Duplicate entries that appear across multiple pages are automatically detected and merged, preserving the most complete version of each data point.
Different labs use different names and units for the same test. This stage harmonises biomarker names, converts units to a single standard, and tags each result with clinical context — so values from any provider can be compared on a like-for-like basis, over time.
Medical embedding models cluster similar biomarker names (e.g. "HbA1c", "Hemoglobin A1C", "Glycated Haemoglobin") under a single canonical name — enabling longitudinal tracking across providers.
Results are converted to standard units so values from different labs are directly comparable. Reference ranges are recalculated to match, preserving clinical meaning.
Each biomarker is assigned to health areas (cardiovascular, metabolic, thyroid, etc.) and clinical groups — enabling filtering, organisation, and targeted analysis by body system.
With structured, standardised data in place, AI generates comprehensive health reports tailored to each patient. Multiple report types are available — each designed for a different clinical lens and consultation need.
Every insight and recommendation is grounded in the patient's actual data and peer-reviewed clinical guidelines. The AI cites its sources and explains its reasoning.
From comprehensive clinical analysis to focused biomarker trend tracking — choose the report type that fits the consultation. Each offers a distinct clinical perspective.
Reports surface the insights that matter most — flagged values, emerging trends, risk factors, and prioritised recommendations — ready for clinical decision-making.