This study develops and validates a privacy-preserving OCR-LLM pipeline that converts admission history of present illness (HPI) records into structured coronary syndrome subtypes (STEMI, NSTEMI, unstable angina, and chronic coronary syndrome). The system first extracts text from de-identified HPI images using locally deployed OCR, then applies large language models with a fixed diagnostic prompt to generate subtype classification and evidence. Performance is evaluated in an internal validation cohort and multiple external datasets covering heterogeneous EHR templates, emergency department cases, and an English dataset from MIMIC-IV. A clinician usability study assesses changes in diagnostic accuracy and time with and without tool assistance.
Study Type
OBSERVATIONAL
Enrollment
10
An automated clinical data management workflow integrating Optical Character Recognition (OCR), optimized prompt engineering, and large language models (LLMs). The system processes unstructured inpatient/ED records (primarily admission history of present illness and related narrative text) to extract prespecified key clinical indicators (e.g., left ventricular ejection fraction, coronary syndrome subtype, medications) and to classify cases into prespecified coronary artery disease categories (e.g., unstable angina, STEMI, NSTEMI, chronic coronary syndrome). The workflow outputs structured fields and a classification result with supporting evidence excerpts.
Standard manual process in which experienced clinicians review patient medical records and extract the same prespecified clinical indicators and coronary artery disease categories using routine clinical judgment and documentation review. This manual abstraction serves as the human benchmark for comparing diagnostic accuracy, completeness, and operational efficiency against the automated OCR-Prompt-LLM workflow.
Overall classification accuracy
Time Frame: Up to completion of dataset evaluation (internal + external cohorts) Description: Proportion of cases with correct subtype (STEMI/NSTEMI/UA/CCS) compared with expert-adjudicated gold standard.
Time frame: 1 month
This platform is for informational purposes only and does not constitute medical advice. Always consult a qualified healthcare professional.