RT Journal Article SR Electronic T1 Machine Learning Applied to Routine Blood Tests and Clinical Metadata to Identify and Classify Heart failure JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.07.26.21261115 DO 10.1101/2021.07.26.21261115 A1 James, Nick A1 Gerrish, Lianna A1 Rokotyan, Nikita A1 Gladding, Patrick A. YR 2021 UL http://medrxiv.org/content/early/2021/07/29/2021.07.26.21261115.abstract AB Introduction We applied machine learning (ML) to routine bloods, then to advanced haematology data from a full blood count (rawFBC) plus biochemistry, to build predictive models for heart failure, which were then used at population scale.Methods Routine blood results from 8,031 patients with heart failure, with equal number of controls, were used in ML training and testing datasets (Split 80:20). NT-proBNP was used for diagnostic comparison. rawFBC metadata was used in a dataset of 698 patients, 314 of whom had heart failure, to train and test ML models (Split 70:30) from rawFBC, rawFBC plus biochemistry and routine bloods. The rawFBC model was used to predict heart failure in a validation dataset of 69,492 FBCs (2.3% heart failure prevalence).Results Heart failure was predicted from rawFBC and biochemistry versus rawFBC AUROC 0.93 versus 0.91, 95% CI -0.023 to 0.048, P = 0.5, and predicted from routine bloods and NT-proBNP, AUROC 0.87 versus 0.81, 95% CI 0.004 to 0.097, P = 0.03. In the validation cohort heart failure was predicted from rawFBC with AUROC 0.83, 95% CI 0.83 to 0.84, P < 0.001, sensitivity 75%, specificity 76%, PPV 7%, NPV 99.2% (Figure 2). Elevated NT-proBNP (≥ 34 pmol/L) was predicted from rawFBC with AUROC 0.97, 95% CI 0.93 to 0.99, P < 0.0001. Common predictive features included markers of erythropoiesis (red cell distribution width, haemoglobin, haematocrit).Conclusion Heart failure can be predicted from routine bloods with accuracy equivalent to NT-proBNP. Predictive features included markers of erythropoiesis, with therapeutic monitoring implications.Competing Interest StatementDr. P. A. Gladding is founder of Theranostics Laboratory, a molecular diagnostics company, and HeartLab, an echocardiography artificial intelligence company.Funding StatementResearch was supported by Precision Driven Health (Orion Health), Theranostics Laboratory, as well as the University of Auckland.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Ethics approval was obtained locally from the Research and Knowledge Centre - the Institutional Review Board for Waitemata District Health Board (RM13732 for the ECLIPSE study for standard haematology and biochemistry data). Ethics approval was also obtained from the regional HDEC (Health and Disability Ethics Committee - New Zealand's National Ethics Committee) (20/CEN/162 for the PROGRAM study for haematology raw data). Informed consent was waived, as the research was observational and used secondary dataAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAnonymised data from this study is available on request from the corresponding author, pending approval from local research boards at their respective institutions Freely available data is provided access via URLs within the manuscript