PT - JOURNAL ARTICLE AU - Shinichi Goto AU - Andreas A. Werdich AU - Max Homilius AU - Jenine E. John AU - Li-Ming Gan AU - Calum A. MacRae AU - Marcelo F. DiCarli AU - Rahul C. Deo TI - Discovery of cardiac imaging biomarkers by training neural network models across diagnostic modalities AID - 10.1101/2021.02.07.21251025 DP - 2021 Jan 01 TA - medRxiv PG - 2021.02.07.21251025 4099 - http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025.short 4100 - http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025.full AB - Machines can be readily trained to automate medical image interpretation, with the primary goal of replicating human capabilities. Here, we propose an alternative role: using machine learning to discover pragmatic imaging-based biomarkers by interpreting one complex imaging modality via a second, more ubiquitous, lower-cost modality. We applied this strategy to train convolutional neural network models to estimate positron emission tomography (PET)-derived myocardial blood flow (MBF) at rest and with hyperemic stress, and their ratio, coronary flow reserve (CFR), using contemporaneous two-dimensional echocardiography videos as inputs. The resulting parameters, echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR modestly approximated the original values. However, using echocardiograms of 5,393 (derivation) and 5,289 (external validation) patients, we show they sharply stratify individuals according to disease comorbidities and combined with baseline demographics, are strong predictors for heart failure hospitalization (C-statistic derivation: 0.79, 95% confidence interval 0.77-0.81; validation: 0.81, 0.79-0.82) and acute coronary syndrome (C-statistic derivation: 0.77, 0.73-0.80; validation: 0.75, 0.73-0.78). Using echocardiograms of 3,926 genotyped individuals, we estimate narrow-sense heritability of 9.2%, 20.4% and 6.5%, respectively for echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR. MBF indices show inverse genetic correlation with impedance-derived body mass indices, such as fat-free body mass (e.g., ρ=−0.43, q=0.05 for echoAI-restMBF) and resolve conflicting historical data regarding body mass index and CFR. In terms of diseases, genetic association with ischemic heart disease is seen most prominently for echoAI-stressMBF (ρ=−0.37, q=2.4×10−03). We hypothesize that interpreting one imaging modality through another represents a type of “information bottleneck”, capturing latent features of the original physiologic measurements that have relevance across tissues. Thus, we propose a broader potential role for machine learning algorithms in developing scalable biomarkers that are anchored in known physiology, representative of latent biological factors, and are readily deployable in population health applications.Competing Interest StatementR.C.D is supported by grants from the National Institute of Health, the American Heart Association (One Brave Idea, Apple Heart and Movement Study) and GE Healthcare, has received consulting fees from Novartis and Pfizer, and is co-founder of Atman Health. C.A.M. is a consultant for Pfizer and co-founder of Atman Health. All other authors declare no competing interests.Funding StatementThis work was supported by One Brave Idea, co-founded by the American Heart Association and Verily with significant support from AstraZeneca and pillar support from Quest Diagnostics (to CAM and RCD), NIH/NHLBI HL140731 (to RCD) SENSHIN Medical Research Foundation (to SG), the Kanae foundation for the promotion of medical science (to SG), the Uehara Memorial Foundation (to SG), SG acknowledge support from Mower fellowship.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study complies with all ethical regulations and guidelines. The study protocol was approved by local institutional review board (IRB) of Mass General Brigham (2019P002651). This study had minimal patient risk: it collected data retrospectively, there was no direct contact with patients, and data were collected after medical care was completed. Thus, and to recruit an unbiased and representative cohort of patients, data were collected under a waiver of informed consent, which was approved by the IRB.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe data that support the findings of this study are available on request from the corresponding author R.C.D. upon approval of the data sharing committees. The data are not publicly available due to the presence of information that could compromise research participant privacy. https://github.com/obi-ml-public/echoAI-PET-measurements http://onebraveideaml.org