Discovery of cardiac imaging biomarkers by training neural network models across diagnostic modalities ====================================================================================================== * Shinichi Goto * Andreas A. Werdich * Max Homilius * Jenine E. John * Li-Ming Gan * Calum A. MacRae * Marcelo F. DiCarli * Rahul C. Deo ## Abstract Machines can be readily trained to automate medical image interpretation, with the primary goal of replicating human capabilities. Here, we propose an alternative role: using machine learning to discover pragmatic imaging-based biomarkers by interpreting one complex imaging modality via a second, more ubiquitous, lower-cost modality. We applied this strategy to train convolutional neural network models to estimate positron emission tomography (PET)-derived myocardial blood flow (MBF) at rest and with hyperemic stress, and their ratio, coronary flow reserve (CFR), using contemporaneous two-dimensional echocardiography videos as inputs. The resulting parameters, echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR modestly approximated the original values. However, using echocardiograms of 5,393 (derivation) and 5,289 (external validation) patients, we show they sharply stratify individuals according to disease comorbidities and combined with baseline demographics, are strong predictors for heart failure hospitalization (C-statistic derivation: 0.79, 95% confidence interval 0.77-0.81; validation: 0.81, 0.79-0.82) and acute coronary syndrome (C-statistic derivation: 0.77, 0.73-0.80; validation: 0.75, 0.73-0.78). Using echocardiograms of 3,926 genotyped individuals, we estimate narrow-sense heritability of 9.2%, 20.4% and 6.5%, respectively for echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR. MBF indices show inverse genetic correlation with impedance-derived body mass indices, such as fat-free body mass (e.g., ρ=−0.43, q=0.05 for echoAI-restMBF) and resolve conflicting historical data regarding body mass index and CFR. In terms of diseases, genetic association with ischemic heart disease is seen most prominently for echoAI-stressMBF (ρ=−0.37, q=2.4×10−03). We hypothesize that interpreting one imaging modality through another represents a type of “information bottleneck”, capturing latent features of the original physiologic measurements that have relevance across tissues. Thus, we propose a broader potential role for machine learning algorithms in developing scalable biomarkers that are anchored in known physiology, representative of latent biological factors, and are readily deployable in population health applications. ## Introduction The automated interpretation of medical images, videos, and signals has been a popular application of machine learning algorithms for decades. Recent examples include detecting breast cancer in mammograms1, intracranial hemorrhage in computed tomography scans2, 3, and cardiac amyloidosis in electrocardiograms and echocardiograms4–6. These and other prevailing use cases primarily view medical data as a depiction of anatomical structures requiring segmentation or a collection of features enriched in specific disease states. However, with its high information content, medical imaging has the potential to provide a window into underlying latent disease processes, which may have prognostic utility or motivate more precise biologically grounded disease definitions. For example, coronary flow reserve (CFR) is an intriguing physiological parameter defined as the proportional increase in myocardial blood flow during maximal vasodilatory stress7–10. It is the ratio of the stress myocardial blood flow (stressMBF) to the rest myocardial blood flow (restMBF), and lower values reflect abnormalities in the coronary epicardial vessels and microvasculature. These values have prognostic utility in predicting cardiovascular outcomes11–14 and are abnormal in several cardiac and systemic disease states, including dilated cardiomyopathy15, heart failure with preserved ejection fraction16, osteoporosis17, systemic lupus erythematosus18, rheumatoid arthritis18, and chronic obstructive pulmonary disease19. Coronary blood flow indices are unfortunately challenging to measure without invasive coronary angiography or indirectly through magnetic resonance imaging or positron emission tomography (PET), all of which have high associated costs. CFR has a clear physiological basis, and its association with non-cardiac diseases implies that it reflects more global underlying biologic processes that may be captured by other potentially lower cost modalities. Beyond the pragmatism of developing a more scalable version of these original measurements, building a representation of one modality with another may provide a cleaner readout of the underlying signal free of technical artifacts, reminiscent of the concept of an information bottleneck in machine learning20–22. In this study, we exploited this strategy for biomarker discovery by training convolutional neural networks to estimate rest and stress MBF and CFR using only resting echocardiogram videos as inputs. We deployed the resulting models on thousands of echocardiograms, enabling development and external validation of prognostic models, and investigating their genetic basis. ## Results ### Convolutional neural network models using resting echocardiograms modestly estimate indices of PET-determined myocardial blood flow An overview of our strategy is shown in **Figure 1**. We selected a total of 6,147 echocardiogram studies that occurred within one year of 3,448 corresponding PET studies and trained three-dimensional convolutional neural networks using resting apical 4-chamber (A4c) videos to estimate: 1) myocardial blood flow (MBF) at rest; (2) after vasodilatory stress; and 3) coronary flow reserve (**Figures S1 and S2**). We refer to the resulting parameter estimates as echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR. The output of echo-AI models only had a modest correlation with the corresponding PET measurement: the Spearman’s rank correlation coefficients were 0.35, 0.42, and 0.29 for rest MBF, stress MBF, and CFR, respectively. The corresponding median absolute errors were 0.16 ml/g/min, 0.5 ml/g/min, and 0.4 (**Figure S3**). ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F1) Figure 1. Graphical overview of the study Myocardial blood flow (MBF), as measured by PET, is a powerful predictor of cardiovascular outcomes, but has restricted availability due to equipment cost, tracer availability, technical expertise required, and radiation risk. In contrast, two-dimensional B-mode echocardiography measured at rest is a widely available, scalable technology. We hypothesized that many biological factors that drive abnormalities in MBF (red dotted arrows) would also be reflected within echocardiograms whereas technical factors unique to PET (black dotted arrow) would not. We trained three-dimensional convolutional neural network-derived models to estimate PET-derived indices of MBF, including CFR, using contemporaneously measured echocardiogram videos as inputs. We deployed the resulting models on tens of thousands of echocardiogram videos and found that echoAI derived MBF parameters and CFR sharply stratify patients by comorbidities, predict cardiovascular outcomes, and have genetic correlation with a breadth of clinical traits, shedding light on underlying biological mechanisms. ### Echo-AI stress MBF and echo-AI CFR stratify patients by age and cardiac and metabolic disease status Although the agreement with the original PET-derived estimates was modest, we nonetheless explored the utility of our echoAI-derived measures, which are scalable, and thus can be deployed on a large number of echocardiograms. We selected all patients at Brigham and Women’s Hospital (BWH) who had at least one echocardiogram within a 6-month period (June 2015-November 2015). We used the earliest echocardiogram for each patient, resulting in 5,393 echocardiograms, and we deployed all 3 models on all available A4c videos. Analysis of the baseline patient characteristics stratified by echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR values (**Tables 1-3**) revealed a sharp separation of comorbidity status by quartiles of echoAI-stressMBF and echoAI-CFR. Patients with lower echoAI-stressMBF and echoAI-CFR were older, had worse cardiac disease status, including lower ejection fraction and higher proportions of heart failure, atrial fibrillation, and coronary artery disease (CAD; all p<0.001). Furthermore, strata defined by low echoAI-stressMBF and echoAI-CFR reflected key facets of metabolic disorders associated with microvascular dysfunction, including a greater proportion of patients with type 2 diabetes mellitus, hyperlipidemia, and renal dysfunction12 (**Tables 2 and 3**). View this table: [Table 1.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T1) Table 1. Baseline demographic for EchoAI-restMBF quartiles (BWH cohort) View this table: [Table 2.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T2) Table 2. Baseline demographic for EchoAI-stressMBF quartiles (BWH cohort) View this table: [Table 3.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T3) Table 3. Baseline demographic for EchoAI-CFR quartiles (BWH cohort) The echoAI-restMBF values similarly separated patients according to these baseline attributes but with less sharp differences in the prevalence of comorbidities and indices of diminished cardiac and renal performance (**Table 1**). We further validated these results using an external cohort consisting of 5,289 patients from Massachusetts General Hospital (MGH) who had at least one echocardiogram within Feb 2016-Nov 2016 and observed a similar stratification of comorbidities with echoAI measures (**Tables S1-S3**). ### EchoAI-CFR and echoAI-stressMBF predict cardiovascular outcomes, independent of known risk markers To assess the ability of echoAI measures to predict adverse cardiovascular outcomes, we performed a time-to-event analysis on the same 5,393 patients from BWH (**Figure S4, Tables S4-S9**) for the outcomes of heart failure (HF) hospitalization and acute coronary syndrome (ACS), adjudicated by a machine learning algorithm as described previously19. The median follow-up length for this cohort was 4.1 (interquartile range (IQR): 1.7-4.4) years for heart failure and 4.2 (2.1-4.4) years for ACS. In univariate analysis, the echoAI measures predicted HF hospitalization with AUC 0.60 (95% CI 0.57-0.63), 0.70 (0.68-0.73) and 0.71 (0.69-0.73) and ACS with AUC 0.58 (0.54-0.62), 0.67 (0.63-0.72), and 0.66 (0.63-0.70) for echoAI-restMBF, echoAI-stressMBF and echo-AI-CFR respectively (**Figure 2**, **Figure S5 and S6**). Importantly, the predictive accuracy of echoAI-stressMBF and echoAI-CFR was higher than that of known HF predictors. For example, for HF hospitalization, the AUC for age alone was 0.67, and that of HF history alone was 0.66. For ACS, these echoAI measures’ predictive accuracy was similar to that of known strong predictors such as age and CAD history. We further performed a multivariable Cox proportional hazard model analysis with 9 parameters (the 3 echoAI parameters and 6 baseline characteristics including sex, age, ejection fraction, heart rate, history of CAD, and history of heart failure), which revealed that echoAI-stressMBF and echoAI-CFR were associated with HF hospitalization (hazard ratio per standard deviation change: echoAI-stressMBF=0.78, 95% CI 0.68-0.90, echoAI-CFR=0.71 95% CI 0.63-0.81) and echoAI-stressMBF was associated with ACS (HR: echoAI-stressMBF=0.73, 95% CI 0.56-0.95), independent of other covariates (**Figure 3**). The ROC analysis of the output of this 9-parameter Cox proportional hazard model predicted HF hospitalization (C-statistic 0.79 95%CI 0.77-0.88) and ACS (C-statistic 0.77, 0.73-0.80) with high discriminative accuracy (**Figure S5B and S6B)** and excellent calibration (**Figure S5C and S6C)**. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F2) Figure 2. Cardiovascular event free survival for echo-AI measurements quartiles. Kaplan-Meier plots for HF hospitalization and ACS free survival for echoAI measurement quartiles. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F3) Figure 3. Hazard ratios of echoAI measurements and various patient characteristics for the multivariable Cox proportional hazard models Forest plot showing the hazard ratio and 95% confidence intervals for each parameter in the Cox proportional hazard models for heart failure hospitalization and ACS. Hazard ratios are plotted as per standard deviation. Abbreviations, ACS: acute coronary syndrome, EF: ejection fraction, CAD: coronary artery disease, MBF: myocardial blood flow, CFR: coronary flow reserve, We validated this result using the MGH external validation cohort (**Figure S7, Tables S10-S15**). This cohort’s median follow-up time was 3.3 (IQR: 1.4-3.6) years and 3.3 (0.9-3.6) years for heart failure hospitalization and ACS, respectively. As with the BWH cohort, echoAI-stressMBF and echoAI-CFR accurately stratified patients by risk of HF hospitalization and ACS in this cohort (**Figure S8**). Multivariable Cox proportional hazard models, trained *de novo* on MGH data, yielded results consistent with findings from the BWH cohort: echoAI-stressMBF and echoAI-CFR both associated with HF hospitalization (HR per standard deviation change: 0.87 95%CI 0.77-0.98 and 0.76 95%CI 0.68-0.84, respectively) and echoAI-stressMBF associated with ACS (HR per standard deviation change: 0.82 95%CI 0.68-0.99) independent of other baseline characteristics (**Figures S9**). We further tested the predictive accuracy of the individual echoAI measurements (along with the 6 baseline features) and the combined 9-parameter model trained on BWH data. In the univariate analysis, predictive accuracy assessed by C-statistics was similar for all 9 parameters (**Figures S10A and S11A**). The 9-parameter model performed similarly to the derivation BWH cohort for heart failure hospitalization (**Figure S10B**, C-statistic 0.81 95% CI 0.79-0.82) and ACS (**Figure S11B**, C-statistic 0.75 95% CI 0.73-0.78). Furthermore, the models had excellent calibration on the MGH cohort without any adjustment (**Figures S10C and S11C**) ### Genetic correlation of echoAI flow parameters with conventional traits Given that our echoAI models have a modest input requirement - an echocardiogram with an apical 4-chamber view – they can enable investigations that have been historically challenging because of limited data availability. For example, there have been no systematic large-scale studies to understand the genetic determinants of myocardial blood flow, primarily because the imaging modalities used are invasive or require specialized expertise to acquire. We deployed the echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR on echocardiograms of 3,926 participants from the Mass General Brigham Biobank, a repository of biomolecular samples combined with clinical data. We used the resulting model outputs as continuous phenotypes and performed a genomewide association study to identify genetic determinants. Although the top SNP for all traits fell below conventional genomewide significance thresholds, we used summary statistics to estimate narrow-sense trait heritability and explore genetic correlations with a diversity of phenotypes, some of which have previously been associated with MBF or CFR. We estimated heritability for echoAI-restMBF, echoAI-stressMBF and echoAI-CFR as 9.2%, 20.4%, and 6.5% respectively and found the MBF traits to be modestly correlated with one another (restMBF/stressMBF ρ=0.26). We next focused on understanding whether the genetic determinants of myocardial blood flow indices are shared with any other trait, using summary statistics from the UK Biobank as a source for comparison. Genetic correlations are useful for parameters where phenotypic correlations with the underlying trait of interest may not be feasible, either because the trait has not been measured in the population or it has been modified by treatment. The strongest genetic correlations were seen for echoAI-stressMBF, which also had the highest estimated heritability. echoAI-stressMBF had an inverse genetic correlation (**Figure 4**) with several traits reflective of impedance-estimated body mass, including whole-body fat-free mass (ρ=−0.32, q=5.1×10−05) and basal metabolic rate (ρ=−0.33, q=5.1 ×10−05). echoAI-stressMBF also had negative correlation with ischemic heart disease (ρ=−0.37, q=2.4 ×10−03) and C-reactive protein (ρ=−0.21, q=0.09). Finally, echoAI-stressMBF had a weak positive genetic correlation with heart rate (ρ=0.16, q=0.10). ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F4) Figure 4. Genetic and phenotypic correlation of echoAI-CFR, echoAI-restMBF and echoAI-stressMBF with traits previously associated with indices of coronary blood flow. (A) Heatmap representing the genetic correlation coefficient values between echo-AI GWAS summary statistics and multiple traits in UK Biobank. Significant genetic correlations at FDR thresholds 0.01, 0.05, 0.1 are indicated by \***|, **, *. (B) Correlation of PET measures with anthropomorphic and physiologic traits. (C) Correlation of echo-AI estimates with anthropomorphic and physiologic traits. Significant phenotypic correlations with Bonferroni-corrected p-values < 0.05 are indicated by *. We observed similar genetic correlations with echoAI-restMBF, including an even stronger correlation with whole body fat-free mass (ρ=−0.43, q=0.05) and heart rate (ρ=−0.38, q=0.06). As expected, echoAI-restMBF had a weaker correlation with ischemic heart disease (ρ=−0.22, q=0.26). echoAI-restMBF also had a surprising inverse genetic correlation with standing height (ρ=−0.35, q=0.04). Compared with the MBF indices, echoAI-CFR had a weaker correlation with whole body fat-free mass (ρ=−0.15, q=0.27) but interestingly a stronger negative correlation with body mass index (ρ=−0.41, q=0.19) and waist circumference (ρ=−0.37, q=0.20). echoAI-CFR had a negative but non-significant correlation with ischemic heart disease (ρ=−0.33, q=0.30), a negative correlation with neutrophil count (ρ=−0.31, q=0.14), a negative correlation with C-reactive protein (ρ=−0.37, q=0.19), and positive correlation with forced vital capacity (ρ=0.34, q=0.19). Larger sample sizes will be needed to estimate these parameters more precisely. We confirmed several anthropomorphic and physiologic associations at the phenotypic level, using both PET-derived and echoAI indices (**Figure 4**). For height, Spearman correlations were −0.30 −0.26 and 0.0 for PET-restMBF, PET-stressMBF, and PET-CFR and for weight they were −0.14, −0.08, and 0.04, respectively. BMI had no apparent correlation with any of the PET blood flow indices. Resting heart rate was significantly correlated only with PET-restMBF (ρ=0.20) and PET-CFR (ρ=0.13). For the echoAI measures, the Spearman correlations with height were −0.30, −0.25, and −0.13 for echoAI-restMBF, echoAI-stressMBF, and echoAI-CFR, respectively. For weight, Spearman correlations were −0.24, −0.29, and −0.14; for BMI −0.12, −0.20, and −0.13; and for heart rate, −0.15 −0.22, and −0.08, respectively (**Figure 4**). ## Discussion The primary, altogether unexpected result of this work is the fact that relatively modest estimators of complex physiologic parameters can be 1) strongly associated with specific comorbidities associated with abnormal coronary flow; 2) powerful predictors of cardiovascular outcomes independent of known risk markers; and 3) have a defined genetic basis that, through genetic correlation analyses, can shed light on latent biological processes that underlie these metrics. By mapping PET-derived coronary blood flow indices to information derived from a single view of a 2D echocardiogram, we markedly increased the scale of downstream analyses, including longitudinal time-to-event analyses and genetic association analysis. From a workflow standpoint, this mapping of the output of a costly imaging modality to a more readily available one (i.e., echocardiography without Doppler) should also enable broader clinical deployment. The rationale for this work is a belief in common biological factors or pathways underlying a broader set of disease manifestations. In the case of CFR, the strong association of a coronary blood flow parameter with a diverse group of measurements and disease states, including indices of myocardial function23, systemic inflammatory markers19, pulmonary disease19, and bone disease 17 suggest the existence of common latent processes. A parsimonious explanation is that vascular beds across a wide range of tissues have a partially shared susceptibility to internal and external stress and that dysfunction in one bed is reflected in others24, 25. In addition to the pragmatic utility of developing more readily deployable biomarkers, we propose that our strategy of representing one imaging modality’s output via another may help estimate such a shared susceptibility. We describe our method as a type of “information bottleneck”, consistent with information-theoretic work by Tishby and colleagues, formulated two decades ago20, but enjoying recent attention as a possible explanation for the success of deep learning methods21, 22. Their work interprets supervised learning as a deliberate process of mapping a complex input signal into a compressed representation but still preserving relevant information about a target. Our application here goes one step further: by placing two modalities (PET and echocardiography) in series, we are driving the underlying physiologic process (i.e., abnormality of coronary blood flow under vasodilatory stress) through an information bottleneck to extract a specific underlying factor that may be partially obscured in the original measurement. Our genetic correlation work also sheds light on distinctions between stress myocardial blood flow and coronary flow reserve. Previous work has demonstrated independent contributions of these two parameters to cardiovascular outcomes12. We found that echoAI-stressMBF had the strongest genetic correlation with ischemic heart disease and a negative correlation with C-reactive protein, a marker of systemic inflammation, in support of prior phenotypic correlation19, 26. Among other correlations, the inverse association with rest and stress myocardial blood flow with impedance-estimated fat-free body mass sheds light on previous difficult to reconcile findings. Prior phenotypic association of body mass index with coronary flow reserve and hyperemic blood flow in small groups of patients has been interpreted as the deleterious effects of obesity on endothelial cell function26, although body mass index does not unambiguously distinguish between adiposity and muscle mass. Our observed genetic correlation of rest myocardial blood flow with fat-free body mass and standing height (which was further supported by phenotypic data) suggests a complementary hypothesis – there may be competition for blood flow between the heart and remainder of the body: those with greater amounts of metabolically active body mass may have reduced coronary blood flow, independent of any adverse effects of adipose tissue on coronary microcirculation. This observation is in keeping with the previously observed non-monotonic relationship between BMI and CFR27, where CFR increases with BMI in the non-obese range but decreases in the obese range. Our results would indicate potential utility in correcting MBF parameters for lean body mass and height to isolate adverse effects of adipose tissue. The primary limitations of this study are that the PET data were all derived from a single center. However, the utility of the model for predicting cardiovascular events is validated in an external cohort, which showed good discrimination and calibration, suggesting that the model is generalizable to a different institution. A second limitation is that the sample size for genetic association and correlation analysis, though large enough for multiple insights, is also relatively small to discover trait-associated loci. In summary, we describe the development of scalable imaging-based biomarkers with surprisingly good discriminative properties for cardiovascular outcomes, and more broadly, a generalizable strategy for the discovery of measures of latent biological factors that may extend across multiple tissues. ## Methods ### Selection of temporally coincident PET-CFR and echocardiography cases Consecutive clinically indicated rest/stress myocardial perfusion PET studies performed at Brigham and Women’s Hospital (Boston, MA) between January 1, 2006 and November 30, 2017, with a clinically indicated echocardiogram within 365 days were included. Patients who had undergone coronary artery bypass grafting (CABG) or cardiac transplantation prior to the PET or the echocardiogram study, were excluded. PET studies were performed with either 82Rubidium or 13N-ammonia, using computed tomography (CT) for attenuation correction as described previously28. Absolute global myocardial blood flow (MBF) was quantified in mL/min/g at rest and at peak hyperemia by a single operator blinded to patient data using a validated two-compartment tracer kinetic model as described previously29. Coronary flow reserve (CFR) was calculated as ratio between MBF at stress and MBF at rest. For contemporaneous echocardiograms, we selected all two-dimensional resting echocardiograms that occurred within 365 days of the corresponding PET study. ### Echocardiography pre-processing The video processing involved 3 steps: (1) frame scaling, (2) frame rate adjustment and (3) square padding. For each echocardiography video, frames were scaled in each dimension by the original pixel spacing factors (in mm/pixel) that were extracted from the DICOM headers to account for variations in resolution. All images were subsequently scaled by a universal scaling factor of 1.177 which was chosen so that the largest dimension of all videos used for training, testing and validation was smaller than the square model input size of 299 pixels. This universal scaling made sure that the model input size of 299 by 299 pixels could be achieved for all videos by simple zero padding while keeping the correct feature sizes. Given that dynamic information, such as velocity of the heart walls, are dependent on frame rate we then set the model input frame rate to a uniform value of 30 frames per second (fps) for all the videos. We excluded all the videos with framerates of that were smaller than 30 fps or had lengths of less than 1.33s (40 frames × 30 fps). All qualifying videos were subsampled by the formula: Pn=On·fps/30, where Pn is the n-th frame of the processed video and On is the n-th frame of the original video. When the n·fps/30 term was a non-integer value, it was rounded to the closest integer. Videos were saved together with the corresponding labels (CFR and MBF values) as 3-D arrays in the TensorFlow TFRecord serialized format to be used by input pipeline. During training, images and labels were re-shuffled before each epoch, distributed across all 8 GPUs and finally zero-padded to the required square model input size of 299 by 299 pixels without further scaling. We used TensorFlow (version 2.2.0) for implementation and deployment of the neural network models for myocardial blood flow indices. Given that the echocardiograms were time-series of multiple frames, we constructed a 3D-CNN model treating the temporal axis as the 3rd dimension, to maximize the ability of the model to use dynamic features. It consisted of 3 layers of 3D-CNN followed by 12 layers with Multi-3D-CNN-modules, which was constructed by 3 parallel multilayer 3D-CNNs and a max pooling operation concatenated at the end of the module (schematic shown in **Figure S12**). The model was trained using data from the training cohort (**Figure S1**). The echocardiography videos were labeled with the PET-measured CFR or MBF value on the study level and the model was trained to minimize the mean-squared-error between model prediction and the label using an RMSprop optimizer with an initial learning rate of 0.0001. The model was trained for 300 epochs with a batch size of 64 videos distributed across 8 NVIDIA Tesla V100 SXM2 32GB GPUs. At the end of each epoch, the Spearman correlation coefficient was calculated using the validation set. The final models were chosen as the models with the highest correlation coefficient on the validation cohort across all 300 epochs. The code to implement the models along with instructions for installation/usage are provided in a public repository [https://github.com/obi-ml-public/echoAI-PET-measurements](https://github.com/obi-ml-public/echoAI-PET-measurements). ### Cohort selection for event analysis The Massachusetts General Brigham Electronic Data Warehouse (EDW), which includes notes from >20 provider locations across a large integrated delivery network, was the source of notes and structured clinical data used in this study. All information in the electronic health record in the EDW system was gathered including free text medical notes and discharge summaries, and outcome events were adjudicated using a machine learning based model using the discharge summaries as input as described previously30. Briefly, after manually labeling 1,372 training, 592 validation and 1,003 test set discharge summaries, we trained neural network models for event adjudication for heart failure hospitalization (test set AUROC 0.97) and ACS (test set AUROC 0.98). We selected classification thresholds to maximize sensitivity at specificity over 0.90 (HF: cutoff 0.12, sensitivity 0.94, specificity 0.91. ACS: cutoff 0.04, sensitivity 0.91, specificity 0.91). The discharge summary classification models were deployed on 44,150 notes and a binary event status was determined by each model for each note. To test if the echoAI measurements can predict clinical outcomes, survival analysis was performed using patients who had echocardiography within Jun 2015 and Nov 2015 at BWH. The first echocardiography study for each patient was identified and the echoAI models were ran on those videos to obtain the corresponding measurements. Follow up for each patient was determined using hospital and clinic encounters within the system. Patients were excluded if they had (1) no notes in the system, (2) less than 30 days follow up or (3) had an event during the first 30 days. To avoid the possible bias caused by the influence of COVID-19, all the patients were censored at Feb 2019. An external validation cohort was selected with similar procedure from patients who had at least one echo cardiogram from MGH (**Figure S7**). Since MGH started to enter notes from Feb 2016, the inclusion period was set to Feb 2016 to Nov 2016 for MGH. ### Analysis of predictive accuracy of echo-AI rest MBF, echo-AI stress MBF and echo AI CFR for heart failure hospitalization Receiver operating characteristics (ROC) curves for the echoAI measurements along with ejection fractions, heart rate and 4 patient demographic variables (age, sex, heart failure history and CAD history) were calculated using timeROC package version 0.4. A multivariable Cox proportional hazard (PH) model including all 9 parameters were trained using survival package (version 3.2.3) and the ROC curve for the model output was calculated with the timeROC package. The individual hazard ratios for the multivariable Cox PH model were analyzed to assess the adjusted association of each parameter to the outcome. For external validation, two separate Cox PH models were used. To assess the ability of the 9-parameter model to predict outcomes, the original Cox PH models trained on BWH data were used without any weight adjustment. However, to analyze the individual adjusted hazard ratios for the echoAI measures along with 6 baseline characteristics information, we trained separate Cox PH models with the same input parameters on MGH data. ### Genetic association analysis Genomewide SNP array data is available for over 36,000 individuals enrolled in the Massachusetts General Brigham Biobank31, a collection of plasma, serum, and DNA samples of consented subjects with associated linkage to their electronic health records. We downloaded echocardiograms for 4,400 of these patients and were able to compute echoAI-CFR and echoAI-stressMBF on 4,145. To minimize confounding by population structure, we focused our genetic association analyses on 3,925 white study subjects. We performed a genomewide association study using age, sex, and the first 10 principal components as covariates using PLINK 2.032. Summary statistics were used to estimate narrow-sense heritability with the help of the R *HDL* package33. Genetic correlation with a prioritized selection of traits from the UK Biobank was also performed using summary statistics provided by the Neale Lab ([http://www.nealelab.is/uk-biobank/](http://www.nealelab.is/uk-biobank/)). Genetic correlation p-values for UK Biobank traits were adjusted for multiple hypothesis testing using the R *qvalue* package34. We set a false-discovery threshold for “significance” at 0.20, though any such threshold is arbitrary and should be driven by downstream decisions related to the corresponding finding. ### Statistical analysis Data were collected and stored using Numpy package version 1.19.2 with Python 3.7.3. The scatter plots and ROC curves are plotted using the *ggplot2*35 package (R 3.6.1). Continuous values are presented as mean ± standard deviation (SD) and categorical values are presented as numbers and percentages if not otherwise specified. ### Ethics statement This study complies with all ethical regulations and guidelines. The study protocol was approved by local institutional review board (IRB) of Mass General Brigham (2019P002651). This study had minimal patient risk: it collected data retrospectively, there was no direct contact with patients, and data were collected after medical care was completed. Thus, and to recruit an unbiased and representative cohort of patients, data were collected under a waiver of informed consent, which was approved by the IRB. ### Data availability The data that support the findings of this study are available on request from the corresponding author R.C.D. upon approval of the data sharing committees. The data are not publicly available due to the presence of information that could compromise research participant privacy. ### Code availability The code to run the models along with instructions for installation/usage are provided in a public repository [https://github.com/obi-ml-public/echoAI-PET-measurements](https://github.com/obi-ml-public/echoAI-PET-measurements). The model weights may contain patient personal information and thus, could not be shared. We provide a web-interface to run our model and generate predictions at [http://onebraveideaml.org](http://onebraveideaml.org) ## Supporting information Genetic Correlation Data [[supplements/251025_file02.xlsx]](pending:yes) ## Data Availability The data that support the findings of this study are available on request from the corresponding author R.C.D. upon approval of the data sharing committees. The data are not publicly available due to the presence of information that could compromise research participant privacy. [https://github.com/obi-ml-public/echoAI-PET-measurements](https://github.com/obi-ml-public/echoAI-PET-measurements) [http://onebraveideaml.org](http://onebraveideaml.org) ## Competing interests R.C.D is supported by grants from the National Institute of Health, the American Heart Association (One Brave Idea, Apple Heart and Movement Study) and GE Healthcare, has received consulting fees from Novartis and Pfizer, and is co-founder of Atman Health. C.A.M. is a consultant for Pfizer and co-founder of Atman Health. All other authors declare no competing interests. ## Supplementary Figures ![Figure S1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F5.medium.gif) [Figure S1.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F5) Figure S1. Consort diagram of echo selections for echoAI-CFR model training and testing. ![Figure S2](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F6.medium.gif) [Figure S2](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F6) Figure S2 Consort diagram of echo selections for echoAI-restMBF and echoAI-stressMBF model training and testing. ![Figure S3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F7.medium.gif) [Figure S3.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F7) Figure S3. Agreement of EchoAI measurements with the PET measurements (A) Scatter plots and (B) Bland-Altman plot for stress MBF, rest MBF and CFR. Observed represents the PET measured value and Predicted represents the echo-AI predicted value. Abbreviations, PET: positron emission tomography, MBF: myocardial blood flow, CFR: coronary flow reserve. ![Figure S4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F8.medium.gif) [Figure S4.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F8) Figure S4. Consort diagram of patient selection for survival analysis. ![Figure S5](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F9.medium.gif) [Figure S5](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F9) Figure S5 Predictive accuracy of echoAI measurements and various patient characteristics for heart failure hospitalization (A) Receiver operating characteristics curves for the echoAI parameters, ejection fraction, heart rate and other patients’ demographics information to predict heart failure hospitalization. (B) Receiver operating characteristics curve and (C) calibration plot for the 9-parameter combined model to predict heart failure hospitalization. Abbreviations, EF: ejection fraction, CAD: coronary artery disease, MBF: myocardial blood flow, CFR: coronary flow reserve. ![Figure S6.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F10.medium.gif) [Figure S6.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F10) Figure S6. Predictive accuracy of echoAI measurements and various patient characteristics for ACS (A) Receiver operating characteristics curves for the echoAI parameters, ejection fraction, heart rate and other patients’ demographics information to predict ACS. (B) Receiver operating characteristics curve and (C) calibration plot for the 9 parameters combined model to predict ACS. Abbreviations, ACS: acute coronary syndrome, EF: ejection fraction, CAD: coronary artery disease, MBF: myocardial blood flow, CFR: coronary flow reserve. ![Figure S7.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F11.medium.gif) [Figure S7.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F11) Figure S7. Consort diagram of patient selection for survival analysis in external validation cohort from MGH. ![Figure S8.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F12.medium.gif) [Figure S8.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F12) Figure S8. Cardiovascular event free survival for echo-AI measurements quartiles in external validation cohort from MGH. Kaplan-Meier plots for HF hospitalization and ACS free survival for echoAI measurement quartiles. MGH: Massachusetts General Hospital. ![Figure S9.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F13.medium.gif) [Figure S9.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F13) Figure S9. Hazard ratios of echoAI measurements and various patient characteristics for the multivariable Cox proportional hazard models in external validation cohort from MGH Forest plot showing the hazard ratio and 95% confidence intervals for each parameter in the Cox proportional hazard models for heart failure hospitalization and ACS. Hazard ratios are plotted as per standard deviation. Abbreviations, ACS: acute coronary syndrome, EF: ejection fraction, CAD: coronary artery disease, MBF: myocardial blood flow, CFR: coronary flow reserve, MGH: Massachusetts General Hospital. ![Figure S10](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F14.medium.gif) [Figure S10](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F14) Figure S10 Predictive accuracy of echoAI measurements and various patient characteristics for heart failure hospitalization in external validation cohort from MGH (A) Receiver operating characteristics curves for the echoAI parameters, ejection fraction, heart rate and other patients’ demographics information to predict heart failure hospitalization. (B) Receiver operating characteristics curve and (C) calibration plot for the 9 parameters combined model to predict heart failure hospitalization. Abbreviations, EF: ejection fraction, CAD: coronary artery disease, MBF: myocardial blood flow, CFR: coronary flow reserve. ![Figure S11.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F15.medium.gif) [Figure S11.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F15) Figure S11. Predictive accuracy of echoAI measurements and various patient characteristics for ACS in external validation cohort from MGH (A) Receiver operating characteristics curves for the echoAI parameters, ejection fraction, heart rate and other patients’ demographics information to predict ACS. (B) Receiver operating characteristics curve and (C) calibration plot for the 9 parameters combined model to predict ACS. Abbreviations, ACS: acute coronary syndrome, EF: ejection fraction, CAD: coronary artery disease, MBF: myocardial blood flow, CFR: coronary flow reserve. ![Figure S12.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/02/08/2021.02.07.21251025/F16.medium.gif) [Figure S12.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/F16) Figure S12. Schematic illustration of the model architecture. The model architecture for the (A) multi-CNN module and the (B) overall model. Abbreviations, CNN: convolutional neural network ## Supplementary Tables View this table: [Table S1.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T4) Table S1. Baseline demographic for EchoAI-restMBF quartiles (MGH cohort) View this table: [Table S2.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T5) Table S2. Baseline demographic for EchoAI-stressMBF quartiles (MGH cohort) View this table: [Table S3.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T6) Table S3. Baseline demographic for EchoAI-CFR quartiles (MGH cohort) View this table: [Table S4.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T7) Table S4. Baseline demographic for EchoAI-restMBF quartiles for heart failure survival analysis (BWH cohort) View this table: [Table S5.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T8) Table S5. Baseline demographic for EchoAI-stressMBF quartiles for heart failure survival analysis (BWH cohort) View this table: [Table S6.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T9) Table S6. Baseline demographic for EchoAI-CFR quartiles for heart failure survival analysis (BWH cohort) View this table: [Table S7.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T10) Table S7. Baseline demographic for EchoAI-restMBF quartiles for ACS survival analysis (BWH cohort) View this table: [Table S8.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T11) Table S8. Baseline demographic for EchoAI-stressMBF quartiles for ACS survival analysis (BWH cohort) View this table: [Table S9.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T12) Table S9. Baseline demographic for EchoAI-CFR quartiles for ACS survival analysis (BWH cohort) View this table: [Table S10.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T13) Table S10. Baseline demographic for EchoAI-restMBF quartiles for heart failure survival analysis (MGH cohort) View this table: [Table S11.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T14) Table S11. Baseline demographic for EchoAI-stressMBF quartiles for heart failure survival analysis (MGH cohort) View this table: [Table S12.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T15) Table S12. Baseline demographic for EchoAI-CFR quartiles for heart failure survival analysis (MGH cohort) View this table: [Table S13.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T16) Table S13. Baseline demographic for EchoAI-restMBF quartiles for ACS survival analysis (MGH cohort) View this table: [Table S14.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T17) Table S14. Baseline demographic for EchoAI-stressMBF quartiles for ACS survival analysis (MGH cohort) View this table: [Table S15.](http://medrxiv.org/content/early/2021/02/08/2021.02.07.21251025/T18) Table S15. Baseline demographic for EchoAI-CFR quartiles for ACS survival analysis (MGH cohort) ## Acknowledgement This work was supported by One Brave Idea, co-founded by the American Heart Association and Verily with significant support from AstraZeneca and pillar support from Quest Diagnostics (to CAM and RCD), NIH/NHLBI HL140731 (to RCD) SENSHIN Medical Research Foundation (to SG), the Kanae foundation for the promotion of medical science (to SG), the Uehara Memorial Foundation (to SG), SG acknowledge support from Mower fellowship. * Received February 7, 2021. * Revision received February 7, 2021. * Accepted February 8, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.McKinney, S. M. et al. International evaluation of an AI system for breast cancer screening. Nature 577, 89–94 (2020). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-019-1799-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31894144&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F08%2F2021.02.07.21251025.atom) 2. 2.Lee, H. et al. An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets. Nat Biomed Eng 3, 173–182 (2019). 3. 3.Arbabshirani, M. R. et al. Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. Npj Digital Medicine 1, 9 (2018). 4. 4.Tison, G. H., Zhang, J., Delling, F. N. & Deo, R. C. Automated and Interpretable Patient ECG Profiles for Disease Detection, Tracking, and Discovery. Circulation Cardiovasc Qual Outcomes 12, e005289 (2019). 5. 5.Goto, S. et al. Artificial Intelligence-Enabled, Fully Automated Detection of Cardiac Amyloidosis Using Electrocardiograms and Echocardiograms. (2020) doi:10.1101/2020.07.02.20141028. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4wNy4wMi4yMDE0MTAyOHYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDIvMDgvMjAyMS4wMi4wNy4yMTI1MTAyNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 6. 6.Zhang, J. et al. Fully Automated Echocardiogram Interpretation in Clinical Practice. Circulation 138, 1623–1635 (2018). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F08%2F2021.02.07.21251025.atom) 7. 7.Hoef, T. P. van de, Siebes, M., Spaan, J. A. E. & Piek, J. J. Fundamentals in clinical coronary physiology: why coronary flow is more important than coronary pressure. Eur Heart J 36, 3312– 3319 (2015). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/eurheartj/ehv235&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26033981&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F08%2F2021.02.07.21251025.atom) 8. 8.Gould, L. K. & Johnson, N. P. Coronary Physiology Beyond Coronary Flow Reserve in Microvascular Angina JACC State-of-the-Art Review. Journal of the American College of Cardiology 72, 2642–2662 (2018). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjEwOiI3Mi8yMS8yNjQyIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDIvMDgvMjAyMS4wMi4wNy4yMTI1MTAyNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 9. 9.Taqueti, V. R. & Carli, M. F. Coronary Microvascular Disease Pathogenic Mechanisms and Therapeutic Options JACC State-of-the-Art Review. Journal of the American College of Cardiology 72, 2625–2641 (2018). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjEwOiI3Mi8yMS8yNjI1IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDIvMDgvMjAyMS4wMi4wNy4yMTI1MTAyNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 10. 10.Gan, L.-M., Wikström, J. & Fritsche-Danielson, R. Coronary Flow Reserve from Mouse to Man—from Mechanistic Understanding to Future Interventions. J Cardiovasc Transl 6, 715–728 (2013). 11. 11.Bravo, P. E. et al. Diagnostic and prognostic value of myocardial blood flow quantification as non-invasive indicator of cardiac allograft vasculopathy. European heart journal 39, 316–323 (2018). 12. 12.Taqueti, V. R. et al. Global coronary flow reserve is associated with adverse cardiovascular events independently of luminal angiographic severity and modifies the effect of early revascularization. Circulation 131, 19–27 (2015). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjg6IjEzMS8xLzE5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDIvMDgvMjAyMS4wMi4wNy4yMTI1MTAyNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 13. 13.Taqueti, V. R. et al. Interaction of impaired coronary flow reserve and cardiomyocyte injury on adverse cardiovascular outcomes in patients without overt coronary artery disease. Circulation 131, 528–35 (2015). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6MTQ6ImNpcmN1bGF0aW9uYWhhIjtzOjU6InJlc2lkIjtzOjk6IjEzMS82LzUyOCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAyLzA4LzIwMjEuMDIuMDcuMjEyNTEwMjUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 14. 14.Hage, C. et al. Association of Coronary Microvascular Dysfunction with Heart Failure Hospitalizations and Mortality in Heart Failure with Preserved Ejection Fraction – a follow-up in the PROMIS-HFpEF study. J Card Fail 26, 1016–1021 (2020). 15. 15.Heuvel, A. F. M. van den et al. Regional myocardial blood flow reserve impairment and metabolic changes suggesting myocardial ischemia in patients with idiopathic dilated cardiomyopathy. J Am Coll Cardiol 35, 19–28 (2000). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjc6IjM1LzEvMTkiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMS8wMi8wOC8yMDIxLjAyLjA3LjIxMjUxMDI1LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 16. 16.Dryer, K. et al. Coronary microvascular dysfunction in patients with heart failure with preserved ejection fraction. Am J Physiol-heart C 314, H1033–H1042 (2018). 17. 17.Lerman, A. et al. Coronary microvascular endothelial dysfunction is an independent predictor of development of osteoporosis in postmenopausal women. Vasc Heal Risk Management Volume 10, 533–538 (2014). 18. 18.Recio-Mayoral, A. et al. Chronic inflammation and coronary microvascular dysfunction in patients without risk factors for coronary artery disease. Eur Heart J 30, 1837–1843 (2009). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/eurheartj/ehp205&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19502228&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F08%2F2021.02.07.21251025.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000268806100011&link_type=ISI) 19. 19.Nakamori, S. et al. Myocardial perfusion reserve is impaired in patients with chronic obstructive pulmonary disease: a comparison to current smokers. European Hear J - Cardiovasc Imaging 15, 180–188 (2014). 20. 20.Tishby, N., Pereira, F. C. & Bialek, W. The information bottleneck method. (2000). 21. 21.Tishby, N. & Zaslavsky, N. Deep Learning and the Information Bottleneck Principle. (2015). 22. 22.Alemi, A. A., Fischer, I., Dillon, J. V. & Murphy, K. Deep Variational Information Bottleneck. (2016). 23. 23.Shah, S. J. et al. Prevalence and correlates of coronary microvascular dysfunction in heart failure with preserved ejection fraction: PROMIS-HFpEF. European heart journal 39, 3439– 3450 (2018). [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30165580&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F08%2F2021.02.07.21251025.atom) 24. 24.Neunteufl, T. et al. Systemic endothelial dysfunction is related to the extent and severity of coronary artery disease. Atherosclerosis 129, 111–118 (1997). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0021-9150(96)06018-2&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9069525&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F08%2F2021.02.07.21251025.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1997WX81300016&link_type=ISI) 25. 25.Broxterman, R. M. et al. Strong Relationship Between Vascular Function in the Coronary and Brachial Arteries: A Clinical Coming of Age for the Updated Flow-Mediated Dilation Test? Hypertension 74, 208–215 (2019). 26. 26.Schindler, T. H. et al. Relationship Between Increasing Body Weight, Insulin Resistance, Inflammation, Adipocytokine Leptin, and Coronary Circulatory Function. J Am Coll Cardiol 47, 1188–1195 (2006). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjk6IjQ3LzYvMTE4OCI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAyLzA4LzIwMjEuMDIuMDcuMjEyNTEwMjUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 27. 27.Bajaj, N. S. et al. Coronary Microvascular Dysfunction and Cardiovascular Risk in Obese Patients. J Am Coll Cardiol 72, 707–717 (2018). [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6MzoiUERGIjtzOjExOiJqb3VybmFsQ29kZSI7czo0OiJhY2NqIjtzOjU6InJlc2lkIjtzOjg6IjcyLzcvNzA3IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDIvMDgvMjAyMS4wMi4wNy4yMTI1MTAyNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 28. 28.Taqueti, V. R. et al. Coronary microvascular dysfunction and future risk of heart failure with preserved ejection fraction. Eur Heart J 39, 840–849 (2017). 29. 29.Fakhri, G. E. et al. Reproducibility and Accuracy of Quantitative Myocardial Blood Flow Assessment with 82Rb PET: Comparison with 13N-Ammonia PET. J Nucl Med 50, 1062–1071 (2009). [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Njoiam51bWVkIjtzOjU6InJlc2lkIjtzOjk6IjUwLzcvMTA2MiI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIxLzAyLzA4LzIwMjEuMDIuMDcuMjEyNTEwMjUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 30. 30.Goto, S. et al. Artificial intelligence-enabled event adjudication: estimating delayed cardiovascular effects of respiratory viruses. doi:10.1101/2020.11.12.20230706. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoibWVkcnhpdiI7czo1OiJyZXNpZCI7czoyMToiMjAyMC4xMS4xMi4yMDIzMDcwNnYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjEvMDIvMDgvMjAyMS4wMi4wNy4yMTI1MTAyNS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 31. 31.Gainer, V. S. et al. The Biobank Portal for Partners Personalized Medicine: A Query Tool for Working with Consented Biobank Samples, Genotypes, and Phenotypes Using i2b2. J Personalized Medicine 6, 11 (2016). 32. 32.Purcell, S. et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am J Hum Genetics 81, 559–575 (2007). [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1086/519795&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17701901&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F02%2F08%2F2021.02.07.21251025.atom) 33. 33.Ning, Z., Pawitan, Y. & Shen, X. High-definition likelihood inference of genetic correlations across human complex traits. Nat Genet 1–6 (2020) doi:10.1038/s41588-020-0653-y. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-020-0653-y&link_type=DOI) 34. 34.Storey, J. D., Bass, A. J., Dabney, A. & Robinson, D. qvalue: Q-value estimation for false discovery rate control. (2019). 35. 35.Wickham, H. ggplot2: Elegant Graphics for Data Analysis. (Springer-Verlag New York, 2016).