Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

CARDIAC-FM: A Multimodal Foundation Model for Cardiovascular Risk Prediction Using ECG and Cardiac MRI

Fumin Li, Siting Li, Yuhan Qian, Bojun Chen, Jennifer A Brody, Vidhushei Yogeswaran, Kerri L Wiggins, Colleen M Sitlani, Joshua C Bis, Ali Shojaie, W T Longstreth Jr, Bruce M Psaty, Geoffrey H Tison, Simon Du, James S Floyd, Ting Ye
doi: https://doi.org/10.64898/2026.03.16.26348526
Fumin Li
1Department of Statistics, University of Washington, Seattle, WA, USA
2Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Siting Li
3Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yuhan Qian
4Department of Biostatistics, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bojun Chen
4Department of Biostatistics, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jennifer A Brody
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vidhushei Yogeswaran
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kerri L Wiggins
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Colleen M Sitlani
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joshua C Bis
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ali Shojaie
1Department of Statistics, University of Washington, Seattle, WA, USA
4Department of Biostatistics, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
W T Longstreth Jr
6Department of Epidemiology, University of Washington, Seattle, Washington, USA
7Department of Neurology, University of Washington, Seattle, Washington, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bruce M Psaty
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
6Department of Epidemiology, University of Washington, Seattle, Washington, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Geoffrey H Tison
8Division of Cardiology, University of California-San Francisco, San Francisco, California, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Simon Du
3Paul G. Allen School of Computer Science & Engineering, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James S Floyd
5Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
6Department of Epidemiology, University of Washington, Seattle, Washington, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ting Ye
4Department of Biostatistics, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tingye1{at}uw.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Atrial fibrillation and heart failure impose substantial health burdens worldwide, yet existing prediction models lack sufficient accuracy and generalizability. We developed CARDIAC-FM, a multimodal foundation model that learns joint representations of 12-lead electrocardiogram (ECG) and cardiac magnetic resonance imaging (MRI) through contrastive learning. We trained CARDIAC-FM on 57,609 paired ECG-cardiac MRI samples from UK Biobank and evaluated it in two external cohorts: the Cardiovascular Health Study (CHS) and the Multi-Ethnic Study of Atherosclerosis (MESA). CARDIAC-FM consistently outperformed unimodal models across all cohorts, and jointly incorporating ECG features with established clinical risk scores yielded additive gains in discrimination, indicating that ECG and traditional risk factors capture complementary dimensions of cardiovascular risk. The learned representations improved prediction across a range of cardiovascular outcomes with minimal task-specific fine-tuning, reflecting real-world settings where many diseases have limited positive samples and lack dedicated risk models. Although trained on paired ECG and MRI data, CARDIAC-FM generates predictions using ECG alone or ECG combined with established risk scores, enabling broad clinical deployment without MRI. These findings demonstrate the promise of multimodal pre-training for generalizable cardiovascular risk prediction.

Introduction

Atrial fibrillation and heart failure cause substantial cardiovascular morbidity and mortality worldwide, and their prevalence is projected to rise substantially over the coming decades1–3. Early identification of individuals at elevated risk is essential for targeted prevention, clinical decision-making, risk stratification across both clinical practice and trial design. However, existing risk prediction models based on traditional cardiovascular risk factors show varying discrimination and limited generalizability across populations4–6.

Deep learning applied to the resting 12-lead electrocardiogram (ECG) has shown promise for cardiovascular risk assessment. Convolutional neural networks trained on ECGs can detect subclinical cardiovascular conditions – including structural heart disease7, left atrial abnormalities8,9, ventricular dysfunction10 – and predict incident atrial fibrillation11–13 and other cardiovascular outcomes14–16. More recently, ECG foundation models trained with self-supervised learning on unlabeled data have further improved representation learning from electrocardiographic signals17–19. In parallel, deep learning applied to cardiac magnetic resonance imaging (MRI) has enabled automated cardiovascular disease screening and diagnosis20–23.

Despite these advances, current artificial intelligence (AI) models share several limitations that constrain clinical translation. Most models rely on a single modality, even though complementary data sources capture distinct and synergistic aspects of cardiac pathophysiology. Models are often developed in highly selected clinical cohorts, which limits their utility for risk prediction and population-based screening in apparently healthy individuals. Most models are also task-specific, requiring separate training for each outcome of interest. Finally, few have demonstrated robust generalizability across populations with different demographic characteristics, risk profiles, and healthcare contexts.

Recent multimodal approaches have begun to address the first limitation by integrating ECG with additional data modalities23–25 or combining imaging with clinical text26. However, these efforts have focused largely on identification of prevalent conditions rather than prospective prediction of cardiovascular events. To date, no foundation model has combined ECG and cardiac MRI using contrastive learning to predict incident cardiovascular events with demonstrated transferability across multiple large, independent cohorts and comparative performance against traditional risk scores.

Here, we introduce CARDIAC-FM (Contrastive Alignment for Risk Detection In Cardiovascular Foundation Model), a multimodal foundation model that integrates large-scale 12-lead ECG and cardiac MRI data from the UK Biobank. CARDIAC-FM uses a two-stage training framework. In the first stage, paired ECG and MRI data from 57,609 samples are used to pretrain modality-specific encoders and align their representations in a shared latent space through contrastive learning, enabling the ECG encoder to capture structural and functional information measured by MRI. In the second stage, a lightweight prediction head is added and fine-tuned for downstream outcome prediction, allowing efficient adaptation to various clinical endpoints without requiring separate, resource-intensive model training for each target condition. This stands in contrast to conventional approaches, where a dedicated model must be trained from scratch for every new outcome, demanding substantial labeled data and computational resources each time.

At inference, CARDIAC-FM can generate predictions using ECG alone or ECG combined with MRI, and can be further augmented with established risk scores, accommodating heterogeneous data availability in real-world clinical settings. We evaluated CARDIAC-FM for prediction of incident atrial fibrillation and heart failure in the UK Biobank and in two independent, multi-site prospective cohort studies in the United States – the Cardiovascular Health Study (CHS)27 and the Multi-Ethnic Study of Atherosclerosis (MESA)28. We also evaluated predictions across a broad range of additional cardiovascular outcomes in CHS and MESA. Across all cohorts, CARDIAC-FM consistently outperformed unimodal AI models, demonstrating superior generalizability. Jointly incorporating ECG and established risk scores through CARDIAC-FM yielded additive improvements in discrimination, indicating that ECG signals and traditional risk factors capture complementary aspects of cardiovascular risk. We further evaluated the pre-trained representations on additional cardiovascular outcomes in CHS and MESA — including myocardial infarction, ischaemic stroke, cardioembolic stroke, cardiovascular death, and all-cause death — under few-shot learning, reflecting the reality that labeled samples are often scarce in external cohorts. Strong performance under this limited-data regime confirms that multimodal pre-training captures a comprehensive representation of cardiac health that transfers effectively to diverse clinical endpoints. To our knowledge, CARDIAC-FM is the first ECG-MRI multimodal foundation model for prospective cardiovascular risk prediction with external validation across multiple independent cohorts. We publicly release the model code and pretrained weights to facilitate reproducibility and accelerate clinical translation of multimodal AI in cardiovascular medicine.

Results

UK Biobank Data and Participant Characteristics

At the time of analysis, resting 12-lead ECG and cardiac MRI data were available from 71,144 UK Biobank participants who had attended the baseline imaging visit. Participants had a mean age of 65.6 years (s.d. 7.8); 51.4% were female, and a total of 6,182 (8.7%) returned for repeat imaging, with a median inter-scan interval of 3.0 years (interquartile range 2.2-3.4 years), yielding 77,326 imaging visits (Supplementary Tables 1-2). ECG and MRI were acquired on the same day at each visit.

Among all the imaging visits, 70,424 visits had evaluable MRI and 67,661 had evaluable ECG data. Most visits (80.3%) had both modalities available, 7.1% had ECG only, and 10.7% had MRI only (Supplementary Table 1). After applying pre-specified quality control criteria (Methods), a total of 57,609 visits with paired, high-quality ECG and MRI data were retained for analysis. Participants with prevalent atrial fibrillation or heart failure at baseline were retained for pretraining but excluded from outcome prediction analyses. We randomly partitioned participants into training (60%), validation (5%), and test (35%) sets at the participant level, ensuring that all visits from the same individual were assigned to a single partition.

Overview of Multimodal AI model

We developed a multimodal contrastive learning framework that integrates ECG and MRI data through self-supervised representation learning (Fig 1). During multimodal pretraining, modality-specific encoders extract features from raw ECG and MRI inputs, which are aligned in a shared latent space using a symmetric InfoNCE contrastive loss29. The model was trained on 34,596 samples with paired ECG and MRI data. For downstream outcome prediction, we excluded participants with prevalent disease at baseline, added a prediction layer to the learned representations, and fine-tuned the model for 20 epochs using supervised learning. The final model supports four input configurations: 1. ECG-only, 2.ECG + MRI, 3. ECG + Risk Score, 4. ECG + MRI + Risk Score, enabling deployment across clinical settings with varying imaging availability. Full methodological details are provided in Methods.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: Overview of the CARDIAC-FM framework.

A, Multimodal pretraining. ECG waveforms are processed through the ECG Foundation Model (ECG-FM) encoder and MRI (short-axis (SAX) stack, four-chamber (4CH), and two-chamber (2CH) views; 25 frames each) are processed through a convolutional neural network–long short-term memory (CNN-LSTM) encoder. The resulting representations are aligned in a shared latent space using contrastive loss. B, Downstream prediction. The pretrained ECG encoder can be used independently with ECG input alone (left) or combined with the pre-trained MRI encoder for multimodal prediction (right). Both configurations can further incorporate established clinical risk scores, enabling flexible deployment across clinical settings with varying data availability.

To evaluate the impact of modality combinations and clinical risk scores on predictive performance, we compared several model configurations. The comparator models were two established clinical risk scores derived from traditional risk factors (CHARGE-AF for atrial fibrillation and PREVENT-HF for heart failure)6,30, and an ECG-only foundation model without multimodal pretraining (ECG-FM). We then evaluated CARDIAC-FM under two input configurations: ECG only and combined ECG and MRI. For each AI model (ECG-FM and both CARDIAC-FM configurations), we further constructed variants that incorporated the corresponding clinical risk score (CHARGE-AF or PREVENT-HF) via late fusion. Performance was assessed on held-out test data using the area under the receiver operating characteristic curve (AUROC) and the area under the precision–recall curve (AUPRC).

Fine-Tuning for Cardiac MRI Feature Prediction

We fine-tuned CARDIAC-FM with ECG-only input to predict MRI-derived cardiac structural and functional measures: left ventricular ejection fraction (LVEF), left ventricular mass (LVM), left ventricular end-diastolic volume (LVEDV), left ventricular end-systolic volume (LVESV), left atrial ejection fraction (LAEF), left atrial minimum volume (LAVmin), and left atrial maximum volume (LAVmax). ECG-predicted values were strongly correlated with direct imaging measures across all features (Pearson r=0.51-0.79; R2= 0.25-0.62; Supplementary Fig. 6; Supplementary Table 17). Correlations were generally higher for structural measures (LVM, r = 0.79; LVEDV, r = 0.72; LVESV, r = 0.71; LAVmin, r = 0.67) than for functional indices (LVEF, r = 0.51; LAEF, r = 0.55), with LAVmax showing intermediate performance (r = 0.57). These findings demonstrate that ECG representations learned through multimodal pretraining capture structural and functional cardiac information acquired with MRI, enabling imputation of imaging-derived features when cardiac MRI is unavailable.

Fine-Tuning for Downstream Clinical Outcome Prediction

We evaluated CARDIAC-FM for prediction of incident atrial fibrillation and heart failure at 5 years, comparing ECG-only and combined ECG–MRI inputs. All models were evaluated in a held-out UK Biobank test set restricted to participants with both modalities available, enabling fair comparison across input configurations (outcome rates in Supplementary Table 2; results in Fig. 2 and Supplementary Table 5).

Figure 2:
  • Download figure
  • Open in new tab
Figure 2: Prediction of incident atrial fibrillation and heart failure in the UK Biobank.

Discrimination (AUROC, left) and precision–recall (AUPRC, right) for 5-year prediction of incident atrial fibrillation and heart failure in the held-out test set. Models compared include clinical risk scores alone (CHARGE-AF for atrial fibrillation; PREVENT-HF for heart failure), an ECG foundation model without multimodal pretraining (ECG-FM), and CARDIAC-FM under four input configurations: ECG only, ECG + MRI, ECG + clinical risk score, and ECG + MRI + clinical risk score. Error bars indicate 95% confidence intervals. CARDIAC-FM combined with clinical risk scores outperformed unimodal baselines across most comparisons.

For ECG-only models, ECG-FM (without multimodal pretraining) achieved AUROCs of 0.75 for atrial fibrillation and 0.76 for heart failure. CARDIAC-FM with ECG-only input showed comparable performance for atrial fibrillation but improved discrimination for heart failure (AUROC 0.80). Incorporating clinical risk scores into CARDIAC-FM substantially improved atrial fibrillation prediction but conferred smaller additional benefit for heart failure. Among models without MRI, CARDIAC-FM combining ECG with clinical risk score achieved the highest discrimination: AUROC 0.82 for atrial fibrillation and 0.81 for heart failure. These results indicate that ECG waveforms provide prognostic information independent of and complementary to traditional cardiovascular risk factors.

Adding MRI to CARDIAC-FM yielded minimal additional gains for atrial fibrillation prediction (AUROC and AUPRC differences <1 point) but improved heart failure prediction (AUROC gains of 1 point; AUPRC gains of 3-5 points). The full CARDIAC-FM model integrating ECG, MRI, and clinical risk scores achieved AUROCs of 0.82 for atrial fibrillation and 0.83 for heart failure.

To assess whether CARDIAC-FM generalized across clinically relevant subgroups, we evaluated model performance stratified by age (<65 versus ≥65 years) and sex (Supplementary Fig. 3; Supplementary Table 10). CARDIAC-FM demonstrated consistent discrimination across all subgroups for both atrial fibrillation and heart failure, with AUROC differences typically <5 percentage points and no substantial effect modification by age or sex.

External Validation in CHS and MESA Cohorts

We evaluated CARDIAC-FM in two independent US prospective cohort studies: the Cardiovascular Health Study (CHS) and the Multi-Ethnic Study of Atherosclerosis (MESA). CHS enrolled 5,888 adults aged 65 years or older in 1989-1990 and 1992-1993, including predominantly White participants and a smaller proportion of African American individuals, with a mean age of 73 years at baseline. MESA enrolled 6,814 adults aged 45-84 years from four racial and ethnic groups (White, African American, Hispanic, and Chinese American) in 2000-2002, all free of clinical cardiovascular disease at enrollment, with a mean age of 62 years. Both cohorts differed from UK Biobank imaging participants (mean age 65 years) in age distribution and racial and ethnic composition but provided longitudinal surveillance for atrial fibrillation events and central physician adjudication of heart failure, enabling robust assessment of generalizability. In CHS, participants underwent standardized resting 12-lead ECGs at the baseline examination, whereas in MESA participants underwent both ECG and MRI at baseline. We therefore evaluated CARDIAC-FM (ECG only) and CARDIAC-FM (ECG + Risk Score) in both external cohorts (Fig. 3; Supplementary Tables 6-7).

Figure 3:
  • Download figure
  • Open in new tab
Figure 3: External validation of CARDIAC-FM in the (a) Cardiovascular Health Study and (b) Multi-Ethnic Study of Atherosclerosis.

Zero-shot and few-shot prediction performance for 5-year incident atrial fibrillation and heart failure, evaluated using AUROC and AUPRC. Models were pretrained on UK Biobank and externally validated on CHS and MESA. Zero-shot denotes direct application without cohort-specific training; few-shot denotes fine-tuning on 20% of the external cohort data, respectively. Error bars represent 95% confidence intervals. CARDIAC-FM (ECG + Risk Score) achieved the strongest performance across all settings.

We first evaluated model generalizability in CHS, which included 10,058 ECG recordings from 5,806 unique participants at baseline and year-7 visits (outcome rates in Supplementary Table 3). In zero-shot evaluation, models trained on UK Biobank were applied directly without fine-tuning, reflecting the most common practical scenario of applying pretrained models to new populations without additional training. Traditional risk scores were evaluated in the zero-shot setting only, as they are established models with fixed coefficients. CARDIAC-FM (ECG + Risk Score) yielded the strongest performance across both outcomes (AUROC 0.73 and AUPRC 0.33 for atrial fibrillation; AUROC 0.77 and AUPRC 0.24 for heart failure). Compared with the clinical risk scores alone, this represented a 3-point increase in AUROC and 5-point increase in AUPRC for atrial fibrillation, and a 2-point increase in AUROC and 3-point increases in AUPRC for heart failure. We next evaluated few-shot fine-tuning using 20% of CHS data. This minimal adaptation improved both AUROC and AUPRC across all configurations, with CARDIAC-FM (ECG + Risk Score) achieving the strongest performance under all conditions (AUROC 0.75 and AUPRC 0.38 for atrial fibrillation; AUROC 0.82 and AUPRC 0.31 for heart failure; Fig. 3a). This level of performance is comparable to that observed in the held-out UK Biobank test set, demonstrating that the foundation model can be efficiently adapted to new populations. Across all experiments, CARDIAC-FM consistently outperformed ECG-FM given the same inputs, demonstrating the benefit of multimodal pretraining in improving generalization.

Results in MESA were broadly consistent, with CARDIAC-FM (ECG + Risk Score) achieving the highest discrimination across all settings. In particular, CARDIAC-FM yielded absolute improvements in AUPRC of 4–5% for both outcomes under zero-shot evaluation and 5–8% with few-shot fine-tuning (Fig. 4b). Subgroup analyses stratified by age and sex demonstrated stable performance across both cohorts (Supplementary Figs. 4 and 5; Supplementary Tables 11 and 14).

Figure 4:
  • Download figure
  • Open in new tab
Figure 4: Risk stratification performance of CARDIAC-FM in external validation cohorts.

Hazard ratios for 5-year incident atrial fibrillation, heart failure, heart failure with preserved ejection fraction (HFpEF), and heart failure with reduced ejection fraction (HFrEF) in the Cardiovascular Health Study (CHS; a) and 5-year incident atrial fibrillation and heart failure in the Multi-Ethnic Study of Atherosclerosis (MESA; b). Participants were stratified into risk tertiles based on clinical risk scores alone (CHARGE-AF for atrial fibrillation; PREVENT-HF for heart failure) or CARDIAC-FM (ECG + Risk Score) under zero-shot and 20% fine-tuning conditions. Hazard ratios were estimated using univariable Cox proportional hazards models with the low-risk tertile as reference. Teal points indicate high-risk versus low-risk tertile; red points indicate intermediate-risk versus low-risk tertile. Error bars indicate 95% confidence intervals. In CHS, CARDIAC-FM outperformed clinical risk scores in zero-shot evaluation, with further improvement upon fine-tuning. In MESA, lower event rates, particularly for heart failure, resulted in wider confidence intervals; CARDIAC-FM achieved similar performance to clinical risk scores for atrial fibrillation but underperformed for heart failure in zero-shot evaluation, with comparable or superior risk stratification after fine-tuning.

We also evaluated CARDIAC-FM’s risk stratification performance in CHS and MESA (Fig. 4; Supplementary Tables 12 and 15). Using predicted risk from the ECG + Risk Score model, we stratified the test set into tertiles and estimated hazard ratios via univariable Cox regression. In CHS, zero-shot CARDIAC-FM outperformed clinical risk scores alone, with high-risk tertiles showing hazard ratios of 3.83 (95% CI:3.50–4.21) for atrial fibrillation and 4.37 (95% CI: 3.88–4.91) for heart failure versus the low-risk reference. Few-shot fine-tuning further improved discrimination: with 20% fine-tuning, hazard ratios increased to 4.47 (95% CI: 4.06–4.91) for atrial fibrillation and 6.21 (95% CI: 5.49–7.03) for heart failure. Because CHS had sufficient events for heart failure subtype analysis, we examined differential benefits of ECG integration. HFrEF showed the largest improvement, consistent with prominent electrical abnormalities from structural remodeling, whereas HFpEF demonstrated more modest gains. In MESA, lower event rates, particularly for heart failure (Supplementary Table 4), yielded wider confidence intervals. Zero-shot CARDIAC-FM achieved risk stratification similar to clinical scores for atrial fibrillation but underperformed for heart failure; fine-tuning restored comparable or superior performance across all outcomes.

To assess whether ECG-predicted MRI parameters capture prognostically relevant information, we examined associations between zero-shot CARDIAC-FM-predicted MRI parameters and incident cardiovascular events in CHS, adjusting for traditional risk factors (Supplementary Fig. 1; Supplementary Table 13). Predicted left atrial volumes (LAVmax, LAVmin) and left ventricular structural measures (LVEDV, LVESV, LVM) were positively associated with incident atrial fibrillation and heart failure, whereas predicted ejection fractions (LAEF, LVEF) showed protective associations. These patterns were more pronounced for HFrEF, with predicted ventricular volumes and mass showing hazard ratios of 1.43–1.64 and LVEF demonstrating the strongest protective association (HR ∼0.67). In contrast, HFpEF showed minimal associations across all predicted parameters. These analyses were possible because CARDIAC-FM enabled inference of MRI-derived features from ECG alone, allowing examination of cardiac structure–disease associations in a cohort without cardiac imaging. In MESA, where cardiac MRI was performed, associations between predicted and measured MRI parameters with incident cardiovascular events were largely consistent (Supplementary Fig. 2; Supplementary Table 16), validating the prognostic relevance of ECG-derived structural estimates.

Evaluation of multimodal pre-trained representations across a range of cardiovascular outcomes

We hypothesized that multimodal contrastive pre-training would yield a representation of cardiac health that can improve prediction of a broad range of cardiovascular outcomes. To test this, we leveraged the rigorously adjudicated outcomes available in CHS and MESA and evaluated CARDIAC-FM (ECG-only) on five additional endpoints: myocardial infarction, ischaemic stroke, cardioembolic stroke (the ischemic stroke subtype most closely linked with atrial fibration), cardiovascular mortality, and all-cause mortality. The event rates for these outcomes are reported in Supplementary Tables 3–4. For each outcome, we assessed prediction at both 5- and 10-year horizons, except for cardioembolic stroke where only 10-year prediction was evaluated due to insufficient events at shorter follow-up. We fine-tuned the pre-trained CARDIAC-FM and ECG-FM encoders using ECG-only input and only 20% of data from each cohort, reflecting a data-scarce setting typical of deploying foundation models to new clinical populations with limited outcome annotations.

Under minimal fine-tuning, CARDIAC-FM consistently outperformed ECG-FM in both AUROC and AUPRC across all outcomes and time horizons in both cohorts, with the exception of 5-year incident IS in MESA, where performance was comparable between the two models (Fig. 5, Supplementary Tables 8-9), with AUROC improvement ranging from 0.02 to 0.09 in CHS, and 0.00 to 0.07 in MESA. These findings indicate that multimodal pre-training with cardiac MRI encodes a broad representation of cardiac structure and function into the ECG encoder that transfers to diverse clinical endpoints with minimal task-specific supervision.

Figure 5:
  • Download figure
  • Open in new tab
Figure 5: Evaluation of pre-trained representations across a broad range of cardiovascular outcomes.

AUROC (left) and AUPRC (right) for CARDIAC-FM (blue) and ECG-FM (red) across five cardiovascular outcomes at 5- and 10-year prediction horizons in CHS (a) and MESA (b). Outcomes include myocardial infarction (MI), ischaemic stroke (IS), cardioembolic stroke (CES; 10-year only due to limited events at shorter follow-up), cardiovascular death (CV death), and all-cause death (death). Both models used ECG-only input and were fine-tuned using 20% of data from each cohort. CARDIAC-FM consistently outperformed ECG-FM across all outcomes and time horizons in both cohorts. Error bars indicate 95% confidence intervals.

Discussion

In this study, we present CARDIAC-FM, the first large-scale multimodal foundation model integrating ECG waveform data and cardiac MRI for prospective cardiovascular event prediction. A key finding is that multimodal pre-training enhances model performance even when only ECG is available at inference, demonstrating that cross-modal representation learning enriches unimodal ECG features without requiring MRI during deployment. These improvements likely reflect an enhanced capacity of the ECG encoder to capture features predictive of cardiac structure and function, information learned through alignment with MRI during pre-training that is particularly relevant to heart failure pathophysiology, without prespecifying any imaging parameters. Importantly, the benefits of multimodal pre-training extended beyond the primary endpoints of atrial fibrillation and heart failure: when evaluated on a broad range of additional cardiovascular outcomes in CHS and MESA, including myocardial infarction, ischaemic stroke, cardioembolic stroke, cardiovascular death, and all-cause death, CARDIAC-FM consistently outperformed its unimodal counterpart with minimal fine-tuning, indicating that the learned representations capture a comprehensive picture of cardiac health rather than features specific to any single disease process.

In our analyses, we compared CARDIAC-FM with established clinical risk scores that are widely used in research and, to a lesser degree, in clinical practice. There are distinct advantages to estimating cardiovascular risk from a single, widely available biological measurement such as the resting 12-lead ECG. Clinical risk scores incorporate readily available variables including age, sex, and body mass index, yet other components such as diabetes assessment or documentation of clinical cardiovascular disease involve costly laboratory or diagnostic testing that may be difficult for many patients to access. Risk prediction based on a single biological measurement that synthesizes key features of cardiac structure has the potential to mitigate bias arising from incomplete ascertainment of predictors, which often relates to socioeconomic position and access to care. More importantly, beyond a small number of common conditions, most cardiovascular diseases do not have dedicated risk prediction models. By contrast, our model can use ECG as the sole input to estimate risk for a wide range of cardiovascular outcomes. Even when only a limited number of positive samples are available, minimal task-specific fine-tuning enables the model to adapt to new disease endpoints, bypassing the need to develop separate disease-specific risk models from scratch.

Beyond prediction, CARDIAC-FM functions as a cross-modality translator, inferring MRI-derived structural markers from ECG data. Cardiac MRI is expensive, requires specialized infrastructure, and was not performed in most epidemiological cohorts. By generating MRI-derived structural estimates from routine ECG recordings, CARDIAC-FM enables population-scale investigation of cardiac structure and its relationship to clinical outcomes in cohorts that lack imaging data. We demonstrated this capability in CHS, where cardiac MRI was not acquired, and validated the prognostic relevance of the predicted structural parameters against measured MRI values in MESA.

CARDIAC-FM supports two inference modes: an ECG-only mode that is broadly deployable in routine clinical settings and a comprehensive ECG+MRI mode that maximizes predictive performance when imaging is available. Both modes can incorporate established clinical risk scores via late fusion. Across three independent cohorts, CARDIAC-FM generally outperformed both unimodal deep learning models and clinical risk scores, highlighting the robustness and clinical relevance of multimodal training. The model demonstrated strong generalizability across diverse populations, and its efficient fine-tuning paradigm enabled rapid adaptation with limited labeled data.

Our results also clarify the distinct and complementary roles of ECG and MRI in predicting different cardiovascular conditions. Raw ECG waveforms exhibit independent predictive value that extends beyond traditional ECG markers; for example, PR interval is not a strong predictor of atrial fibrillation, yet raw ECG waveforms contain richer temporal and morphological information that contributes to prediction. We further find that adding MRI on top of raw ECG contributes little additional value for atrial fibrillation prediction. A likely explanation is that atrial fibrillation is an electrical disorder, and ECG waveforms can capture early electrical instability and subclinical atrial fibrillation episodes that precede or occur without structural abnormalities on MRI. In contrast, for heart failure, multimodal learning provides clear benefits. Heart failure encompasses several phenotypes and often involves subtle, subclinical structural changes, such as early remodeling, fibrosis, or impaired contractility that are challenging to detect from ECG alone. MRI is highly sensitive to these structural features, and training with combined ECG and MRI data enhances heart failure prediction even when only ECG is available at inference. Although our current MRI model provides informative features, future improvements in imaging encoders may further strengthen heart failure prediction.

These findings suggest that multimodal foundation modelling is a promising approach to cardiovascular risk assessment. By supporting flexible inference modes, leveraging complementary information across modalities, and maintaining strong performance with minimal labeled data across a range of cardiovascular endpoints, CARDIAC-FM provides a scalable framework for integrating heterogeneous clinical data into predictive cardiovascular medicine.

Data availability

UK Biobank data are made available to researchers from research institutions as described at this site: https://www.ukbiobank.ac.uk/enable-your-research. Data for Cardiovascular Health Study and Multi-Ethnic Study of Atherosclerosis can be requested at https://internal.mesa-nhlbi.org/ and https://chs-nhlbi.org/node/6222, respectively.

Code availability

The fully trained model and weights are available for download at https://github.com/lst627/CARDIAC-FM.

Author contributions

T.Y., J.S.F. and S.D. designed the study. F.L., S.L., Y.Q., B.C., J.A.B., C.M.S and K.L.W. performed analyses, W.L. contributed data from MESA. B.M.P and W.L. contributed data from CHS. T.Y., F.L., S.L., and J.S.F. drafted the manuscript. All authors reviewed the manuscript and provided critical revision.

Online Methods

UK Biobank

The UK Biobank (UKB) is a cohort study that recruited 500,000 participants aged 40 to 69 years from the United Kingdom between 2006 and 201031. The baseline assessment included comprehensive questionnaires, blood and urine collection for biochemical measurements, anthropometry, and interviews. All participants provided electronic informed consent at the UK Biobank assessment centers. The UKB received ethics approval from the North West Multi-Centre Research Ethics Committee.

Longitudinal follow-up for health outcomes of atrial fibrillation, and heart failure was performed through linkages with national databases on hospitalizations, primary care records, and death registries. Incident atrial fibrillation events were identified from self-reports, nurse interviews, hospital admission codes, death register cause-of-death codes, and UK Biobank first occurrence data. Heart failure was identified solely from diagnosis codes and cause-of-death codes (Supplementary Table 18).

We defined binary prediction outcomes at a 5-year time horizon following the imaging visit. For each participant, we coded atrial fibrillation status as positive if the participant had no prior diagnosis of atrial fibrillation at the imaging visit and developed incident atrial fibrillation within 5 years, as negative if the participant completed 5-year follow-up without an event, and as missing otherwise. We excluded participants with prevalent atrial fibrillation at the imaging visit from atrial fibrillation prediction analyses. We defined heart failure analogously and excluded participants with prevalent disease at the imaging visit from respective analyses.

Participants whose imaging visits occurred after the UKB data censoring dates were excluded. Follow-up for this study extended through 2022. Hospital data were available until October 2022 for England, August 2022 for Scotland, and May 2022 for Wales. Death record data was censored in November 2022 across all sites.

Traditional cardiovascular risk factors were measured at the imaging visit or, when unavailable, at the most recent prior visit. Covariates included age, sex, height, weight, body mass index (BMI), systolic and diastolic blood pressure, total cholesterol, high-density lipoprotein cholesterol, estimated glomerular filtration rate (eGFR; calculated using the 2009 Chronic Kidney Disease Epidemiology Collaboration creatinine equation), diabetes mellitus, current smoking status, prevalent myocardial infarction, and current use of antihypertensive and lipid-lowering medications. These variables comprise the CHARGE-AF and American Heart Association PREVENT-HF (Predicting Risk of Cardiovascular Disease EVENTs equation for heart failure).

Cardiovascular Health Study

The Cardiovascular Health Study (CHS) is a prospective cohort study of risk factors for cardiovascular diseases in adults aged 65 years and older, recruited from four field centers in the United States (Sacramento, CA; Hagerstown, MD; Winston-Salem, NC; Pittsburgh, PA)27. Between June 1989 and June 1990, 5,201 participants were recruited from random samples of Medicare eligibility lists. An additional 687 Black participants were recruited between November 1992 and June 1993. Study participants were seen in the clinic annually between enrollment and 1998-99 and were contacted by telephone at 6-month intervals through June 2015 to collect information about health status, hospitalizations, and medication use32. The study was approved by the institutional review boards of all participating sites, and all study participants provided informed consent.

Atrial fibrillation events were identified using a combination of ECG findings from clinic visits through 1999, hospital discharge records indicating atrial fibrillation, and Medicare inpatient, outpatient, and physician (carrier) claims data. Resting 12-lead ECGs were performed at baseline and follow-up visits, with centralized readings to classify atrial fibrillation according to standardized criteria. For atrial fibrillation identified through Medicare data, diagnosis required either ECG evidence, one inpatient claim, or two outpatient or physician claims within 365 days using International Classification of Diseases, Ninth Revision (ICD-9) codes 427.31 or 427.32 in any diagnostic position.

Assessment of heart failure events in CHS has been previously described33. All potential heart failure events were identified through in-person visits or telephone interviews. These events were then adjudicated by the central events committee. Adjudication of heart failure was based on a physician diagnosis of heart failure, documented symptoms and signs of heart failure, use of medical treatment for heart failure, and supportive findings on diagnostic testing.

Methods for clinical ischemic stroke adjudication in CHS has been previously described34. In summary, an expert events committee, including neurologists from the study sites and CHS coordinating center, along with a neuroradiologist from the imaging reading center, reviewed the cases. Ischemic stroke was characterized by the sudden onset of a focal neurological deficit persisting for at least 24 hours, supported by compatible imaging findings on computed tomography scan or cardiac magnetic resonance imaging, and without evidence of intracranial hemorrhage or a nonvascular underlying cause. The adjudication committee categorized ischemic strokes into four subtypes based on the presumed infarction mechanism: cardioembolic, small vessel, large-artery atherosclerotic, or unknown/other causes.

Myocardial infarction (MI) was adjudicated to have occurred in the presence of evolving Q-wave MI or cardiac pain plus abnormal enzymes and either an evolving ST-T pattern or new left bundle branch block.

Details of surveillance methods and disease classification in CHS have been described previously35. Since participants were enrolled in the study, deaths were identified through phone calls, annual clinic visits, or local obituaries. Cause of death was adjudicated by a centralized adjudication committee, which reviewed information from obituaries, death certificates, medical records, and proxy interviews36. Cardiovascular death is defined as death from coronary artery disease.

Multi-Ethnic Study of Atherosclerosis

The Multi-Ethnic Study of Atherosclerosis (MESA) is a prospective, population-based observational cohort study of 6,814 men and women representing four racial/ethnic groups (white, African American, Hispanic, and Chinese-American), aged 45–84 years and free of clinical cardiovascular disease at enrollment28. Study participants were recruited in 2000-2 at 6 field centers in the US (Baltimore, MD; Chicago, IL; Forsyth County, NC; Los Angeles, CA; New York, NY; and St. Paul, MN). At telephone contacts every 9–12 months during follow-up, participants were asked to identify new hospitalizations and diagnoses, and medical records were obtained. Institutional review boards of all field centers approved the study protocol, and all participants gave written informed consent.

Clinically recognized atrial fibrillation was identified by an ICD hospital discharge diagnosis code (version 9: 427.31 or 427.32; version 10: I48) in any position; and for those enrolled in fee-for-service Medicare, by an inpatient, outpatient, or physician claim with an atrial fibrillation code37. A panel reviewed medical records and adjudicated heart failure events according to standardized criteria. The present study included probable or definite heart failure events. Probable heart failure required heart failure symptoms, a physician diagnosis of heart failure, and treatment. Definite heart failure required at least one objective feature of heart failure, including abnormalities on chest X-ray, echocardiography, or ventriculography.

Methods for clinical ischemic stroke adjudication in MESA has been previously described38. In brief, information about all new cardiovascular conditions, hospital admissions, cardiovascular outpatient diagnoses, treatments, and deaths were obtained. Two vascular neurologists from the MESA study events committee independently reviewed all medical records for end point classification and assignment of incidence dates. They reviewed and classified stroke as present if there was a focal neurologic deficit lasting 24 hours or until death or, if <24h, there was a clinically relevant lesion on brain imaging and there was no nonvascular cause. Patients with focal neurological deficits secondary to brain trauma, tumor, infection, or other nonvascular cause were excluded. Ischemic strokes were distinguished from hemorrhagic stroke using findings on imaging, surgery, autopsy, or some combination of these. Ischemic stroke subtypes were assigned based on an extension of the Trial of Org 10172 in Acute Stroke Treatment (TOAST) criteria to try to reduce the number classified as undetermined.

An myocardial infarction in MESA required either abnormal cardiac biomarkers (2 times upper limits of normal) regardless of pain or ECG findings; evolving Q waves regardless of pain or biomarker findings; or a combination of chest pain, ST-T evolution or new left bundle branch block, and biomarker levels 1 to 2 times upper limits of normal39.

Hard cardiovascular disease events in MESA were required to be symptomatic and included myocardial infarction (MI), resuscitated cardiac arrest, stroke, and cardiovascular death. Incident heart failure events were recorded. All deaths were identified. For potential cardiovascular deaths, cause was assigned through committee review. For other deaths, the underlying cause was obtained through state or city vital statistics departments40.

Cardiac magnetic resonance imaging

The UK Biobank Imaging Study, initiated in 2014, includes magnetic resonance imaging of the brain, abdomen, and heart41. The first 5,065 cardiac MRI scans were manually analyzed, and LA contours at end-systole and end-diastole were traced42. Bai et al43. used these annotated images to develop a fully automated convolutional network pipeline for large-scale quantification of cardiac structure and function. The algorithm’s accuracy was evaluated using technical metrics (Dice coefficient, contour distance, and Hausdorff distance) and validated by clinical experts. This publicly available algorithm was applied to derive MRI parameters, including left ventricular ejection fraction (LVEF), end-diastolic and end-systolic volumes (LVEDV, LVESV), left ventricular mass (LVM), left atrial emptying fraction (LAEF), and minimum and maximum left atrial volumes (LAVmin, LAVmax).

We preprocessed multi-view cine MRI scans (two-chamber [2CH], four-chamber [4CH], and short-axis [SAX]) before input to our multimodal AI pipeline. The 2CH and 4CH views each comprised a single slice, whereas the SAX view usually comprised 11–12 slices. Each MRI slice contained 50 time points; we subsampled every other time point to reduce computational requirements, yielding 25 time points per slice. Using segmentation masks to identify the left ventricular center, we extracted a 128×128-pixel patch from each image, removing background information while preserving the cardiac region.

We then applied standardized preprocessing to all slices to match the input format of a pretrained CNN-LSTM model20: (1) zero-padding to 210×210 pixels to ensure uniform dimensions; (2) normalization using the mean and standard deviation computed across all views within each sample; and (3) resizing to 64×64 pixels using bilinear interpolation. Our model required six slices per sample; therefore, we retained only samples containing 2CH, 4CH, and at least four SAX slices (after sampling every other slice from all SAX views) with valid anatomical segmentation masks, yielding 65,480 samples with evaluable MRI data.

Electrocardiograms

In UK Biobank, each ECG trace comprises 5,000 uniformly sampled data points per lead over 10 seconds at 500 Hz. We parsed 12-lead ECG signals, including all standard leads (I, II, III, aVR, aVL, aVF, V1–V6).

We excluded ECGs with poor quality, suspected lead reversal, undetermined rhythm, or implausible ventricular rates (<20 or >140 bpm). We also excluded ECGs showing atrial fibrillation, atrial flutter, or pacemaker activity. After quality control, 67,661 samples with valid 12-lead ECG data were retained for analysis.

To remove baseline wander, we applied locally weighted scatterplot smoothing (LOWESS) independently to each lead. For each 5,000-point trace, we fitted a LOWESS curve using the signal as the response variable and sample index as the predictor with a smoothing fraction of 0.1. The resulting smoothed curve, representing low-frequency drift, was subtracted from the raw signal to obtain baseline-corrected ECG data.

In CHS and MESA, resting 10-second 12-lead ECGs were recorded at 500 Hz using MAC PC ECG machines (Marquette Electronics, Milwaukee, WI) at baseline and selected follow-up visits. All ECGs were centrally reviewed for technical errors and quality at the Epidemiological Cardiology Research Center (EPICARE), Wake Forest University School of Medicine (Winston-Salem, NC), using General Electric 12-SL software (GE Healthcare, Milwaukee, WI) on MUSE and Magellan workstations. In the ECG signals from both the CHS and MESA cohorts, no significant baseline wander was observed. ECG signals from both cohorts showed no significant baseline wander. CHS ECG amplitudes were rescaled to align with UK Biobank reference values, whereas MESA ECG amplitudes were already comparable and required no adjustment.

Multimodal AI development

CARDIAC-FM framework

The CARDIAC-FM framework comprised a two-stage pipeline. In the first stage, modality-specific encoders extracted features from ECG and MRI inputs, which were aligned in a shared latent space using symmetric InfoNCE contrastive loss29. This objective encouraged both encoders to learn shared, cross-modal representations. We used state-of-the-art open-source models as encoders: ECG Foundation Model (ECG-FM) for ECG data and a convolutional neural network–long short-term memory (CNN-LSTM) architecture for MRI data. The ECG-FM encoder was initialized with pretrained weights and used the global representation after average pooling. The MRI encoder was initialized with weights pretrained for binary classification of left ventricular ejection fraction (LVEF ≤50% versus >50%) and used the final LSTM layer output as the representation. In the second stage, a prediction layer was added to the learned representations and the entire model was fine-tuned with supervised learning for clinical outcome prediction. The final model accepts two input configurations—ECG-only or combined ECG+MRI—enabling broad clinical applicability across settings with varying data availability.

Loss objectives for multimodal pretraining

The learned representations were trained using symmetric InfoNCE loss across modalities29. Consider a batch containing B paired samples, where each sample consists of one ECG input and one MRI input, denoted by Embedded Image The modality-specific encoders produce feature representations: Embedded Image The similarity function between representations is defined as: Embedded Image Where τ denotes the temperature parameter. The InfoNCE Loss for contrastive learning is then computed as: Embedded Image This loss is computed across ECG-MRI pairs within each batch, pulling together embeddings from paired modalities of the same subject while pushing apart embeddings from different individuals. This contrastive learning step established the foundation for subsequent fine-tuning on downstream prediction tasks.

Model architecture

The ECG-FM encoder comprised a multi-layer convolutional neural network (CNN) feature extractor and a BERT-Large transformer encoder (https://github.com/bowang-lab/ECG-FM). The CNN-LSTM encoder consisted of two DenseNet-40-12 CNNs and two LSTM encoders (https://github.com/MedAI-Vision/CMR-AI). During multimodal pretraining, only the transformer component of ECG-FM and the LSTM component of CNN-LSTM were trainable. A linear projection layer was appended to each encoder to align feature dimensions to 512; for ECG-FM, this layer reduced dimensionality from 768 to 512.

Training Details

Multimodal pretraining was performed on four RTX 2070 8GB GPUs with a batch size of 32 for 20 epochs using the AdamW optimizer (β₁ = 0.9, β₂ = 0.98, ε = 1×10⁻⁶, weight decay = 0.05). Learning rate followed a cosine annealing schedule with base rate 5×10⁻⁶ and warm-up length of 50 steps. The temperature parameter in the InfoNCE loss was trainable and initialized at 0.07.

For downstream supervised fine-tuning, all ECG encoder parameters were trainable, along with the LSTM component and projection layer of the MRI encoder. Downstream fine-tuning on UK Biobank data was performed on a single RTX 2070 8GB GPU with a batch size of 4 for 20 epochs using the AdamW optimizer (β₁ = 0.9, β₂ = 0.98, ε = 1×10⁻⁶, weight decay = 0.01) with a linear warm-up schedule (base learning rate 5×10⁻⁶, warm-up length 50 steps). Model checkpoints were selected based on the highest AUROC on the validation set. No hyperparameter tuning was performed beyond optimizer and learning rate scheduler selection.

For the risk factors in the clinical risk models, all predictors except sex contained varying degrees of missingness (approximately 1–18%). We applied multiple imputation using chained equations (MICE) before model fitting, generating four imputed datasets. When combining risk factors with the ECG or ECG+MRI model, we employed a late-fusion strategy in which established risk scores and the ECG-based or ECG+MRI-based model were integrated using a final logistic regression classifier.

For all models, we repeated training using four different random seeds, yielding four independent sets of predicted values. For the established risk scores, we derived the four sets of predictions from the four independently completed MICE-imputed datasets. We calculated confidence intervals for AUC estimates from the standard deviation across the four realizations, using the mean ± 1.96 × SD to approximate 95% intervals. These confidence intervals quantify variability due to random seed, conditional on the dataset.

Subgroup analysis

We performed subgroup analyses stratified by sex (male versus female) and age (<65 versus ≥65 years). Model performance was evaluated separately within each subgroup across the UK Biobank, Cardiovascular Health Study, and Multi-Ethnic Study of Atherosclerosis cohorts.

Statistical Analysis

Cox proportional hazards analysis across risk strata

We performed Cox regression analyses to evaluate the prognostic value of three model configurations: risk scores alone, CARDIAC-FM (ECG + risk scores), and CARDIAC-FM (ECG + MRI + risk scores). For each model and outcome (atrial fibrillation and heart failure), predicted risk scores (averaged across four random seeds) were stratified into tertiles representing high-, intermediate-, and low-risk groups, with the low-risk tertile serving as the reference. We estimated hazard ratios and 95% confidence intervals for the high- and intermediate-risk groups using univariable Cox proportional hazards models.

Cox proportional hazards analysis across MRI features

We performed Cox regression analyses to evaluate the prognostic value of cardiac MRI features, including left atrial volumes (LAVmax, LAVmin), left atrial emptying fraction (LAEF), left ventricular volumes (LVEDV, LVESV), left ventricular ejection fraction (LVEF), and left ventricular mass (LVM). We estimated hazard ratios using Cox proportional hazards models adjusted for traditional risk factors. Because these models included multiply imputed risk factors, we calculated confidence intervals using Rubin’s rules.

In UK Biobank, Cox models for atrial fibrillation and heart failure were constructed separately using two distinct sets of predictors: measured MRI features and CARDIAC-FM-predicted MRI features derived from ECG. Hazard ratios were estimated from both model sets to compare prognostic performance.

In the Cardiovascular Health Study and Multi-Ethnic Study of Atherosclerosis, Cox models were fitted for atrial fibrillation, heart failure and their respective subtypes using CARDIAC-FM-predicted MRI features derived from models trained on UK Biobank. In MESA, where cardiac MRI was performed, models using measured MRI features were also constructed, enabling direct comparison between predicted and measured structural parameters.

Acknowledgements

UK Biobank: This research has been conducted using the UK Biobank Resource under Application Number #40713, and this work uses data provided by patients and collected by the NHS as part of their care and support. The authors would like to thank the participants and researchers involved in creating and maintaining the UK Biobank as a valuable resource for health-related research.

Cardiovascular Health Study: This Cardiovascular Health Study research was supported by NHLBI contracts HHSN268201200036C, HHSN268200800007C, HHSN268201800001C, N01HC55222, N01HC85079, N01HC85080, N01HC85081, N01HC85082, N01HC85083, N01HC85086, N01HC85084, N01HC35129, R01AG15928, R01AG20098, 75N92021D00006; and NHLBI grants U01HL080295, R01HL087652, R01HL103612, R01HL105756, R01HL120393, U01HL130114, and R01HL172803 with additional contribution from the National Institute of Neurological Disorders and Stroke (NINDS). Additional support was provided through R01AG023629 from the National Institute on Aging (NIA). A full list of principal CHS investigators and institutions can be found at CHS-NHLBI.org. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Multi-Ethnic Study of Atherosclerosis: This research was supported by contracts 75N92020D00001, HHSN268201500003I, N01-HC-95159, 75N92020D00005, N01-HC-95160, 75N92020D00002, N01-HC-95161, 75N92020D00003, N01-HC-95162, 75N92020D00006, N01-HC-95163, 75N92020D00004, N01-HC-95164, 75N92020D00007, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168 and N01-HC-95169 from the National Heart, Lung, and Blood Institute, grants UL1-TR-000040 and UL1-TR-001420 from NCATS, and UL1-RR-025005 from NCRR and R01-HL-127659, R01-HL-107577 and R01 HL127659 from NHLBI. The authors thank the other investigators, the staff, and the participants of the MESA study for their valuable contributions. A full list of participating MESA investigators and institutions can be found at http://www.mesa-nhlbi.org.

Footnotes

  • ↵# Joint last author

References

  1. (1).↵
    Ball, J.; Carrington, M. J.; McMurray, J. J. V.; Stewart, S. Atrial Fibrillation: Profile and Burden of an Evolving Epidemic in the 21st Century. Int. J. Cardiol. 2013, 167 (5), 1807–1824. doi:10.1016/j.ijcard.2012.12.093.
    OpenUrlCrossRefPubMed
  2. (2).
    Kornej, J.; Börschel, C. S.; Benjamin, E. J.; Schnabel, R. B. Epidemiology of Atrial Fibrillation in the 21st Century: Novel Methods and New Insights. Circ. Res. 2020, 127 (1), 4–20. doi:10.1161/CIRCRESAHA.120.316340.
    OpenUrlCrossRefPubMed
  3. (3).↵
    Bozkurt, B.; Ahmad, T.; Alexander, K. M.; Baker, W. L.; Bosak, K.; Breathett, K.; Fonarow, G. C.; Heidenreich, P.; Ho, J. E.; Hsich, E.; Ibrahim, N. E.; Jones, L. M.; Khan, S. S.; Khazanie, P.; Koelling, T.; Krumholz, H. M.; Khush, K. K.; Lee, C.; Morris, A. A.; Page, R. L.; Pandey, A.; Piano, M. R.; Stehlik, J.; Stevenson, L. W.; Teerlink, J. R.; Vaduganathan, M.; Ziaeian, B. Heart Failure Epidemiology and Outcomes Statistics: A Report of the Heart Failure Society of America. J. Card. Fail. 2023, 29 (10), 1412–1451. doi:10.1016/j.cardfail.2023.07.006.
    OpenUrlCrossRefPubMed
  4. (4).↵
    Gulati, G.; Upshaw, J.; Wessler, B. S.; Brazil, R. J.; Nelson, J.; van Klaveren, D.; Lundquist, C. M.; Park, J. G.; McGinnes, H.; Steyerberg, E. W.; Van Calster, B.; Kent, D. M. Generalizability of Cardiovascular Disease Clinical Prediction Models: 158 Independent External Validations of 104 Unique Models. Circ. Cardiovasc. Qual. Outcomes 2022, 15 (4), e008487. doi:10.1161/CIRCOUTCOMES.121.008487.
    OpenUrlCrossRefPubMed
  5. (5).
    Himmelreich, J. C. L.; Veelers, L.; Lucassen, W. A. M.; Schnabel, R. B.; Rienstra, M.; van Weert, H. C. P. M.; Harskamp, R. E. Prediction Models for Atrial Fibrillation Applicable in the Community: A Systematic Review and Meta-Analysis. Eur. Eur. Pacing Arrhythm. Card. Electrophysiol. J. Work. Groups Card. Pacing Arrhythm. Card. Cell. Electrophysiol. Eur. Soc. Cardiol. 2020, 22 (5), 684–694. doi:10.1093/europace/euaa005.
    OpenUrlCrossRef
  6. (6).↵
    Cho, S. M. J.; Levin, M.; Chen, R.; Samarakoon, R.; Judy, R.; Hartmann, K.; Postupaka, D.; Viscosi, V.; Farber-Eger, E.; Fahed, A. C.; Honigberg, M. C.; Patel, C. J.; Hornsby, W.; Wells, Q. S.; Damrauer, S. M.; Do, R.; Natarajan, P. AHA PREVENT Equations and Cardiovascular Disease Risk in Diverse Health Care Populations. J. Am. Coll. Cardiol. 2025, 86 (3), 181–192. doi:10.1016/j.jacc.2025.04.066.
    OpenUrlCrossRefPubMed
  7. (7).↵
    Poterucha, T. J.; Jing, L.; Ricart, R. P.; Adjei-Mosi, M.; Finer, J.; Hartzel, D.; Kelsey, C.; Long, A.; Rocha, D.; Ruhl, J. A.; vanMaanen, D.; Probst, M. A.; Daniels, B.; Joshi, S. D.; Tastet, O.; Corbin, D.; Avram, R.; Barrios, J. P.; Tison, G. H.; Chiu, I.-M.; Ouyang, D.; Volodarskiy, A.; Castillo, M.; Roedan Oliver, F. A.; Malta, P. P.; Ye, S.; Rosner, G. F.; Dizon, J. M.; Ali, S. R.; Liu, Q.; Bradley, C. K.; Vaishnava, P.; Waksmonski, C. A.; DeFilippis, E. M.; Agarwal, V.; Lebehn, M.; Kampaktsis, P. N.; Shames, S.; Beecy, A. N.; Kumaraiah, D.; Homma, S.; Schwartz, A.; Hahn, R. T.; Leon, M.; Einstein, A. J.; Maurer, M. S.; Hartman, H. S.; Hughes, J. W.; Haggerty, C. M.; Elias, P. Detecting Structural Heart Disease from Electrocardiograms Using AI. Nature 2025, 644 (8075), 221–230. doi:10.1038/s41586-025-09227-0.
    OpenUrlCrossRefPubMed
  8. (8).↵
    Brody, J. A.; Yogeswaran, V.; Wiggins, K. L.; Sitlani, C. M.; Bis, J. C.; Chen, L. Y.; Heckbert, S. R.; Lima, J. A. C.; Longstreth, W. T.; Soliman, E. Z.; Tison, G. H.; Ye, T.; Psaty, B. M.; Shojaie, A.; Floyd, J. S. Deep Learning Prediction of Left Atrial Structure and Function from 12-Lead Electrocardiograms. MedRxiv Prepr. Serv. Health Sci. 2025, 2025.09.29.25336490. doi:10.1101/2025.09.29.25336490.
    OpenUrlAbstract/FREE Full Text
  9. (9).↵
    Raghunath, S.; Pfeifer, J. M.; Ulloa-Cerna, A. E.; Nemani, A.; Carbonati, T.; Jing, L.; vanMaanen, D. P.; Hartzel, D. N.; Ruhl, J. A.; Lagerman, B. F.; Rocha, D. B.; Stoudt, N. J.; Schneider, G.; Johnson, K. W.; Zimmerman, N.; Leader, J. B.; Kirchner, H. L.; Griessenauer, C. J.; Hafez, A.; Good, C. W.; Fornwalt, B. K.; Haggerty, C. M. Deep Neural Networks Can Predict New-Onset Atrial Fibrillation From the 12-Lead ECG and Help Identify Those at Risk of Atrial Fibrillation-Related Stroke. Circulation 2021, 143 (13), 1287–1298. doi:10.1161/CIRCULATIONAHA.120.047829.
    OpenUrlCrossRefPubMed
  10. (10).↵
    Attia, Z. I.; Kapa, S.; Lopez-Jimenez, F.; McKie, P. M.; Ladewig, D. J.; Satam, G.; Pellikka, P. A.; Enriquez-Sarano, M.; Noseworthy, P. A.; Munger, T. M.; Asirvatham, S. J.; Scott, C. G.; Carter, R. E.; Friedman, P. A. Screening for Cardiac Contractile Dysfunction Using an Artificial Intelligence-Enabled Electrocardiogram. Nat. Med. 2019, 25 (1), 70–74. doi:10.1038/s41591-018-0240-2.
    OpenUrlCrossRefPubMed
  11. (11).↵
    Attia, Z. I.; Noseworthy, P. A.; Lopez-Jimenez, F.; Asirvatham, S. J.; Deshmukh, A. J.; Gersh, B. J.; Carter, R. E.; Yao, X.; Rabinstein, A. A.; Erickson, B. J.; Kapa, S.; Friedman, P. A. An Artificial Intelligence-Enabled ECG Algorithm for the Identification of Patients with Atrial Fibrillation during Sinus Rhythm: A Retrospective Analysis of Outcome Prediction. The Lancet 2019, 394 (10201), 861–867. doi:10.1016/S0140-6736(19)31721-0.
    OpenUrlCrossRefPubMed
  12. (12).
    Baek, Y.-S.; Lee, S.-C.; Choi, W.; Kim, D.-H. A New Deep Learning Algorithm of 12-Lead Electrocardiogram for Identifying Atrial Fibrillation during Sinus Rhythm. Sci. Rep. 2021, 11 (1), 12818. doi:10.1038/s41598-021-92172-5.
    OpenUrlCrossRefPubMed
  13. (13).↵
    Yuan, N.; Duffy, G.; Dhruva, S. S.; Oesterle, A.; Pellegrini, C. N.; Theurer, J.; Vali, M.; Heidenreich, P. A.; Keyhani, S.; Ouyang, D. Deep Learning of Electrocardiograms in Sinus Rhythm From US Veterans to Predict Atrial Fibrillation. JAMA Cardiol. 2023, 8 (12), 1131–1139. doi:10.1001/jamacardio.2023.3701.
    OpenUrlCrossRef
  14. (14).↵
    Raghunath, S.; Ulloa Cerna, A. E.; Jing, L.; vanMaanen, D. P.; Stough, J.; Hartzel, D. N.; Leader, J. B.; Kirchner, H. L.; Stumpe, M. C.; Hafez, A.; Nemani, A.; Carbonati, T.; Johnson, K. W.; Young, K.; Good, C. W.; Pfeifer, J. M.; Patel, A. A.; Delisle, B. P.; Alsaid, A.; Beer, D.; Haggerty, C. M.; Fornwalt, B. K. Prediction of Mortality from 12-Lead Electrocardiogram Voltage Data Using a Deep Neural Network. Nat. Med. 2020, 26 (6), 886–891. doi:10.1038/s41591-020-0870-z.
    OpenUrlCrossRef
  15. (15).
    Hughes, J. W.; Tooley, J.; Torres Soto, J.; Ostropolets, A.; Poterucha, T.; Christensen, M. K.; Yuan, N.; Ehlert, B.; Kaur, D.; Kang, G.; Rogers, A.; Narayan, S.; Elias, P.; Ouyang, D.; Ashley, E.; Zou, J.; Perez, M. V. A Deep Learning-Based Electrocardiogram Risk Score for Long Term Cardiovascular Death and Disease. NPJ Digit. Med. 2023, 6 (1), 169. doi:10.1038/s41746-023-00916-6.
    OpenUrlCrossRefPubMed
  16. (16).↵
    Lin, C.-H.; Liu, Z.-Y.; Chu, P.-H.; Chen, J.-S.; Wu, H.-H.; Wen, M.-S.; Kuo, C.-F.; Chang, T.-Y. A Multitask Deep Learning Model Utilizing Electrocardiograms for Major Cardiovascular Adverse Events Prediction. Npj Digit. Med. 2025, 8 (1), 1. doi:10.1038/s41746-024-01410-3.
    OpenUrlCrossRefPubMed
  17. (17).↵
    Oh, J.; Chung, H.; Kwon, J.; Hong, D.; Choi, E. Lead-Agnostic Self-Supervised Learning for Local and Global Representations of Electrocardiogram. arXiv March 18, 2022. doi:10.48550/arXiv.2203.06889.
    OpenUrlCrossRef
  18. (18).
    McKeen, K.; Masood, S.; Toma, A.; Rubin, B.; Wang, B. ECG-FM: An Open Electrocardiogram Foundation Model. arXiv May 30, 2025. doi:10.48550/arXiv.2408.05178.
    OpenUrlCrossRef
  19. (19).↵
    Nolin-Lapalme, A.; Sowa, A.; Delfrate, J.; Tastet, O.; Corbin, D.; Kulbay, M.; Ozdemir, D.; Noël, M.-J.; Marois-Blanchet, F.-C.; Harvey, F.; Sharma, S.; Ansari, M.; Chiu, I. M.; D’souza, V.; Friedman, S. F.; Chassé, M.; Potter, B. J.; Afilalo, J.; Elias, P. A.; Jabbour, G.; Bahani, M.; Dubé, M.-P.; Boyle, P. M.; Chatterjee, N. A.; Barrios, J.; Tison, G. H.; Ouyang, D.; Maddah, M.; Khurshid, S.; Cadrin-Tourigny, J.; Tadros, R.; Hussin, J.; Avram, R. Foundation Models for Electrocardiogram Interpretation: Clinical Implications. Eur. Heart J. 2026, ehaf1119. doi:10.1093/eurheartj/ehaf1119.
    OpenUrlCrossRef
  20. (20).↵
    Wang, Y.-R. (Joyce); Yang, K.; Wen, Y.; Wang, P.; Hu, Y.; Lai, Y.; Wang, Y.; Zhao, K.; Tang, S.; Zhang, A.; Zhan, H.; Lu, M.; Chen, X.; Yang, S.; Dong, Z.; Wang, Y.; Liu, H.; Zhao, L.; Huang, L.; Li, Y.; Wu, L.; Chen, Z.; Luo, Y.; Liu, D.; Zhao, P.; Lin, K.; Wu, J. C.; Zhao, S. Screening and Diagnosis of Cardiovascular Disease Using Artificial Intelligence-Enabled Cardiac Magnetic Resonance Imaging. Nat. Med. 2024, 30 (5), 1471–1480. doi:10.1038/s41591-024-02971-2.
    OpenUrlCrossRefPubMed
  21. (21).
    Shad, R.; Zakka, C.; Kaur, D.; Fong, R.; Filice, R. W.; Mongan, J.; Kalianos, K.; Khandwala, N.; Eng, D.; Leipzig, M.; Witschey, W.; Feria, A. de; Ferrari, V.; Ashley, E.; Acker, M. A.; Langlotz, C.; Hiesinger, W. A Generalizable Deep Learning System for Cardiac MRI. arXiv December 1, 2023. doi:10.48550/arXiv.2312.00357.
    OpenUrlCrossRef
  22. (22).
    Jacob, A. J.; Borgohain, I.; Chitiboi, T.; Sharma, P.; Comaniciu, D.; Rueckert, D. Towards a Vision Foundation Model for Comprehensive Assessment of Cardiac MRI. arXiv October 6, 2024. doi:10.48550/arXiv.2410.01665.
    OpenUrlCrossRef
  23. (23).↵
    Lai, C.; Yin, M.; Kholmovski, E. G.; Popescu, D. M.; Lu, D.-Y.; Scherer, E.; Binka, E.; Zimmerman, S. L.; Chrispin, J.; Hays, A. G.; Phelan, D. M.; Abraham, M. R.; Trayanova, N. A. Multimodal AI to Forecast Arrhythmic Death in Hypertrophic Cardiomyopathy. Nat. Cardiovasc. Res. 2025, 4 (7), 891–903. doi:10.1038/s44161-025-00679-1.
    OpenUrlCrossRefPubMed
  24. (24).
    Radhakrishnan, A.; Friedman, S. F.; Khurshid, S.; Ng, K.; Batra, P.; Lubitz, S. A.; Philippakis, A. A.; Uhler, C. Cross-Modal Autoencoder Framework Learns Holistic Representations of Cardiovascular State. Nat. Commun. 2023, 14 (1), 2436. doi:10.1038/s41467-023-38125-0.
    OpenUrlCrossRef
  25. (25).↵
    Soto, J. T.; Weston Hughes, J.; Sanchez, P. A.; Perez, M.; Ouyang, D.; Ashley, E. A. Multimodal Deep Learning Enhances Diagnostic Precision in Left Ventricular Hypertrophy. Eur. Heart J. Digit. Health 2022, 3 (3), 380–389. doi:10.1093/ehjdh/ztac033.
    OpenUrlCrossRefPubMed
  26. (26).↵
    Vukadinovic, M.; Chiu, I.-M.; Tang, X.; Yuan, N.; Chen, T.-Y.; Cheng, P.; Li, D.; Cheng, S.; He, B.; Ouyang, D. Comprehensive Echocardiogram Evaluation with View Primed Vision Language AI. Nature 2025, 1–3. doi:10.1038/s41586-025-09850-x.
    OpenUrlCrossRef
  27. (27).↵
    Fried, L. P.; Borhani, N. O.; Enright, P.; Furberg, C. D.; Gardin, J. M.; Kronmal, R. A.; Kuller, L. H.; Manolio, T. A.; Mittelmark, M. B.; Newman, A. The Cardiovascular Health Study: Design and Rationale. Ann. Epidemiol. 1991, 1 (3), 263–276. doi:10.1016/1047-2797(91)90005-w.
    OpenUrlCrossRefPubMed
  28. (28).↵
    Bild, D. E.; Bluemke, D. A.; Burke, G. L.; Detrano, R.; Diez Roux, A. V.; Folsom, A. R.; Greenland, P.; Jacob, D. R.; Kronmal, R.; Liu, K.; Nelson, J. C.; O’Leary, D.; Saad, M. F.; Shea, S.; Szklo, M.; Tracy, R. P. Multi-Ethnic Study of Atherosclerosis: Objectives and Design. Am. J. Epidemiol. 2002, 156 (9), 871–881. doi:10.1093/aje/kwf113.
    OpenUrlCrossRefPubMedWeb of Science
  29. (29).↵
    Oord, A. van den; Li, Y.; Vinyals, O. Representation Learning with Contrastive Predictive Coding. arXiv January 22, 2019. doi:10.48550/arXiv.1807.03748.
    OpenUrlCrossRef
  30. (30).↵
    Alonso, A.; Krijthe, B. P.; Aspelund, T.; Stepas, K. A.; Pencina, M. J.; Moser, C. B.; Sinner, M. F.; Sotoodehnia, N.; Fontes, J. D.; Janssens, A. C. J. W.; Kronmal, R. A.; Magnani, J. W.; Witteman, J. C.; Chamberlain, A. M.; Lubitz, S. A.; Schnabel, R. B.; Agarwal, S. K.; McManus, D. D.; Ellinor, P. T.; Larson, M. G.; Burke, G. L.; Launer, L. J.; Hofman, A.; Levy, D.; Gottdiener, J. S.; Kääb, S.; Couper, D.; Harris, T. B.; Soliman, E. Z.; Stricker, B. H. C.; Gudnason, V.; Heckbert, S. R.; Benjamin, E. J. Simple Risk Model Predicts Incidence of Atrial Fibrillation in a Racially and Geographically Diverse Population: The CHARGE-AF Consortium. J. Am. Heart Assoc. 2013, 2 (2), e000102. doi:10.1161/JAHA.112.000102.
    OpenUrlAbstract/FREE Full Text
  31. (31).↵
    Littlejohns, T. J.; Sudlow, C.; Allen, N. E.; Collins, R. UK Biobank: Opportunities for Cardiovascular Research. Eur. Heart J. 2019, 40 (14), 1158–1166. doi:10.1093/eurheartj/ehx254.
    OpenUrlCrossRefPubMed
  32. (32).↵
    Psaty, B. M.; Delaney, J. A.; Arnold, A. M.; Curtis, L. H.; Fitzpatrick, A. L.; Heckbert, S. R.; McKnight, B.; Ives, D.; Gottdiener, J. S.; Kuller, L. H.; Longstreth, W. T. Study of Cardiovascular Health Outcomes in the Era of Claims Data: The Cardiovascular Health Study. Circulation 2016, 133 (2), 156–164. doi:10.1161/CIRCULATIONAHA.115.018610.
    OpenUrlAbstract/FREE Full Text
  33. (33).↵
    Gottdiener, J. S.; McClelland, R. L.; Marshall, R.; Shemanski, L.; Furberg, C. D.; Kitzman, D. W.; Cushman, M.; Polak, J.; Gardin, J. M.; Gersh, B. J.; Aurigemma, G. P.; Manolio, T. A. Outcome of Congestive Heart Failure in Elderly Persons: Influence of Left Ventricular Systolic Function. The Cardiovascular Health Study. Ann. Intern. Med. 2002, 137 (8), 631–639. doi:10.7326/0003-4819-137-8-200210150-00006.
    OpenUrlCrossRefPubMedWeb of Science
  34. (34).↵
    Longstreth, W. T.; Bernick, C.; Fitzpatrick, A.; Cushman, M.; Knepper, L.; Lima, J.; Furberg, C. D. Frequency and Predictors of Stroke Death in 5,888 Participants in the Cardiovascular Health Study. Neurology 2001, 56 (3), 368–375. doi:10.1212/wnl.56.3.368.
    OpenUrlCrossRefPubMed
  35. (35).↵
    Ives, D. G.; Fitzpatrick, A. L.; Bild, D. E.; Psaty, B. M.; Kuller, L. H.; Crowley, P. M.; Cruise, R. G.; Theroux, S. Surveillance and Ascertainment of Cardiovascular Events. The Cardiovascular Health Study. Ann. Epidemiol. 1995, 5 (4), 278–285. doi:10.1016/1047-2797(94)00093-9.
    OpenUrlCrossRefPubMed
  36. (36).↵
    Newman, A. B.; Sachs, M. C.; Arnold, A. M.; Fried, L. P.; Kronmal, R.; Cushman, M.; Psaty, B. M.; Harris, T. B.; Robbins, J. A.; Burke, G. L.; Kuller, L. H.; Lumley, T. Total and Cause-Specific Mortality in the Cardiovascular Health Study. J. Gerontol. A. Biol. Sci. Med. Sci. 2009, 64 (12), 1251–1261. doi:10.1093/gerona/glp127.
    OpenUrlCrossRefPubMed
  37. (37).↵
    Heckbert, S. R.; Wiggins, K. L.; Blackshear, C.; Yang, Y.; Ding, J.; Liu, J.; McKnight, B.; Alonso, A.; Austin, T. R.; Benjamin, E. J.; Curtis, L. H.; Sotoodehnia, N.; Correa, A. Pericardial Fat Volume and Incident Atrial Fibrillation in the Multi-Ethnic Study of Atherosclerosis and Jackson Heart Study. Obes. Silver Spring Md 2017, 25 (6), 1115–1121. doi:10.1002/oby.21835.
    OpenUrlCrossRef
  38. (38).↵
    Longstreth, W. T.; Gasca, N. C.; Gottesman, R. F.; Pearce, J. B.; Sacco, R. L. Adjudication of Transient Ischemic Attack and Stroke in the Multi-Ethnic Study of Atherosclerosis. Neuroepidemiology 2018, 50 (1–2), 23–28. doi:10.1159/000486174.
    OpenUrlCrossRefPubMed
  39. (39).↵
    McClelland, R. L.; Jorgensen, N. W.; Budoff, M.; Blaha, M. J.; Post, W. S.; Kronmal, R. A.; Bild, D. E.; Shea, S.; Liu, K.; Watson, K. E.; Folsom, A. R.; Khera, A.; Ayers, C.; Mahabadi, A.-A.; Lehmann, N.; Jöckel, K.-H.; Moebus, S.; Carr, J. J.; Erbel, R.; Burke, G. L. 10-Year Coronary Heart Disease Risk Prediction Using Coronary Artery Calcium and Traditional Risk Factors: Derivation in the MESA (Multi-Ethnic Study of Atherosclerosis) With Validation in the HNR (Heinz Nixdorf Recall) Study and the DHS (Dallas Heart Study). J. Am. Coll. Cardiol. 2015, 66 (15), 1643–1653. doi:10.1016/j.jacc.2015.08.035.
    OpenUrlFREE Full Text
  40. (40).↵
    Redheuil, A.; Wu, C. O.; Kachenoura, N.; Ohyama, Y.; Yan, R. T.; Bertoni, A. G.; Hundley, G. W.; Duprez, D. A.; Jacobs, D. R.; Daniels, L. B.; Darwin, C.; Sibley, C.; Bluemke, D. A.; Lima, J. A. C. Proximal Aortic Distensibility Is an Independent Predictor of All-Cause Mortality and Incident CV Events: The MESA Study. J. Am. Coll. Cardiol. 2014, 64 (24), 2619–2629. doi:10.1016/j.jacc.2014.09.060.
    OpenUrlFREE Full Text
  41. (41).↵
    Littlejohns, T. J.; Holliday, J.; Gibson, L. M.; Garratt, S.; Oesingmann, N.; Alfaro-Almagro, F.; Bell, J. D.; Boultwood, C.; Collins, R.; Conroy, M. C.; Crabtree, N.; Doherty, N.; Frangi, A. F.; Harvey, N. C.; Leeson, P.; Miller, K. L.; Neubauer, S.; Petersen, S. E.; Sellors, J.; Sheard, S.; Smith, S. M.; Sudlow, C. L. M.; Matthews, P. M.; Allen, N. E. The UK Biobank Imaging Enhancement of 100,000 Participants: Rationale, Data Collection, Management and Future Directions. Nat. Commun. 2020, 11 (1), 2624. doi:10.1038/s41467-020-15948-9.
    OpenUrlCrossRefPubMed
  42. (42).↵
    Petersen, S. E.; Aung, N.; Sanghvi, M. M.; Zemrak, F.; Fung, K.; Paiva, J. M.; Francis, J. M.; Khanji, M. Y.; Lukaschuk, E.; Lee, A. M.; Carapella, V.; Kim, Y. J.; Leeson, P.; Piechnik, S. K.; Neubauer, S. Reference Ranges for Cardiac Structure and Function Using Cardiovascular Magnetic Resonance (CMR) in Caucasians from the UK Biobank Population Cohort. J. Cardiovasc. Magn. Reson. Off. J. Soc. Cardiovasc. Magn. Reson. 2017, 19 (1), 18. doi:10.1186/s12968-017-0327-9.
    OpenUrlCrossRefPubMed
  43. (43).↵
    Bai, W.; Sinclair, M.; Tarroni, G.; Oktay, O.; Rajchl, M.; Vaillant, G.; Lee, A. M.; Aung, N.; Lukaschuk, E.; Sanghvi, M. M.; Zemrak, F.; Fung, K.; Paiva, J. M.; Carapella, V.; Kim, Y. J.; Suzuki, H.; Kainz, B.; Matthews, P. M.; Petersen, S. E.; Piechnik, S. K.; Neubauer, S.; Glocker, B.; Rueckert, D. Automated Cardiovascular Magnetic Resonance Image Analysis with Fully Convolutional Networks. J. Cardiovasc. Magn. Reson. 2018, 20 (1), 65. doi:10.1186/s12968-018-0471-x. rial fibrillation, heart failure and their respective subtypes using CARDIAC-FM-predicted MRI features derived from models trained on UK Biobank. In MESA, where cardiac MRI was performed, models using measured MRI features were also constructed, enabling direct comparison between predicted and measured structural parameters.
    OpenUrlCrossRefPubMed
Back to top
PreviousNext
Posted March 18, 2026.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
CARDIAC-FM: A Multimodal Foundation Model for Cardiovascular Risk Prediction Using ECG and Cardiac MRI
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
CARDIAC-FM: A Multimodal Foundation Model for Cardiovascular Risk Prediction Using ECG and Cardiac MRI
Fumin Li, Siting Li, Yuhan Qian, Bojun Chen, Jennifer A Brody, Vidhushei Yogeswaran, Kerri L Wiggins, Colleen M Sitlani, Joshua C Bis, Ali Shojaie, W T Longstreth Jr, Bruce M Psaty, Geoffrey H Tison, Simon Du, James S Floyd, Ting Ye
medRxiv 2026.03.16.26348526; doi: https://doi.org/10.64898/2026.03.16.26348526
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
CARDIAC-FM: A Multimodal Foundation Model for Cardiovascular Risk Prediction Using ECG and Cardiac MRI
Fumin Li, Siting Li, Yuhan Qian, Bojun Chen, Jennifer A Brody, Vidhushei Yogeswaran, Kerri L Wiggins, Colleen M Sitlani, Joshua C Bis, Ali Shojaie, W T Longstreth Jr, Bruce M Psaty, Geoffrey H Tison, Simon Du, James S Floyd, Ting Ye
medRxiv 2026.03.16.26348526; doi: https://doi.org/10.64898/2026.03.16.26348526

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Cardiovascular Medicine
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (867)
  • Anesthesia (306)
  • Cardiovascular Medicine (4480)
  • Dentistry and Oral Medicine (449)
  • Dermatology (384)
  • Emergency Medicine (614)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15271)
  • Forensic Medicine (31)
  • Gastroenterology (1133)
  • Genetic and Genomic Medicine (6643)
  • Geriatric Medicine (670)
  • Health Economics (1005)
  • Health Informatics (4597)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1621)
  • Hematology (544)
  • HIV/AIDS (1273)
  • Infectious Diseases (except HIV/AIDS) (15955)
  • Intensive Care and Critical Care Medicine (1110)
  • Medical Education (624)
  • Medical Ethics (147)
  • Nephrology (673)
  • Neurology (6685)
  • Nursing (346)
  • Nutrition (1005)
  • Obstetrics and Gynecology (1152)
  • Occupational and Environmental Health (961)
  • Oncology (3365)
  • Ophthalmology (987)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (667)
  • Pediatrics (1701)
  • Pharmacology and Therapeutics (698)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5489)
  • Public and Global Health (9283)
  • Radiology and Imaging (2219)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1201)
  • Rheumatology (598)
  • Sexual and Reproductive Health (720)
  • Sports Medicine (535)
  • Surgery (717)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (266)