Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Multimodal prediction of visual improvement in diabetic macular edema using real-world electronic health records and optical coherence tomography images

View ORCID ProfileSiqi Sun, Cindy X. Cai, Ruochong Fan, Saiyu You, Diep Tran, P. Kumar Rao, View ORCID ProfileMarc A. Suchard, Yixin Wang, Cecilia S. Lee, View ORCID ProfileAaron Y. Lee, View ORCID ProfileLinying Zhang
doi: https://doi.org/10.64898/2026.04.23.26351616
Siqi Sun
1Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, MO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Siqi Sun
Cindy X. Cai
2Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, MD
3Department of Biomedical Informatics and Data Science, Johns Hopkins School of Medicine, Baltimore, MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruochong Fan
1Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, MO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Saiyu You
1Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, MO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Diep Tran
2Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
P. Kumar Rao
4John F. Hardesty MD, Department of Ophthalmology and Visual Sciences, Washington University in St. Louis, St. Louis, MO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Marc A. Suchard
5Department of Biostatistics, University of California, Los Angeles, Los Angeles, CA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marc A. Suchard
Yixin Wang
6Department of Statistics, University of Michigan, Ann Arbor, Ann Arbor, MI
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cecilia S. Lee
4John F. Hardesty MD, Department of Ophthalmology and Visual Sciences, Washington University in St. Louis, St. Louis, MO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aaron Y. Lee
4John F. Hardesty MD, Department of Ophthalmology and Visual Sciences, Washington University in St. Louis, St. Louis, MO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Aaron Y. Lee
Linying Zhang
1Institute for Informatics, Data Science and Biostatistics, Washington University in St. Louis, St. Louis, MO
7Department of Medicine, Washington University in St. Louis, St. Louis, MO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Linying Zhang
  • For correspondence: linyingz{at}wustl.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Preview PDF
Loading

Abstract

Multimodal learning has the potential to improve clinical prediction by integrating complementary data sources, but the incremental value of imaging beyond structured electronic health record (EHR) data remains unclear in real-world settings. We developed a multimodal survival modeling framework integrating optical coherence tomography (OCT) and EHR data to predict time to visual improvement in patients with diabetic macular edema (DME), and evaluated how different ophthalmic foundation model representations contribute to prognostic performance.

In a retrospective cohort of 973 patients (1,450 eyes) receiving anti-vascular endothelial growth factor therapy, we compared multimodal models combining 22,227 EHR variables with 196,402 OCT images, with OCT embeddings derived from three ophthalmic foundation models (RETFound, EyeCLIP, and VisionFM). The EHR-only model showed minimal prognostic discrimination (C-index 0.50 [95% CI, 0.45–0.55]). Incorporating OCT improved performance, with the magnitude of improvement depending on the representation. EHR+RETFound achieved the strongest performance (C-index 0.59 [0.54–0.65]), followed by EHR+EyeCLIP (0.57 [0.52–0.62]) and EHR+VisionFM (0.56 [0.51–0.61]). Multimodal models, particularly EHR+RETFound, demonstrated improved risk stratification with clearer separation of Kaplan–Meier curves.

Partial information decomposition revealed that prognostic information was dominated by modality-specific contributions, with OCT and EHR providing largely distinct signals and minimal shared information. The magnitude of OCT-specific contribution varied across foundation models and aligned with observed performance differences.

These findings indicate that OCT provides complementary prognostic value beyond structured clinical data, but gains are modest and depend strongly on representation choice. Our results highlight both the promise of multimodal modeling for personalized prognosis and the need for rigorous, context-specific evaluation of foundation models in real-world clinical settings.

Competing Interest Statement

C.X.C. receives equipment support from Optomed USA, Inc outside the scope of this work. M.A.S. receives contracts and grants from US Department of Veterans Affairs, Johnson & Johnson and Gilead Sciences outside the scope of this work. A.Y.L. reports personal fees from Astellas, personal fees from Genentech, personal fees from Johnson and Johnson, personal fees from Alcon, personal fees from Apellis, non-financial support from iCareWorld, non-financial support from Topcon, grants and non-financial support from Carl Zeiss Meditec, non-financial support from Optomed, non-financial support from Heidelberg, non-financial support from Microsoft, non-financial support from Amazon, and non-financial support from Meta outside the submitted work. The remaining authors declare no competing interests.

Funding Statement

S.S. contributed to conceptualization, methodology, analysis and interpretation of results, and manuscript writing. C.X.C. contributed to data curation of visual acuity data and manuscript review and editing. R.F. contributed to data extraction and manuscript editing. S.Y. contributed to OCT image extraction and data curation. D.T. contributed to data curation of visual acuity data. P.K.R., C.S.L., and A.Y.L. contributed to clinical interpretation and manuscript review. M.A.S. and Y.W. contributed to methodology and manuscript review and editing. L.Z. contributed to conceptualization, methodology, supervision, and manuscript review and editing. All authors reviewed and approved the final manuscript.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The study was conducted in accordance with the Declaration of Helsinki and was approved by the Institutional Review Board of Washington University School of Medicine in St. Louis (IRB No. 202502158). A waiver of informed consent was granted because of the retrospective design and use of de-identified data.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted April 24, 2026.
Download PDF

Supplementary Material

Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Multimodal prediction of visual improvement in diabetic macular edema using real-world electronic health records and optical coherence tomography images
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Multimodal prediction of visual improvement in diabetic macular edema using real-world electronic health records and optical coherence tomography images
Siqi Sun, Cindy X. Cai, Ruochong Fan, Saiyu You, Diep Tran, P. Kumar Rao, Marc A. Suchard, Yixin Wang, Cecilia S. Lee, Aaron Y. Lee, Linying Zhang
medRxiv 2026.04.23.26351616; doi: https://doi.org/10.64898/2026.04.23.26351616
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Multimodal prediction of visual improvement in diabetic macular edema using real-world electronic health records and optical coherence tomography images
Siqi Sun, Cindy X. Cai, Ruochong Fan, Saiyu You, Diep Tran, P. Kumar Rao, Marc A. Suchard, Yixin Wang, Cecilia S. Lee, Aaron Y. Lee, Linying Zhang
medRxiv 2026.04.23.26351616; doi: https://doi.org/10.64898/2026.04.23.26351616

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (867)
  • Anesthesia (306)
  • Cardiovascular Medicine (4480)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (614)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15276)
  • Forensic Medicine (31)
  • Gastroenterology (1133)
  • Genetic and Genomic Medicine (6643)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4602)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1622)
  • Hematology (544)
  • HIV/AIDS (1275)
  • Infectious Diseases (except HIV/AIDS) (15959)
  • Intensive Care and Critical Care Medicine (1110)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (674)
  • Neurology (6692)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1152)
  • Occupational and Environmental Health (961)
  • Oncology (3369)
  • Ophthalmology (988)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (668)
  • Pediatrics (1703)
  • Pharmacology and Therapeutics (699)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5494)
  • Public and Global Health (9284)
  • Radiology and Imaging (2223)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1201)
  • Rheumatology (598)
  • Sexual and Reproductive Health (720)
  • Sports Medicine (535)
  • Surgery (720)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (266)