Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Physician-level classification performance across multiple imaging domains with a diagnostic medical foundation model and a large dataset of annotated medical images

View ORCID ProfileAlexander Henry Thieme, Tahir Miri, Alexandre R Marra, Takaaki Kobayashi, Guillermo Rodriguez-Nava, View ORCID ProfileYiheng Li, Thomas Barba, Ahmet Görkem Er, Justus Benzler, Maximilian Gertler, Mareike Riechers, Christian Hinze, View ORCID ProfileYuanning Zheng, Konstantin Pelz, Divya Nagaraj, Alexa Chen, Anastassia Löser, Alexander Rühle, Constantinos Zamboglou, Lujain Alyahya, Maximilian Uhlig, Gautam Machiraju, Kuba Weimann, View ORCID ProfileChristoph Lippert, Tim Conrad, Jackie Ma, Roberto Novoa, Michael Moor, Tina Hernandez-Boussard, Mohammed Alawad, Jorge L. Salinas, Mirja Mittermaier, View ORCID ProfileOlivier Gevaert
doi: https://doi.org/10.1101/2025.05.30.25328646
Alexander Henry Thieme
1Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University, Stanford, CA 94305, United States, Department of Radiation Oncology, Charité - Universitätsmedizin Berlin, Berlin, Germany and Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Telephone: +1 650 518 8715
MD MSc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alexander Henry Thieme
  • For correspondence: thieme{at}stanford.edu
Tahir Miri
2Hasso Plattner Institute (HPI), Department of Digital Health - Machine Learning, 14482 Potsdam, Germany and Stanford University, Center for Biomedical Informatics Research (BMIR), 3180 Porter Dr, Palo Alto, CA 94304, USA
MSc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tahir.miriyev{at}hpi.de
Alexandre R Marra
3Faculdade Israelita de Ciências da Saúde Albert Einstein, Hospital Israelita Albert Einstein, São Paulo, SP, Brazil and Department of Internal Medicine, University of Iowa Carver College of Medicine, Iowa City, IA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: alexandre-rodriguesmarra{at}uiowa.edu
Takaaki Kobayashi
4Division of Infectious Diseases, Department of Internal Medicine, University of Kentucky College of Medicine, Lexington, KY, USA
MD MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: TKO240{at}uky.edu
Guillermo Rodriguez-Nava
5Division of Infectious Diseases & Geographic Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: guiro{at}stanford.edu
Yiheng Li
6Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University, Stanford, CA 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yiheng Li
  • For correspondence: yyhhli{at}stanford.edu
Thomas Barba
7Department of Internal Medicine, Edouard Herriot Hospital, Lyon University Hospital, Lyon, France
MD PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: thomas.barba{at}chu-lyon.fr
Ahmet Görkem Er
8Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University, Stanford, CA, 94305, USA; Department of Health Informatics, Graduate School of Informatics, Middle East Technical University, 06800, Ankara, Turkey; Department of Infectious Diseases and Clinical Microbiology, Hacettepe University Faculty of Medicine, 06230, Ankara, Turkey
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: ahmetgorkemer{at}gmail.com
Justus Benzler
9Dept of Infectious Disease Epidemiology, Robert Koch Institute, Berlin, Germany
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: benzlerj{at}rki.de
Maximilian Gertler
10Institute of International Health, Charité Centre Global Health, Charité - Universitätsmedizin Berlin, Berlin, Germany
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: maximilian.gertler{at}charite.de
Mareike Riechers
11Department of Nephrology and Hypertension Hannover Medical School, Hannover, Germany
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: riechers.mareike{at}mh-hannover.de
Christian Hinze
12Department of Nephrology and Hypertension, Hannover Medical School, Hannover, Germany
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: hinze.christian{at}mh-hannover.de
Yuanning Zheng
13Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University, Stanford, CA 94305
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yuanning Zheng
  • For correspondence: eric2021{at}stanford.edu
Konstantin Pelz
14Data Science in Systems Biology, TUM School of Life Sciences, Technical University of Munich, 85354 Freising, Germany
MSc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: konstantin.pelz{at}tum.de
Divya Nagaraj
15Department of Computer Science, Stanford University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: dnagaraj{at}stanford.edu
Alexa Chen
16Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: achen333{at}stanford.edu
Anastassia Löser
17Department of Radiotherapy, University Medical Center Schleswig-Holstein (Campus Lübeck), Lübeck, Germany
MD PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: anastassia.loeser{at}uksh.de
Alexander Rühle
18Department of Radiation Oncology, University of Leipzig Medical Center, Leipzig, Germany and Department of Radiation Oncology, Medical Center – University of Freiburg, Faculty of Medicine, University of Freiburg, German Cancer Consortium (DKTK), partner site DKTK-Freiburg, Germany
MD MHBA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: alexander.ruehle{at}medizin.uni-leipzig.de
Constantinos Zamboglou
19German Oncology Center, European University Cyprus, Limassol, Cyprus and Department of Radiation Oncology, Medical Center – University of Freiburg, Faculty of Medicine, University of Freiburg, German Cancer Consortium (DKTK), partner site DKTK-Freiburg, Germany
MD MHBA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: constantinos.zamboglou{at}goc.com.cy
Lujain Alyahya
20National Center for AI (NCAI), Saudi Data and AI Authority (SDAIA) Riyadh, Saudi Arabia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: lyahya{at}sdaia.gov.sa
Maximilian Uhlig
21Department of Hematology, Oncology and Rheumatology, Heidelberg University Hospital
MD MSc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: maximilian.uhlig{at}med.uni-heidelberg.de
Gautam Machiraju
22Department of Biomedical Data Science, Stanford University
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: gmachi{at}stanford.edu
Kuba Weimann
23Department of Visual and Data-Centric Computing, Zuse Institute Berlin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: weimann{at}zib.de
Christoph Lippert
24Digital Health & Machine Learning, Hasso Plattner Institute, University of Potsdam, Potsdam, Germany, Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Christoph Lippert
  • For correspondence: christoph.lippert{at}hpi.de
Tim Conrad
25Department of Visual and Data-Centric Computing, Zuse Institute Berlin, Berlin, Germany
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: conrad{at}zib.de
Jackie Ma
26Department of Artificial Intelligence, Fraunhofer Heinrich Hertz Institute, Berlin, Germany
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jackie.ma{at}hhi.fraunhofer.de
Roberto Novoa
27Department of Pathology, Stanford University, Stanford, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: rnovoa{at}stanford.edu
Michael Moor
28Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
MD PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: michael.moor{at}bsse.ethz.ch
Tina Hernandez-Boussard
29Department of Medicine, Stanford University, Stanford, CA, USA, and Stanford Center for Biomedical Informatics Research (BMIR), Department of Biomedical Data Science, Stanford University, Stanford, USA, and Department of Surgery, Stanford University, Stanford, CA, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: boussard{at}stanford.edu
Mohammed Alawad
30National Center for AI (NCAI), Saudi Data and AI Authority (SDAIA) Riyadh, Saudi Arabia
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: malawad{at}sdaia.gov.sa
Jorge L. Salinas
31Division of Infectious Diseases & Geographic Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: jlsalinas{at}stanford.edu
Mirja Mittermaier
32Berlin Institute of Health (BIH), 10178, Berlin, Germany. Department of Infectious Diseases, Respiratory Medicine and Critical Care, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: mirja.mittermaier{at}charite.de
Olivier Gevaert
33Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine, Stanford University, Stanford, CA 94305
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Olivier Gevaert
  • For correspondence: ogevaert{at}stanford.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

A diagnostic medical foundation model (MedFM) is an artificial intelligence (AI) system engineered to accurately determine diagnoses across various medical imaging modalities and specialties. To train MedFM, we created the PubMed Central Medical Images Dataset (PMCMID), the largest annotated medical image dataset to date, comprising 16,126,659 images from 3,021,780 medical publications. Using AI- and ontology-based methods, we identified 4,482,237 medical images (e.g., clinical photos, X-rays, ultrasounds) and generated comprehensive annotations. To optimize MedFM’s performance and assess biases, 13,266 images were manually annotated to establish a multimodal benchmark. MedFM achieved physician-level performance in diagnosis tasks spanning radiology, dermatology, and infectious diseases without requiring specific training. Additionally, we developed the Image2Paper app, allowing clinicians to upload medical images and retrieve relevant literature. The correct diagnoses appeared within the top ten results in 88.4% and at least one relevant differential diagnosis in 93.0%. MedFM and PMCMID were made publicly available.

Funding Research reported here was partially supported by the National Cancer Institute (NCI) (R01 CA260271), the Saudi Company for Artificial Intelligence (SCAI) Authority, and the German Federal Ministry for Economic Affairs and Climate Action (BMWK) under the project DAKI-FWS (01MK21009E). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

Research reported here was partially supported by the National Cancer Institute (NCI) (R01 CA260271), the Saudi Company for Artificial Intelligence (SCAI) Authority, and the German Federal Ministry for Economic Affairs and Climate Action (BMWK) under the project DAKI-FWS (01MK21009E). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Open Access Publications

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

All data produced in the present study are available upon reasonable request to the authors.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted May 31, 2025.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Physician-level classification performance across multiple imaging domains with a diagnostic medical foundation model and a large dataset of annotated medical images
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Physician-level classification performance across multiple imaging domains with a diagnostic medical foundation model and a large dataset of annotated medical images
Alexander Henry Thieme, Tahir Miri, Alexandre R Marra, Takaaki Kobayashi, Guillermo Rodriguez-Nava, Yiheng Li, Thomas Barba, Ahmet Görkem Er, Justus Benzler, Maximilian Gertler, Mareike Riechers, Christian Hinze, Yuanning Zheng, Konstantin Pelz, Divya Nagaraj, Alexa Chen, Anastassia Löser, Alexander Rühle, Constantinos Zamboglou, Lujain Alyahya, Maximilian Uhlig, Gautam Machiraju, Kuba Weimann, Christoph Lippert, Tim Conrad, Jackie Ma, Roberto Novoa, Michael Moor, Tina Hernandez-Boussard, Mohammed Alawad, Jorge L. Salinas, Mirja Mittermaier, Olivier Gevaert
medRxiv 2025.05.30.25328646; doi: https://doi.org/10.1101/2025.05.30.25328646
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Physician-level classification performance across multiple imaging domains with a diagnostic medical foundation model and a large dataset of annotated medical images
Alexander Henry Thieme, Tahir Miri, Alexandre R Marra, Takaaki Kobayashi, Guillermo Rodriguez-Nava, Yiheng Li, Thomas Barba, Ahmet Görkem Er, Justus Benzler, Maximilian Gertler, Mareike Riechers, Christian Hinze, Yuanning Zheng, Konstantin Pelz, Divya Nagaraj, Alexa Chen, Anastassia Löser, Alexander Rühle, Constantinos Zamboglou, Lujain Alyahya, Maximilian Uhlig, Gautam Machiraju, Kuba Weimann, Christoph Lippert, Tim Conrad, Jackie Ma, Roberto Novoa, Michael Moor, Tina Hernandez-Boussard, Mohammed Alawad, Jorge L. Salinas, Mirja Mittermaier, Olivier Gevaert
medRxiv 2025.05.30.25328646; doi: https://doi.org/10.1101/2025.05.30.25328646

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (868)
  • Anesthesia (306)
  • Cardiovascular Medicine (4483)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (615)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15283)
  • Forensic Medicine (31)
  • Gastroenterology (1134)
  • Genetic and Genomic Medicine (6651)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4606)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1624)
  • Hematology (545)
  • HIV/AIDS (1276)
  • Infectious Diseases (except HIV/AIDS) (15965)
  • Intensive Care and Critical Care Medicine (1111)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (675)
  • Neurology (6699)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1153)
  • Occupational and Environmental Health (961)
  • Oncology (3370)
  • Ophthalmology (989)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (670)
  • Pediatrics (1704)
  • Pharmacology and Therapeutics (700)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5497)
  • Public and Global Health (9288)
  • Radiology and Imaging (2225)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1202)
  • Rheumatology (598)
  • Sexual and Reproductive Health (721)
  • Sports Medicine (536)
  • Surgery (722)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (267)