Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

DISCERN: A Clinical Impact-aware Framework for Radiology Report Comparison

View ORCID ProfileRakesh Sharma, View ORCID ProfileCameron Beeche, View ORCID ProfileJessie Dong, Richard Zhuang, Huaizhi Qu, Ruichen Zhang, Vineeth Gangaram, Pulak Goswami, Jiayi Xin, Jenna Ballard, View ORCID ProfileJeffery Duda, View ORCID ProfileCharles E. Kahn Jr, Ari Goldberg, Hersh Sagreiya, View ORCID ProfileQi Long, Tianlong Chen, View ORCID ProfileWalter Witschey
doi: https://doi.org/10.64898/2026.05.26.26353612
Rakesh Sharma
1University of Pennsylvania, Philadelphia, PA, USA
MTech
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rakesh Sharma
  • For correspondence: rakesh.sharma{at}pennmedicine.upenn.edu
Cameron Beeche
1University of Pennsylvania, Philadelphia, PA, USA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Cameron Beeche
Jessie Dong
1University of Pennsylvania, Philadelphia, PA, USA
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jessie Dong
Richard Zhuang
1University of Pennsylvania, Philadelphia, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Huaizhi Qu
2University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
BE
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruichen Zhang
2University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Vineeth Gangaram
1University of Pennsylvania, Philadelphia, PA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pulak Goswami
1University of Pennsylvania, Philadelphia, PA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jiayi Xin
1University of Pennsylvania, Philadelphia, PA, USA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jenna Ballard
1University of Pennsylvania, Philadelphia, PA, USA
MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeffery Duda
1University of Pennsylvania, Philadelphia, PA, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jeffery Duda
Charles E. Kahn Jr
1University of Pennsylvania, Philadelphia, PA, USA
MD, MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Charles E. Kahn Jr
Ari Goldberg
1University of Pennsylvania, Philadelphia, PA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hersh Sagreiya
1University of Pennsylvania, Philadelphia, PA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qi Long
1University of Pennsylvania, Philadelphia, PA, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Qi Long
Tianlong Chen
2University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Walter Witschey
1University of Pennsylvania, Philadelphia, PA, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Walter Witschey
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The surge in medical imaging has spurred the development of vision-language models (VLMs) to alleviate radiologist workloads. However, clinical deployment is hindered by the lack of meaningful evaluation frameworks. Current metrics - ranging from semantic similarity to large language model (LLM) based judges - often fail to distinguish between clinically trivial and critical discrepancies, poorly reflecting real-world clinical judgment. To address this, we introduce DISCERN (Discordance and Significance-aware Entity-level Radiology Report Comparison). DISCERN is a significance-aware framework that weighs report errors based on their potential impact on patient care. Our results demonstrate that DISCERN powered by closed source LLMs aligns more closely with expert radiologist assessments than traditional metrics or current LLM evaluators, providing a more interpretable and clinically relevant benchmark. By modeling radiologist prioritization and entity-level feedback, DISCERN facilitates targeted model refinement and ensures the safer integration of generative AI into clinical workflows.

Competing Interest Statement

The authors have declared no competing interest.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

PhysioNet approved access to ReXVal dataset (https://physionet.org/content/rexval-dataset/1.0.0/). RadEvalX is open for public access (https://physionet.org/content/rad-eval-x/1.0.0/).

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • I missed to add two of the co-authors. Recent version corrects that mistake.

Data Availability

All data used for the evaluation is available online at https://physionet.org/content/rexval-dataset/1.0.0/ and https://physionet.org/content/rad-eval-x/1.0.0/. Codebase is available at https://github.com/rakshrma/discern/.

Funder Information Declared

NIH Common Fund, https://ror.org/001d55x84, P41-EB029460, R01-HL169378, R01-HL171709, UL1-TR001878, OT2-OD038048, R21-EB036734
Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted May 28, 2026.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
DISCERN: A Clinical Impact-aware Framework for Radiology Report Comparison
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
DISCERN: A Clinical Impact-aware Framework for Radiology Report Comparison
Rakesh Sharma, Cameron Beeche, Jessie Dong, Richard Zhuang, Huaizhi Qu, Ruichen Zhang, Vineeth Gangaram, Pulak Goswami, Jiayi Xin, Jenna Ballard, Jeffery Duda, Charles E. Kahn Jr, Ari Goldberg, Hersh Sagreiya, Qi Long, Tianlong Chen, Walter Witschey
medRxiv 2026.05.26.26353612; doi: https://doi.org/10.64898/2026.05.26.26353612
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
DISCERN: A Clinical Impact-aware Framework for Radiology Report Comparison
Rakesh Sharma, Cameron Beeche, Jessie Dong, Richard Zhuang, Huaizhi Qu, Ruichen Zhang, Vineeth Gangaram, Pulak Goswami, Jiayi Xin, Jenna Ballard, Jeffery Duda, Charles E. Kahn Jr, Ari Goldberg, Hersh Sagreiya, Qi Long, Tianlong Chen, Walter Witschey
medRxiv 2026.05.26.26353612; doi: https://doi.org/10.64898/2026.05.26.26353612

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Radiology and Imaging
Subject Areas
All Articles
  • Addiction Medicine (576)
  • Allergy and Immunology (868)
  • Anesthesia (306)
  • Cardiovascular Medicine (4483)
  • Dentistry and Oral Medicine (449)
  • Dermatology (385)
  • Emergency Medicine (615)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1528)
  • Epidemiology (15283)
  • Forensic Medicine (31)
  • Gastroenterology (1134)
  • Genetic and Genomic Medicine (6651)
  • Geriatric Medicine (671)
  • Health Economics (1006)
  • Health Informatics (4606)
  • Health Policy (1378)
  • Health Systems and Quality Improvement (1624)
  • Hematology (545)
  • HIV/AIDS (1276)
  • Infectious Diseases (except HIV/AIDS) (15965)
  • Intensive Care and Critical Care Medicine (1111)
  • Medical Education (626)
  • Medical Ethics (147)
  • Nephrology (675)
  • Neurology (6699)
  • Nursing (346)
  • Nutrition (1006)
  • Obstetrics and Gynecology (1153)
  • Occupational and Environmental Health (961)
  • Oncology (3370)
  • Ophthalmology (989)
  • Orthopedics (370)
  • Otolaryngology (421)
  • Pain Medicine (437)
  • Palliative Medicine (131)
  • Pathology (670)
  • Pediatrics (1704)
  • Pharmacology and Therapeutics (700)
  • Primary Care Research (717)
  • Psychiatry and Clinical Psychology (5497)
  • Public and Global Health (9288)
  • Radiology and Imaging (2225)
  • Rehabilitation Medicine and Physical Therapy (1375)
  • Respiratory Medicine (1202)
  • Rheumatology (598)
  • Sexual and Reproductive Health (721)
  • Sports Medicine (536)
  • Surgery (722)
  • Toxicology (100)
  • Transplantation (290)
  • Urology (267)