Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Interoperability of phenome-wide multimorbidity patterns: a comparative study of two large-scale EHR systems

Nick Strayer, Tess Vessels, View ORCID ProfileKarmel Choi, Siwei Zhang, Yajing Li, Lide Han, Brian Sharber, Ryan S Hsi, Cosmin A Bejan, Alexander G. Bick, Justin M Balko, Douglas B Johnson, Lee E Wheless, Quinn S Wells, Elizabeth J Philips, Jill M Pulley, Wesley H Self, Qingxia Chen, View ORCID ProfileTina Hartert, Consuelo H Wilkins, View ORCID ProfileMichael R Savona, Yu Shyr, View ORCID ProfileDan M Roden, Jordan W Smoller, Douglas M Ruderfer, View ORCID ProfileYaomin Xu
doi: https://doi.org/10.1101/2024.03.28.24305045
Nick Strayer
1Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tess Vessels
2Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
3Center for Digital Genomic Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Karmel Choi
7Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston MA
8Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston MA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Karmel Choi
Siwei Zhang
1Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yajing Li
1Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lide Han
2Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
3Center for Digital Genomic Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brian Sharber
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ryan S Hsi
9Department of Urology, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Cosmin A Bejan
10Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexander G. Bick
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
MDPhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Justin M Balko
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Douglas B Johnson
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lee E Wheless
4Tennessee Valley Health System VA Hospital, Nashville, TN, USA
5Department of Dermatology, Vanderbilt University Medical Center, Nashville, TN, USA
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Quinn S Wells
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Elizabeth J Philips
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
11Institute for Immunology and Infectious Diseases, Murdoch University, Murdoch, Western Australia, Australia
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jill M Pulley
12Department of Allergy, Pulmonary and Critical Care Medicine, Vanderbilt University School of Medicine, Nashville, TN, USA
14Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN, USA
MBA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wesley H Self
13Department of Emergency Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
14Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Qingxia Chen
1Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tina Hartert
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tina Hartert
Consuelo H Wilkins
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
14Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael R Savona
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael R Savona
Yu Shyr
1Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dan M Roden
15Department of Pharmacology, Vanderbilt University Medical Center, Nashville, TN, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dan M Roden
Jordan W Smoller
7Psychiatric & Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston MA
8Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston MA
16Stanley Center for Psychiatric Research, Broad Institute, Cambridge, MA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Douglas M Ruderfer
2Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
3Center for Digital Genomic Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
6Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA
10Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
17Department of Psychiatry and Behavioral Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: yaomin.xu{at}vumc.org douglas.ruderfer{at}vumc.org
Yaomin Xu
1Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
10Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yaomin Xu
  • For correspondence: yaomin.xu{at}vumc.org douglas.ruderfer{at}vumc.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Electronic health records (EHR) are increasingly used for studying multimorbidities. However, concerns about accuracy, completeness, and EHRs being primarily designed for billing and administrative purposes raise questions about the consistency and reproducibility of EHR-based multimorbidity research.

Methods Utilizing phecodes to represent the disease phenome, we analyzed pairwise comorbidity strengths using a dual logistic regression approach and constructed multimorbidity as an undirected weighted graph. We assessed the consistency of the multimorbidity networks within and between two major EHR systems at local (nodes and edges), meso (neighboring patterns), and global (network statistics) scales. We present case studies to identify disease clusters and uncover clinically interpretable disease relationships. We provide an interactive web tool and a knowledge base combining data from multiple sources for online multimorbidity analysis.

Findings Analyzing data from 500,000 patients across Vanderbilt University Medical Center and Mass General Brigham health systems, we observed a strong correlation in disease frequencies ( Kendall’s τ = 0.643) and comorbidity strengths (Pearson ρ = 0.79). Consistent network statistics across EHRs suggest similar structures of multimorbidity networks at various scales. Comorbidity strengths and similarities of multimorbidity connection patterns align with the disease genetic correlations. Graph-theoretic analyses revealed a consistent core-periphery structure, implying efficient network clustering through threshold graph construction. Using hydronephrosis as a case study, we demonstrated the network’s ability to uncover clinically relevant disease relationships and provide novel insights.

Interpretation Our findings demonstrate the robustness of large-scale EHR data for studying phenome-wide multimorbidities. The alignment of multimorbidity patterns with genetic data suggests the potential utility for uncovering shared biology of diseases. The consistent core-periphery structure offers analytical insights to discover complex disease interactions. This work also sets the stage for advanced disease modeling, with implications for precision medicine.

Funding VUMC Biostatistics Development Award, the National Institutes of Health, and the VA CSRD

Competing Interest Statement

JWS is a member of the Scientific Advisory Board of Sensorium Therapeutics (with equity) and has received grant support from Biogen, Inc. He is the principal investigator of a collaborative study of the genetics of depression and bipolar disorder sponsored by 23andMe, for which 23andMe provides analysis time as in-kind support but no payments. DMR has served on advisory boards for Illumina and Alkermes and has received research funds unrelated to this work from PTC Therapeutics. All other authors declare no competing interests.

Funding Statement

NS and YX are supported by the Vanderbilt University Department of Biostatistics Development Award; YX, CB and RH are supported by R21DK127075; YX, DE, EP and DR are supported by P50GM115305; JWS is supported in part by R01 MH118233. The Vanderbilt University Medical Center dataset(s) used for the analyses described were obtained from Vanderbilt University Medical Centers SD/BioVU, which is supported by numerous sources: institutional funding, private agencies, and federal grants. These include the NIH funded Shared Instrumentation Grant S10RR025141; and CTSA grants UL1TR002243, UL1TR000445, and UL1RR024975. Genomic data are also supported by investigator-led projects that include U01HG004798, R01NS032830, RC2GM092618, P50GM115305, U01HG006378, U19HL065962, R01HD074711; and additional funding sources listed at https://victr.vanderbilt.edu/pub/biovu/. This research has been conducted using the UK Biobank Resource under Application Number 43397.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

IRB# 172041 of Vanderbilt University Medical Center (VUMC) gave ethical approval for this work. IRB# 2009P002312 of Mass General Brigham (MGB) gave ethical approval for this work.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • ↵* Contribute equally

  • The manuscript revised; Figure 7 revised, author and affiliation updated; Supplemental file updated

Data Availability

All coding details associated with the models has been shared. Results have been aggregated and reported within this Article to the maximum extent possible, while maintaining privacy from personal health information as required by law. All dynamic online analysis results are available from PheMIME App (https://prod.tbilab.org/PheMIME/). All data are archived within TBILab systems in an audited computing environment secured by the Health Insurance Portability and Accountability Act to facilitate verification of study conclusions. The open-source code for PheMIME is publicly available on our GitHub repository at https://github.com/tbilab/PheMIME.

https://prod.tbilab.org/PheMIME/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted May 27, 2024.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Interoperability of phenome-wide multimorbidity patterns: a comparative study of two large-scale EHR systems
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Interoperability of phenome-wide multimorbidity patterns: a comparative study of two large-scale EHR systems
Nick Strayer, Tess Vessels, Karmel Choi, Siwei Zhang, Yajing Li, Lide Han, Brian Sharber, Ryan S Hsi, Cosmin A Bejan, Alexander G. Bick, Justin M Balko, Douglas B Johnson, Lee E Wheless, Quinn S Wells, Elizabeth J Philips, Jill M Pulley, Wesley H Self, Qingxia Chen, Tina Hartert, Consuelo H Wilkins, Michael R Savona, Yu Shyr, Dan M Roden, Jordan W Smoller, Douglas M Ruderfer, Yaomin Xu
medRxiv 2024.03.28.24305045; doi: https://doi.org/10.1101/2024.03.28.24305045
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Interoperability of phenome-wide multimorbidity patterns: a comparative study of two large-scale EHR systems
Nick Strayer, Tess Vessels, Karmel Choi, Siwei Zhang, Yajing Li, Lide Han, Brian Sharber, Ryan S Hsi, Cosmin A Bejan, Alexander G. Bick, Justin M Balko, Douglas B Johnson, Lee E Wheless, Quinn S Wells, Elizabeth J Philips, Jill M Pulley, Wesley H Self, Qingxia Chen, Tina Hartert, Consuelo H Wilkins, Michael R Savona, Yu Shyr, Dan M Roden, Jordan W Smoller, Douglas M Ruderfer, Yaomin Xu
medRxiv 2024.03.28.24305045; doi: https://doi.org/10.1101/2024.03.28.24305045

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (431)
  • Allergy and Immunology (757)
  • Anesthesia (221)
  • Cardiovascular Medicine (3298)
  • Dentistry and Oral Medicine (365)
  • Dermatology (280)
  • Emergency Medicine (479)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1173)
  • Epidemiology (13384)
  • Forensic Medicine (19)
  • Gastroenterology (899)
  • Genetic and Genomic Medicine (5158)
  • Geriatric Medicine (482)
  • Health Economics (783)
  • Health Informatics (3276)
  • Health Policy (1143)
  • Health Systems and Quality Improvement (1193)
  • Hematology (432)
  • HIV/AIDS (1019)
  • Infectious Diseases (except HIV/AIDS) (14637)
  • Intensive Care and Critical Care Medicine (913)
  • Medical Education (478)
  • Medical Ethics (127)
  • Nephrology (525)
  • Neurology (4930)
  • Nursing (262)
  • Nutrition (730)
  • Obstetrics and Gynecology (886)
  • Occupational and Environmental Health (795)
  • Oncology (2524)
  • Ophthalmology (728)
  • Orthopedics (282)
  • Otolaryngology (347)
  • Pain Medicine (323)
  • Palliative Medicine (90)
  • Pathology (544)
  • Pediatrics (1302)
  • Pharmacology and Therapeutics (551)
  • Primary Care Research (557)
  • Psychiatry and Clinical Psychology (4218)
  • Public and Global Health (7512)
  • Radiology and Imaging (1708)
  • Rehabilitation Medicine and Physical Therapy (1016)
  • Respiratory Medicine (980)
  • Rheumatology (480)
  • Sexual and Reproductive Health (498)
  • Sports Medicine (424)
  • Surgery (549)
  • Toxicology (72)
  • Transplantation (236)
  • Urology (205)