Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Comorbidity analysis and clustering of endometriosis patients using electronic health records

View ORCID ProfileUmair Khan, View ORCID ProfileTomiko T. Oskotsky, Bahar D. Yilmaz, View ORCID ProfileJacquelyn Roger, View ORCID ProfileKetrin Gjoni, Juan C. Irwin, View ORCID ProfileJessica Opoku-Anane, View ORCID ProfileNoémie Elhadad, View ORCID ProfileLinda C. Giudice, View ORCID ProfileMarina Sirota
doi: https://doi.org/10.1101/2025.02.13.25322244
Umair Khan
1Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA
2Biological and Medical Informatics Graduate Program, University of California, San Francisco, San Francisco, CA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Umair Khan
Tomiko T. Oskotsky
1Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tomiko T. Oskotsky
Bahar D. Yilmaz
3Department of Obstetrics, Gynecology, and Reproductive Sciences, Center for Reproductive Sciences, University of California, San Francisco, San Francisco, CA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jacquelyn Roger
2Biological and Medical Informatics Graduate Program, University of California, San Francisco, San Francisco, CA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jacquelyn Roger
Ketrin Gjoni
4Pharmaceutical Sciences and Pharmacogenomics Graduate Program, University of California, San Francisco, San Francisco, CA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ketrin Gjoni
Juan C. Irwin
3Department of Obstetrics, Gynecology, and Reproductive Sciences, Center for Reproductive Sciences, University of California, San Francisco, San Francisco, CA
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jessica Opoku-Anane
5Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jessica Opoku-Anane
Noémie Elhadad
6Department of Biomedical Informatics, Columbia University, New York, NY
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Noémie Elhadad
Linda C. Giudice
3Department of Obstetrics, Gynecology, and Reproductive Sciences, Center for Reproductive Sciences, University of California, San Francisco, San Francisco, CA
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Linda C. Giudice
Marina Sirota
1Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Marina Sirota
  • For correspondence: marina.sirota{at}ucsf.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Endometriosis is a prevalent, complex, inflammatory condition associated with a diverse range of symptoms and comorbidities. Despite its substantial burden on patients, population-level studies that explore its comorbid patterns and heterogeneity are limited. In this retrospective case-control study, we analyzed comorbidities from over forty thousand endometriosis patients across six University of California medical centers using de-identified electronic health record (EHR) data. We found hundreds of conditions significantly associated with endometriosis, including genitourinary disorders, neoplasms, and autoimmune diseases, with strong replication across datasets. Clustering analyses identified patient subpopulations with distinct comorbidity patterns, including psychiatric and autoimmune conditions. This study provides a comprehensive analysis of endometriosis comorbidities and highlights the heterogeneity within the patient population. Our findings demonstrate the utility of EHR data in uncovering clinically meaningful patterns and suggest pathways for personalized disease management and future research on biological mechanisms underlying endometriosis.

Competing Interest Statement

L.C.G. is a consultant to Myovant Sciences, Gensyta Pharma, Celmatix, NextGen Jane, and Chugai Pharmaceutical Co. The remaining authors declare no competing interests.

Funding Statement

This manuscript was supported by the Eunice Kennedy Shriver National Institute for Child Health and Human Development, P01HD106414 (UK, TTO, JCI, JO, LCG, MS), and the National Institute of General Medical Sciences, T32GM067547 (UK) and T32GM142516 (KG).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

All analysis of University of California electronic health record data was performed under the approval of the Institutional Review Board from the University of California, San Francisco. All clinical data were de-identified and written informed consent was waived by the institution.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • References to specific University of California locations have been removed.

Data Availability

The data that support the findings of this study are not openly available to individuals unaffiliated with UCSF due to the sensitivity of medical records, with the exception of collaborators. Individuals not affiliated with UCSF may set up an official collaboration with a UCSF-affiliated investigator by reaching out to the lead contact, Marina Sirota (marina.sirota{at}ucsf.edu). UCSF-affiliated individuals may contact UCSF’s Clinical and Translational Science Institute (ctsi{at}ucsf.edu) or the UCSF’s Information Commons team for more information (info.commons{at}ucsf.edu). UC-wide data is only available to UC researchers who have completed analyses in their respective UC first and have provided justification for scaling their analyses across UC health centers. Censored code for the analysis and visualizations in this study can be found at https://github.com/khanu263/comorbidities-clustering-endo.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted February 19, 2025.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Comorbidity analysis and clustering of endometriosis patients using electronic health records
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Comorbidity analysis and clustering of endometriosis patients using electronic health records
Umair Khan, Tomiko T. Oskotsky, Bahar D. Yilmaz, Jacquelyn Roger, Ketrin Gjoni, Juan C. Irwin, Jessica Opoku-Anane, Noémie Elhadad, Linda C. Giudice, Marina Sirota
medRxiv 2025.02.13.25322244; doi: https://doi.org/10.1101/2025.02.13.25322244
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Comorbidity analysis and clustering of endometriosis patients using electronic health records
Umair Khan, Tomiko T. Oskotsky, Bahar D. Yilmaz, Jacquelyn Roger, Ketrin Gjoni, Juan C. Irwin, Jessica Opoku-Anane, Noémie Elhadad, Linda C. Giudice, Marina Sirota
medRxiv 2025.02.13.25322244; doi: https://doi.org/10.1101/2025.02.13.25322244

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Sexual and Reproductive Health
Subject Areas
All Articles
  • Addiction Medicine (431)
  • Allergy and Immunology (757)
  • Anesthesia (221)
  • Cardiovascular Medicine (3298)
  • Dentistry and Oral Medicine (365)
  • Dermatology (280)
  • Emergency Medicine (479)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1173)
  • Epidemiology (13385)
  • Forensic Medicine (19)
  • Gastroenterology (899)
  • Genetic and Genomic Medicine (5158)
  • Geriatric Medicine (482)
  • Health Economics (783)
  • Health Informatics (3276)
  • Health Policy (1143)
  • Health Systems and Quality Improvement (1193)
  • Hematology (432)
  • HIV/AIDS (1019)
  • Infectious Diseases (except HIV/AIDS) (14638)
  • Intensive Care and Critical Care Medicine (913)
  • Medical Education (478)
  • Medical Ethics (127)
  • Nephrology (525)
  • Neurology (4930)
  • Nursing (262)
  • Nutrition (730)
  • Obstetrics and Gynecology (886)
  • Occupational and Environmental Health (795)
  • Oncology (2524)
  • Ophthalmology (728)
  • Orthopedics (282)
  • Otolaryngology (347)
  • Pain Medicine (323)
  • Palliative Medicine (90)
  • Pathology (544)
  • Pediatrics (1302)
  • Pharmacology and Therapeutics (551)
  • Primary Care Research (557)
  • Psychiatry and Clinical Psychology (4218)
  • Public and Global Health (7512)
  • Radiology and Imaging (1708)
  • Rehabilitation Medicine and Physical Therapy (1016)
  • Respiratory Medicine (980)
  • Rheumatology (480)
  • Sexual and Reproductive Health (498)
  • Sports Medicine (424)
  • Surgery (549)
  • Toxicology (72)
  • Transplantation (236)
  • Urology (205)