Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)

View ORCID ProfileSara Jones, View ORCID ProfileKatie R. Bradwell, View ORCID ProfileLauren E. Chan, View ORCID ProfileCourtney Olson-Chen, Jessica Tarleton, View ORCID ProfileKenneth J. Wilkins, View ORCID ProfileQiuyuan Qin, View ORCID ProfileEmily Groene Faherty, View ORCID ProfileYan Kwan Lau, Catherine Xie, View ORCID ProfileYu-Han Kao, View ORCID ProfileMichael N. Liebman, Federico Mariona, View ORCID ProfileAnup Challa, View ORCID ProfileLi Li, View ORCID ProfileSarah J. Ratcliffe, View ORCID ProfileJulie A. McMurry, View ORCID ProfileMelissa A. Haendel, View ORCID ProfileRena C. Patel, View ORCID ProfileElaine L. Hill the N3C Consortium
doi: https://doi.org/10.1101/2022.08.04.22278439
Sara Jones
1Office of Data Science and Emerging Technologies, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sara Jones
Katie R. Bradwell
2Palantir Technologies, Denver, CO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Katie R. Bradwell
Lauren E. Chan
3College of Public Health and Human Sciences, Oregon State University, Corvallis, OR
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lauren E. Chan
Courtney Olson-Chen
4Department of Obstetrics and Gynecology, University of Rochester Medical Center, Rochester, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Courtney Olson-Chen
Jessica Tarleton
5Department of Obstetrics and Gynecology, Medical University of South Carolina, Charleston, SC
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kenneth J. Wilkins
6Biostatistics Program, Office of the Director, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kenneth J. Wilkins
Qiuyuan Qin
7Department of Public Health Sciences, University of Rochester Medical Center, Rochester, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Qiuyuan Qin
Emily Groene Faherty
8University of Minnesota School of Public Health, Minneapolis, MN
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Emily Groene Faherty
Yan Kwan Lau
9Sema4, Stamford, CT
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yan Kwan Lau
Catherine Xie
7Department of Public Health Sciences, University of Rochester Medical Center, Rochester, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yu-Han Kao
9Sema4, Stamford, CT
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yu-Han Kao
Michael N. Liebman
10IPQ Analytics, LLC, Kennett Square, PA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael N. Liebman
Federico Mariona
11Beaumont Hospital, Dearborn, MI
12Wayne State University, Detroit, MI
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anup Challa
13Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, TN
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anup Challa
Li Li
9Sema4, Stamford, CT
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Li Li
Sarah J. Ratcliffe
14Department of Public Health Sciences, University of Virginia, Charlottesville, VA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sarah J. Ratcliffe
Julie A. McMurry
15Department of Biomedical Informatics, University of Colorado, Anschutz Medical Campus, Aurora, CO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Julie A. McMurry
Melissa A. Haendel
15Department of Biomedical Informatics, University of Colorado, Anschutz Medical Campus, Aurora, CO
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Melissa A. Haendel
Rena C. Patel
16Department of Medicine and Global Health, University of Washington, Seattle, WA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Rena C. Patel
Elaine L. Hill
4Department of Obstetrics and Gynecology, University of Rochester Medical Center, Rochester, NY
7Department of Public Health Sciences, University of Rochester Medical Center, Rochester, NY
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elaine L. Hill
  • For correspondence: elaine_hill@urmc.rochester.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Objective To define pregnancy episodes and estimate gestational aging within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C).

Materials and Methods We developed a comprehensive approach, named Hierarchy and rule-based pregnancy episode Inference integrated with Pregnancy Progression Signatures (HIPPS) and applied it to EHR data in the N3C from 1 January 2018 to 7 April 2022. HIPPS combines: 1) an extension of a previously published pregnancy episode algorithm, 2) a novel algorithm to detect gestational aging-specific signatures of a progressing pregnancy for further episode support, and 3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated three types of pregnancy cohorts based on the level of precision for gestational aging and pregnancy outcomes for comparison of COVID-19 and other characteristics.

Results We identified 628,165 pregnant persons with 816,471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, spontaneous abortions), and 23.3% had unknown outcomes. We were able to estimate start dates within one week of precision for 431,173 (52.8%) episodes. 66,019 (8.1%) episodes had incident COVID-19 during pregnancy. Across varying COVID-19 cohorts, patient characteristics were generally similar though pregnancy outcomes differed.

Discussion HIPPS provides support for pregnancy-related variables based on EHR data for researchers to define pregnancy cohorts. Our approach performed well based on clinician validation.

Conclusion We have developed a novel and robust approach for inferring pregnancy episodes and gestational aging that addresses data inconsistency and missingness in EHR data.

Competing Interest Statement

KRB is an employee of Palantir Technologies. YK and LL are employees of Sema4, ML is Managing Director of IPQ Analytics, LLC.

Funding Statement

The analyses described in this publication were conducted with data or tools accessed through the NCATS N3C Data Enclave covid.cd2h.org/enclave and N3C Attribution & Publication Policy v1.2-2020-08-25b, and supported by NCATS U24 TR002306, and NIGMS National Institute of General Medical Sciences, 5U54GM104942-04. Individual authors were supported by the following funding sources: NIMH R01131542 (Rena C. Patel), NICHD R21105304 (Anup P. Challa).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Data partner sites transfer their N3C-eligible data to NCATS/NIH under a Johns Hopkins University Reliance Protocol (IRB00249128) or via individual site agreements with NCATS (see below). Managed under the NIH authority, the N3C Data Enclave can be accessed as previously described [10] and at ncats.nih.gov/n3c/resources, https://covid.cd2h.org/for-researchers. SiteIRB NameExempted vs approvedProtocol number Medical University of South CarolinaHealth Sciences South Carolina Institutional Review BoardexemptPro00111335 National Institutes of HealthNIH Office of IRB OperationsexemptN/A University of MinnesotaUniversity of Minnesota Institutional Review BoardapprovedSTUDY00012706 University of RochesterUniversity of Rochester Research Subjects Review BoardexemptSTUDY00005366 University of WashingtonHuman Subjects DivisionapprovedSTUDY00013147 Institutional IRBs determine that it is not human subjects research, it is research on the data

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

N3C Data Enclave can be accessed at ncats.nih.gov/n3c/resources, https://covid.cd2h.org/for-researchers

https://ncats.nih.gov/n3c/resources

https://covid.cd2h.org/for-researchers

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted August 08, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)
Sara Jones, Katie R. Bradwell, Lauren E. Chan, Courtney Olson-Chen, Jessica Tarleton, Kenneth J. Wilkins, Qiuyuan Qin, Emily Groene Faherty, Yan Kwan Lau, Catherine Xie, Yu-Han Kao, Michael N. Liebman, Federico Mariona, Anup Challa, Li Li, Sarah J. Ratcliffe, Julie A. McMurry, Melissa A. Haendel, Rena C. Patel, Elaine L. Hill
medRxiv 2022.08.04.22278439; doi: https://doi.org/10.1101/2022.08.04.22278439
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)
Sara Jones, Katie R. Bradwell, Lauren E. Chan, Courtney Olson-Chen, Jessica Tarleton, Kenneth J. Wilkins, Qiuyuan Qin, Emily Groene Faherty, Yan Kwan Lau, Catherine Xie, Yu-Han Kao, Michael N. Liebman, Federico Mariona, Anup Challa, Li Li, Sarah J. Ratcliffe, Julie A. McMurry, Melissa A. Haendel, Rena C. Patel, Elaine L. Hill
medRxiv 2022.08.04.22278439; doi: https://doi.org/10.1101/2022.08.04.22278439

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (271)
  • Allergy and Immunology (553)
  • Anesthesia (135)
  • Cardiovascular Medicine (1761)
  • Dentistry and Oral Medicine (238)
  • Dermatology (173)
  • Emergency Medicine (312)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (660)
  • Epidemiology (10803)
  • Forensic Medicine (8)
  • Gastroenterology (593)
  • Genetic and Genomic Medicine (2953)
  • Geriatric Medicine (287)
  • Health Economics (534)
  • Health Informatics (1930)
  • Health Policy (836)
  • Health Systems and Quality Improvement (745)
  • Hematology (293)
  • HIV/AIDS (631)
  • Infectious Diseases (except HIV/AIDS) (12520)
  • Intensive Care and Critical Care Medicine (693)
  • Medical Education (299)
  • Medical Ethics (86)
  • Nephrology (324)
  • Neurology (2801)
  • Nursing (151)
  • Nutrition (433)
  • Obstetrics and Gynecology (559)
  • Occupational and Environmental Health (597)
  • Oncology (1469)
  • Ophthalmology (444)
  • Orthopedics (172)
  • Otolaryngology (257)
  • Pain Medicine (190)
  • Palliative Medicine (56)
  • Pathology (381)
  • Pediatrics (867)
  • Pharmacology and Therapeutics (366)
  • Primary Care Research (338)
  • Psychiatry and Clinical Psychology (2641)
  • Public and Global Health (5374)
  • Radiology and Imaging (1014)
  • Rehabilitation Medicine and Physical Therapy (596)
  • Respiratory Medicine (726)
  • Rheumatology (330)
  • Sexual and Reproductive Health (289)
  • Sports Medicine (279)
  • Surgery (327)
  • Toxicology (47)
  • Transplantation (150)
  • Urology (126)