Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Ultra-low coverage genome-wide association study - insights into gestational age using 17,844 embryo samples with preimplantation genetic testing

View ORCID ProfileShumin Li, Bin Yan, Thomas K.T. Li, Jianliang Lu, Yifan Gu, Yueqiu Tan, Fei Gong, Tak-Wah Lam, Pingyuan Xie, Yuexuan Wang, Ge Lin, View ORCID ProfileRuibang Luo
doi: https://doi.org/10.1101/2022.06.15.22276464
Shumin Li
1Department of Computer Science, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Shumin Li
Bin Yan
1Department of Computer Science, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thomas K.T. Li
6Department of Obstetrics & Gynecology, Queen Mary Hospital, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jianliang Lu
1Department of Computer Science, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yifan Gu
4NHC Key Laboratory of Human Stem Cell and Reproductive Engineering, School of Basic Medical Science, Institute of Reproductive and Stem Cell Engineering, Central South University, Changsha 410008, Hunan, China
5Clinical Research Center for Reproduction and Genetics in Hunan Province, Reproductive and Genetic Hospital of CITIC-Xiangya, Changsha 410013, Hunan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yueqiu Tan
4NHC Key Laboratory of Human Stem Cell and Reproductive Engineering, School of Basic Medical Science, Institute of Reproductive and Stem Cell Engineering, Central South University, Changsha 410008, Hunan, China
5Clinical Research Center for Reproduction and Genetics in Hunan Province, Reproductive and Genetic Hospital of CITIC-Xiangya, Changsha 410013, Hunan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fei Gong
4NHC Key Laboratory of Human Stem Cell and Reproductive Engineering, School of Basic Medical Science, Institute of Reproductive and Stem Cell Engineering, Central South University, Changsha 410008, Hunan, China
5Clinical Research Center for Reproduction and Genetics in Hunan Province, Reproductive and Genetic Hospital of CITIC-Xiangya, Changsha 410013, Hunan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tak-Wah Lam
1Department of Computer Science, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pingyuan Xie
2Hunan Normal University School of Medicine, Changsha, 410013, Hunan, China
3National Engineering and Research Center of Human Stem Cell, Changsha, Hunan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: rbluo@cs.hku.hk linggf@hotmail.com amywang@hku.hk plainxie192@126.com
Yuexuan Wang
1Department of Computer Science, The University of Hong Kong, Hong Kong, China
7College of Computer Science and Technology, Zhejiang University, Hangzhou, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: rbluo@cs.hku.hk linggf@hotmail.com amywang@hku.hk plainxie192@126.com
Ge Lin
3National Engineering and Research Center of Human Stem Cell, Changsha, Hunan, China
4NHC Key Laboratory of Human Stem Cell and Reproductive Engineering, School of Basic Medical Science, Institute of Reproductive and Stem Cell Engineering, Central South University, Changsha 410008, Hunan, China
5Clinical Research Center for Reproduction and Genetics in Hunan Province, Reproductive and Genetic Hospital of CITIC-Xiangya, Changsha 410013, Hunan, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: rbluo@cs.hku.hk linggf@hotmail.com amywang@hku.hk plainxie192@126.com
Ruibang Luo
1Department of Computer Science, The University of Hong Kong, Hong Kong, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ruibang Luo
  • For correspondence: rbluo@cs.hku.hk linggf@hotmail.com amywang@hku.hk plainxie192@126.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Very low coverage (0.1 to 1x) whole genome sequencing (WGS) has become a promising and affordable approach to discover genomic variants of human populations for Genome-Wide Association Study (GWAS). To support genetic screening using Preimplantation Genetic Testing (PGT) in a large population, the sequencing coverage goes below 0.1x to an ultra-low level. However, its feasibility and effectiveness for GWAS remains undetermined.

Methods We devised a pipeline to process ultra-low coverage WGS data and benchmarked the accuracy of genotype imputation at the combination of different coverages below 0.1x and sample sizes from 2,000 to 16,000, using 17,844 embryo PGT with approximately 0.04x average coverage and the standard Chinese sample HG005 with known genotypes. We then applied the imputed genotypes of 1,744 transferred embryos who have gestational ages and complete follow-up records to GWAS.

Results The accuracy of genotype imputation under ultra-low coverage can be improved by increasing the sample size and applying a set of filters. From 1,744 born embryos, we identified 11 genomic risk loci associated with gestational ages and 166 genes mapped to these loci according to positional, expression quantitative trait locus and chromatin interaction strategies. Among these mapped genes, CRHBP, ICAM1 and OXTR were more frequently reported as preterm birth related. By joint analysis of gene expression data from previous studies, we constructed interrelationships of mainly CRHBP, ICAM1, PLAGL1, DNMT1, CNTLN, DKK1 and EGR2 with preterm birth, infant disease and breast cancer.

Conclusions This study not only demonstrates that ultra-low coverage WGS could achieve relatively high accuracy of adequate genotype imputation and is capable of GWAS, but also provides insights into uncovering genetic associations of gestational age trait existed in the fetal embryo samples from Chinese or Eastern Asian populations.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

The study was supported by the Early Career Schema (27204518) of the Hong Kong Research Grants Council for RL, partially by General Research Funding of Hong Kong Research Grants Council (17113721) for RL and (17117918) for BY, by the University Grants Committees Fund from the University of Hong Kong for RL, partially by National Key Research and Developmental Program of China (2018YFC1004900) for YT, by the Innovation and Technology Fund (ITF/331/17FP) of Innovation and Technology Commission of the Hong Kong SAR government for TL.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

All individual samples in the technical concordance cohort and clinical cohort, and protocols used in this study have been reviewed and approved by the Institutional Review Board (IRB) of China International Trust Investment Corporation - Xianngya (IRB Reference No. LL-SC-2020-004). The need to obtain informed consent has been waived by the IRB due to the study's retrospective nature. Following the regulations of the Human Genetic Resources Administration of China, all genetic materials involved in this study have been reviewed and approved by Ministry of Science and Technology of the People's Republic of China (Approval No. [2022] GH1831). The experimental methods were in accordance with the principles of the Helsinki Declaration.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The datasets supporting the conclusions of this article are included within the article and its additional files. The significant SNPs and candidate SNPs are listed in Additional file 1: Tables S3 and S4, respectively. The processed or raw counts of mRNA expression datasets shown in Additional file 1: Table S8 was downloaded from Gene Expression Omnibus (GEO)/National Center for Biotechnology Information (NCBI).

  • Abbreviations

    WGS
    whole genome sequencing
    NGS
    next-generation sequencing
    GWAS
    genome-wide association study
    bp
    base pair
    SNP
    single nucleotide polymorphism
    PTB
    preterm birth
    PGT
    preimplantation genetic testing
    MAF
    minor allele frequency
    LD
    linkage disequilibrium
    HWE
    Hardy-Weinberg equilibrium
    WGA
    whole genome amplification
    PCA
    principal component analysis
    eQTL
    expression quantitative trait locus
    FDR
    false discovery rate
    GO
    gene ontology
    BPD
    bronchopulmonary dysplasia
    ER
    Estrogen receptor
    PR
    Progesterone receptor
    HER2
    human epidermal growth factor receptor 2
    TNBC
    Triple-negative breast cancer
    DEG
    differentially expressed gene
    AF
    allele frequency
    GIAB
    Genome in a Bottle
    BWA
    Burrows-Wheeler Aligner
    GEO
    Gene Expression Omnibus
    NCBI
    National Center for Biotechnology Information
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
    Back to top
    PreviousNext
    Posted June 16, 2022.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Ultra-low coverage genome-wide association study - insights into gestational age using 17,844 embryo samples with preimplantation genetic testing
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Ultra-low coverage genome-wide association study - insights into gestational age using 17,844 embryo samples with preimplantation genetic testing
    Shumin Li, Bin Yan, Thomas K.T. Li, Jianliang Lu, Yifan Gu, Yueqiu Tan, Fei Gong, Tak-Wah Lam, Pingyuan Xie, Yuexuan Wang, Ge Lin, Ruibang Luo
    medRxiv 2022.06.15.22276464; doi: https://doi.org/10.1101/2022.06.15.22276464
    Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
    Citation Tools
    Ultra-low coverage genome-wide association study - insights into gestational age using 17,844 embryo samples with preimplantation genetic testing
    Shumin Li, Bin Yan, Thomas K.T. Li, Jianliang Lu, Yifan Gu, Yueqiu Tan, Fei Gong, Tak-Wah Lam, Pingyuan Xie, Yuexuan Wang, Ge Lin, Ruibang Luo
    medRxiv 2022.06.15.22276464; doi: https://doi.org/10.1101/2022.06.15.22276464

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Genetic and Genomic Medicine
    Subject Areas
    All Articles
    • Addiction Medicine (215)
    • Allergy and Immunology (495)
    • Anesthesia (106)
    • Cardiovascular Medicine (1093)
    • Dentistry and Oral Medicine (195)
    • Dermatology (141)
    • Emergency Medicine (274)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (499)
    • Epidemiology (9758)
    • Forensic Medicine (5)
    • Gastroenterology (480)
    • Genetic and Genomic Medicine (2304)
    • Geriatric Medicine (222)
    • Health Economics (462)
    • Health Informatics (1554)
    • Health Policy (732)
    • Health Systems and Quality Improvement (602)
    • Hematology (236)
    • HIV/AIDS (501)
    • Infectious Diseases (except HIV/AIDS) (11634)
    • Intensive Care and Critical Care Medicine (616)
    • Medical Education (236)
    • Medical Ethics (67)
    • Nephrology (257)
    • Neurology (2140)
    • Nursing (134)
    • Nutrition (335)
    • Obstetrics and Gynecology (426)
    • Occupational and Environmental Health (517)
    • Oncology (1172)
    • Ophthalmology (363)
    • Orthopedics (128)
    • Otolaryngology (220)
    • Pain Medicine (145)
    • Palliative Medicine (50)
    • Pathology (309)
    • Pediatrics (694)
    • Pharmacology and Therapeutics (298)
    • Primary Care Research (265)
    • Psychiatry and Clinical Psychology (2173)
    • Public and Global Health (4648)
    • Radiology and Imaging (775)
    • Rehabilitation Medicine and Physical Therapy (455)
    • Respiratory Medicine (623)
    • Rheumatology (274)
    • Sexual and Reproductive Health (225)
    • Sports Medicine (210)
    • Surgery (250)
    • Toxicology (43)
    • Transplantation (120)
    • Urology (94)