Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Improving Polygenic Prediction in Ancestrally Diverse Populations

Yunfeng Ruan, Yen-Chen Anne Feng, Chia-Yen Chen, Max Lam, Stanley Global Asia Initiatives, Akira Sawa, Alicia R. Martin, Shengying Qin, Hailiang Huang, Tian Ge
doi: https://doi.org/10.1101/2020.12.27.20248738
Yunfeng Ruan
1Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
2Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yen-Chen Anne Feng
1Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
4Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
5Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Chia-Yen Chen
6Translational Biology, Biogen Inc., Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Max Lam
1Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
5Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
7Division of Psychiatry Research, The Zucker Hillside Hospital, Northwell Health, Glen Oaks, NY, USA
8Research Division, Institute of Mental Health Singapore, Singapore, Singapore
9Human Genetics, Genome Institute of Singapore, Singapore, Singapore
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Akira Sawa
10Departments of Psychiatry, Neuroscience, and Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD, USA
11Department of Mental Health, Johns Hopkins University Bloomberg School of Public Health, Baltimore, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alicia R. Martin
1Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
5Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shengying Qin
2Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Shanghai Jiao Tong University, Shanghai, China
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hailiang Huang
1Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
5Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
12Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tge1@mgh.harvard.edu
Tian Ge
1Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
4Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Polygenic risk scores (PRS) have attenuated cross-population predictive performance. As existing genome-wide association studies (GWAS) were predominantly conducted in individuals of European descent, the limited transferability of PRS reduces its clinical value in non-European populations and may exacerbate healthcare disparities. Recent efforts to level ancestry imbalance in genomic research have expanded the scale of non-European GWAS, although they remain under-powered. Here we present a novel PRS construction method, PRS-CSx, which improves cross-population polygenic prediction by integrating GWAS summary statistics from multiple populations. PRS-CSx couples genetic effects across populations via a shared continuous shrinkage prior, enabling more accurate effect size estimation by sharing information between summary statistics and leveraging linkage disequilibrium (LD) diversity across discovery samples, while inheriting computational efficiency and robustness from PRS-CS. We show that PRS-CSx outperforms alternative methods across traits with a wide range of genetic architectures and cross-population genetic correlations in simulations, and substantially improves the prediction of quantitative traits and schizophrenia risk in non-European populations.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

T.G. is supported by NIA K99/R00AG054573. H.H. acknowledges support from NIDDK K01DK114379, NIMH U01MH109539, Brain & Behavior Research Foundation Young Investigator Grant, the Zhengxu and Ying He Foundation, and the Stanley Center for Psychiatric Research. A.R.M. is supported by NIMH K99/R00MH117229.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Collection of the UK Biobank (UKBB) data was approved by the UKBB's Research Ethics Committee. UKBB GWAS summary statistics were released by the Neale Lab and are publicly available (http://www.nealelab.is/uk-biobank). UKBB individual-level data used in the present work were obtained under application #32568. Biobank Japan (BBJ) GWAS summary statistics are publicly available on the BBJ website (http://jenger.riken.jp/en/result). The PAGE study GWAS summary statistics are publicly available on the GWAS Catalog (https://www.ebi.ac.uk/gwas/downloads/summary-statistics). Schizophrenia GWAS summary statistics for the European and East Asian populations are publicly available on the Psychiatric Genomics Consortium (PGC) website (https://www.med.unc.edu/pgc/download-results/). The use of schizophrenia individual-level data of East Asian ancestry in the present work was approved by the Stanley Global Asia Initiatives. The following institutions provided ethics oversight for the collection of schizophrenia East Asian data: Domain Specific Review Board (Singapore); University of Hong Kong; Fujita Health University; RIKEN Center for Integrative Medical Sciences; Nagoya University; Osaka University; Niigata University; Bio-X Institutes of Shanghai Jiao Tong University; University Medical Center Utrecht; The University of Western Australia; The University of Indonesia; Peking University Sixth Hospital; National Taiwan University; Samsung Medical Center; Chonnam National University Hospital; Tokyo Metropolitan Institute of Medical Science and affiliated institutions; Harvard Harvard T.H. Chan School of Public Health.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ↵* Email: chinsir{at}sjtu.edu.cn (S.Q.); hhuang{at}broadinstitute.org (H.H.); tge1{at}mgh.harvard.edu (T.G.)

Data Availability

Publicly available data were downloaded from the following databases: 1000 Genomes Phase 3 reference panels: https://mathgen.stats.ox.ac.uk/impute/1000GP_Phase3.html; Genetic map for each subpopulation: ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/working/20130507_omni_recombination_rates; UKBB summary statistics: http://www.nealelab.is/uk-biobank (GWAS round 2 was used in this study); BBJ summary statistics: http://jenger.riken.jp/en/result; PAGE summary statistics were downloaded from the GWAS Catalog: https://www.ebi.ac.uk/gwas/downloads/summary-statistics; PGC wave 2 schizophrenia EUR GWAS (49 EUR samples): https://www.med.unc.edu/pgc/download-results/scz; Schizophrenia EAS summary statistics are available upon request to PGC Schizophrenia Work Group. Schizophrenia individual-level data of East Asian ancestry are available upon reasonable request with regulatory/compliance review to the Stanley Global Asia Initiatives: zguo@broadinstitute.org.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted January 02, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Improving Polygenic Prediction in Ancestrally Diverse Populations
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Improving Polygenic Prediction in Ancestrally Diverse Populations
Yunfeng Ruan, Yen-Chen Anne Feng, Chia-Yen Chen, Max Lam, Stanley Global Asia Initiatives, Akira Sawa, Alicia R. Martin, Shengying Qin, Hailiang Huang, Tian Ge
medRxiv 2020.12.27.20248738; doi: https://doi.org/10.1101/2020.12.27.20248738
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Improving Polygenic Prediction in Ancestrally Diverse Populations
Yunfeng Ruan, Yen-Chen Anne Feng, Chia-Yen Chen, Max Lam, Stanley Global Asia Initiatives, Akira Sawa, Alicia R. Martin, Shengying Qin, Hailiang Huang, Tian Ge
medRxiv 2020.12.27.20248738; doi: https://doi.org/10.1101/2020.12.27.20248738

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (227)
  • Allergy and Immunology (500)
  • Anesthesia (110)
  • Cardiovascular Medicine (1226)
  • Dentistry and Oral Medicine (205)
  • Dermatology (147)
  • Emergency Medicine (282)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (528)
  • Epidemiology (9999)
  • Forensic Medicine (5)
  • Gastroenterology (497)
  • Genetic and Genomic Medicine (2442)
  • Geriatric Medicine (236)
  • Health Economics (479)
  • Health Informatics (1634)
  • Health Policy (750)
  • Health Systems and Quality Improvement (633)
  • Hematology (247)
  • HIV/AIDS (530)
  • Infectious Diseases (except HIV/AIDS) (11855)
  • Intensive Care and Critical Care Medicine (625)
  • Medical Education (251)
  • Medical Ethics (74)
  • Nephrology (267)
  • Neurology (2272)
  • Nursing (139)
  • Nutrition (349)
  • Obstetrics and Gynecology (451)
  • Occupational and Environmental Health (532)
  • Oncology (1244)
  • Ophthalmology (375)
  • Orthopedics (133)
  • Otolaryngology (226)
  • Pain Medicine (154)
  • Palliative Medicine (50)
  • Pathology (324)
  • Pediatrics (729)
  • Pharmacology and Therapeutics (311)
  • Primary Care Research (281)
  • Psychiatry and Clinical Psychology (2280)
  • Public and Global Health (4823)
  • Radiology and Imaging (833)
  • Rehabilitation Medicine and Physical Therapy (488)
  • Respiratory Medicine (650)
  • Rheumatology (283)
  • Sexual and Reproductive Health (237)
  • Sports Medicine (224)
  • Surgery (266)
  • Toxicology (44)
  • Transplantation (124)
  • Urology (99)