Abstract
Introduction Linkage disequilibrium (LD) score regression is widely used for estimating common variant heritability and genetic correlations from genome-wide association study (GWAS) summary statistics. We hypothesise that segmented regression (also known as piecewise regression) improves on previous LD score regression implementations, when estimating both genetic covariance and its standard error.
Methods We present novel extensions to LD score regression (LDSC++) improving I.) handling of varying numbers of shared genetic variants across trait pairs and reference panels, II.) estimation of genetic covariance and its variance, and III.) handling of imputation quality. We propose supporting statistical tests that use our novel extensions to improve sensitivity, and are further aimed at comparing parameter estimates that are highly correlated, such as those obtained from the same trait but from different methods. We validate LDSC++ first on real-world individual level data from the Genetic Links to Anxiety and Depression study and the United Kingdom National Institute of Health and Social Care Research BioResource (N: 14,190 - 20,144), second on simulated data with different degrees of shared QTL, and third on a battery of publicly available GWASs of ten diverse traits of varying statistical power and heritability.
Results Using variance-component method (GCTA-GREML) estimates for reference, LDSC++ extensions were found to yield heritability estimates with a bias of about -10% to -20% while standard LD score regression yielded a bias of -30%, and heritability variability estimates with a bias of -1% to -7% while standard LD score regression yielded a bias of 8%. For ten external trait GWASs, LDSC++ was shown to recover 5% to 8% larger heritabilities with 4% smaller variability on average compared to standard LD score regression. Weighting by imputation quality in the model, rather than excluding genetic variants of low imputation quality, contributed to retaining information. Our supporting statistical tests enabled us to detect statistically significant differences in genetic covariance and its standard error while considering the varying number of shared genetic variants across bivariate trait pairs.
Conclusion LDSC++ was confirmed to produce less biassed estimates of genetic covariance and its variability in our GLAD+ sample compared to standard LD score regression, using GCTA-REML as reference. This performance was supported by results from external trait GWASs of varying character, also implying an important performance of our extended weighting schemes. Our proposed extensions to LD score regression, among which genome-wide parameters are constructed as aggregates of heterogeneous local parameters, may prove important for large-scale multivariate studies such as genomic structural equation models or local genetic covariance analyses.
Competing Interest Statement
Prof Breen has received honoraria, research or conference grants and consulting fees from Illumina, Otsuka, and COMPASS Pathfinder Ltd.
Funding Statement
This study/research is funded by the National Institute for Health and Care Research (NIHR) Maudsley Biomedical Research Centre (BRC). The views expressed are those of the author(s) and not necessarily those of the NIHR or the Department of Health and Social Care. Christopher Hübel acknowledges funding by Lundbeckfonden (R276-2018-4581). Alexandra Gillett is supported by the Medical Research Council (MR/X009815/1).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The London-Fulham Research Ethics Committee approved the GLAD Study on 21st August 2018 (REC reference: 18/LO/1218). The London-Fulham Research Ethics Committee approved the EDGI UK Study on 29th July 2019 (REC reference: 19/LO/1254). The East of England-Cambridge Central Committee approved the NIHR BioResource as a Research Tissue Bank (REC reference: 17/EE/0025). The South West-Central Bristol Research Ethics Committee approved the COVID-19 Psychiatry and Neurological Genetics study on 27th April 2020 (REC reference: 20/SW/0078).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes





