Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Meta-analysis fine-mapping is often miscalibrated at single-variant resolution

View ORCID ProfileMasahiro Kanai, Roy Elzur, Wei Zhou, Global Biobank Meta-analysis Initiative, Mark J Daly, Hilary K Finucane
doi: https://doi.org/10.1101/2022.03.16.22272457
Masahiro Kanai
1Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
4Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
5Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Masahiro Kanai
  • For correspondence: mkanai@broadinstitute.org finucane@broadinstitute.org
Roy Elzur
1Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wei Zhou
1Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mark J Daly
1Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
6Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hilary K Finucane
1Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
2Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: mkanai@broadinstitute.org finucane@broadinstitute.org
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Meta-analysis is pervasively used to combine multiple genome-wide association studies (GWAS) into a more powerful whole. To resolve causal variants, meta-analysis studies typically apply summary statistics-based fine-mapping methods as they are applied to single-cohort studies. However, it is unclear whether heterogeneous characteristics of each cohort (e.g., ancestry, sample size, phenotyping, genotyping, or imputation) affect fine-mapping calibration and recall. Here, we first demonstrate that meta-analysis fine-mapping is substantially miscalibrated in simulations when different genotyping arrays or imputation panels are included. To mitigate these issues, we propose a summary statistics-based QC method, SLALOM, that identifies suspicious loci for meta-analysis fine-mapping by detecting outliers in association statistics based on ancestry-matched local LD structure. Having validated SLALOM performance in simulations and the GWAS Catalog, we applied it to 14 disease endpoints from the Global Biobank Meta-analysis Initiative and found that 68% of loci showed suspicious patterns that call into question fine-mapping accuracy. These predicted suspicious loci were significantly depleted for having likely causal variants, such as nonsynonymous variants, as a lead variant (2.8x; Fisher’s exact P = 6.2 × 10−4). Compared to fine-mapping results in individual biobanks, we found limited evidence of fine-mapping improvement in the GBMI meta-analyses. Although a full solution requires complete synchronization across cohorts, our approach identifies likely spurious results in meta-analysis fine-mapping. We urge extreme caution when interpreting fine-mapping results from meta-analysis.

Introduction

Meta-analysis is pervasively used to combine multiple genome-wide association studies (GWAS) from different cohorts1. Previous GWAS meta-analyses have identified thousands of loci associated with complex diseases and traits, such as type 2 diabetes2,3, schizophrenia4,5, rheumatoid arthritis6,7, body mass index8, and lipid levels9. These meta-analyses are typically conducted in large-scale consortia (e.g., the Psychiatric Genomics Consortium [PGC], the Global Lipids Genetics Consortium [GLGC], and the Genetic Investigation of Anthropometric Traits [GIANT] consortium) to increase sample size while harmonizing analysis plans across participating cohorts in every possible aspect (e.g., phenotype definition, quality-control [QC] criteria, statistical model, and analytical software) by sharing summary statistics as opposed to individual-level data, thereby avoiding data protection issues and variable legal frameworks governing individual genome and medical data around the world. The Global Biobank Meta-analysis Initiative (GBMI)10 is one such large-scale, international effort, which aims to establish a collaborative network spanning 19 biobanks from four continents (total n = 2.1 million) for coordinated GWAS meta-analyses, while addressing the many benefits and challenges in meta-analysis and subsequent downstream analyses.

One such challenging downstream analysis is statistical fine-mapping11–13. Despite the great success of past GWAS meta-analyses in locus discovery, individual causal variants in associated loci are largely unresolved. Identifying causal variants from GWAS associations (i.e., fine-mapping) is challenging due to extensive linkage disequilibrium (LD, the correlation among genetic variants), the presence of multiple causal variants, and limited sample sizes, but is rapidly becoming achievable with high confidence in individual cohorts14–17 owing to the recent development of large-scale biobanks18–20 and scalable fine-mapping methods21–23 that enable well-powered, accurate fine-mapping using in-sample LD from large-scale individual-level data.

After conducting GWAS meta-analysis, previous studies2,7,9,24–30 have applied existing summary statistics-based fine-mapping methods (e.g., approximate Bayes factor [ABF]31,32, CAVIAR33, PAINTOR34,35, FINEMAP21,22, and SuSiE23) just as they are applied to single-cohort studies, without considering or accounting for the unavoidable heterogeneity among cohorts (e.g. differences in sample size, phenotyping, genotyping, or imputation). Such heterogeneity could lead to false positives and miscalibration in meta-analysis fine-mapping (Fig. 1). For example, case-control studies enriched with more severe cases or ascertained with different phenotyping criteria may disproportionately contribute to genetic discovery, even when true causal effects for genetic liability are exactly the same between these studies and less severe or unascertained ones. Quantitative traits like biomarkers could have phenotypic heterogeneity arising from different measurement protocols and errors across studies. There might be genuine biological mechanisms too, such as gene–gene (GxG) and gene–environment (GxE) interactions and (population-specific) dominance variation (e.g., rs671 and alcohol dependence36), that introduce additional heterogeneity across studies37,38. In addition to phenotyping, differences in genotyping and imputation could dramatically undermine fine-mapping calibration and recall at single-variant resolution, because differential patterns of missingness and imputation quality across constituent cohorts of different sample sizes can disproportionately diminish association statistics of potentially causal variants. Finally, although more easily harmonized than phenotyping and genotyping data, subtle differences in QC criteria and analytical software may further exacerbate the effect of heterogeneity on fine-mapping.

Fig. 1.
  • Download figure
  • Open in new tab
Fig. 1. Schematic overview of meta-analysis fine-mapping.

An illustrative example of such issues can be observed in the TYK2 locus (19p13.2) in the recent meta-analysis from the COVID-19 Host Genetics Initiative (COVID-19 HGI; Supplementary Fig. 1)39. This locus is known for protective associations against autoimmune diseases6,24, while a complete TYK2 loss of function results in a primary immunodeficiency40. Despite strong LD (r2 = 0.82) with a lead variant in a locus (rs74956615; P = 9.7 × 10−12), a known functional missense variant rs34536443 (p.Pro1104Ala) that reduces TYK2 function41,42 did not achieve genome-wide significance (P = 7.5 × 10−7), primarily due to its missingness in two more cohorts than rs74956615. This serves as just one example of the major difficulties with meta-analysis fine-mapping at single-variant resolution. Indeed, the COVID-19 HGI cautiously avoided an in-silico fine-mapping in the flagship to prevent spurious results39.

Only a few studies have carefully addressed these concerns in their downstream analyses. The Schizophrenia Working Group of PGC, for example, recently updated their largest meta-analysis of schizophrenia5 (69,369 cases and 236,642 controls), followed by a downstream fine-mapping analysis using FINEMAP21. Unlike many other GWAS consortia, since PGC has access to individual-level genotypes for a majority of samples, they were able to apply standardized sample and variant QC criteria and impute variants using the same reference panel, all uniformly processed using the RICOPILI pipeline43. This harmonized procedure was crucial for properly controlling inter-cohort heterogeneity and thus allowing more robust meta-analysis fine-mapping at single-variant resolution. Furthermore, PGC’s direct access to individual-level data enabled them to compute in-sample LD matrices for multiple causal variant fine-mapping, which prevents the significant miscalibration that results from using an external LD reference14–16. A 2017 fine-mapping study of inflammatory bowel disease also benefited from access to individual-level genotypes and careful pre-and post-fine-mapping QC44. For a typical meta-analysis consortium, however, many of these steps are infeasible as full genotype data from all cohorts is not available. For such studies, a new approach to meta-analysis fine-mapping in the presence of the many types of heterogeneity is needed. Until such a method is developed, QC of meta-analysis fine-mapping results deserves increased attention.

While existing variant-level QC procedures are effective for limiting spurious associations in GWAS (Supplementary Box)45, they do not suffice for ensuring high-quality fine-mapping results. In some cases, they even hurt fine-mapping quality, because they can i) cause or exacerbate differential patterns of missing variants across cohorts, and ii) remove true causal variants as well as suspicious variants. Thus, additional QC procedures that retain consistent variants across cohorts for consideration but limit poor-quality fine-mapping results are needed. A recently proposed method called DENTIST46, for example, performs summary statistics QC to improve GWAS downstream analyses, such as conditional and joint analysis (GCTA-COJO47), by removing variants based on estimated heterogeneity between summary statistics and reference LD. Although DENTIST was also applied prior to fine-mapping (FINEMAP21), simulations only demonstrated that it could improve power for detecting the correct number of causal variants in a locus, not true causal variants. This motivated us to develop a new fine-mapping QC method for better calibration and recall at single-variant resolution and to demonstrate its performance in large-scale meta-analysis.

Here, we first demonstrate the effect of inter-cohort heterogeneity in meta-analysis fine-mapping via realistic simulations with multiple heterogeneous cohorts, each with different combinations of genotyping platforms, imputation reference panels, and genetic ancestries. We propose a summary statistics-based QC method, SLALOM (suspicious loci analysis of meta-analysis summary statistics), that identifies suspicious loci for meta-analysis fine-mapping by detecting association statistics outliers based on local LD structure, building on the DENTIST method. Applying SLALOM to 14 disease endpoints from the Global Biobank Meta-analysis Initiative10 as well as 467 meta-analysis summary statistics from the GWAS Catalog48, we demonstrate that suspicious loci for fine-mapping are widespread in meta-analysis and urge extreme caution when interpreting fine-mapping results from meta-analysis.

Results

Large-scale simulations demonstrate miscalibration in meta-analysis fine-mapping

Existing fine-mapping methods21,23,31 assume that all association statistics are derived from a single-cohort study, and thus do not model the per-variant heterogeneity in effect sizes and sample sizes that arise when meta-analyzing multiple cohorts (Figure 1). To evaluate how different characteristics of constituent cohorts in a meta-analysis affect fine-mapping calibration and recall, we conducted a series of large-scale GWAS meta-analysis and fine-mapping simulations (Supplementary Table 1–4; Methods). Briefly, we simulated multiple GWAS cohorts of different ancestries (10 European ancestry, one African ancestry and one East Asian ancestry cohorts; n = 10,000 each) that were genotyped and imputed using different genotyping arrays (Illumina Omni2.5, Multi-Ethnic Global Array [MEGA], and Global Screening Array [GSA]) and imputation reference panels (the 1000 Genomes Project Phase 3 [1000GP3]49, the Haplotype Reference Consortium [HRC]50, and the TOPMed51). For each combination of cohort, genotyping array, and imputation panel, we conducted 300 GWAS with randomly simulated causal variants that resemble the genetic architecture of a typical complex trait, including minor allele frequency (MAF) dependent causal effect sizes52, total SNP heritability53, functional consequences of causal variants17, and levels of genetic correlation across cohorts (i.e., true effect size heterogeneity; rg = 1, 0.9, and 0.5; see Methods). We then meta-analyzed the single-cohort GWAS results across 10 independent cohorts based on multiple configurations (different combinations of genotyping arrays and imputation panels for each cohort) to resemble realistic meta-analysis of multiple heterogeneous cohorts (Supplementary Table 4). We applied ABF fine-mapping to compute a posterior inclusion probability (PIP) for each variant and to derive 95% and 99% credible sets (CS) that contain the smallest set of variants covering 95% and 99% of probability of causality. We evaluated the false discovery rate (FDR, defined as the proportion of variants with PIP > 0.9 that are non-causal) and compared against the expected proportion of non-causal variants if the meta-analysis fine-mapping method were calibrated, based on PIP. More details of our simulation pipeline are described in Methods and visually summarized in Supplementary Fig. 2.

We found that FDR varied widely over the different configurations, reaching as high as 37% for the most heterogeneous configurations (Fig. 2). We characterized the contributing factors to the miscalibration. We first found that lower true effect size correlation rg (i.e., larger phenotypic heterogeneity) always caused higher miscalibration and lower recall. Second, when using the same imputation panel (1000GP3), use of less dense arrays (MEGA or GSA) led to moderately inflated FDR (up to FDR = 11% vs. expected 1%), while use of multiple genotyping array did not cause further FDR inflation (Fig. 2a). Third, when using the same genotyping array (Omni2.5), use of imputation panels (HRC or TOPMed) that does not match our simulation reference significantly affects miscalibration (up to FDR = 17% vs. expected 1%), and using multiple imputation panels further increased miscalibration (up to FDR = 35% vs. expected 2%, Fig. 2c); this setup is as bad as the most heterogeneous configuration using multiple genotyping arrays and imputation panels (FDR = 37%). When TOPMed-imputed variants were lifted over from GRCh38 to GRCh37, we observed FDR increases of up to 10%, likely due to genomic build conversion failures (Supplementary Note)54. Fourth, recall was not significantly affected by heterogeneous genotyping arrays or imputation panels (Fig. 2b,d). Fifth, including multiple genetic ancestries did not affect calibration when using the same genotyping array and imputation panel (Omni 2.5 and 1000GP3; Fig. 2e) but significantly improved recall if African ancestry was included (Fig. 2f). This is expected, given the shorter LD length in the African population compared to other populations, which improves fine-mapping resolution55. Finally, in the most heterogeneous configurations where multiple genotyping arrays and imputation panels existed, we observed a FDR of up to 37% and 28% for European and multi-ancestry meta-analyses, respectively (vs. expected 2% for both), demonstrating that inter-cohort heterogeneity can substantially undermine meta-analysis fine-mapping (Fig. 2g,h).

Fig. 2.
  • Download figure
  • Open in new tab
Fig. 2. Evaluation of false discovery rate (FDR) and recall in meta-analysis fine-mapping simulations.

We evaluated FDR and recall in meta-analysis fine-mapping using different genotyping arrays (a,b), imputation reference panels (c,d), genetic ancestries (e, f), and more heterogeneous settings by combining these (g, h). As shown in top-right gray labels, the EUR ancestry, the Omni2.5 genotyping array and/or the 1000GP3 reference panel were used unless otherwise stated. FDR is defined as the proportion of non-causal variants with PIP > 0.9. Horizontal gray lines represent 1 – mean PIP, i.e. expected FDR were the method calibrated. Recall is defined as the proportion of true causal variants in the top 1% PIP bin. Shapes correspond to the true effect size correlation rg across cohorts which represent a phenotypic heterogeneity parameter (the lower rg, the higher phenotypic heterogeneity).

To further characterize observed miscalibration in meta-analysis fine-mapping, we investigated the availability of GWAS variants in each combination of ancestry, genotyping array, and imputation panel. Out of 3,285,617 variants on chromosome 3 that passed variant QC in at least one combination (per-combination MAF > 0.001 and Rsq > 0.6; Methods), 574,261 variants (17%) showed population-level gnomAD MAF > 0.001 in every ancestry that we simulated (African, East Asian, and European). Because we used a variety of imputation panels, we retrieved population-level MAF from gnomAD. Of these 574,261 variants, 389,219 variants (68%) were available in every combination (Supplementary Fig. 3a). This fraction increased from 68% to 73%, 74%, and 76% as we increased gnomAD MAF thresholds to > 0.005, 0.01, and 0.05, respectively, but never reached 100% (Supplementary Fig. 4). Notably, we observed a substantial number of variants that are unique to a certain genotyping array and an imputation panel, even when we restricted to 344,497 common variants (gnomAD MAF > 0.05) in every ancestry (Supplementary Fig. 3b). For example, there are 34,317 variants (10%) that were imputed in the 1000GP3 and TOPMed reference but not in the HRC. Likewise, we observed 33,106 variants (10%) that were specific to the 1000GP3 reference and even 3,066 variants (1%) that were imputed in every combination except for East Asian ancestry with the GSA array and the TOPMed reference. When using different combinations of gnomAD MAF thresholds (> 0.001, 0.005, 0.01, or 0.05 in every ancestry) and Rsq thresholds (> 0.2, 0.4, 0.6, or 0.8), we observed the largest fraction of shared variants (78%) was achieved with gnomAD MAF > 0.01 and Rsq > 0.2 while the largest number of the shared variants (427,494 variants) was achieved with gnomAD MAF > 0.001 and Rsq > 0.2, leaving it unclear which thresholds would be preferable in the context of fine-mapping (Supplementary Fig. 4).

The remaining 2,711,356 QC-passing variants in our simulations (gnomAD MAF ≤ 0.001 in at least one ancestry) further exacerbate variable coverage of the available variants (Supplementary Fig. 3c). Of these, the largest proportion of variants (39%) were only available in African ancestry, followed by African and European (but not in East Asian) available variants (7%), European-specific variants (6%), and East Asian-specific variants (5%). Furthermore, similar to the aforementioned common variants, we found a substantial number of variants that are unique to a certain combination. Altogether, we observed that only 393,471 variants (12%) out of all the QC-passing 3,285,617 variants were available in every combination (Supplementary Fig. 3d). These observations recapitulate that different combinations of genetic ancestry, genotyping array, imputation panels, and QC thresholds substantially affect the availability of common, well-imputed variants for association testing53.

Thus, the different combinations of genotyping and imputation cause each cohort in a meta-analysis to have a different set of variants, and consequently variants can have very different overall sample sizes. In our simulations with the most heterogeneous configurations, we found that 66% of the false positive loci (where a non-causal [false positive] variant was assigned PIP 1.0.9) had different sample sizes for true causal and false positive variants (median maximum/minimum sample size ratio = 1.4; Supplementary Fig. 5). Analytically, we found that at common meta-analysis sample sizes and genome-wide significant effect size regimes, when two variants have similar marginal effects, the one with the larger sample size will usually achieve a higher ABF PIP (Supplementary Note). This elucidates the mechanism by which sample size imbalance can lead to miscalibration.

Overview of the SLALOM method

To address the challenges in meta-analysis fine-mapping discussed above, we developed SLALOM (suspicious loci analysis of meta-analysis summary statistics), a method that flags suspicious loci for meta-analysis fine-mapping by detecting outliers in association statistics based on deviations from expectation, estimated with local LD structure (Methods). SLALOM consists of three steps, 1) defining loci and lead variants based on a 1 Mb window, 2) detecting outlier variants in each locus using meta-analysis summary statistics and an external LD reference panel, and 3) identifying suspicious loci for meta-analysis fine-mapping (Fig. 3a,b).

Fig. 3.
  • Download figure
  • Open in new tab
Fig. 3. Overview of the SLALOM method.

a,b. An illustrative example of the SLALOM application. a. In an example locus, two independent association signals are depicted: i) the most significant signal that contains a lead variant (purple diamond) and five additional variants that are in strong LD (r2 > 0.9) with the lead variant, and ii) an additional independent signal (r2 < 0.05). There is one outlier variant (orange diamond) in the first signal that deviates from the expected association based on LD. b. Step-by-step procedure of the SLALOM method. For outlier variant detection in a locus, a diagnosis plot of r2 values to the lead variant vs. marginal χ2 is shown to aid interpretation. Background color represents a theoretical distribution of –log10 PDENTIST-S values when a lead variant has a marginal χ2 of 50, assuming no allele flipping. Points represent the variants depicted in the example locus (a), where the lead variant (purple diamond) and the outlier variant (white diamond) were highlighted. Diagonal line represents an expected marginal association. Horizontal dotted lines represent the genome-wide significance threshold (P < 5.0 × 10−8). c. The ROC curve of SLALOM prediction for identifying suspicious loci in the simulations. Positive conditions were defined as whether a true causal variant in a locus is 1) a lead PIP variant, 2) in 95% CS, and 3) in 99% CS. AUROC values were shown in the labels. Black points represent the performance of our adopted metric, i.e., whether a locus contains at least one outlier variant (PDENTIST-S < 1.0 × 10−4 and r2 > 0.6). d. Calibration plot in the simulations under different PIP thresholds. Calibration was measured as the mean PIP – fraction of true causal variants among variants above the threshold. Shadows around the lines represent 95% confidence intervals. e. The fraction of variants in predicted suspicious and non-suspicious loci under different PIP thresholds. Gray shadows in the panels d,e represent a PIP ≤ 0.1 region as we excluded loci with maximum PIP ≤ 0.1 in the actual SLALOM analysis based on these panels.

To detect outlier variants, we first assume a single causal variant per associated locus. Then the marginal z-score zi for a variant i should be approximately equal to ri,c · zc where zc is the z-score of the causal variant c, and ri,c is a correlation between variants i and c. For each variant in meta-analysis summary statistics, we first test this relationship using a simplified version of the DENTIST statistics46, DENTIST-S, based on the assumption of a single causal variant. The DENTIST-S statistics for a given variant i is written as Embedded Image which approximately follows a χ2 distribution with 1 degree of freedom46. Since the true causal variant and LD structure are unknown in real data, we approximate the causal variant as the lead PIP variant in the locus (the variant with the highest PIP) and use a large-scale external LD reference from gnomAD57, either an ancestry-matched LD for a single-ancestry meta-analysis or a sample-size-weighted LD by ancestries for a multi-ancestry meta-analysis (Methods).

SLALOM then evaluates whether each locus is “suspicious”—that is, has a pattern of meta-analysis statistics and LD that appear inconsistent and therefore call into question the fine-mapping accuracy. By training on loci with maximum PIP > 0.9 in the simulations, we determined that the best-performing criterion for classifying loci as true or false positives is whether a locus has a variant with r2 > 0.6 to the lead and DENTIST-S P-value < 1.0 × 10−4 (Methods). Using this criterion we achieved an area under the receiver operating characteristic curve (AUROC) of 0.74, 0.76, and 0.80 for identifying whether a true causal variant is a lead PIP variant, in 95% credible set (CS), and in 99% CS, respectively (Fig. 3c). We further validated the performance of SLALOM using all the loci in the simulations and observed significantly higher miscalibration in predicted suspicious loci than in non-suspicious loci (up to 16% difference in FDR at PIP > 0.9; Fig. 3d). Given the relatively lower miscalibration and specificity at low PIP thresholds (Fig. 3d,e), in subsequent real data analysis we restricted the application of SLALOM to loci with maximum PIP >0.1 (Methods).

Widespread suspicious loci for fine-mapping in existing meta-analysis summary statistics

Having assessed the performance of SLALOM in simulations, we applied SLALOM to 467 meta-analysis summary statistics in the GWAS Catalog48 that are publicly available with a sufficient discovery sample size (N > 10,000; Supplementary Table 5; Methods) to quantify the prevalence of suspicious loci in existing studies. These summary statistics were mostly European ancestry-only meta-analysis (63%), followed by multi-ancestry (31%), East Asian ancestry-only (3%), and African ancestry-only (2%) meta-analyses. Across 467 summary statistics from 96 publications, we identified 28,925 loci with maximum PIP > 0.1 (out of 35,864 genome-wide significant loci defined based on 1 Mb window around lead variants; Methods) for SLALOM analysis, of which 8,137 loci (28%) were predicted suspicious (Supplementary Table 6).

To validate SLALOM performance in real data, we restricted our analysis to 6,065 loci that have maximum PIP > 0.1 and that contain nonsynonymous coding variants (predicted loss-of-function [pLoF] and missense) in LD with the lead variant (r2 > 0.6). Given prior evidence16,17,44 that such nonsynonymous variants are highly enriched for being causal, we tested the validity of our method by whether they achieve the highest PIP in the locus (i.e., successful fine-mapping) in suspicious vs. non-suspicious loci (Methods). While 40% (1,557 / 3,860) of non-suspicious loci successfully fine-mapped nonsynonymous variants, only 17% (384 / 2,205) of suspicious loci did, demonstrating a significant depletion (2.3x) of successfully fine-mapped nonsynonymous variants in suspicious loci (Fisher’s exact P = 3.6 × 10−79; Fig. 4a). We also tested whether nonsynonymous variants belonged to 95% and 99% CS and again observed significant depletion (1.4x and 1.3x, respectively; Fisher’s exact P < 4.6 × 10−100). In addition, when we used a more stringent r2 threshold (> 0.8) for selecting loci that contain nonsynonymous variants, we also confirmed significant enrichment (Fisher’s exact P < 6.1 × 10−65; Supplementary Figure 6). To quantify potential fine-mapping miscalibration in the GWAS Catalog, we investigated the difference between mean PIP for lead variants and fraction of lead variants that are nonsynonymous; assuming that nonsynonymous variants in these loci are truly causal, this difference equals the difference between the true and reported fraction of lead PIP variants that are causal. We observed differences between 26–51% and 10–18% under different PIP thresholds in suspicious and non-suspicious loci, respectively (Fig. 4b), marking 45% and 15% for high-PIP (> 0.9) variants.

Fig. 4.
  • Download figure
  • Open in new tab
Fig. 4. Evaluation of SLALOM performance in the GWAS Catalog summary statistics.

a,c,d. Depletion of likely causal variants in predicted suspicious loci. We evaluated whether (a) nonsynonymous coding variants (pLoF and missense), (c) high-PIP (> 0.9) complex trait variants in biobank fine-mapping, and (d) high-PIP (> 0.9) cis-eQTL variants in GTEx v8 and eQTL Catalogue were lead PIP variants, in 95% CS, or in 99% CS in suspicious vs. non-suspicious loci. Depletion was calculated by relative risk (i.e. a ratio of proportions; Methods). Error bars correspond to 95% confidence intervals using bootstrapping. Significance represents a Fisher’s exact test P-value (*, P < 0.05; **, < 0.01; ***, < 0.001; ***, < 10−4). b. Plot of the estimated difference between true and reported proportion of causal variants in the loci tagging nonsynonymous variants (r2 > 0.6 with the lead variants) in the GWAS Catalog under different PIP thresholds. Analogous to Fig. 3b, assuming nonsynonymous variants in these loci are truly causal, the mean PIP for lead variants minus the fraction of lead variants that are nonsynonymous above the threshold is equal to the difference between true and reported proportion of causal variants..

We further assessed SLALOM performance in the GWAS Catalog meta-analyses by leveraging high-PIP (> 0.9) complex trait and cis-eQTL variants that were rigorously fine-mapped16,17 in large-scale biobanks (Biobank Japan [BBJ]58, FinnGen20, and UK Biobank [UKBB]19) and eQTL resources (GTEx59 v8 and eQTL Catalogue60). Among the 27,713 loci analyzed by SLALOM (maximum PIP > 0.1) that contain a lead variant that was included in biobank fine-mapping, 17% (3,266 / 19,692) of the non-suspicious loci successfully fine-mapped one of the high-PIP GWAS variants in biobank fine-mapping, whereas 7% (589 / 8,021) of suspicious loci did, showing a significant depletion (2.3x) of the high-PIP complex trait variants in suspicious loci (Fisher’s exact P = 4.6 × 10−100; Fig. 4c). Similarly, among 26,901 loci analyzed by SLALOM that contain a lead variant that was included in cis-eQTL fine-mapping, we found a significant depletion (1.9x) of the high-PIP cis-eQTL variants in suspicious loci, where 7% (1,247 / 18,976) of non-suspicious loci vs. 4% (281 / 7,925) of suspicious loci successfully fine-mapped one of the high-PIP cis-eQTL variants (Fisher’s exact P = 2.6 × 10−24; Fig. 4d). We observed the same significant depletions of the high-PIP complex trait and cis-eQTL variants in suspicious loci that belonged to 95% and 99% CS set (Fig. 4c,d).

Suspicious loci for fine-mapping in the GBMI summary statistics

Next, we applied SLALOM to meta-analysis summary statistics of 14 disease endpoints from the GBMI10. These summary statistics were generated from a meta-analysis of 2.1 million individuals in total across 19 biobanks, representing six different genetic ancestry groups of approximately 33,000 African, 18,000 Admixed American, 31,000 Central and South Asian, 341,000 East Asian, 1.8 million European, and 1,600 Middle Eastern individuals (Supplementary Table 7). Among 509 genome-wide significant loci across the 14 traits, we found that 87 loci (17%) showed maximum PIP < 0.1, thus not being further considered by SLALOM. Of the remaining 422 loci with maximum PIP > 0.1, SLALOM identified that 285 loci (68%) were suspicious loci for fine-mapping (Fig. 5a; Supplementary Table 8). The fraction of suspicious loci and their maximum PIP varied by trait, reflecting different levels of statistical power (e.g., sample sizes, heritability, and local LD structure) as well as inter-cohort heterogeneity (Fig. 5b–o).

Fig. 5.
  • Download figure
  • Open in new tab
Fig. 5. SLALOM prediction results in the GBMI summary statistics.

For (a) all 14 traits and (b–o) individual traits, a number of predicted suspicious (SL), non-suspicious (NSL), and non-applicable (NA; maximum PIP < 0.1) loci were summarized. Individual traits are ordered by the total number of loci. Color represents the maximum PIP in a locus. Label represents the fraction of loci in each prediction category. AAA, abdominal aortic aneurysm. AcApp, acute appendicitis. COPD, chronic obstructive pulmonary disease. HCM, hypertrophic cardiomyopathy. HF, heart failure. IPF, idiopathic pulmonary fibrosis. POAG, primary open angle glaucoma. ThC, thyroid cancer. UtC, uterine cancer. VTE, venous thromboembolism.

While the fraction of suspicious loci (68%) in the GBMI meta-analyses is higher than in the GWAS Catalog (28%), there might be multiple reasons for this discrepancy, including association significance, sample size, ancestral diversity, and study-specific QC criteria. For example, the GBMI summary statistics were generated from multi-ancestry, large-scale meta-analyses of median sample size of 1.4 million individuals across six ancestries, while 63% of the 467 summary statistics from the GWAS Catalog were only in European-ancestry studies and 83% had less than 0.5 million discovery samples. Nonetheless, predicted suspicious loci for fine-mapping were prevalent in both the GWAS Catalog and the GBMI.

Using nonsynonymous (pLoF and missense) and high-PIP (> 0.9) complex trait and cis-eQTL variants, we recapitulated a significant depletion of these likely causal variants in predicted suspicious loci (2.8x, 5.4x, and 5.2x for nonsynonymous, high-PIP complex trait, and high-PIP cis-eQTL variants being a lead PIP variant, respectively; Fisher’s exact P < 6.2 × 10−4), confirming our observation in the GWAS Catalog analysis (Fig. 6a–c).

Fig. 6.
  • Download figure
  • Open in new tab
Fig. 6. Evaluation of SLALOM performance in the GBMI summary statistics.

a–c. Similar to Fig. 4, we evaluated whether (a) nonsynonymous coding variants (pLoF and missense), (b) high-PIP (> 0.9) complex trait variants in biobank fine-mapping, and (c) high-PIP (> 0.9) cis-eQTL variants in GTEx v8 and eQTL Catalogue were lead PIP variants, in 95% CS, or in 99% CS in suspicious vs. non-suspicious loci. Depletion was calculated by relative risk (i.e. a ratio of proportions; Methods). Error bars correspond to 95% confidence intervals using bootstrapping. Significance represents a Fisher’s exact test P-value (*, P < 0.05; **, < 0.01; ***, < 0.001; ***, < 10−4). d. Locuszoom plot of the 1q23.3 locus for COPD. The top panel shows a Manhattan plot, where the lead variant rs2099684 (purple diamond) and a missense variant rs396991 (orange diamond) are highlighted. Color represents r2 values to the lead variant. Horizontal line represents a genome-wide significance threshold (P = 5.0 × 10−8). The middle panel shows PIP from ABF fine-mapping. Color represents whether variants belong to a 95% CS. The bottom panel shows r2 values with the lead variant in gnomAD populations. e. A diagnosis plot showing r2 values to the lead variant vs. marginal χ2. Color represents –log10 PDENTIST-S values. Outlier variants with PDENTIST-S < 10−4 are depicted in red with a diamond shape. Diagonal line represents an expected marginal association. Horizontal line represents a genome-wide significance threshold. f. Z-scores of the lead variant (rs2099684) vs. the missense variant (rs396991) in the constituent cohorts of the meta-analysis. Open and closed circles represent whether both variants exist in a cohort or rs396991 is missing. Circle size corresponds to an effective sample size. Color represents genetic ancestry.

In 15/23 non-suspicious loci harboring a nonsynonymous variant, the nonsynonymous variant had the highest PIP. These included known missense variants such as rs116483731 (p.Arg20Gln) in SPDL1 for idiopathic pulmonary fibrosis (IPF)61,62 and rs28929474 (p.Glu366Lys) in SERPINA1 for chronic obstructive pulmonary disease (COPD)63,64. In addition, we observed successful fine-mapping in 2 novel loci for asthma, i) rs41286560 (p.Pro558Thr) in RTL1, a missense variant known for decreasing height65,66 and ii) rs34187696 (p.Gly337Val) in ZSCAN5A, a known missense variant for increasing monocyte count30.

To characterize fine-mapping failures in suspicious loci, we examined suspicious loci in which a nonsynonymous variant did not achieve the highest PIP. For example, the FCGR2A/FCGR3A (1q23.3) locus for COPD contained a genome-wide significant lead intergenic variant rs2099684 (P = 1.7 × 10−11) which is in LD (r2 = 0.92) with a missense variant rs396991 (p.Phe176Val) of FCGR3A (Fig. 6d). This locus was not previously reported for COPD, but is known for associations with autoimmune diseases (e.g., inflammatory bowel disease44, rheumatoid arthritis7, and systemic lupus erythematosus67) and encodes the low-affinity human FC-gamma receptors that bind to the Fc region of IgG and activate immune responses68. Notably, this locus contains copy number variations that contribute to the disease associations in addition to single-nucleotide variants, which makes genotyping challenging68,69. Despite strong LD with the lead variant, rs396991 did not achieve genome-wide significance (P = 9.1 × 10−3), showing a significant deviation from the expected association (PDENTIST-S = 5.3 × 10−41; Fig. 6e). This is primarily due to missingness of rs396991 in 8 biobanks out of 17 (Neff = 76,790 and 36,781 for rs2099684 and rs396991, respectively; Fig. 6f), which is caused by its absence from major imputation reference panels (e.g., 1000GP49, HRC50, and UK10K70) despite having a high MAF in every population (MAF = 0.24–0.34 in African, admixed American, East Asian, European, and South Asian populations of gnomAD57).

Sample size imbalance across variants was pervasive in the GBMI meta-analyses71, and was especially enriched in predicted suspicious loci—84% of suspicious loci vs. 24% of non-suspicious loci showed a maximum/minimum effective sample size ratio > 2 among variants in LD (r2 > 0.6) with lead variants (a median ratio = 4.2 and 1.2 in suspicious and non-suspicious loci, respectively; Supplementary Fig. 7). These observations are consistent with our simulations, recapitulating that sample size imbalance results in miscalibration for meta-analysis fine-mapping. Notably, we observed a similar issue in other GBMI downstream analyses (e.g., polygenic risk score [PRS]71 and drug discovery72), where predictive performance improved significantly after filtering out variants with maximum Neff < 50%. Although fine-mapping methods cannot simply take this approach because it inevitably reduces calibration and recall by removing true causal variants, other meta-analysis downstream analyses that primarily rely on polygenic signals rather than individual variants should consider this filtering as an extra QC step.

Fig. 7.
  • Download figure
  • Open in new tab
Fig. 7. Fine-mapping improvement and retrogression in the GBMI meta-analyses over individual biobanks.

a– c. Functional enrichment of variants in each functional category based on top PIP rankings in the GBMI and individual biobanks (maximum PIP of BBJ, FinnGen, and UKBB). Shape corresponds to top PIP ranking (top 0.5%, 0.1%, and 0.05%). Enrichment was calculated by a relative risk (i.e. a ratio of proportions; Methods). Error bars correspond to 95% confidence intervals using bootstrapping. d–g. Locuszoom plots for the same non-suspicious locus of asthma in the GBMI meta-analysis and an individual biobank (BBJ, FinnGen, or UKBB Europeans) that showed the highest PIP in our biobank fine-mapping. Colors in the Manhattan panels represent r2 values to the lead variant. In the PIP panels, only fine-mapped variants in the 95% CS are colored, where the same colors are applied between the GBMI meta-analysis and an individual biobank based on merged CS as previously described. Horizontal line represents a genome-wide significance threshold (P = 5.0 × 10−8). d. rs1888909 for asthma in the GBMI and FinnGen. e. rs16903574 for asthma in the GBMI and UKBB Europeans. Nearby rs528167451 was also highlighted, which was in strong LD (r2 = 0.86) and in the same 95% CS in UKBB Europeans, but not in the GBMI (r2 = 0.67). f. rs1295686 for asthma in the GBMI and UKBB Europeans. A nearby missense, rs20541, showed lower PIP than rs1295686 despite having strong LD (r2 = 0.96). g. rs12123821 for asthma in the GBMI and UKBB Europeans. Nearby stop-gained rs61816761 was independent of rs12123821 (r2 = 0.0) and not fine-mapped in the GBMI due to a single causal variant assumption in the ABF fine-mapping.

Comparison of fine-mapping results between the GBMI meta-analyses and individual biobanks

Motivated by successful validation of SLALOM performance, we investigated whether fine-mapping confidence and resolution were improved in the GBMI meta-analyses over individual biobanks. To this end, we used our fine-mapping results16,17 of nine disease endpoints (asthma64, COPD64, gout, heart failure73, IPF62, primary open angle glaucoma74, thyroid cancer, stroke75, and venous thromboembolism76) in BBJ58, FinnGen20, and UKBB19 Europeans that also contributed to the GBMI meta-analyses for the same traits.

To perform an unbiased comparison of PIP between the GBMI meta-analysis and individual biobanks, we investigated functional enrichment of fine-mapped variants based on top PIP rankings in the GBMI and individual biobanks (top 0.5%, 0.1%, and 0.05% PIP variants in the GBMI vs. maximum PIP across BBJ, FinnGen, and UKBB; Methods). Previous studies have shown that high-PIP (> 0.9) complex trait variants are significantly enriched for well-known functional categories, such as coding (pLoF, missense, and synonymou), 5’/3’ UTR, promoter, and cis-regulatory element (CRE) regions (DNase I hypersensitive sites [DHS] and H3K27ac)16,17. Using these functional categories, we found no significant enrichment of variants in the top PIP rankings in the GBMI over individual biobanks (Fisher’s exact P > 0.05; Fig. 7a) except for variants in the promoter region (1.8x; Fisher’s exact P = 3.1 × 10−4 for the top 0.1% PIP variants). We observed similar trends regardless of whether variants were in suspicious or non-suspicious loci (Fig. 7b,c). To examine patterns of increased and decreased PIP for individual variants, we also calculated PIP difference between the GBMI and individual biobanks, defined as ΔPIP = PIP (GBMI) – maximum PIP across BBJ, FinnGen, and UKBB (Supplementary Fig. 8,9). We investigated functional enrichment based on ΔPIP bins and observed inconsistent enrichment results using different ΔPIP thresholds (Supplementary Fig. 10). Finally, to test whether fine-mapping resolution was improved in the GBMI over individual biobanks, we compared the size of 95% CS after restricting them to cases where a GBMI CS overlapped with an individual biobank CS from BBJ, FinnGen, or UKBB (Methods). We observed the median 95% CS size of 2.5 and 2.5 in non-suspicious loci for the GBMI and individual biobanks, respectively, and 5 and 15 in suspicious loci, respectively (Supplementary Fig. 11). The smaller credible set size in suspicious loci in GBMI could be due to improved resolution or to increased miscalibration. These results provide limited evidence of overall fine-mapping improvement in the GBMI meta-analyses over what is achievable by taking the best result from individual biobanks.

Individual examples, however, provide insights into the types of fine-mapping differences that can occur. To characterize the observed differences in fine-mapping confidence and resolution, we further examined non-suspicious loci with ΔPIP > 0.5 in asthma. In some cases, the increased power and/or ancestral diversity of GBMI led to improved fine-mapping: for example, an intergenic variant rs1888909 (∼18 kb upstream of IL33) showed ΔPIP = 0.99 (PIP = 1.0 and 0.008 in GBMI and FinnGen, respectively; Fig. 7d), which was primarily owing to increased association significance in a meta-analysis (P = 3.0 × 10−86, 7.4 × 10−2, 3.6 × 10−16, and 1.9 × 10−53 in GBMI, BBJ, FinnGen, and UKBB Europeans, respectively) as well as a shorter LD length in the African population than in the European population (LD length = 4 kb vs. 41 kb for variants with r2 > 0.6 with rs1888909 in the African and European populations, respectively; Neff = 4,270 for Africans in the GBMI asthma meta-analysis; Supplementary Fig. 12). This variant was also fine-mapped for eosinophil count in UKBB Europeans (PIP = 1.0; P = 1.3 × 10−314)16 and was previously reported to regulate IL33 gene expression in human airway epithelial cells via allele-specific transcription factor binding of OCT-1 (POU2F1)77. Likewise, we observed a missense variant rs16903574 (p.Phe319Leu) in OTULINL showed ΔPIP = 0.79 (PIP = 1.0 and 0.21 in GBMI and UKBB Europeans, respectively; Fig. 7e) owing to improved association significance (P = 7.7 × 10−15 and 4.7 × 10−12 in GBMI and UKBB Europeans, respectively).

However, we also observed very high ΔPIP for variants that are not likely causal. For example, we observed that an intronic variant rs1295686 in IL13 showed ΔPIP = 0.56 (PIP = 0.56 and 0.0002 in GBMI and UKBB Europeans, respectively; Fig. 7f), despite having strong LD with a nearby missense variant rs20541 (p.Gln144Arg; r2 = 0.96 with rs1295686) which only showed ΔPIP = 0.13 (PIP = 0.13 and 0.0001 in GBMI and UKBB Europeans, respectively). The missense variant rs20541 showed PIP = 0.23 and 0.15 for a related allergic disease, atopic dermatitis, in BBJ and FinnGen, respectively17, and was previously shown to induce STAT6 phosphorylation and up-regulate CD23 expression in monocytes, promoting IgE synthesis78. Although the GBMI meta-analysis contributed to prioritizing these two variants (sum of PIP = 0.69 vs. 0.0003 in GBMI and UKBB Europeans, respectively), the observed ΔPIP was higher for rs1295686 than for rs20541.

While increasing sample size in meta-analysis improves association significance, we also found negative ΔPIP due to losing the ability to model multiple causal variants. A stop-gained variant rs61816761 (p.Arg501Ter) in FLG showed ΔPIP = –1.0 (PIP = 6.4 × 10−5 and 1.0 in GBMI and UKBB Europeans, respectively; Fig. 7g), which was primarily owing to a nearby lead variant rs12123821 (∼17 kb downstream of HRNR; r2 = 0.0 with rs61816761). This lead variant rs12123821 showed greater significance than rs61816761 in GBMI (P = 9.3 × 10−16 and 2.0 × 10− 11 for rs12123821 and rs61816761, respectively) as well as in UKBB Europeans (P = 7.1 × 10−26 and 1.5 × 10−18). While our biobank fine-mapping16,17 assigned PIP = 1.0 for both variants based on multiple causal variant fine-mapping (i.e., FINEMAP21 and SuSiE23), our ABF fine-mapping in the GBMI meta-analysis was only able to assign PIP = 0.74 for the lead variant rs12123821 due to a single causal variant assumption. This recapitulates the importance of multiple causal variant fine-mapping in complex trait fine-mapping16,17—however, we note that multiple causal variant fine-mapping with an external LD reference is extremely error-prone as previously reported14–16.

Discussion

In this study, we first demonstrated in simulations that meta-analysis fine-mapping is substantially miscalibrated when constituent cohorts are heterogeneous in phenotyping and imputation. To mitigate this issue, we developed SLALOM, a summary statistics-based QC method for identifying suspicious loci in meta-analysis fine-mapping. Applying SLALOM to 14 disease endpoints from the GBMI meta-analyses10 as well as 467 summary statistics from the GWAS Catalog48, we observed widespread suspicious loci in meta-analysis summary statistics, suggesting that meta-analysis fine-mapping is often miscalibrated in real data too. Indeed, we demonstrated that the predicted suspicious loci were significantly depleted for having likely causal variants as a lead PIP variant, such as nonsynonymous variants, high-PIP (> 0.9) GWAS and cis-eQTL fine-mapped variants from our previous fine-mapping studies16,17. Our method provides better calibration in non-suspicious loci for meta-analysis fine-mapping, generating a more reliable set of variants for further functional characterization.

We have found limited evidence of improved fine-mapping in the GBMI meta-analyses over individual biobanks. A few empirical examples in this study as well as other previous studies7,9,26,27,30 suggested that multi-ancestry, large-scale meta-analysis could have potential to improve fine-mapping confidence and resolution owing to increased statistical power in associations and differential LD pattern across ancestries. However, we have highlighted that the observed improvement in PIP could be due to sample size imbalance in a locus, miscalibration, and technical confoundings too, which further emphasizes the importance of careful investigation of fine-mapped variants identified through meta-analysis fine-mapping.

As high-confidence fine-mapping results in large-scale biobanks and molecular QTLs continue to become available16,17,60, we propose alternative approaches for prioritizing candidate causal variants in a meta-analysis. First, these high-confidence fine-mapped variants have been a valuable resource to conduct a “PheWAS”16 to match with associated variants in a meta-analysis, which provides a narrower list of candidate variants assuming they would equally be functional and causal in related complex traits or tissues/cell-types. Second, a traditional approach based on tagging variants (e.g., r2 > 0.6 with lead variants, or PICS79 fine-mapping approach that only relies on a lead variant and LD) can be still highly effective, especially for known functional variants such as nonsynonymous coding variants. As we highlighted in this and previous39 studies, potentially causal variants in strong LD with lead variants might not achieve genome-wide significance because of missingness and heterogeneity.

While using an external LD reference for fine-mapping has been shown to be extremely error-prone14–16, we find here that it can be useful for flagging suspicious loci, even when it does not perfectly represent the in-sample LD structure of the meta-analyzed individuals. However, our use of external LD reference comes with several limitations. For example, due to the finite sample size of external LD reference, rare or low-frequency variants have larger uncertainties around r2 than common variants. Moreover, our r2 values in a multi-ancestry meta-analysis are currently approximated based on a sample-size-weighted average of r2 across ancestries as previously suggested80, but this can be different from actual r2. These uncertainties around r2 affect SLALOM prediction performance and should be modeled appropriately for further method development. On the other hand, we find it challenging to use a LD reference when true causal variants are located within a complex region (e.g., major histocompatibility complex [MHC]), or are entirely missing from standard LD or imputation reference panels, especially for structural variants. These limitations are not specific to meta-analysis fine-mapping, and separate fine-mapping methods based on bespoke imputation references have been developed (e.g., HLA81, KIR82, and variable numbers of tandem repeats [VNTR]83).

In addition, there are several methodological limitations of SLALOM. First, our simulations only include one causal variant per locus. Although additional independent causal variants would not affect SLALOM precision (but decrease recall), multiple correlated causal variants in a locus would violate SLALOM assumptions and could lead to some DENTIST-S outliers that are not due to heterogeneity or missingness but rather simply a product of tagging multiple causal variants in LD. In fact, our previous studies have illustrated infrequent but non-zero presence of such correlated causal variants in complex traits16,17. Second, SLALOM prediction is not perfect. Although fine-mapping calibration is certainly better in non-suspicious vs. suspicious loci, SLALOM has low precision, and we still observe some miscalibration in non-suspicious loci. Finally, SLALOM is a per-locus QC method and does not calibrate per-variant PIPs. Further methodological development that properly models heterogeneity, missingness, multiple causal variants, and LD uncertainty across multiple cohorts and ancestries is needed to refine per-variant calibration and recall in meta-analysis fine-mapping.

We have found evidence in our simulations and real data of severe miscalibration of fine-mapping results from GWAS meta-analysis; for example, we estimate that the difference between true and reported proportion of causal variants is 20% and 45% for high-PIP (> 0.9) variants in suspicious loci from the simulations and the GWAS Catalog, respectively. Our SLALOM method helps to exclude spurious results from meta-analysis fine-mapping; however, even fine-mapping results in SLALOM-predicted “non-suspicious” loci remain somewhat miscalibrated, showing estimated differences between true and reported proportion of causal variants of 4% and 15% for high-PIP variants in the simulations and the GWAS Catalog, respectively. We thus urge extreme caution when interpreting PIPs computed from meta-analyses until improved methods are available. We recommend that researchers looking to identify likely causal variants employ complete synchronization of study design, case/control ascertainment, genomic profiling, and analytical pipeline, or rely more heavily on functional annotations, biobank fine-mapping, or molecular QTLs.

Data Availability

The GBMI summary statistics for the 14 endpoints are available at https://www.globalbiobankmeta.org/resources and are browserble at the GBMI PheWeb website (http://results.globalbiobankmeta.org/).

https://www.globalbiobankmeta.org/resources

http://results.globalbiobankmeta.org/

Data availability

The GBMI summary statistics for the 14 endpoints are available at https://www.globalbiobankmeta.org/resources and are browserble at the GBMI PheWeb84 website (http://results.globalbiobankmeta.org/).

Code availability

The SLALOM software is available at https://github.com/mkanai/slalom. Custom scripts to perform all the analyses and generate all the figures are available at https://github.com/mkanai/slalom-paper.

Competing interests

M.J.D. is a founder of Maze Therapeutics. All other authors declare no competing interests.

Methods

Meta-analysis fine-mapping simulation

To benchmark fine-mapping performance in meta-analysis, we simulated a large-scale, realistic GWAS meta-analysis and performed fine-mapping under different scenarios. An overview of our simulation pipeline is summarized in Supplementary Fig. 2.

Simulated true genotype

Using HAPGEN285 with the 1000 Genomes Project Phase 3 reference49, we simulated “true” genotypes of chromosome 3 for multiple independent cohorts from African, East Asian, and European ancestries. For each independent cohort from a given ancestry, we simulated 10,000 individuals each using the default parameters, with an ancestry-specific effective population size set to 17,469, 14,269, and 11,418 for Africans, East Asians, and Europeans, respectively, as recommended85. To mimic sample size imbalance of different ancestries in the current meta-analyses, we simulated 10 independent European cohorts, 1 African cohort, and 1 East Asian cohort.

To restrict our analysis to unrelated samples, we computed sample relatedness based on KING kinship coefficients86 using PLINK 2.0 (ref. 87) and removed monozygotic twins, duplicated individuals, or first-degree relatives with the coefficient threshold of 0.177. The detailed sample sizes of unrelated individuals for each cohort is summarized in Supplementary Table 1.

Genotyping and imputation

To simulate realistic genotyping and imputation procedures, we first virtually genotyped each cohort by restricting variants to those that are available on different genotyping arrays. We selected three major genotyping arrays from Illumina, Inc. (Omni2.5, Multi-Ethnic Global Array [MEGA], and Global Screening Array [GSA]) that have different densities of genotyping probes (Supplementary Table 2). For each cohort, we created three virtually genotyped datasets by retaining variants that are genotyped on each array. For the sake of simplicity, we assumed no genotyping errors occurred between true genotypes and virtually genotyped data—however, in practice, genotyping error is one of the major sources of unexpected confounding (e.g., see recent discussions here88,89) and should be treated carefully.

For each pair of cohort and genotyping array, we then imputed missing variants using different imputation reference panels. We used the Michigan Imputation Server (https://imputationserver.sph.umich.edu/)90 and the TOPMed Imputation Server (https://imputation.biodatacatalyst.nhlbi.nih.gov/)51 with the default parameters, using three publicly available reference panels: the 1000 Genomes Project Phase 3 (version 5; n = 2,504; 1000GP3)49, the Haplotype Reference Consortium (version r1.1; n = 32,470; HRC)50, and the TOPMed (version R2; n = 97,256)51. Briefly, for each input, the imputation server created chunks of 20 Mb, applied the standard QC, pre-phased each chunk with Eagle2 (ref. 91), and imputed non-genotyped variants using a specified reference panel with Minimac4 (https://genome.sph.umich.edu/wiki/Minimac4). The detailed documentation of the imputation pipeline is available on the Michigan and TOPMed websites and has been described elsewhere90.

We applied post-imputation QC by only keeping variants with MAF > 0.001 and imputation Rsq > 0.6. Because the TOPMed panel is based on GRCh38 while the 1000GP3 and the HRC panels are on GRCh37, we lifted over TOPMed variants from GRCh38 to GRCh37 to meta-analyze with other cohorts. We excluded any variants which were lifted over to different chromosomes or for which the conversion failed. The number of virtually genotyped and imputed variants for each combination of cohort, genotyping array, and imputation panel is summarized in Supplementary Table 3.

True phenotype

We simulated 300 true phenotypes that resemble observed complex trait genetic architecture and phenotypic heterogeneity across cohorts. Based on previous literature, we set parameters as follows: 1) 50% of 1 Mb loci contain a true causal variant92; 2) probability of being causal is proportional to functional enrichments of variant consequences (pLoF, missense, synonymous, UTR5/3, promoter, cis-regulatory region, and non-genic) for fine-mapped variants as estimated in a previous complex trait fine-mapping study17; 3) per-allele causal effect sizes have a variance proportional to [2p(1 − p)]α where p represents a maximum MAF across the three ancestries (AFR, EAS, and EUR) and α is set to be –0.38 (ref. 52); and 4) total SNP-heritability Embedded Image for chromosome 3 equals 0.03 (ref. 53). For the sake of simplicity, we randomly draw a single true causal variant per locus because ABF assumes a single causal variant31,32. We draw true causal variants from 1,150,893 non-ambiguous single-nucleotide variants in 1000GP3 that showed MAF> 0.01 in at least one of the three ancestries (AFR, EAS, or EUR) and were not located within conversion-unstable positions (CUP)54 between the human genome builds GRCh37 and GRCh38. To mimic phenotypic heterogeneity across cohorts in real-world meta-analysis (due to e.g., different ascertainment, measurement error, or true effect size heterogeneity), we introduced cross-cohort genetic correlation of true effect sizes rg which is set to be one of 1, 0.9, or 0.5. For a true causal variant j, true causal effect sizes βj across cohorts were randomly drawn from βj ∼MVM(0, Σ) where diagonal elements of Σ were set to be Embedded Image and off-diagonal elements of Σ were set to be Embedded Image was determined by Embedded Image. For each cohort, true phenotype y was computed via y = Xβ + ε where X is the above true genotype matrix from HAPGEN2 and Embedded Image i.i.d. We simulated 100 true phenotypes for each of rg= 1, 0.9, and 0.5, respectively.

GWAS

For each combination of phenotype, cohort, genotyping chip, and imputation panel, we conducted GWAS via a standard linear regression as implemented in PLINK 2.0 using imputed dosages. For covariates, we included top 10 principal components that were calculated based on true genotypes after restricting to unrelated samples. We only used LD-pruned variants with MAF > 0.01 for PCA.

Meta-analysis

To simulate meta-analyses that resemble real-world settings, we generated multiple configurations of the above GWAS results to meta-analyze across 10 independent cohorts. Briefly, we chose configurations based on the following settings: 1) 10 EUR cohorts are genotyped and imputed using the same genotyping array (one of GSA, MEGA, or Omni2.5) and the same imputation panel (one of 1000GP3, HRC, TOPMed, or TOPMed-liftover); 2) 10 cohorts consisting of multiple ancestries (9 EUR + 1 AFR/EAS cohorts or 8 EUR + 1 AFR + 1 EAS cohorts), with all cohorts genotyped and imputed using the same array (Omni2.5) and the same panel (1000GP3); 3) 10 EUR or multi-ancestry cohorts are genotyped using the same array (Omni2.5) but imputed using different panels across cohorts; 4) 10 EUR or multi-ancestry cohorts are imputed using the same panel (1000GP3) but genotyped using different arrays across cohorts; 5) 10 EUR or multi-ancestry cohorts are genotyped and imputed using different arrays and panels across cohorts. For settings 3–5, we randomly draw a combination of a genotyping array and an imputation panel for each cohort five times each for 10 EUR and multi-ancestry cohorts. In total, we generated 45 configurations as summarized in Supplementary Table 4.

For each configuration, we conducted a fixed-effect meta-analysis based on inverse-variance weighted betas and standard errors using a modified version of PLINK 1.9 (https://github.com/mkanai/plink-ng/tree/add_se_meta).

Fine-mapping

For each meta-analysis, we defined fine-mapping regions based on a 1 Mb window around each genome-wide significant lead variant and applied ABF31,32 using prior effect size variance of Embedded Image. We set a prior variance of effect size to be 0.04 which was taken from Wakefield et al31 and is commonly used in meta-analysis fine-mapping studies2,7. We computed posterior inclusion probability (PIP) and 95% credible set (CS) for each locus and evaluated whether true causal variants were correctly fine-mapped.

The SLALOM method

SLALOM takes GWAS summary statistics and external LD reference as input and predicts whether a locus is suspicious for fine-mapping. SLALOM consists of the following three steps:

Locus definition

Consistent with common fine-mapping region definition, we defined loci based on a 1 Mb window around each genome-wide significant lead variant and merged them if they overlapped. We excluded the major histocompatibility complex (MHC) region (chr 6: 25-36 Mb) from analysis due to extensive LD structure in the region.

DENTIST-S outlier detection

For each variant in a locus, we computed DENTIST-S statistics using equation (1) based on the assumption of a single causal variant. DENTIST-S P-values (PDENTIST-S) were computed using the χ2 distribution with 1 degree of freedom. We applied ABF31,32 using prior effect size variance of Embedded Image and used the lead PIP variant (the variant with the highest PIP) as an approximation of the causal variant in the locus. To retrieve correlation r among the variants, we used publicly available LD matrices from gnomAD57 v2 as external LD reference for African, Admixed American, East Asian, Finnish, and non-Finnish European populations. When multiple populations exist, we computed a sample-size-weighted average of r2 using per-variant sample sizes for each population as previously suggested80. We excluded variants without r2 available in gnomAD from the analysis. Since gnomAD v2 LD matrices are based on the human genome assembly GRCh37, variants were lifted over to GRCh38 if the input summary statistics were based on GRCh38.

We determined DENTIST-S outlier variants using two thresholds: 1) r2 > ρ to the lead and 2) PDENTIST-S < τ. The thresholds ρ and τ were set to ρ = 0.6 and τ = 1.0 × 10−4 based on the training in simulations as described below.

Suspicious loci prediction

We predicted whether a locus is suspicious or non-suspicious for fine-mapping based on the number of DENTIST-S outlier variants in the locus > κ. To determine the best-performing thresholds (ρ, τ, and κ), we used loci with maximum PIP > 0.9 in the simulations for training. Positive conditions were defined as whether a true causal variant in a locus is 1) a lead PIP variant, 2) in 95% CS, and 3) in 99% CS. We computed AUROC across different thresholds (ρ = 0, 0.1, 0.2, …, 0.9; –log10 τ = 0, 0.5, 1, …, 10; and κ = 0, 1, 2, …) and chose ρ = 0.6, τ = 1.0 × 10−4, and κ = 0 that showed the highest AUROC for all the aforementioned positive conditions. Using all the loci in the simulations, we then evaluated fine-mapping miscalibration (defined as mean PIP – fraction of true causal variants) at different PIP thresholds in suspicious and non-suspicious loci and decided to only apply SLALOM to loci with maximum PIP > 0.1 owing to relatively lower miscalibration and specificity of SLALOM at lower PIP thresholds.

GWAS Catalog analysis

We retrieved full GWAS summary statistics publicly available on the GWAS Catalog48. Out of 33,052 studies from 5,553 publications registered at the GWAS Catalog (as of January 12, 2022), we selected 467 studies from 96 publications that have 1) full harmonized summary statistics preprocessed by the GWAS Catalog with non-missing variant ID, marginal beta, and standard error columns, 2) a discovery sample size of more than 10,000 individuals, 3) African (including African American, Afro-Caribbean, and Sub-Saharan African), admixed American (Hispanic and Latin American), East Asian, or European samples based on their broad ancestral category metadata, 4) at least one genome-wide significant association (P < 5.0 × 10−8), and 5) our manual annotation as a meta-analysis rather than a single-cohort study (Supplementary Table 5). We applied SLALOM to the 467 summary statistics and identified 35,864 genome-wide significant loci (based on 1 Mb window around lead variants), of which 28,925 loci with maximum PIP > 0.1 were further classified into suspicious and non-suspicious loci. Since per-variant sample sizes were not available, we used overall sample sizes of each ancestry (African, Admixed American, East Asian, and European) to calculate the weighted-average of r2. All the variants were harmonized into the human genome assembly GRCh38 by the GWAS Catalog.

GBMI analysis

We used meta-analysis summary statistics of 14 disease endpoints from the GBMI (Supplementary Table 7). These meta-analyses were conducted using up to 1.8 million individuals across 19 biobanks, representing six different genetic ancestry groups (approximately 33,000 African, 18,000 Admixed American, 31,000 Central and South Asian, 341,000 East Asian, 1.8 million European, and 1,600 Middle Eastern individuals). Detailed procedures of the GBMI meta-analyses were described in the GBMI flagship manuscript10.

Across the 14 summary statistics, we defined 503 genome-wide significant loci (P < 5.0 × 10−8) based on a 1 Mb window around each lead variant and merged them if they overlapped. We applied SLALOM to 422 loci with maximum PIP > 0.1 based on the ABF fine-mapping and predicted whether they were suspicious or non-suspicious for fine-mapping. We used per-variant sample sizes of each ancestry (African, Admixed American, East Asian, Finnish, and non-Finnish European) to calculate the weighted-average of r2. Since gnomAD LD matrices were not available for Central and South Asian and Middle Eastern, we did not use their sample sizes for the calculation. All the variants were processed on the human genome assembly GRCh38.

Fine-mapping results of complex traits and cis-eQTL

We retrieved our previous fine-mapping results for 1) complex traits in large-scale biobanks (BBJ58, FinnGen20, and UKBB19 Europeans)16,17 and 2) cis-eQTLs in GTEx59 v8 and eQTL Catalogue60. Briefly, we conducted multiple-causal-variant fine-mapping (FINEMAP21,22 and SuSiE23) of complex trait GWAS (# unique traits = 148) and cis-eQTL gene expression (# unique tissues/cell-types = 69) using summary statistics and in-sample LD. Detailed fine-mapping methods are described elsewhere16,17.

In this study, we collected 1) high-PIP GWAS variants that achieved PIP > 0.9 for any traits in any biobank and 2) high-PIP cis-eQTL variants that acheived PIP > 0.9 for any gene expression in any tissues/cell-types. All the variants were originally processed on the human genome assembly GRCh37 and lifted over to the GRCh38 for comparison.

Additional fine-mapping results

To compare with the GBMI meta-analyses, we additionally conducted multi-causal-variant fine-mapping of four additional endpoints (gout, heart failure, thyroid cancer, and venous thromboembolism) that were not fine-mapped in our previous study16,17. We used exactly the same fine-mapping pipeline (FINEMAP21,22 and SuSiE23) as described previously16,17. For UKBB Europeans, to use the exact same samples that contributed to the GBMI, we used individuals of European ancestry (n = 420,531) as defined in the Pan-UKBB project (https://pan.ukbb.broadinstitute.org), instead of those of “white British ancestry” (n = 361,194) used in our previous study16,17.

Enrichment analysis of likely causal variants

To validate SLALOM performance, we asked whether suspicious and non-suspicious loci were enriched for having likely causal variants as a lead PIP variant, and for containing them in the 95% and 99% CS. We defined likely causal variants using 1) nonsynonymous coding variants, i.e., pLoF and missense variants annotated93 by the Ensembl Variant Effect Predictor (VEP) v101 (using GRCh38 and GENCODE v35), 2) the high-PIP (> 0.9) complex trait fine-mapped variants, and 3) the high-PIP (> 0.9) cis-eQTL fine-mapped variants from our previous studies as described above.

We estimated enrichment for suspicious and non-suspicious loci as a relative risk (i.e., a ratio of proportion of variants) between being in suspicious/non-suspicious loci and having the annotated likely causal variants as a lead PIP variant (or containing them in the 95% or 99% CS). That is, a relative risk = (proportion of non-suspicious loci having the annotated variants as a lead PIP variant) / (proportion of suspicious loci having the annotated variants as a lead PIP variant). We computed 95% confidence intervals using bootstrapping.

Comparison of fine-mapping results between the GBMI and individual biobanks

To directly compare with fine-mapping results from the GBMI meta-analyses, we used our fine-mapping results of nine disease endpoints (asthma64, COPD64, gout, heart failure73, IPF62, primary open angle glaucoma74, thyroid cancer, stroke75, and venous thromboembolism76) in BBJ58, FinnGen20, and UKBB19 Europeans that were also part of the GBMI meta-analyses for the same traits. For comparison, we computed the maximum PIP for each variant and the minimum size of 95% CS across BBJ, FinnGen, and UKBB. We restricted the 95% CS in biobanks to those that contain the lead variants from the GBMI. We defined the PIP difference between the GBMI and individual biobanks as ΔPIP = PIP (GBMI) – the maximum PIP across the biobanks.

We conducted functional enrichment analysis to compare between the GBMI meta-analysis and individual biobanks because unbiased comparison of PIP requires conditioning on likely causal variants independent of the fine-mapping results, and functional annotations have been shown to be enriched for causal variants. Using functional categories (coding [pLoF, missense, and synonymous], 5’/3’ UTR, promoter, and CRE) from our previous study16,17, we estimated functional enrichments of variants in each functional category based on 1) top PIP rankings and 2) ΔPIP bins. Since fine-mapping PIP in the GBMI meta-analysis can be miscalibrated, we performed a comparison based on top PIP rankings to assess whether the ordering given by GBMI PIPs is more informative than the ordering given by the biobanks. For the top PIP rankings, we took the top 0.5%, 0.1%, and 0.05% variants based on the PIP rankings in the GBMI and individual biobanks. We computed enrichment as a relative risk = (proportion of top X% PIP variants in the GBMI that are in the annotation) / (proportion of top X% PIP variants in the individual biobanks that are in the annotation). For ΔPIP bins, we defined three bins using different thresholds (θ = 0.01, 0.05, and 0.1): 1) decreased PIP bin, ΔPIP < –θ, 2) null bin, –θ ≤ ΔPIP ≤ θ, and 3) increased PIP bin, θ < ΔPIP. We computed enrichment as a relative risk = (proportion of variants in the decreased/increased PIP bin that are in the annotation) / (proportion of variants in the null PIP bin). We combined coding, UTR, and promoter categories for this analysis due to the limited number of variants for each bin.

Global Biobank Meta-analysis Initiative

Wei Zhou1,2,3, Masahiro Kanai1,2,3,4,5, Kuan-Han H Wu6, Rasheed Humaira7,8,9, Kristin Tsuo1,2,3, Jibril B Hirbo10,11, Ying Wang1,2,3, Arjun Bhattacharya12, Huiling Zhao13, Shinichi Namba5, Ida Surakka14, Brooke N Wolford6, Valeria Lo Faro15,16,17, Esteban A Lopera-Maya18, Kristi Läll19, Marie-Julie Favé20, Sinéad B Chapman2,3, Juha Karjalainen1,2,3,21, Mitja Kurki1,2,3,21, Maasha Mutaamba1,2,3,21, Ben M Brumpton22, Sameer Chavan23, Tzu-Ting Chen24, Michelle Daya23, Yi Ding12,25, Yen-Chen A Feng26, Christopher R Gignoux23, Sarah E Graham14, Whitney E Hornsby14, Nathan Ingold27, Ruth Johnson12,28, Triin Laisk19, Kuang Lin29, Jun Lv30, Iona Y Millwood29,31, Priit Palta19,21, Anita Pandit32, Michael Preuss33, Unnur Thorsteinsdottir34, Jasmina Uzunovic20, Matthew Zawistowski32, Xue Zhong10,35, Archie Campbell36, Kristy Crooks23, Geertruida h De Bock37, Nicholas J Douville38,39, Sarah Finer40, Lars G Fritsche32, Christopher J Griffiths40, Yu Guo41, Karen A Hunt42, Takahiro Konuma5,43, Riccardo E Marioni36, Jansonius Nomdo15, Snehal Patil32, Nicholas Rafaels23, Anne Richmond44, Jonathan A Shortt23, Peter Straub10,35, Ran Tao35,45, Brett Vanderwerff32, Kathleen C Barnes23, Marike Boezen37, Zhengming Chen29,31, Chia-Yen Chen46, Judy Cho33, George Davey Smith13,47, Hilary K Finucane1,2,3, Lude Franke18, Eric Gamazon35,48, Andrea Ganna1,2,21, Tom R Gaunt13, Tian Ge49,50, Hailiang Huang1,2, Jennifer Huffman51, Clara Lajonchere52,53, Matthew H Law27, Liming Li30, Cecilia M Lindgren54, Ruth JF Loos33, Stuart MacGregor27, Koichi Matsuda55, Catherine M Olsen27, David J Porteous36, Jordan A Shavit56, Harold Snieder37, Richard C Trembath57, Judith M Vonk37, David Whiteman27, Stephen J Wicks23, Cisca Wijmenga18, John Wright58, Jie Zheng13, Xiang Zhou32, Philip Awadalla20,59, Michael Boehnke32, Nancy J Cox10,60, Daniel H Geschwind52,61,62, Caroline Hayward44, Kristian Hveem22, Eimear E Kenny63, Yen-Feng Lin24,64,65, Reedik Mägi19, Hilary C Martin66, Sarah E Medland27, Yukinori Okada5,67,68,69,70, Aarno V Palotie1,2,21, Bogdan Pasaniuc12,25,52,61,71, Serena Sanna18,72, Jordan W Smoller73, Kari Stefansson34, David A van Heel42, Robin G Walters29,31, Sebastian Zoellner32, Biobank Japan, BioMe, BioVU, Canadian Partnership for Tomorrow, China Kadoorie Biobank Collaborative Group, Colorado Center for Personalized Medicine, deCODE Genetics, Estonian Biobank, FinnGen, Generation Scotland, Genes & Health, LifeLines, Mass General Brigham Biobank, Michigan Genomics Initiative, QIMR Berghofer Biobank, Taiwan Biobank, The HUNT Study, UCLA ATLAS Community Health Initiative, UK Biobank, Alicia R Martin1,2,3, Cristen J Willer6,14,74*, Mark J Daly1,2,3,21*, Benjamin M Neale1,2,3*

*These authors jointly supervised the initiative

1Analytic and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA, 2Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA, 3Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA, 4Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA, 5Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita 565-0871, Japan, 6Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA, 7K. G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, Faculty of Medicine and Health, NTNU, Norwegian University of Science and Technology, Trondheim, Norway, 8Division of Medicine and Laboratory Sciences, University of Oslo, Norway, 9MRC Integrative Epidemiology Unit, Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK, 10Department of Medicine, Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN, USA, 11Vanderbilt Genetic Institute, Vanderbilt University Medical Center, Nashville, TN, USA, 12Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA, 13MRC Integrative Epidemiology Unit (IEU), Bristol Medical School, University of Bristol, Oakfield House, Oakfield Grove, Bristol, BS8 2BN, UK, 14Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA, 15University of Groningen, UMCG, Department of Ophthalmology, Groningen, the Netherlands, 16Department of Clinical Genetics, Amsterdam University Medical Center (AMC), Amsterdam, the Netherlands, 17Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden, 18University of Groningen, UMCG, Department of Genetics, Groningen, the Netherlands, 19Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia, 20Ontario Institute for Cancer Research, Toronto, ON, Canada, 21Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland, 22K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, 7030, Norway, 23University of Colorado -Anschutz Medical Campus, Aurora, CO, USA, 24Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan, 25Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA, 26Division of Biostatistics, Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taiwan, 27QIMR Berghofer Medical Research Institute, Brisbane, Australia, 28Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA, 29Nuffield Department of Population Health, University of Oxford, Oxford, UK, 30Department of Epidemiology and Biostatistics, School of Public Health, Peking University Health Science Center, Beijing, China, 31MRC Population Health Research Unit, University of Oxford, Oxford, UK, 32Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA, 33The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA, 34deCODE Genetics/Amgen inc., 101, Reykjavik, Iceland, 35Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashviile, TN, USA, 36Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK, 37University of Groningen, UMCG, Department of Epidemiology, Groningen, the Netherlands, 38Department of Anesthesiology, Michigan Medicine, Ann Arbor, MI, USA, 39Institute of Healthcare Policy & Innovation, University of Michigan, Ann Arbor, MI, USA, 40Wolfson Institute of Population Health, Queen Mary University of London, London, UK, 41Chinese Academy of Medical Sciences, Beijing, China, 42Blizard Institute, Queen Mary University of London, London, UK, 43Central Pharmaceutical Research Institute, JAPAN TOBACCO INC., Takatsuki 569-1125, Japan, 44Medical Research Council Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, UK, 45Department of Biostatistics, Vanderbilt University Medical Center, Nashviile, TN, USA, 46Biogen, Cambridge, MA, USA, 47NIHR Bristol Biomedical Research Centre, Bristol, UK, 48MRC Epidemiology Unit, University of Cambridge, Cambridge, UK, 49Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA, 50Center for Precision Psychiatry, Massachusetts General Hospital, Boston, MA, USA, 51Centre for Population Genomics, VA Boston Healthcare System, Boston, MA, USA, 52Institute of Precision Health, University of California, Los Angeles, Los Angeles, CA, USA, 53Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA, 54Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK, 55Department of Computational Biology and Medical Sciences, Graduate school of Frontier Sciences, The University of Tokyo, Tokyo, Japan, 56University of Michigan, Department of Pediatrics, Ann Arbor MI 48109, 57School of Basic and Medical Biosciences, Faculty of Life Sciences and Medicine, King’s College London, London, UK, 58Bradford Institute for Health Research, Bradford Teaching Hospitals National Health Service (NHS) Foundation Trust, Bradford, UK, 59Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada, 60Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA, 61Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA, 62Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA, 63Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA, 64Department of Public Health & Medical Humanities, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan, 65Institute of Behavioral Medicine, College of Medicine, National Cheng Kung University, Tainan, Taiwan, 66Medical and Population Genomics, Wellcome Sanger Institute, Hinxton, UK, 67Center for Infectious Disease Education and Research (CiDER), Osaka University, Suita 565-0871, Japan, 68Laboratory of Statistical Immunology, Immunology Frontier Research Center (WPI-IFReC), Osaka University, Suita 565-0871, Japan, 69Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan, 70Integrated Frontier Research for Medical Science Division, Institute for Open and Transdisciplinary Research Initiatives, Osaka University, Suita 565-0871, Japan, 71Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA, 72Institute for Genetics and Biomedical Research, National Research Council, Cagliari 09100, Italy, 73Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, 74Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA.

Acknowledgements

We acknowledge all the participants of the 19 biobanks that have contributed to the GBMI. Biobank-specific acknowledgements are included in the Supplementary Note. We thank H. Huang, A.R. Martin, B.M. Neale, Y. Okada, K. Tsuo, J.C. Ulirsch, Y. Wang, and all the members of Finucane and Daly labs for their helpful feedback. M.K. was supported by a Nakajima Foundation Fellowship and the Masason Foundation. H.K.F. was funded by NIH grant DP5 OD024582.

References

  1. 1.↵
    Evangelou, E. & Ioannidis, J. P. a. Meta-analysis methods for genome-wide association studies and beyond. Nat. Rev. Genet. 14, 379–389 (2013).
    OpenUrlCrossRefPubMed
  2. 2.↵
    Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513 (2018).
    OpenUrlCrossRefPubMed
  3. 3.↵
    Spracklen, C. N. et al. Identification of type 2 diabetes loci in 433,540 East Asian individuals. Nature 582, 240–245 (2020).
    OpenUrlPubMed
  4. 4.↵
    Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
    OpenUrlCrossRefPubMedWeb of Science
  5. 5.↵
    The Schizophrenia Working Group of the Psychiatric Genomics Consortium, Ripke, S., Walters, J. T. R. & O’Donovan, M. C. Mapping genomic loci prioritises genes and implicates synaptic biology in schizophrenia. bioRxiv (2020) doi:10.1101/2020.09.12.20192922.
    OpenUrlCrossRef
  6. 6.↵
    Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
    OpenUrlCrossRefPubMedWeb of Science
  7. 7.↵
    Ishigaki, K. et al. Trans-ancestry genome-wide association study identifies novel genetic mechanisms in rheumatoid arthritis. bioRxiv (2021) doi:10.1101/2021.12.01.21267132.
    OpenUrlCrossRef
  8. 8.↵
    Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
    OpenUrlCrossRefPubMed
  9. 9.↵
    Graham, S. E. et al. The power of genetic diversity in genome-wide association studies of lipids. Nature 600, 675–679 (2021).
    OpenUrl
  10. 10.↵
    Zhou, W. et al. Global Biobank Meta-analysis Initiative: powering genetic discovery across human diseases. bioRxiv (2021) doi:10.1101/2021.11.19.21266436.
    OpenUrlCrossRef
  11. 11.↵
    Visscher, P. M. et al. 10 Years of GWAS Discovery: Biology, Function, and Translation. Am. J. Hum. Genet. 101, 5–22 (2017).
    OpenUrlCrossRefPubMed
  12. 12.
    Shendure, J., Findlay, G. M. & Snyder, M. W. Genomic Medicine-Progress, Pitfalls, and Promise. Cell 177, 45–57 (2019).
    OpenUrlCrossRefPubMed
  13. 13.↵
    Schaid, D. J., Chen, W. & Larson, N. B. From genome-wide associations to candidate causal variants by statistical fine-mapping. Nat. Rev. Genet. 19, 491–504 (2018).
    OpenUrlCrossRefPubMed
  14. 14.↵
    Ulirsch, J. C. et al. Interrogation of human hematopoiesis at single-cell and single-variant resolution. Nat. Genet. 51, 683–693 (2019).
    OpenUrlCrossRef
  15. 15.
    Weissbrod, O. et al. Functionally informed fine-mapping and polygenic localization of complex trait heritability. Nat. Genet. 52, 1355–1363 (2020).
    OpenUrl
  16. 16.↵
    Ulirsch, J. C. & Kanai, M. An annotated atlas of causal variants underlying complex traits and gene expression. Under review.
  17. 17.↵
    Kanai, M. et al. Insights from complex trait fine-mapping across diverse populations. medRxiv (2021) doi:10.1101/2021.09.03.21262975.
    OpenUrlAbstract/FREE Full Text
  18. 18.↵
    Nagai, A. et al. Overview of the BioBank Japan Project: Study design and profile. Journal of Epidemiology 27, S2–S8 (2017).
    OpenUrlCrossRefPubMed
  19. 19.↵
    Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
    OpenUrlCrossRefPubMed
  20. 20.↵
    Kurki, M. I. et al. FinnGen: Unique genetic insights from combining isolated population and national health register data. bioRxiv (2022) doi:10.1101/2022.03.03.22271360.
    OpenUrlCrossRef
  21. 21.↵
    Benner, C. et al. FINEMAP: Efficient variable selection using summary data from genome-wide association studies. Bioinformatics 32, 1493–1501 (2016).
    OpenUrlCrossRefPubMed
  22. 22.↵
    Benner, C., Havulinna, A. S., Salomaa, V., Ripatti, S. & Pirinen, M. Refining fine-mapping: effect sizes and regional heritability. bioRxiv 318618 (2018) doi:10.1101/318618.
    OpenUrlAbstract/FREE Full Text
  23. 23.↵
    Wang, G., Sarkar, A., Carbonetto, P. & Stephens, M. A simple new approach to variable selection in regression, with application to genetic fine mapping. J. R. Stat. Soc. Series B Stat. Methodol. 25, 1 (2020).
    OpenUrl
  24. 24.↵
    Onengut-Gumuscu, S. et al. Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers. Nat. Genet. 47, 381–386 (2015).
    OpenUrlCrossRefPubMed
  25. 25.
    Levey, D. F. et al. Bi-ancestral depression GWAS in the Million Veteran Program and meta-analysis in >1.2 million individuals highlight new therapeutic directions. Nat. Neurosci. 24, 954–963 (2021).
    OpenUrl
  26. 26.
    Gharahkhani, P. et al. Genome-wide meta-analysis identifies 127 open-angle glaucoma loci with consistent effect across ancestries. Nat. Commun. 12, 1258 (2021).
    OpenUrl
  27. 27.
    Chen, J. et al. The trans-ancestral genomic architecture of glycemic traits. Nat. Genet. 53, 840–860 (2021).
    OpenUrlCrossRef
  28. 28.
    Zhou, W. et al. GWAS of thyroid stimulating hormone highlights pleiotropic effects and inverse association with thyroid cancer. Nat. Commun. 11, 1–13 (2020).
    OpenUrlCrossRefPubMed
  29. 29.
    Wightman, D. P. et al. A genome-wide association study with 1,126,563 individuals identifies new risk loci for Alzheimer’s disease. Nat. Genet. 53, 1276–1282 (2021).
    OpenUrlCrossRef
  30. 30.↵
    Chen, M.-H. et al. Trans-ethnic and Ancestry-Specific Blood-Cell Genetics in 746,667 Individuals from 5 Global Populations. Cell 182, 1198–1213.e14 (2020).
    OpenUrlCrossRefPubMed
  31. 31.↵
    Wakefield, J. A Bayesian measure of the probability of false discovery in genetic epidemiology studies. Am. J. Hum. Genet. 81, 208–227 (2007).
    OpenUrlCrossRefPubMedWeb of Science
  32. 32.↵
    Wakefield, J. Bayes factors for genome-wide association studies: comparison with P-values. Genet. Epidemiol. 33, 79–86 (2009).
    OpenUrlCrossRefPubMedWeb of Science
  33. 33.↵
    Hormozdiari, F., Kostem, E., Kang, E. Y., Pasaniuc, B. & Eskin, E. Identifying Causal Variants at Loci with Multiple Signals of Association. Genetics 198, 497–508 (2014).
    OpenUrlAbstract/FREE Full Text
  34. 34.↵
    Kichaev, G. et al. Integrating functional data to prioritize causal variants in statistical fine-mapping studies. PLoS Genet. 10, e1004722 (2014).
    OpenUrlCrossRefPubMed
  35. 35.↵
    Kichaev, G. & Pasaniuc, B. Leveraging Functional-Annotation Data in Trans-ethnic Fine-Mapping Studies. Am. J. Hum. Genet. 97, 260–271 (2015).
    OpenUrlCrossRefPubMed
  36. 36.↵
    Li, D., Zhao, H. & Gelernter, J. Strong protective effect of the aldehyde dehydrogenase gene (ALDH2) 504lys (*2) allele against alcoholism and alcohol-induced medical diseases in Asians. Hum. Genet. 131, 725–737 (2012).
    OpenUrlCrossRefPubMed
  37. 37.↵
    Brown, B. C., Ye, C. J., Price, A. L. & Zaitlen, N. Transethnic Genetic-Correlation Estimates from Summary Statistics. Am. J. Hum. Genet. 99, 76–88 (2016).
    OpenUrlCrossRefPubMed
  38. 38.↵
    Shi, H. et al. Population-specific causal disease effect sizes in functionally important regions impacted by selection. bioRxiv 803452 (2020) doi:10.1101/803452.
    OpenUrlAbstract/FREE Full Text
  39. 39.↵
    COVID-19 Host Genetics Initiative. Mapping the human genetic architecture of COVID-19. Nature 600, 472–477 (2021).
    OpenUrl
  40. 40.↵
    Dendrou, C. A. et al. Resolving TYK2 locus genotype-to-phenotype differences in autoimmunity. Sci. Transl. Med. 8, 363ra149 (2016).
    OpenUrlAbstract/FREE Full Text
  41. 41.↵
    Couturier, N. et al. Tyrosine kinase 2 variant influences T lymphocyte polarization and multiple sclerosis susceptibility. Brain 134, 693–703 (2011).
    OpenUrlCrossRefPubMedWeb of Science
  42. 42.↵
    Li, Z. et al. Two rare disease-associated Tyk2 variants are catalytically impaired but signaling competent. J. Immunol. 190, 2335–2344 (2013).
    OpenUrlAbstract/FREE Full Text
  43. 43.↵
    Lam, M. et al. RICOPILI: Rapid Imputation for COnsortias PIpeLIne. Bioinformatics 36, 930–933 (2020).
    OpenUrlPubMed
  44. 44.↵
    Huang, H. et al. Fine-mapping inflammatory bowel disease loci to single-variant resolution. Nature 547, 173–178 (2017).
    OpenUrlCrossRefPubMed
  45. 45.↵
    Winkler, T. W. et al. Quality control and conduct of genome-wide association meta-analyses. Nat. Protoc. 9, 1192–1212 (2014).
    OpenUrlCrossRefPubMed
  46. 46.↵
    Chen, W. et al. Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors. Nat. Commun. 12, 7117 (2021).
    OpenUrl
  47. 47.↵
    Yang, J. et al. Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nat. Genet. 44, 369–75, S1–3 (2012).
    OpenUrlCrossRefPubMed
  48. 48.↵
    Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005– D1012 (2019).
    OpenUrlCrossRefPubMed
  49. 49.↵
    The 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68–74 (2015).
    OpenUrlCrossRefPubMed
  50. 50.↵
    McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283 (2016).
    OpenUrlCrossRefPubMed
  51. 51.↵
    Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021).
    OpenUrlCrossRefPubMed
  52. 52.↵
    Schoech, A. P. et al. Quantification of frequency-dependent genetic architectures in 25 UK Biobank traits reveals action of negative selection. Nat. Commun. 10, 790 (2019).
    OpenUrlCrossRef
  53. 53.↵
    Yang, J. et al. Genome partitioning of genetic variation for complex traits using common SNPs. Nat. Genet. 43, 519–525 (2011).
    OpenUrlCrossRefPubMed
  54. 54.↵
    Ormond, C., Ryan, N. M., Corvin, A. & Heron, E. A. Converting single nucleotide variants between genome builds: from cautionary tale to solution. Brief. Bioinform. 22, (2021).
  55. 55.↵
    Asimit, J. L., Hatzikotoulas, K., McCarthy, M., Morris, A. P. & Zeggini, E. Trans-ethnic study design approaches for fine-mapping. Eur. J. Hum. Genet. 24, 1330–1336 (2016).
    OpenUrlCrossRefPubMed
  56. 56.
    Marchini, J. & Howie, B. Genotype imputation for genome-wide association studies. Nat. Rev. Genet. 11, 499–511 (2010).
    OpenUrlCrossRefPubMedWeb of Science
  57. 57.↵
    Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
    OpenUrlCrossRefPubMed
  58. 58.↵
    Sakaue, S. et al. A cross-population atlas of genetic associations for 220 human phenotypes. Nat. Genet. 53, 1415–1424 (2021).
    OpenUrl
  59. 59.↵
    The GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
    OpenUrlAbstract/FREE Full Text
  60. 60.↵
    Kerimov, N. et al. A compendium of uniformly processed human gene expression and splicing quantitative trait loci. Nat. Genet. 53, 1290–1299 (2021).
    OpenUrlCrossRef
  61. 61.↵
    Koskela, J. T. et al. Genetic variant in SPDL1 reveals novel mechanism linking pulmonary fibrosis risk and cancer protection. bioRxiv (2021) doi:10.1101/2021.05.07.21255988.
    OpenUrlCrossRef
  62. 62.↵
    Partanen, J. J. et al. Leveraging global multi-ancestry meta-analysis in the study of Idiopathic Pulmonary Fibrosis genetics. bioRxiv (2021) doi:10.1101/2021.12.29.21268310.
    OpenUrlCrossRef
  63. 63.↵
    Foreman, M. G. et al. Alpha-1 Antitrypsin PiMZ Genotype Is Associated with Chronic Obstructive Pulmonary Disease in Two Racial Groups. Ann. Am. Thorac. Soc. 14, 1280– 1287 (2017).
    OpenUrlCrossRefPubMed
  64. 64.↵
    Tsuo, K. et al. Multi-ancestry meta-analysis of asthma identifies novel associations and highlights the value of increased power and diversity. bioRxiv (2021) doi:10.1101/2021.11.30.21267108.
    OpenUrlCrossRef
  65. 65.↵
    Benonisdottir, S. et al. Epigenetic and genetic components of height regulation. Nat. Commun. 7, 13490 (2016).
    OpenUrlCrossRefPubMed
  66. 66.↵
    Marouli, E. et al. Rare and low-frequency coding variants alter human adult height. Nature 542, 186–190 (2017).
    OpenUrlCrossRefPubMed
  67. 67.↵
    Langefeld, C. D. et al. Transancestral mapping and genetic load in systemic lupus erythematosus. Nat. Commun. 8, 16021 (2017).
    OpenUrlPubMed
  68. 68.↵
    Hargreaves, C. E. et al. Fcγ receptors: genetic variation, function, and disease. Immunol. Rev. 268, 6–24 (2015).
    OpenUrlCrossRefPubMed
  69. 69.↵
    Franke, L. et al. Association analysis of copy numbers of FC-gamma receptor genes for rheumatoid arthritis and other immune-mediated phenotypes. Eur. J. Hum. Genet. 24, 263– 270 (2016).
    OpenUrlPubMed
  70. 70.↵
    UK10K Consortium et al. The UK10K project identifies rare variants in health and disease. Nature 526, 82–90 (2015).
    OpenUrlCrossRefPubMed
  71. 71.↵
    Wang, Y. et al. Global biobank analyses provide lessons for computing polygenic risk scores across diverse cohorts. medRxiv 2021.11.18.21266545 (2021).
  72. 72.↵
    Namba, S. et al. A practical guideline of genomics-driven drug discovery in the era of global biobank meta-analysis. bioRxiv (2021) doi:10.1101/2021.12.03.21267280.
    OpenUrlCrossRef
  73. 73.↵
    Wu, K.-H. H. et al. Polygenic risk score from a multi-ancestry GWAS uncovers susceptibility of heart failure. bioRxiv (2021) doi:10.1101/2021.12.06.21267389.
    OpenUrlCrossRef
  74. 74.↵
    Lo Faro, V. et al. Genome-wide association meta-analysis identifies novel ancestry-specific primary open-angle glaucoma loci and shared biology with vascular mechanisms and cell proliferation. bioRxiv (2021) doi:10.1101/2021.12.16.21267891.
    OpenUrlCrossRef
  75. 75.↵
    Surakka, I. et al. Multi-ancestry meta-analysis identifies 2 novel loci associated with ischemic stroke and reveals heterogeneity of effects between sexes and ancestries. bioRxiv (2022) doi:10.1101/2022.02.28.22271647.
    OpenUrlCrossRef
  76. 76.↵
    Wolford, B. et al. Multi-ancestry GWAS for venous thromboembolism identifies novel loci followed by experimental validation. In preparation.
  77. 77.↵
    Aneas, I. et al. Asthma-associated genetic variants induce IL33 differential expression through an enhancer-blocking regulatory region. Nat. Commun. 12, 6115 (2021).
    OpenUrl
  78. 78.↵
    Vladich, F. D. et al. IL-13 R130Q, a common variant associated with allergy and asthma, enhances effector mechanisms essential for human allergic inflammation. J. Clin. Invest. 115, 747–754 (2005).
    OpenUrlCrossRefPubMedWeb of Science
  79. 79.↵
    Farh, K. K.-H. et al. Genetic and epigenetic fine mapping of causal autoimmune disease variants. Nature 518, 337–343 (2015).
    OpenUrlCrossRefPubMed
  80. 80.↵
    Wojcik, G. L. et al. Genetic analyses of diverse populations improves discovery for complex traits. Nature 570, 514–518 (2019).
    OpenUrlCrossRefPubMed
  81. 81.↵
    Luo, Y. et al. A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response. Nat. Genet. 53, 1504–1516 (2021).
    OpenUrl
  82. 82.↵
    Sakaue, S. et al. Decoding the diversity of killer immunoglobulin-like receptors by deep sequencing and a high-resolution imputation method. Cell Genomics 2, (2022).
  83. 83.↵
    Mukamel, R. E. et al. Protein-coding repeat polymorphisms strongly shape diverse human phenotypes. Science 373, 1499–1505 (2021).
    OpenUrlCrossRef
  84. 84.↵
    Gagliano Taliun, S. A. et al. Exploring and visualizing large-scale genetic associations by using PheWeb. Nat. Genet. 52, 550–552 (2020).
    OpenUrlCrossRef
  85. 85.↵
    Su, Z., Marchini, J. & Donnelly, P. HAPGEN2: Simulation of multiple disease SNPs. Bioinformatics 27, 2304–2305 (2011).
    OpenUrlCrossRefPubMedWeb of Science
  86. 86.↵
    Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
    OpenUrlCrossRefPubMedWeb of Science
  87. 87.↵
    Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 1–16 (2015).
    OpenUrlCrossRefPubMed
  88. 88.↵
    Wei, X. & Nielsen, R. CCR5-Δ32 is deleterious in the homozygous state in humans. Nat. Med. 25, 909–910 (2019).
    OpenUrlCrossRef
  89. 89.↵
    Maier, R. et al. No statistical evidence for an effect of CCR5-Δ32 on lifespan in the UK Biobank cohort. Nat. Med. 26, 178–180 (2020).
    OpenUrl
  90. 90.↵
    Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
    OpenUrlCrossRefPubMed
  91. 91.↵
    Loh, P.-R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
    OpenUrlCrossRefPubMed
  92. 92.↵
    Loh, P.-R. et al. Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat. Genet. 47, 1385–1392 (2015).
    OpenUrlCrossRefPubMed
  93. 93.↵
    McLaren, W. et al. The Ensembl Variant Effect Predictor. Genome Biol. 17, 122 (2016).
    OpenUrlCrossRefPubMed
Back to top
PreviousNext
Posted March 20, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Meta-analysis fine-mapping is often miscalibrated at single-variant resolution
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Meta-analysis fine-mapping is often miscalibrated at single-variant resolution
Masahiro Kanai, Roy Elzur, Wei Zhou, Global Biobank Meta-analysis Initiative, Mark J Daly, Hilary K Finucane
medRxiv 2022.03.16.22272457; doi: https://doi.org/10.1101/2022.03.16.22272457
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Meta-analysis fine-mapping is often miscalibrated at single-variant resolution
Masahiro Kanai, Roy Elzur, Wei Zhou, Global Biobank Meta-analysis Initiative, Mark J Daly, Hilary K Finucane
medRxiv 2022.03.16.22272457; doi: https://doi.org/10.1101/2022.03.16.22272457

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (174)
  • Allergy and Immunology (420)
  • Anesthesia (97)
  • Cardiovascular Medicine (898)
  • Dentistry and Oral Medicine (169)
  • Dermatology (102)
  • Emergency Medicine (257)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (405)
  • Epidemiology (8774)
  • Forensic Medicine (4)
  • Gastroenterology (404)
  • Genetic and Genomic Medicine (1858)
  • Geriatric Medicine (178)
  • Health Economics (387)
  • Health Informatics (1288)
  • Health Policy (644)
  • Health Systems and Quality Improvement (491)
  • Hematology (206)
  • HIV/AIDS (392)
  • Infectious Diseases (except HIV/AIDS) (10550)
  • Intensive Care and Critical Care Medicine (564)
  • Medical Education (193)
  • Medical Ethics (52)
  • Nephrology (218)
  • Neurology (1750)
  • Nursing (102)
  • Nutrition (265)
  • Obstetrics and Gynecology (342)
  • Occupational and Environmental Health (461)
  • Oncology (963)
  • Ophthalmology (282)
  • Orthopedics (107)
  • Otolaryngology (176)
  • Pain Medicine (117)
  • Palliative Medicine (41)
  • Pathology (263)
  • Pediatrics (556)
  • Pharmacology and Therapeutics (264)
  • Primary Care Research (218)
  • Psychiatry and Clinical Psychology (1842)
  • Public and Global Health (3979)
  • Radiology and Imaging (653)
  • Rehabilitation Medicine and Physical Therapy (343)
  • Respiratory Medicine (534)
  • Rheumatology (215)
  • Sexual and Reproductive Health (178)
  • Sports Medicine (166)
  • Surgery (196)
  • Toxicology (37)
  • Transplantation (106)
  • Urology (80)