Multi-trait genetic analysis identifies novel pleiotropic loci for depression and schizophrenia in East Asians ============================================================================================================== * Yingchao Song * Linzehao Li * Yue Jiang * Bichen Peng * Hengxuan Jiang * Zhen Chao * Xiao Chang ## Abstract While genetic correlations, pleiotropic loci, and shared genetic mechanisms of psychiatric disorders have been extensively studied in European populations, the investigation of these factors in East Asian populations has been relatively limited. To identify novel pleiotropic risk loci for depression and schizophrenia (SCZ) in East Asians. We harnessed the most comprehensive dataset available for East Asians and quantified the genetic overlap between depression, SCZ, and their related traits via LD Score regression (LDSC) analyses. Besides the correlation between depression and SCZ, our analysis revealed significant genetic correlations between depression and obesity-related traits, such as weight, BMI, T2D, and HDL. In SCZ, significant correlations were detected with HDL, heart diseases and use of various medications. Conventional meta-analysis of depression and SCZ identified a novel locus at 1q25.2 in East Asians. Moreover, this locus was verified in the multi-trait analysis of GWAS (MTAG), which can improve the statistical power of single-trait GWAS by incorporating information from effect estimates across genetically correlated traits. Furthermore, multi-trait analysis of depression, SCZ and related traits identified ten novel pleiotropic loci for depression, and four for SCZ. Our findings demonstrate shared genetic underpinnings between depression and SCZ in East Asians, as well as their associated traits, providing novel candidate genes for the identification and prioritization of therapeutic targets specific to this population. ## Introduction Psychiatric disorders comprise a range of severe illnesses that result in emotional, cognitive, or behavioral disturbances in individuals, making them a significant contributor to the global burden of disability [1]. Empirical studies indicate that psychiatric disorders result from the intricate interplay of genetic and environmental risk factors [2]. Heritability estimates, as determined by twin-based and genome-wide association studies (GWAS), emphasize the substantial role of genetic factors within the spectrum of psychiatric disorders [3–6]. Specifically, the heritability estimates for major depression are approximately 30%-40% [7], while for schizophrenia (SCZ), these estimates range from 65% to 80% [8–10]. Depression and SCZ both exhibit a substantial degree of polygenicity, with numerous risk loci have been identified through genome-wide association studies (GWAS) in individuals of European ancestry [6, 11]. In a recent study, the largest GWAS of depression, involving more than 1.3 million individuals (including 371,184 with depression), identified a total of 243 risk loci. This study also unveiled a critical role of neurodevelopmental pathways associated with prenatal GABAergic neurons, astrocytes and oligodendrocyte lineages [12]. For SCZ, a total of 287 significant loci have been identified in the largest GWAS encompassing 76,755 cases and 243,649 controls. These identified associations predominantly cluster within genes that are actively expressed in both excitatory and inhibitory neurons within the central nervous system [13]. However, the majority of previous genetic studies have been performed in European ancestry cohorts, leaving a conspicuous research gap in Asian populations. The largest GWAS of depression in East Asians (15,771 cases and 178,777 controls) only identified two genetic loci [14]. Similarly, the largest GWAS conducted on SCZ among individuals of East Asian origin (with 22,778 cases and 35,362 controls) identified 19 genetic loci [15], which is significantly fewer than those observed in European studies. Recently, the emergence of multi-trait analysis of GWAS (MTAG) has brought forth a powerful approach to explore the genetic overlap and shared causality among complex traits [16]. By conducting meta-analysis of GWAS summary statistics for different genetically correlated traits, multi-trait analysis can enhance the statistical power and pinpoint pleiotropic genetic variants that influence interrelated traits. Moreover, it reveals the intricate genetic interplay existing between these phenotypes and unveils the common biological pathways that underpin their development. Such cross-trait analysis methods have been successfully applied to the study of psychiatric disorders. For example, a multi-trait meta-analysis across eight psychiatric disorders identified 109 pleiotropic loci linked to at least two disorders, revealing the common genetic structures shared among different psychiatric disorders [6]. Besides, a multi-trait meta-analysis revealed a substantial number of pleiotropic genetic loci between psychiatric disorder and gastrointestinal tract diseases, emphasizing the shared biological mechanism concerning immune response, synaptic structure and function, and potential gut microbiome in these conditions [17]. To the best of our knowledge, multi-trait analysis of GWAS has predominantly centered on European populations, and, no such analysis has been undertaken for psychiatric disorders within East Asian populations so far. In this study, we conducted a comprehensive multi-trait analysis using GWAS data of depression and SCZ in cohorts of East Asian descent. We also investigated the deep-phenotype GWAS data encompassing a range of health aspects such as diseases, biomarkers and medication usage sourced from BioBank Japan (BBJ) [18]. Moreover, we performed analyses to quantify both overall and local genetic correlations, further identifying novel pleiotropic loci for depression and SCZ in East Asians. ## Methods ### GWAS data The hitherto largest GWAS meta-analysis of depression among individuals of East Asia was performed among 194,548 individuals (15,771 cases and 178,777 controls), based on combined data from the China, Oxford, and Virginia Commonwealth University Experimental Research on Genetic Epidemiology (CONVERGE) consortium [19], China Kadoorie Biobank (CKB), and the Taiwan-Major Depressive Disorder (MDD) study, as well as studies conducted in the US and UK that included participants of East Asian ancestry [14]. And the largest GWAS of SCZ among individuals across East Asia were compiled 22,778 SCZ cases and 35,362 controls from 20 sample collections from East Asia [15]. GWAS summary statistics of 96 phenotypes from East Asians including diseases, biomarkers and medication usage were downloaded from GWAS Catalog [20], information of all included GWAS data from East Asians is summarized in Table S1. ### Meta-analysis between depression and SCZ Inverse-variance weighted meta-analysis was performed on depression and SCZ using the basic meta-analysis function in PLINK (v1.90b7) and the fixed-effect meta-analysis p value and fixed-effect ORs were estimated. We prioritized significant SNPs that reached genome-wide significance (P < 5×10−8) in the meta-analysis and suggestive significance (P < 0.01) in the original single-trait GWAS. ### Global genetic correlation analysis Genetic correlation rg between depression and SCZ was estimated by LD (Linkage Disequilibrium) score regression (LDSC) using GWAS summary statistic [21]. The LDSC method is described by the following equation: ![Graphic][1], where *βj* and *γj* denote the effect size of SNP *j* on the two tested traits, *N*1 and *N*2 are the sample sizes of two tested traits, *Ns* is the number of overlapping samples between two tested traits, *r* is the phenotypic correlation in overlapping samples and *lj* is the LD score. Pre-computed linkage disequilibrium scores for HapMap3 SNPs calculated based on East-Asian-ancestry data from the 1000 Genomes Project were used in the analysis. ### Local genetic correlation analysis Considering that genetic correlation estimated by LDSC aggregates information across all variants in the genome, we further estimated the pairwise local genetic correlation using ρ-HESS (heritability estimation from summary statistics) [22]. ρ-HESS are designed to quantify the local genetic correlation between pairs of traits at each of the 1703 prespecified LD independent segments with an average length of 1.6 Mb. A Bonferroni corrected p value less than 0.05/1703 was considered as statistically significant. ### Multi-trait GWAS analysis In addition to the above traditional meta-analysis method, the Multi-Trait Analysis of GWAS (MTAG) framework, a generalized meta-analysis method that outputs trait-specific SNP associations [16], was furthered conducted. MTAG can increase the power to detect loci from correlated traits by analyzing GWAS summary statistics jointly. It also accounts for sample overlap and incomplete genetic correlation, when comparing with the conventional inverse-variance weighted meta-analysis. The first step of MTAG is to filter variants by removing non common SNPs, duplicated SNPs, or SNPs with strand ambiguity. MTAG then estimates the pairwise genetic correlation between traits using LDSC [23] and uses these estimates to calibrate the variance-covariance matrix of the random effect component. MTAG next performs a random-effect meta-analysis to generate the SNP-level summary statistics. We prioritized significant pleiotropic SNPs that reached genome-wide significance (*P* < 5×10−8) in the multi-trait analysis and suggestive significance (*P* < 0.01) in the original single-trait GWAS. ### Genetic correlation analysis between psychiatric disorders and deep phenotypes To investigate the potential risk factors for psychiatric disorders in East Asian populations, we included 96 GWAS summary statistics of deep phenotypes from the BioBank Japan (BBJ), including diseases, biomarkers and medication usage. Genetic correlation rg between depression/SCZ and each phenotype was estimated by LD (Linkage Disequilibrium) score regression (LDSC) using GWAS summary statistic [21]. ### Multi-trait GWAS analysis of psychiatric disorders and risk factors Multi-trait GWAS meta-analysis for psychiatric disorders and the corresponding genetically correlated deep phenotypes were performed by the MTAG framework. We prioritized significant pleiotropic SNPs that reached genome-wide significance (*P* < 5×10−8) in the multi-trait analysis and suggestive significance (P < 0.01) in the original single-trait GWAS. ## Results ### Meta-analysis between depression and schizophrenia We conducted a comprehensive meta-analysis by combing summary statistics derived from the largest GWAS of depression and SCZ within East Asian populations. Our analysis yielded three pleiotropic loci surpassing the significance threshold of *P* < 5×10−8. Among them, 2p16.1 (rs13016665) and 18q23 (rs58736086) loci were previously reported in the study of SCZ among East Asian populations [15], while none of them was identified in the GWAS of depression (Table 1, Figure S1). The lead variant rs12031894 of the only novel locus is located in the intergenic region near gene *SOAT1*. It is worth mentioning that this locus reached genome-wide significance in the meta-analysis of SCZ data including both European and East Asian samples suggesting its potential relevance and impact in psychiatric disorders across diverse populations [15]. Additionally, the 1q25.2 locus (rs12031894) was identified as an expression quantitative trait locus (eQTL) for neighboring genes across multiple tissues (Table S2), particularly in brain tissues for genes *ABL2*, *FAM20B*, and *TDRD5* (Figure S2). View this table: [Table 1.](http://medrxiv.org/content/early/2024/03/01/2024.01.30.24301991/T1) Table 1. Meta-analysis between depression and schizophrenia in East Asians using PLINK. ### Genetic correlation between depression and schizophrenia To further investigate the shared genetic mechanism between depression and SCZ in East Asian populations, we estimated the genetic correlation between the two psychiatric disorders using LDSC. In consistent with findings in European populations [5, 15, 24], we observed a significant positive genetic correlation between depression and SCZ in East Asian populations (rg = 0.46, *P* = 1.48 × 10-6). In addition, heritability estimates on the observed scale using GWAS summary statistics of East Asians were 47.05% for SCZ which is comparable to the estimates calculated from Europeans. However, in the case of depression, heritability estimates among East Asians were notably lower at 0.78%, likely attributed to the relatively limited sample size of cases as compared to European studies. ### Multi-trait analysis between depression and schizophrenia We next applied MTAG to detect the potential pleiotropic loci shared between depression and SCZ in East Asians. MTAG differs from conventional meta-analysis in its capacity to enhance the statistical power of GWAS by incorporating information from effect estimates across genetically correlated traits, and generate trait-specific associations of each single nucleotide polymorphism (SNP). In assessing the augmented detection power, we compared the average χ2 test statistic for depression or SCZ derived from the multi-trait GWAS with that originating from the initial GWAS conducted in East Asian populations. The mean χ2 statistics for the initial GWAS results are: χ2 GWAS-DEP = 1.027 and χ2GWAS-SCZ = 1.265 and the mean χ2 statistics for the MTAG-GWAS results are: χ2 MTAG-DEP = 1.052 and χ2MTAG-SCZ = 1.267. We observed varying degrees of increase in the effective sample size for both depression and SCZ. Specifically, the effective sample size for SCZ (MTAG-SCZ) increased slightly from 58,140 to 58,759, while the effective sample size for depression (MTAG-DEP) experienced a substantial estimated increase of 94.61%, expanding from 194,548 to 378,607. In alignment with the conventional meta-analysis conducted using PLINK, the newly identified signal at 1q25.2 was observed in both the results of the MTAG-DEP and MTAG-SCZ analyses (Table S3, Figure S1). We also detected two additional loci at 2q33.1 (rs17590956) and 18q23(rs28735056), which were previously reported in East Asians (Table S3, Figure S1). ### Genetic correlations between psychiatric disorders and deep phenotypes To further elucidate the genetic correlations between psychiatric disorders and potential risk factors in East Asians, we analyzed GWAS data of 96 phenotypes sourced from BBJ, ranging from diseases, biomarkers and medication usage. Our analysis uncovered significant positive genetic correlations between depression and obesity-related traits. This encompassed variables like weight (rg = 0.16, *P* = 0.009), body mass index (BMI, rg = 0.24, *P* = 8.0 × 10-4), type 2 diabetes (T2D, rg = 0.24, *P* = 0.003), and the use of medications associated with diabetes (rg = 0.19, *P* = 0.022). Besides, serum alkaline phosphatase levels also exhibited a significant positive genetic correlation with depression. Conversely, our analysis unveiled significant negative genetic correlations between depression and two blood biomarkers: high-density lipoprotein (HDL) cholesterol and total bilirubin levels (Table S4, Figure 1A). Likewise, we detected significant negative correlations between SCZ and certain blood lipid markers, particularly HDL (rg = -0.13, *P* = 0.006) and total cholesterol levels (TCL, rg = -0.12, *P* = 0.018). In addition, significant negative genetic correlations were also detected between SCZ and chronic diseases including Chronic hepatitis C infection and Chronic sinusitis. In contrast, SCZ was significantly correlated with heart diseases such as angina pectoris. We also found significant genetic correlations between SCZ and the use of various medications, including those for peptic ulcer and gastro-esophageal reflux disease (GORD) (rg = 0.34, *P* = 0.003), non-steroidal anti-inflammatory and antirheumatic products (rg = 0.29, *P* = 0.012), among others (Table S4, Figure 1B). ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2024/03/01/2024.01.30.24301991/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2024/03/01/2024.01.30.24301991/F1) Figure 1. Genetic correlation network between psychiatric disorders and risk factors. (A) genetic correlation network of depression. (B) genetic correlation network of SCZ. Genetically correlated traits and diseases are clustered close together. Each circle represents a trait, and each edge represents a significant genetic correlation (PlJ