Genetic heterogeneity and subtypes of major depression ====================================================== * Thuy-Dung Nguyen * Arvid Harder * Ying Xiong * Kaarina Kowalec * Sara Hägg * Na Cai * Ralf Kuja-Halkola * Christina Dalman * Patrick F Sullivan * Yi Lu ## ABSTRACT Major depression (MD) is a heterogeneous disorder; however, the extent to which genetic factors distinguish MD patient subgroups (genetic heterogeneity) remains uncertain. This study sought evidence for genetic heterogeneity in MD. Using UK Biobank cohort, the authors defined 16 MD subtypes within eight comparison groups (vegetative symptoms, symptom severity, comorbid anxiety disorder, age at onset, recurrence, suicidality, impairment and postpartum depression; N∼3 000-47 000). To compare genetic component of these subtypes, subtype-specific genome-wide association studies were performed to estimate SNP-heritability, and genetic correlations within subtype comparison and with other related disorders or traits. The findings indicated that MD subtypes were divergent in their SNP-heritability, and genetic correlations both within subtype comparisons and with other related disorders/traits. Three subtype comparisons (vegetative symptoms, age at onset, and impairment) showed significant differences in SNP-heritability; while genetic correlations within subtype comparisons ranged from 0.55 to 0.86, suggesting genetic profiles are only partially shared among MD subtypes. Furthermore, subtypes that are more clinically challenging, e.g., early-onset, recurrent, suicidal, more severely impaired, had stronger genetic correlations with other psychiatric disorders. MD with atypical-like features showed a positive genetic correlation (*+*0.40) with BMI while a negative correlation (−0.09) was found in those without atypical-like features. Novel genomic loci with subtype-specific effects were identified. These results provide the most comprehensive evidence to date for genetic heterogeneity within MD, and suggest that the phenotypic complexity of MD can be effectively reduced by studying the subtypes which share partially distinct etiologies. Keywords * major depression * subtypes * genetic components * heterogeneity ## INTRODUCTION Major depression (MD) is a common psychiatric disorder that affects 15% of the population during lifetime.1 Individuals with MD vary considerably in symptoms, severity, course, treatment response, and neurobiology.2 MD heterogeneity is a major research and clinical challenge.3 Despite major efforts in epidemiological, clinical, and biological psychiatry, this decades-long challenge remains largely unresolved.4-6 MD subtypes have been proposed within five major categories that focused on: symptoms (typical versus atypical which characterized by improved mood in response to positive events, weight gain, increased appetite, and hypersomnia; with or without concomitant anxiety, etc.), etiology (with or without trauma or postpartum exposure), time of onset/time course (early-versus late-onset, recurrent), sex, and treatment outcome (treatment responsive versus resistant).6 Many of these subtypes, however, exhibit unclear distinctions in underlying biology, psychosocial factors, and treatment efficacy.6 One of the key biological component is genetics—the extent to which genetic factors distinguish these MD subtypes (i.e. genetic heterogeneity) is largely unknown. Given its relatively low heritability (30-40%)7, 8, identifying MD subtypes that are more heritable is of particular importance. Among the proposed subtypes, the sex difference in heritability is the most intensively studied, and current findings support that MD is more heritable in women than in men.9 Early-onset, recurrent MD, and postpartum depression have been suggested to confer higher genetic liability from family-based studies, which was subsequently confirmed using polygenic risk scores (PRS) in recent MD genome-wide association studies (GWAS).9-13 Comparisons of MD subtypes between early-versus late-onset, atypical versus non-atypical, with or without adversity have yielded interesting findings (e.g., the genetic overlap with metabolic traits was only found in MD with atypical features subtype, but not among those with non-atypical symptoms).14 The studies to-date that have used genetic approaches to index the heterogeneity of MD subtypes are encouraging (summarized in *Supplementary Table S1*) but overall impeded by a paucity of large cohorts with similar ascertainment, phenotyping, and genotyping.5 As a result, a systematic comparison across the MD subtypes is lacking and overall evidence for genetic heterogeneity within MD is inconclusive. The goal of this study was to investigate genetic heterogeneity in clinically-informed MD subtypes. To accomplish this, we systematically evaluated 16 subtypes in the unique UK Biobank (UKB) cohort with large-scale genomic data and a wide array of phenotypic measures uniformly assessed. In particular, we compared genetic components among subtypes by quantifying differences in heritability (*i*.*e*., measuring the relative importance of genetic effects on phenotypic variance) and estimating genetic correlations (*i*.*e*., to determine if underlying genetic risk factors are identical) within subtype comparisons and with other traits. ## MATERIALS and METHODS To identify MD subtypes and compare their genetic components, we carefully selected phenotypes and large-scale genotype data from the UKB. The full protocol and scripts are available via Github. ### Participant and phenotype definitions UKB is a population-based cohort of over 500 000 adults (age 37-73) from across the United Kingdom.15 UKB has phenotypic data from questionnaires, health records, biological sampling, and physical measurements. Information about mental health including MD was collected using various sources, including touchscreen questionnaires, nurse interviews, hospital admission records, and web-based mental health questionnaires (MHQ) follow-up. The UKB data profile were available elsewhere15 and briefly described in *Supplementary Methods S1*.*1*. #### MD case definition Cases were identified using five MD definitions, including (i) lifetime MD based on the Composite International Diagnostic Interview (CIDI) Short Form; (ii) ICD-coded MD based on linked hospital admission records; (iii) Probable MD based on Smith et al.16; (iv) Self-reported MD as part of past and current medical conditions; and (v) MD cardinal symptoms of anhedonia and dysphoria (*Supplementary Table S2)*. These MD definitions have been used in previous studies.17-19 Because some definitions were available only for parts of the UKB samples, to maximize sample size for MD subtypes, we included individuals who met criteria for at least one of the five MD definitions as cases. MD subtypes were all nested in the broad MD group but coming from different MD definitions (*Supplementary Table S3)*. #### MD subtypes According to major clinical features in MD, we defined 16 MD subtypes within eight comparison dimensions including (i) MD with versus without atypical-like features based on vegetative symptoms of hypersomnia and weight gain; (ii) severe versus mild/moderate MD based on symptom severity defined in Smith et al.16 or ICD codes; (iii) MD with or without comorbid anxiety disorder either self-reported or based on ICD codes; (iv) early- (≤30 years old) versus late-onset (≥44 years old) MD based on age at which first experienced a ≥2-week episode of cardinal symptoms; (v) recurrent MD vs single-episode MD based on the number of episodes self-reported or ICD codes; (vi) MD with or without suicidal thoughts or self-harm either experienced recently or during the worst episode; (vii) MD with mild, moderate, severe impairment on normal roles; and (viii) postpartum depression (PPD), either self-reported or based on ICD codes (***Table 1***; *Supplementary Methods S1*.*1, Table S4*). The majority of these subtypes are included in the five major categories proposed in the previous meta-review; while the subtypes on suicidality and on impairment are considered as outcome-based subtypes (*Supplementary Table S1)*.6 View this table: [Table 1.](http://medrxiv.org/content/early/2021/12/11/2021.03.05.21252911/T1) Table 1. MD subtypes and sample sizes #### Control group We used a common control group without lifetime history of MD to compare with all but the subtypes of comorbid anxiety disorder and PPD. From the entire UKB population, we excluded those with any indications of MD using five MD case criteria described above, and two additional exclusion criteria, help-seeking MD and antidepressant use (medication list in *Supplementary Table S5*). We further excluded those with ICD-diagnoses of anxiety disorders from the controls for the MD subtype with or without comorbid anxiety disorder. For PPD, we restricted controls to women who reported giving at least one live birth. (*Supplementary Table S2)* #### Exclusionary criteria for cases and controls We excluded any case or control who met lifetime criteria for schizophrenia, schizoaffective disorder, and bipolar disorder I (including unipolar mania) *(Supplementary Table S2)*. Thus, anyone who had ICD-diagnosis of schizophrenia/psychosis, bipolar disorder, mania or reported any use of antipsychotics or lithium for psychiatric symptoms (*Supplementary Table S5*) were excluded from analyses. Application of these criteria removed 2 385 MD cases and 231 controls (*Supplementary Figure S1)*. ### Genotyping, quality control, imputation Genotype data were available for 488 363 UKB participants, after a stringent quality control procedure and imputation using combined reference panels of Haplotype Reference Consortium (HRC) and UK10K merged with 1000 Genomes phase 3.15 459 590 individuals remained after the exclusion of subjects with low-quality genotype data, without both genotype and phenotype data, consent withdrawal, and non-European ancestry. Ancestry outliers were determined based on Price et al. (2006)20 with a threshold of three standard deviation from the mean. (*Supplementary Figure S1)*. ### Statistical analysis #### Genome-wide association studies (GWAS) We generated GWAS summary statistics for MD subtypes to estimate SNP-heritability (*h**2**SNP*) and genetic correlations for computational efficiency. In the UKB, about 30% of the participants were found to be related to at least one other person in the cohort up to the 3rd degree.15 Cryptic relatedness within sample could bias results in GWAS, while restricting to the unrelated individuals would cause a major loss of statistical power. We therefore performed the mixed linear model-based GWAS analysis (fastGWA) to retain related individuals in the UKB.21 We first constructed a sparse genetic relationship matrix (GRM) for all included individuals of European ancestries, and then conducted case-control GWAS for each subtype using fastGWA module in GCTA21, adjusting for sex, age, and the first 10 PCs (*Supplementary Methods S1*.*2)*. For subtype-specific GWAS with genome-wide significant SNPs (p≤5×10−8), we identified independent genomic loci using SNP2GENE module in FUMA22 (details setting in *Supplementary Methods S1*.*2*); then compared our loci with the latest published MD GWAS results which consisted of samples from the Psychiatric Genomics Consortium (PGC), UKB, and 23andMe.19 #### SNP-Heritability We estimated SNP-heritability (*h**2**SNP*) for each MD subtype using linkage disequilibrium score regression (LDSC).23 LDSC estimates *h**2**SNP* by regressing GWAS summary statistics on LD scores estimated from a reference population (1000 genomes European samples). We report the *h**2**SNP* estimates on the observed scale assuming 50:50 case-control ascertainment24 (***Figure 1a***; *Supplementary Methods S1*.*2*). For comparison, we also converted the estimates to the liability scale using two formulas: the standard conversion based on Lee et al. (2011)25, and the Yap et al. (2018)26 conversion which takes into account extreme phenotype selection (***Figure 1b***; *Supplementary Table S6, Figure S3)*. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/12/11/2021.03.05.21252911/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2021/12/11/2021.03.05.21252911/F1) Figure 1. SNP-heritability and pair-wise genetic correlation for MD subtypes. **(a)** *h**2**SNP* on observed scale assuming 50:50 case-control ascertainment for each MD subtype. The bars show point estimates. The error bar shows 95% CI. Same color coding is used for subtypes in the same comparison group. The horizontal line shows SNP-heritability for the broad MD phenotype (*h**2**SNP*=0.062). **b)** *h**2**SNP* of MD subtypes on the liability-scale using Yap et al. (2018) for a range of population case prevalence. Each panel shows one comparison group. Shaded areas show 95% CI for *h**2**SNP* on liability scale. Population control prevalence is fixed for each subtype as in *Supplementary Table S6*. **(c)** Pair-wise genetic correlation between subtypes within comparison groups. Error bars show 95% CI. The horizontal line shows the expected genetic correlation between subtypes under the null hypothesis (*H***: *r**g* = 1). Result from simulations where MD cases were randomly split into two halves (with 100 replicates) showed that the expected value of *r**g* was not significantly different from the null (mean=1.04, 95% CI= 0.98-1.10). Co-anxiety: MD with comorbid anxiety; Non-co. anxiety: MD without comorbid anxiety. Colors indicate the same comparison group as in (a). When comparing *h**2**SNP* estimates within subtype comparisons, because common controls were used, we primarily considered that estimates are significantly different when non-overlapping confidence intervals are presented. We further performed statistical tests to confirm significance by splitting controls into random subsets (*Supplementary Methods S1*.*2, Table S9*). To ensure that the potential disproportionate power gain across subtype by modelling relatedness in fastGWA did not affect our *h**2**SNP* comparisons, we also estimated *h**2**SNP* based on unrelated samples (*Supplementary Methods S1*.*2, Figure S4*). #### Genetic correlation Genetic correlations (*r**g*) were estimated using High-Definition Likelihood (HDL) method which yields more precise estimates of genetic correlations than LDSC *(Supplementary Methods S1*.*2)*.27 We estimated *r**g* within subtype comparisons using the LD reference computed from 336 000 Genomic British individuals in the UKB.27 To benchmark the expected *r**g* under the null hypothesis (H: *r**g* = 1) in this population, we conducted a simulation analysis where MD cases were randomly split into two halves and estimated *r**g* between those two dummy-subtypes. We repeated the analysis 100 times, and calculated the mean (*Figure legend 1*). To examine whether the subtypes differ in their genetic overlap with other psychiatric disorders and traits, we also estimated genetic correlations between these MD subtypes and 11 traits, six psychiatric disorders, neuroticism, self-reported well-being, body mass index, and two cognitive traits (***Figure 2***) and compared results within subtype comparisons. These disorders and traits were chosen given the strong evidence for their genetic correlations with MD, or in some cases, for their causal effects on MD.13, 19 We have used the summary statistics from the latest published GWAS for the calculations of *r**g* using HDL.19, 28-37 ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2021/12/11/2021.03.05.21252911/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2021/12/11/2021.03.05.21252911/F2) Figure 2. Genetic correlations (*r**g*) between MD subtypes with other psychiatric disorders and related traits. Each panel shows *r**g* with other traits for each subtype comparison. Last panel shows the comparison between postpartum depression and broad MD. *r**g* with other traits for each subtype are in different colors. Error bars show 95% CI. Vertical dash lines in each panel at *r**g=*0. Horizontal dash line separates psychiatric and other traits. *Co-anxiety*: MD with comorbid anxiety; *Non-co. anxiety*: MD without comorbid anxiety. #### Sensitivity analyses To examine whether our broad MD definition that included less strictly defined cases may bias results, we further restricted the analyses to the CIDI-based definition—previously suggested as the closest to the gold standard for diagnosing MD in the UKB17, 38—and performed similar analyses for all subtypes except impairment (*Supplementary Methods S1*.*2*). ## RESULTS Of 459 590 individuals included in this study (54% females, mean age at recruitment 57 (SD 8.00)), 126 506 (27.53%) met at least one of the five definitions for MD (*i*.*e*., broad MD phenotype). After applying exclusion criteria, we retained 124 121 cases and 250 229 controls. Compared with controls, MD cases had more females (64% vs 47%), higher Townsend deprivation index which measures material deprivation with a higher score implies a greater degree of deprivation (mean −1.33, SD 3.02 vs −x1.63, SD 2.90), more lifetime smokers (57% vs 52%), but did not differ in mean BMI (mean 27.3, SD 4.6 vs 27.3, SD 5.0). The estimates of *h**2**SNP* varied across the five MD case definitions, and for the broad MD phenotype it was 6.18% (95% CI= 5.65-6.71%) on the observed scale assuming 50:50 case-control ascertainment. ### Differences in genetic components reflect subtype heterogeneity Overall, *h**2**SNP* estimates tended to be higher in MD subtypes with more severe manifestation (e.g., MD with atypical-like features, recurrent, PPD, severe impairment and severe symptoms subtypes) (***Figure 1a***). All of the subtype comparisons had higher *h**2**SNP* estimates for the more severe manifestation, and three (vegetative symptoms, age at onset, and impairment) showed significant differences in *h**2**SNP* estimates (***Figure 1a-b***, *Supplementary Table S9*). All examined genetic correlations within comparisons were significantly less than one and the estimates ranged between 0.55-0.86 (***Figure 1c***; *pairwise phenotypic and genetic correlation in Supplementary Figure S5*). The *h**2**SNP* estimate for MD with atypical-like features was the highest among all subtypes, and it was more than twice as high as the estimate for MD without atypical-like features with a non-overlapping 95% CI (19.04%, CI= 10.89-27.19% and 7.53%, CI=6.63-8.43%). The genetic correlation between MD subtypes with and without atypical-like features was the lowest among all comparisons (*r**g*=0.55, CI=0.43-0.67) (***Figure 1c***). The two subtypes did not significantly differ in their genetic correlations with PGC MD *(****Figure 2****)*; instead major differences were found in their correlations with anorexia nervosa and ADHD. Consistent with previous findings14, 39, 40, MD with atypical-like features showed a strong positive *r**g* with BMI (0.40, CI=0.34-0.46) while non-atypical-like features MD showed a small negative *r**g* instead (*r**g*=-0.09, CI=-0.13 to −0.06). Furthermore, positive correlations with cognitive traits were observed in non-atypical-like features MD (*r**g*=0.36 and 0.35 with educational attainment and intelligence) which were not found in MD with atypical-like features (corresponding *r**g*= 0.04 and 0.07). The MD subtype with severe symptoms had slightly higher *h**2**SNP* estimate than the one with mild/moderate symptoms, although the two estimates were not significantly different. The *r**g* within comparison was significantly lower than 1 (0.80, CI=0.68-0.92). However, the two subtypes did not differ in their correlations with other traits except for a stronger *r**g* with schizophrenia found in the subtype with severe symptoms *(****Figure 1-2****)*. Assuming the proportions of MD cases with and without comorbid anxiety disorder at 55% and 45%, respectively41, the former subtype was more heritable than the latter (*h**2**SNP*=12.73%, CI= 11.32-14.14%, for MD with comorbid anxiety disorder, compared with 11.52%, CI=9.64-13.40%, for MD without anxiety disorder). The *r**g* within this comparison was 0.80 (CI=0.72-0.88) (***Figure 1***). Furthermore, the subtype with comorbid anxiety disorder showed higher genetic correlations with MD, schizophrenia and neuroticism, as well as lower correlations with cognitive traits, when compared with the subtype without anxiety disorder (***Figure 2***). The *h**2**SNP* of early-onset MD was three times higher than that of the late-onset subtype (13.04%, CI=11.65-14.43% compared with 4.26%, CI=3.22-5.30%). The *r**g* within comparison was 0.76 (CI=0.68-0.84). (***Figure 1***). Significant differences in their *r**g* with other traits were observed, including higher genetic correlations in early-onset MD with PGC MD, schizophrenia, anorexia nervosa, and autism spectrum disorder, than in late-onset MD (***Figure 2***). Recurrent showed a higher *h**2**SNP* estimates than single-episode MD, 10.67% (CI=9.38-11.96%) vs 8.22% (CI=6.59-9.85%). Their *r**g* was significantly lower than one (0.83, CI=0.73-0.93) *(Figure 1)*. Compared with single-episode cases, recurrent MD had stronger positive correlations with schizophrenia, bipolar disorder, anorexia nervosa, while lower genetic correlation with BMI (***Figure 2***). The MD subtype with suicidal thoughts was slightly more heritable than the subtype without albeit the CI largely overlapped (8.79%, CI=7.75-9.83% and 7.98%, CI=6.98-8.98%). The *r**g* within this comparison was 0.79 (CI=0.73-0.85). The two subtypes in this comparison significantly differed in their genetic correlations with the majority of the other traits considered. Compared with the subtype without suicidal thoughts, the suicidal subtype showed substantially higher positive *r**g* with PGC MD, schizophrenia, neuroticism, and negative *r**g* with well-being; while its *r**g* with cognitive traits was much weaker *(****Figure 1-2****)*. For subtypes based on impairment, the *h**2**SNP* estimates increased with the degree of impairment, roughly in a dose-response relationship, *i*.*e*., mild impairment had the lowest *h**2**SNP* (6.00%, CI=4.98-7.02%), followed by moderate (9.08%, CI=7.79-10.37%) and severe impairment (11.27%, CI=9.84-12.70%). This dose-response relationship was also reflected in the pair-wise genetic correlation estimates, with the *r**g* comparing mild and severe impairment (0.65, CI=0.59-0.71) markedly lower than the other two correlations (***Figure 1***). We observed a clear trend, that is, the more severe impairment in the subtype, the stronger genetic correlation it had with other psychiatric disorders and neuroticism (positive *r**g*), and with self-reported well-being (negative *r**g*), while less severe impairment was more strongly associated with cognitive traits (positive *r**g*) and with BMI (negative *r**g*) *(****Figure 2****)*. The *h**2**SNP* of PPD was estimated at 10.73% (CI=6.28-15.18%) which was higher compared with *h**2**SNP* of broad MD phenotype. PPD showed significant positive *r**g* with other psychiatric disorders, with the strongest *r**g* observed in PGC MD as expected (0.61, CI=0.53-0.69), and with neuroticism (*r**g*=0.34) and cognitive traits (*r**g*=0.35 and 0.41 with educational attainment and intelligence), and a negative *r**g* with well-being (*r**g**=*-0.39) *(****Figure 2****)*. The broad MD definition was used above to allow sufficient statistical power in analyzing each subtype. We further assessed the impact of MD definition by performing a sensitivity analysis based on more strictly defined MD cases. The *h**2**SNP* of the CIDI-based definition was 13.12%, CI=11.12-15.12% (*Supplementary Figure S2*). Restricting the analyses to the CIDI-based cases, the results were highly similar, except for the comparisons of symptom severity and recurrence, where the CIs of the *r**g* estimates now included one due to markedly reduced sample sizes in these subtypes (*Supplementary Table S8*). ### Stratified GWAS reveal novel subtype-specific loci Over all 16 subtype-specific GWAS, we identified 47 genome-wide significant loci (45 non-overlapping) associated with nine subtypes. Less than half (22 loci) were significant in our largest GWAS of broad MD. Comparing with the latest published MD GWAS19, we found 14 loci that have not been reported on MD, with 3 for early-onset, 3 for recurrent, 3 for suicidal MD, 2 for non-suicidal, 1 for non-atypical-like features, 1 for moderate impairment and 1 for PPD (***Table 2***; full results on the 45 loci in *Supplementary Table S7*). The majority (64%) of these 14 loci showed no statistically significant association with the other subtype in comparison (*P*>0.05; *Supplementary Table S7*), suggesting subtype-specific effects. The chromosome 2 locus for recurrent MD, with the leading SNP rs6431690, was significant even after the stringent Bonferroni correction (*P*<3.125*10−9). View this table: [Table 2.](http://medrxiv.org/content/early/2021/12/11/2021.03.05.21252911/T2) Table 2. 14 genome-wide significant loci from MD subtype-specific GWAS, undetected in the Howard *et al*. 2019 ## DISCUSSION In this comprehensive report using the large-scale UKB data, we compared the genetic components of 16 MD subtypes and demonstrated that these subtypes were divergent in their *h**2**SNP* and genetic correlations both within subtype comparisons and with other related disorders/traits. Our results provide convincing evidence for genetic heterogeneity within MD, as indexed by its clinical subtypes. These findings suggest that the complexity in the phenotype of MD can be effectively reduced by studying the subtypes which share partially distinct etiologies. In particular, we note the following key findings: First, clinically-informed subtypes are, in general, genetically more homogeneous than considering all types of MD together. Accurately identifying more homogenous forms is the first step to reduce heterogeneity in MD. The majority of the subtypes showed higher estimates of *h**2**SNP* compared with MD of all forms. Our results corroborated previous findings from family-based studies that early-onset, recurrent MD and PPD represent more heritable MD subtypes.10, 12 We further extended the list to include almost all subtypes based on our eight investigated clinical indices. Among those, MD with atypical-like features, severe episode, MD with or without comorbid anxiety disorder, and with severe impairment showed considerably higher heritability. In contrast, subtypes with lower heritability than all-form MD are those with mild impairment or with late onset. Second, we demonstrated subtype heterogeneity in both *h**2**SNP* and genetic correlations. All subtype comparisons showed non-identical genetic sharing (*i*.*e*., *r**g* between subtypes significantly differ from unity) and some had heritability differences (*i*.*e*., *h**2**SNP* significantly differ between subtypes). Interestingly, the subtype comparisons on vegetative symptoms, age at onset, and impairment showed the strongest evidence for genetic heterogeneity, meaning these clinical features characterize major etiological differences within MD. However, the observed genetic correlations across subtype comparisons were moderate to high, 0.55-0.86, revealing substantial genetic overlaps between subtypes. The level of genetic correlation can be translated into the proportion of genetic variance in one trait attributable to that of another (*r**g*2).17 Thus, it would suggest about 30-70% of genetic variances are shared within subtype comparisons. In line with previous estimates of genetic correlations between male versus female MD9 and across MD symptoms42, our findings confirm that the genetic profiles of MD subtypes are only partially distinct. The estimates of genetic correlations between subtypes need to be benchmarked against genetic correlations between different psychiatric disorders (e.g., schizophrenia and bipolar disorder, two clinically distinct psychiatric disorders, had a *r**g* of ∼0.7031), between different datasets but with same phenotype (e.g., mean *r**g*∼0.76 across the seven cohorts at PGC MD13), and between different populations (e.g., *r**g*∼1 between schizophrenia samples of East Asian and European ancestries43). Genetic correlations can be found lower than one due to differences in phenotype definitions, populations, or technical factors44. In this study, we minimized these potential differences by using the single large sample from UKB. We also restricted the estimation of genetic correlations to within subtype comparisons instead of pair-wise comparisons across all subtypes, to limit the impact of phenotypic differences between subtypes (*e*.*g*., we found mean *r**g* across all subtypes was indeed lower than that within comparison groups). Our genetic correlation estimates are thus reliable for quantifying overall genetic sharing between MD subtypes. Third, the MD subtypes preserve the overall pattern of genetic sharing found between MD (of all forms) and other psychiatric disorders, but differ in the relationships with other traits. MD was shown to be positively correlated with many psychiatric disorders (e.g., *r**g*∼0.3 with schizophrenia and bipolar disorders) and with BMI (*r**g*=0.09), and negatively correlated with educational attainment (*r**g*=-0.13).13, 19 A similar level of genetic correlations was found between MD subtypes and other psychiatric disorders; notably, we found stronger correlations in the MD subtypes that are more clinically challenging, especially early-onset, recurrent, suicidal, more severely impaired. Regarding their relationships with other traits, MD subtypes showed some differences compared with the broad MD phenotype. The positive correlation found between MD and BMI was only detected in MD with atypical-like features, but with a markedly higher estimate (*r**g*∼0.4) than the estimate based on broad MD phenotype. This result concurred with previous findings mainly using PRS or other samples.14, 39, 40, 45 In contrast with the negative value found in the broad MD, we found positive correlations with educational attainment in many MD subtypes. However, this finding might be specific to the UKB cohort as previous research have shown that participation in mental health survey and other optional components is genetically correlated with higher education and intelligence.46 Taken together, our findings provide an improved understanding on heritable MD subtypes and overall genetic sharing between subtypes. These results have strong implications in the gene mapping strategies for MD. Current efforts predominantly aim to maximize samples size. The alternative strategy—to reduce phenotypic heterogeneity through more homogeneous phenotype—has not been fully evaluated, potentially due to theoretical and methodological challenges.47 This strategy relies on the premise that “clinical heterogeneity in MD emerges from an aggregation of different underlying liabilities expressed through partially distinct biological pathways”47 which, to the best knowledge, was not proven. Limited by a lack of large-scale dataset with deep phenotyping, prior studies were only able to focus on a few key subtypes.5, 47 Our comprehensive report, by contrast, convincingly demonstrated genetic heterogeneity in MD, and thus forms a strong theoretical basis for this strategy. We further illustrated the potential of such strategy by performing stratified GWAS on each subtype. This yielded the identification of 47 independent genomic loci, a third of which were undetected in the latest MD GWAS with about 5- to 10-fold more cases than in our subtype-specific analyses. These results warrant further replications in large biobanks with consistent genotyping and phenotyping. Future data collections in MD may benefit from assessing key clinical characteristics and utilizing them to reduce MD heterogeneity. Here we used the UKB data which provide the unique opportunity to evaluate multiple subtypes with sufficient statistical power. We, however, note the following limitations in the context of interpreting the results. First, we were unable to study all MD subtypes, especially rare subtypes like psychotic, seasonal, treatment-related subtypes, as more refined clinical and treatment data would be required. We also acknowledge that the quality of phenotypic definitions varied across the subtypes studied, with those relying on self-reported and retrospective recalls of symptoms more compromised than the others. Together with the varying prevalence and underlying genetic architectures, statistical power varied across subtypes and the power gain using fastGWA may not be proportional to subtypes. “Healthy volunteer bias” was known for UKB48 and likely to contribute to part of our results. Finally, we used theory-driven subtyping approach in this study. New methods using data-driven approaches might hold great promises for novel subtype identification and validation. Etiological heterogeneity hinders treatment efficacy. Our finding of ubiquitous subtype heterogeneity within MD underscores the potential of drug development and treatment optimization for patient subgroups to achieve precision psychiatry. ## Supporting information Supplementary methods, tables and figures [[supplements/252911_file03.pdf]](pending:yes) ## Data Availability Data used for this manuscript is not available for public access. Please contact the corresponding author for questions regarding data. Code that was used to generate the results is available on GitHub link below. [https://github.com/Thuy-Dung-Nguyen/MD-subtypes](https://github.com/Thuy-Dung-Nguyen/MD-subtypes) ## URLs Full protocol and scripts available via Github: [https://github.com/Thuy-Dung-Nguyen/MD-subtypes](https://github.com/Thuy-Dung-Nguyen/MD-subtypes); UK Biobank Showcase User Guide (2017): [http://biobank.ctsu.ox.ac.uk/crystal/crystal/exinfo/ShowcaseUserGuide.pdf](http://biobank.ctsu.ox.ac.uk/crystal/crystal/exinfo/ShowcaseUserGuide.pdf); UK Biobank-Mental health web-based questionnaire (2017): [http://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/mental\_health\_online.pdf](http://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/mental_health_online.pdf); GCTA-fastGWA: [https://cnsgenomics.com/software/gcta/#fastGWA](https://cnsgenomics.com/software/gcta/#fastGWA); FUMA: [https://fuma.ctglab.nl](https://fuma.ctglab.nl); LDSC: [https://github.com/bulik/ldsc](https://github.com/bulik/ldsc); HDL: [https://github.com/zhenin/HDL](https://github.com/zhenin/HDL); Howard et al. 2019 MD GWAS summary results: [https://datashare.is.ed.ac.uk/handle/10283/3203](https://datashare.is.ed.ac.uk/handle/10283/3203). ## Competing interests The authors declare no competing interests. Dr. Sullivan reports the following potentially competing financial interests: current: Lundbeck (advisory committee, grant recipient), RBNC Therapeutics (advisory committee, stock ownership); past three years: Pfizer (scientific advisory board), Element Genomics (consultation fee), and Roche (speaker reimbursement). All other authors report no biomedical financial interests or potential conflicts of interest related to this work. ## Contributions YL conceived the idea, designed and supervised the implementation of the study, interpreted the results, revised the manuscript. TDN conducted the analyses, interpreted the results, drafted, and revised the manuscript. AH, YX contributed to the analyses, and solving technical issues. KK, NC, RKH, CD, SH contributed to the study design, discussion on methodological issues and interpretation of the results. SH provided access to the data. PFS contributed to the discussions on methods, interpretation of the results and revising the manuscript. All authors discussed and commented on the manuscript. ## Acknowledgement This research has been conducted using the UK Biobank Resource under Application Number 22224. This study was funded by the US NIMH grant (R01 MH123724) and the European Union’s Horizon 2020 research and innovation program under grant agreement No 847776. KK was supported by the University of Manitoba and US NIMH (R01 MH123724). PFS was supported by the Swedish Research Council (Vetenskapsrådet, award D0886501), the Horizon 2020 Program of the European Union (COSYN, RIA grant agreement n° 610307), and US NIMH (U01 MH109528 and R01 MH077139). YL is in part supported by a 2018 NARSAD Young Investigator Grant from the Brain & Behaviour Research Foundation and US NIMH (R01 MH123724). The computations were enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at UPPMAX server partially funded by the Swedish Research Council through grant agreement no. 2018-05973. * Received March 5, 2021. * Revision received December 10, 2021. * Accepted December 11, 2021. * © 2021, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. 1.World Health O. Depression and other common mental disorders: global health estimates. Geneva: World Health Organization; 2017 2017. 2. 2.Fried EI, Nesse RM. Depression is not a consistent syndrome: An investigation of unique symptom patterns in the STAR*D study. J Affect Disord 2015; 172: 96–102. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jad.2014.10.010&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25451401&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 3. 3.Flint J, Kendler KS. The genetics of major depression. Neuron 2014; 81(3): 484–503. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2014.01.027&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24507187&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 4. 4.Beijers L, Wardenaar KJ, van Loo HM, Schoevers RA. Data-driven biological subtypes of depression: systematic review of biological approaches to depression subtyping. Molecular Psychiatry 2019; 24(6): 888–900. 5. 5.Cai N, Choi KW, Fried EI. Reviewing the genetics of heterogeneity in depression: operationalizations, manifestations and etiologies. Human Molecular Genetics 2020; 29(R1): R10–R18. 6. 6.Harald B, Gordon P. Meta-review of depressive subtyping models. J Affect Disord 2012; 139(2): 126–140. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jad.2011.07.015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21885128&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 7. 7.Polderman TJC, Benyamin B, de Leeuw CA, Sullivan PF, van Bochoven A, Visscher PM et al. Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nature Genetics 2015; 47(7): 702–709. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3285&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25985137&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 8. 8.Sullivan PF, Neale MC, Kendler KS. Genetic epidemiology of major depression: review and meta-analysis. Am J Psychiatry 2000; 157(10): 1552–1562. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1176/appi.ajp.157.10.1552&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11007705&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000089633900002&link_type=ISI) 9. 9.Kendler KS, Ohlsson H, Lichtenstein P, Sundquist J, Sundquist K. The Genetic Epidemiology of Treated Major Depression in Sweden. American Journal of Psychiatry 2018; 175(11): 1137–1144. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1176/appi.ajp.2018.17111251&link_type=DOI) 10. 10.Fernandez-Pujals AM, Adams MJ, Thomson P, McKechanie AG, Blackwood DHR, Smith BH et al. Epidemiology and Heritability of Major Depressive Disorder, Stratified by Age of Onset, Sex, and Illness Course in Generation Scotland: Scottish Family Health Study (GS:SFHS). PloS one 2015; 10(11): e0142197–e0142197. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0142197&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26571028&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 11. 11.Power RA, Tansey KE, Buttenschøn HN, Cohen-Woods S, Bigdeli T, Hall LS et al. Genome-wide Association for Major Depression Through Age at Onset Stratification: Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium. Biological Psychiatry 2017; 81(4): 325–335. 12. 12.Viktorin A, Meltzer-Brody S, Kuja-Halkola R, Sullivan PF, Landén M, Lichtenstein P et al. Heritability of Perinatal Depression and Genetic Overlap With Nonperinatal Depression. Am J Psychiatry 2016; 173(2): 158–165. 13. 13.Wray NR, Ripke S, Mattheisen M, Trzaskowski M, Byrne EM, Abdellaoui A et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nature Genetics 2018; 50(5): 668–681. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0090-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29700475&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 14. 14.Milaneschi Y, Lamers F, Peyrot WJ, Abdellaoui A, Willemsen G, Hottenga JJ et al. Polygenic dissection of major depression clinical heterogeneity. Mol Psychiatry 2016; 21(4): 516–522. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/mp.2015.86&link_type=DOI) 15. 15.Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 2018; 562(7726): 203–209. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-018-0579-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30305743&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 16. 16.Smith DJ, Nicholl BI, Cullen B, Martin D, Ul-Haq Z, Evans J et al. Prevalence and Characteristics of Probable Major Depression and Bipolar Disorder within UK Biobank: Cross-Sectional Study of 172,751 Participants. PLOS ONE 2013; 8(11): e75362. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0075362&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24282498&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 17. 17.Cai N, Revez JA, Adams MJ, Andlauer TFM, Breen G, Byrne EM et al. Minimal phenotyping yields genome-wide association signals of low specificity for major depression. Nature Genetics 2020; 52(4): 437–447. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-020-0594-5&link_type=DOI) 18. 18.Hall LS, Adams MJ, Arnau-Soler A, Clarke TK, Howard DM, Zeng Y et al. Genome-wide meta-analyses of stratified depression in Generation Scotland and UK Biobank. Transl Psychiatry 2018; 8(1): 9. 19. 19.Howard DM, Adams MJ, Clarke T-K, Hafferty JD, Gibson J, Shirali M et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nature Neuroscience 2019; 22(3): 343–352. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 20. 20.Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 2006; 38(8): 904–909. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng1847&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16862161&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000239325700019&link_type=ISI) 21. 21.Jiang L, Zheng Z, Qi T, Kemper KE, Wray NR, Visscher PM et al. A resource-efficient tool for mixed model association analysis of large-scale data. Nature Genetics 2019; 51(12): 1749–1755. 22. 22.Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nature Communications 2017; 8(1): 1826. 23. 23.Bulik-Sullivan BK, Loh P-R, Finucane HK, Ripke S, Yang J, Patterson N et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature Genetics 2015; 47(3): 291–295. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3211&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25642630&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 24. 24.Peyrot WJ, Price AL. Identifying loci with different allele frequencies among cases of eight psychiatric disorders using CC-GWAS. Nature Genetics 2021; 53(4): 445–454. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-021-00787-1&link_type=DOI) 25. 25.Lee SH, Wray NR, Goddard ME, Visscher PM. Estimating missing heritability for disease from genome-wide association studies. American journal of human genetics 2011; 88(3): 294–305. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2011.02.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21376301&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000288589000007&link_type=ISI) 26. 26.Yap CX, Sidorenko J, Marioni RE, Yengo L, Wray NR, Visscher PM. Misestimation of heritability and prediction accuracy of male-pattern baldness. Nat Commun 2018; 9(1): 2537. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41467-018-04807-3&link_type=DOI) 27. 27.Ning Z, Pawitan Y, Shen X. High-definition likelihood inference of genetic correlations across human complex traits. Nature Genetics 2020; 52(8): 859–864. 28. 28.Demontis D, Walters RK, Martin J, Mattheisen M, Als TD, Agerbo E et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat Genet 2019; 51(1): 63–75. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0269-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30478444&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 29. 29.Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet 2019; 51(3): 431–444. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0344-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30804558&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 30. 30.Watson HJ, Yilmaz Z, Thornton LM, Hübel C, Coleman JRI, Gaspar HA et al. Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat Genet 2019; 51(8): 1207–1214. 31. 31.Stahl EA, Breen G, Forstner AJ, McQuillin A, Ripke S, Trubetskoy V et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat Genet 2019; 51(5): 793–803. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-019-0397-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31043756&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 32. 32.Lee JJ, Wedow R, Okbay A, Kong E, Maghzian O, Zacher M et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet 2018; 50(8): 1112–1121. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0147-3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30038396&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 33. 33.Savage JE, Jansen PR, Stringer S, Watanabe K, Bryois J, de Leeuw CA et al. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nat Genet 2018; 50(7): 912–919. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41588-018-0152-6&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29942086&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 34. 34.Ripke S, Walters JTR, Donovan MC. Mapping genomic loci prioritises genes and implicates synaptic biology in schizophrenia. medRxiv 2020: 2020.2009.2012.20192922. 35. 35.Pulit SL, Stoneman C, Morris AP, Wood AR, Glastonbury CA, Tyrrell J et al. Meta-analysis of genome-wide association studies for body fat distribution in 694 649 individuals of European ancestry. Hum Mol Genet 2019; 28(1): 166–174. 36. 36.Okbay A, Baselmans BM, De Neve JE, Turley P, Nivard MG, Fontana MA et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat Genet 2016; 48(6): 624–633. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/ng.3552&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27089181&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 37. 37.Nagel M, Jansen PR, Stringer S, Watanabe K, de Leeuw CA, Bryois J et al. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat Genet 2018; 50(7): 920–927. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 38. 38.Glanville KP, Coleman JRI, Howard DM, Pain O, Hanscombe KB, Jermy B et al. Multiple measures of depression to enhance validity of major depressive disorder in the UK Biobank. BJPsych Open 2021; 7(2): e44. 39. 39.Badini I, Coleman JRI, Hagenaars SP, Hotopf M, Breen G, Lewis CM et al. Depression with atypical neurovegetative symptoms shares genetic predisposition with immuno-metabolic traits and alcohol consumption. Psychol Med 2020: 1–11. 40. 40.Milaneschi Y, Lamers F, Penninx BWJH. Dissecting Depression Biological and Clinical Heterogeneity—The Importance of Symptom Assessment Resolution. JAMA Psychiatry 2021; 78(3): 341–341. 41. 41.Kaufman J, Charney D. Comorbidity of mood and anxiety disorders. Depress Anxiety 2000; 12 Suppl 1: 69–76. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/1520-6394(2000)12:1+<69::AID-DA9>3.0.CO;2-K&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11098417&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000165269900009&link_type=ISI) 42. 42.Thorp JG, Marees AT, Ong J-S, An J, MacGregor S, Derks EM. Genetic heterogeneity in self-reported depressive symptoms identified through genetic analyses of the PHQ-9. Psychological Medicine 2020; 50(14): 2385–2396. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0033291719002526&link_type=DOI) 43. 43.Lam M, Chen CY, Li Z, Martin AR, Bryois J, Ma X et al. Comparative genetic architectures of schizophrenia in East Asian and European populations. Nat Genet 2019; 51(12): 1670–1678. 44. 44.Baselmans BML, Yengo L, van Rheenen W, Wray NR. Risk in Relatives, Heritability, SNP-Based Heritability, and Genetic Correlations in Psychiatric Disorders: A Review. Biological Psychiatry 2021; 89(1): 11–19. 45. 45.Milaneschi Y, Lamers F, Peyrot WJ, Baune BT, Breen G, Dehghan A et al. Genetic Association of Major Depression With Atypical Features and Obesity-Related Immunometabolic Dysregulations. JAMA Psychiatry 2017; 74(12): 1214–1225. 46. 46.Tyrrell J, Zheng J, Beaumont R, Hinton K, Richardson TG, Wood AR et al. Genetic predictors of participation in optional components of UK Biobank. Nature Communications 2021; 12(1): 886. 47. 47.Schwabe I, Milaneschi Y, Gerring Z, Sullivan PF, Schulte E, Suppli NP et al. Unraveling the genetic architecture of major depressive disorder: merits and pitfalls of the approaches used in genome-wide association studies. Psychol Med 2019; 49(16): 2646–2656. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/s0033291719002502&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom) 48. 48.Fry A, Littlejohns TJ, Sudlow C, Doherty N, Adamska L, Sprosen T et al. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. American Journal of Epidemiology 2017; 186(9): 1026–1034. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwx246&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28641372&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2021%2F12%2F11%2F2021.03.05.21252911.atom)