Exome sequencing in bipolar disorder reveals shared risk gene AKAP11 with schizophrenia

Here we report results from the Bipolar Exome (BipEx) collaboration analysis of whole exome sequencing of 13,933 individuals diagnosed with bipolar disorder (BD), matched with 14,422 controls. We find an excess of ultra-rare protein-truncating variants (PTVs) in BD patients among genes under strong evolutionary constraint, a signal evident in both major BD subtypes, bipolar 1 disorder (BD1) and bipolar 2 disorder (BD2). We also find an excess of ultra-rare PTVs within genes implicated from a recent schizophrenia exome meta-analysis (SCHEMA ; 24,248 SCZ cases and 97,322 controls) and among binding targets of CHD8. Genes implicated from GWAS of BD, however, are not significantly enriched for ultra-rare PTVs. Combining BD gene-level results with SCHEMA, AKAP11 emerges as a definitive risk gene (ultra-rare PTVs seen in 33 cases and 13 controls, OR = 7.06, P = 2.83 × 10 -9 ). At the protein level, AKAP-11 is known to interact with GSK3B, the hypothesized mechanism of action for lithium, one of the few treatments for BD. Overall, our results lend further support to the polygenic basis of BD and demonstrate a role for rare coding variation as a significant risk factor in BD onset.


Introduction
Bipolar disorder (BD) is a heritable neuropsychiatric disorder characterized by episodes of (hypo-)mania and episodes of depression.Across the lifetime, BD has a prevalence between 1-2% of the population, often with onset in early adulthood.Bipolar disorder is a chronic condition that affects individuals across their lifespan and is a significant source of disease burden worldwide (1).Meta-analysis of 24 twin studies estimated broad heritability of BD around 67% (2), while recent molecular genetic analyses estimated the additive heritable component from common SNPs (MAF > 1%) between 17 and 23% (3).This difference between twin heritability estimates of BD and additive heritability from common SNPs indicates that a large fraction of genetic risk is still undiscovered.One potential source may come from rare, often deleterious, genetic variants of more recent origin.Rare variation, particularly copy number variants, have been shown to influence risk for BD, albeit to a weaker degree than other neuropsychiatric illnesses such as schizophrenia and autism spectrum disorders (ASDs) (4).Similarly, previous studies showed some evidence for the role of rare PTVs in BD risk, but with a more modest effect size compared to ASDs and schizophrenia (5).The extent that rare variation plays a role in BD susceptibility can be inferred by measuring the degree of natural selection acting on individuals with BD.Specifically, negative selection on BD causes alleles with high penetrance for BD risk to be kept at low frequency in the population (6,7).Evidence for negative selection on BD can be seen in the significantly lower reproductive rate of both males (0.75 to 1) and females (0.85 to 1) with BD compared to their unaffected siblings in a large Swedish birth cohort (8).The reproductive rate observed in BD, however, is substantially higher than individuals with schizophrenia (0.23 for males, 0.47 for females) or autism (0.25 for males, 0.48 for females), suggesting that the role of rare variation is likely to be smaller in magnitude, as selection is not acting as strongly on BD in aggregate.
Nevertheless, the interrogation of rare variation in BD patients will be pivotal in the discovery of variants with high penetrance for BD risk.
Within BD, two clinical subtype classifications are recognized: bipolar I disorder (BD1) and bipolar II disorder (BD2; APA DSM-IV (9); WHO ICD-10 (10)).BD1 diagnosis includes recurring manic and depressive episodes, with manic episodes often including psychosis symptoms.In contrast, a BD2 diagnosis requires at least one depressive episode and one hypomanic (but not manic) episode across the lifetime.In addition to the primary BD1/BD2 criteria, individuals presenting with both mood symptoms of BD and psychotic symptoms similar to schizophrenia outside of mood episodes are often diagnosed with schizoaffective disorder (SAD)-bipolar subtype.Despite the distinct diagnostic categories, genetic susceptibility for BD from common SNPs has shown strong overlap with schizophrenia (genetic correlation r g = 0.70) and major depressive disorder (MDD) (r g = 0.35), with BD1 showing preferential overlap with schizophrenia and BD2 with MDD, reflecting a broad continuum of genetic influence on psychosis and mood disturbance (3).
To date, GWAS meta-analysis of common SNPs have now identified 64 independent loci that contribute to BD susceptibility, implicating genes encoding ion channels, neurotransmitter transporters, and synaptic and calcium signalling pathways (3,11).Evidence of rare variation on BD risk, however, remains inconclusive as sample sizes are substantially smaller than GWAS.Analysis of large rare copy number variants (MAF < 1%) in 6,353 BD cases found CNV enrichment among SAD over both controls and other BD diagnoses, suggesting that increased risk among detectable rare CNVs is restricted to individuals with psychotic symptoms (4).Analysis of whole exome and genome sequencing of both pedigree and case-control to date have shown only nominal enrichment among individual genes and candidate gene sets (12)(13)(14)(15), with none surpassing exome-wide significance.
Here, we report results from the Bipolar Exome (BipEx) collaboration, the largest whole-exome study of BD to date, comprising 13,933 BD cases and 14,422 controls following aggregation, sequencing, and quality control.

Description of exome sequencing data generation, sample cohorts and quality control
We combined bipolar case-control whole exome sequencing data from 13 sample collections in 6 countries.The aggregated dataset consists of 33,699 individuals, 16,486 of which have been diagnosed with bipolar disorder, and 17,213 with no known psychiatric diagnosis (See Table S1 and supplementary materials: sample collections, for a full breakdown by cohort and subtype, and subtype definitions).All of the sample collections have been previously genotyped for common variant analyses (3).However, this is the first time that exome-sequencing has been performed and jointly analysed.All exome sequence data was generated using the same library preparation, sequencing platform, and joint calling pipeline: exome sequencing of the full sample set was performed between July 2017 and September 2018 using Illumina Nextera sample preparation and HiSeqX sequencing.Samples were then jointly processed and run through variant calling using the Genome Analysis ToolKit (GATK), (supplementary materials: sequence data production).Following sequencing and joint calling, we ran a series of quality control steps to filter out low quality variants (Table S2) and samples (Table S3), restricting to unrelated individuals of broad continental European ancestry (supplementary materials: exome quality control, Figures S1-5).The analysis-ready high-quality dataset consisting of 13,933 bipolar cases and 14,422 controls is summarised in Table S4.Breaking down by subtype, the curated dataset consists of 8,238 BD1, 3,446 BD2, 1,288 BDNOS, 961 BD without a finer diagnosis (together encompassing the 13,933 BD), and 277 SAD.Throughout our analyses, we exclude individuals diagnosed with SAD in order to obtain a more BD specific collection of results and guard against signals more attributable to schizophrenia being pulled into any reported associations.

Significant contribution of rare damaging protein truncating variation to bipolar risk
To test whether bipolar cases carry an excess of damaging coding variants, we analyzed exome-wide burden relative to controls using a logistic regression model controlling for principal components, sex, and overall coding burden (supplementary materials: exome-wide burden analyses).Drawing from previous exome sequencing studies of psychiatric disease (14,16,17), we restricted our attention to variants with minor allele count (MAC) ≤ 5 across the entirety of the dataset, corresponding to MAF ≤ 0.01%.We annotated variants using the Ensembl Variant Effect Predictor (VEP) (18) version 95 with the loftee plugin, and assigned variants to classes of variation.We defined three putatively damaging classes of coding variation: protein-truncating variants (PTVs), missense variants with MPC > 2, and damaging missense variants (missense variants annotated as 'probably damaging' in PolyPhen and 'deleterious' in SIFT).We further defined two annotations which we hypothesised to be likely benign: other missense (the remaining missense variants), and synonymous variants (see supplementary materials: variant annotation and Table S5 for full details).Following this initial restriction we observed nominally significant enrichment of damaging missense variation in BD cases and BD2 cases over controls (OR = 1.01,P = 0.024 and OR = 1.02,P = 0.0086 respectively); Figure 1B,C, but not for the other a priori damaging classes of variation (missense MPC > 2, and PTV).However, stepwise filtering of rare PTVs to those not in the non-psychiatric portion of the Genome Aggregation Database (gnomAD), hereafter referred to as 'ultra-rare variants', and then in constrained genes (defined as pLI ≥ 0.9), shows that case-control PTV enrichment is present once we filter to high pLI genes, a finding in line with schizophrenia exomes (19); Figure 1B,C.This enrichment is consistent among both BD1 and BD2 subtypes (Figure 1A).While the magnitude of PTV enrichment in BD (OR = 1.11,P = 5.0 × 10 -5 ) is considerably lower than the latest PTV enrichment in schizophrenia (OR = 1.26; (19)), this difference is in line with the increased selective pressure estimated from higher reproductive rates in BD affected siblings relative to those seen in schizophrenia affected siblings (8).
Given the excess burden observed between BD case status and ultra-rare PTV burden, we looked to tease apart this signal.We evaluated age at first impairment for a subset of 3,134 cases (supplementary materials: age of onset definitions, Table S6), but found no difference in the distribution of PTV burden or carrier status between earlier onset cases compared to older onset cases (minimum P-value across 50 tests using Kolmogorov-Smirnov (KS) test was 0.40, minimum P-value across 50 tests using Fisher's exact tests 0.067 (supplementary materials: testing for relationship between age of onset and rare variant burden).
We also assessed whether the presence or absence of psychosis in a subset of 8,017 case samples (4,214 with psychosis (comprising 3,152 BD1, 661 BD2, 352 BDNOS, and 49 BD without a fine subclassification), 3,803 without psychosis (comprising 1,423 BD1, 1,845 BD2, 505 BDNOS, and 30 BD without a fine subclassification)) stratified risk (Table S7, supplementary materials: psychosis definitions).Both case subsets displayed significant enrichment of ultra-rare PTV burden in constrained genes (OR = 1.12,P = 0.0018; OR = 1.16,P = 6.6 × 10 -5 for cases with and without psychosis respectively).However, there was no significant difference in excess burden between the two subcategories when including psychosis status as a covariate in the regression of ultra-rare PTV burden in constrained genes on case status (P = 0.42).
Restricting to missense enrichment, we do not observe a significant signal of enrichment of highly putatively damaging missense (MPC > 2) variation in bipolar disorder cases, in contrast to schizophrenia (19); Figure 1B,C.However, outside constrained genes, we observe significant enrichment of ultra-rare damaging missense variation across both BD subtypes with BD2 showing the strongest enrichment; Figure 1B,C (BD: OR = 1.02,P = 0.014; BD1: OR = 1.02,P = 0.014; BD2: OR = 1.03,P = 0.0036).Panel A displays enrichment in cases over controls in case subsets, according to the legend.In panels B and C, we display case-control enrichment and excess case rare variant burden increasingly a priori damaging variant subsets using logistic and linear regression respectively.Consequence categories are stratified by rarity: moving from left to right the putatively damaging nature of the variants reduces from dark red to pink according to the legend, and the rarity reduces from a variant with MAC ≤ 5 in a pLI ≥ 0.9 gene and not in the non-psych portion of gnomAD (Not in gnomAD pLI ≥ 0.9), to a variant with MAC ≤ 5 (All) according to the x-axis labelling.Bars in panels B and C represent the 95% confidence intervals on the logistic and linear regression estimate of the enrichment of the class of variation labelled on the x-axis respectively.Regressions are run as described in supplementary materials: exome-wide burden analyses, and include sex, 10 PCs and total MAC ≤ 5 coding burden as covariates.Nominally significant enrichments or excess variants in cases are labelled with the associated P-value.

Candidate gene-set and tissue enrichment
Beyond exome-wide and constrained gene burden, biologically and empirically informed gene sets can refine our understanding of how ultra-rare PTVs confer risk for BD and generate potential biological hypotheses for follow-up analyses.Using the Genotype-Tissue Expression portal (20), we find weak evidence for enrichment of ultra-rare PTVs in 13,372 genes expressed in brain tissues in bipolar cases; OR = 1.01,P = 0.032, compared to genes expressed in non-brain tissues; 23,450 genes, OR = 1.00,P = 0.15.More broadly, we tested for enrichment of ultra-rare PTVs in 43 GTEx tissues ((21), Table S8) defined as having the strongest tissue specific expression (Figure 2A, Figure S6).The pattern of enrichment for damaging ultra-rare variation resides predominantly in brain tissues, with the strongest association seen in the Amygdala (OR = 1.03,P = 3.9 × 10 -5 ), a brain region previously found to be reduced in size in BD1 cases (22).
We then considered 68 candidate gene-sets either generated or implicated in previous genetic studies of psychiatric disorders (supplementary materials: gene-set enrichment analysis, Figure 2B, Figure S7), and a more strictly defined collection of genes brain-enriched in GTEx: average expression over two-fold higher in brain tissues than the average across all tissues in GTEx (23).With this more stringent brain-enrichment definition (6,630 genes), we saw stronger ultra-rare PTV enrichment; OR = 1.04,P = 2.49 × 10 -3 .Among the 68 candidate gene sets, we observe significant enrichment (multiple test correction set at P < 7.35 × 10 -4 ) of ultra-rare variation in four gene sets.For ultra-rare PTVs, we see significant enrichment in SCHEMA genes; FDR < 5% (19); 34 genes, OR = 1.89,P = 4.81 × 10 -5 , and CHD8 binding targets in human brain (24); 2,517 genes, OR = 1.09,P = 5.18 × 10 -5 .For ultra-rare damaging missense variants, we see significant enrichment in genes targeted by RBFOX (25); 948 genes, OR = 1.07,P = 3.70 × 10 -4 , and ASD FDR < 10% (26); 66 genes, OR = 1.24,P = 7.25 × 10 -4 .The enrichment of ultra-rare PTVs in SCHEMA and damaging missense in ASD provides further evidence of convergence of shared signal in the ultra-rare end of the allele frequency spectrum, mirroring the overlapping genetic risk for schizophrenia and BD observed in common variation (27), and schizophrenia and ASD in rare variation (19).Notably, we did not observe a rare-variant enrichment of damaging variation in gene sets generated from GWAS of BD (3).To investigate the rare-variant signal with schizophrenia further, we considered four distinct gene-sets of size 50, ordered by P-value in SCHEMA (19).Further PTV enrichment was observed in the top 50 genes over the FDR < 5% set; OR = 2.02, P = 8.14 × 10 -7 , but this significant enrichment was not observed as we moved down through the genes displaying ultra-rare damaging case-control enrichment in SCHEMA (genes 51-100; OR = 0.936, P = 0.680, genes 101-150; OR = 1.03,P = 0.794, genes 151-200; OR = 1.07,P = 0.686).We also did not observe PTV enrichment of ultra-rare damaging variation in the recently fine-mapped schizophrenia genes published by the PGC (11): OR = 1.10,P = 0.178.
Along with a candidate gene-set enrichment analysis approach, we considered a broad-based enrichment analysis using gene-sets derived from large pathway databases including Gene Ontology (GO), REACTOME and KEGG); a total of 1,697 gene-sets (Figure S8).By analysing excess rare variant burden in such a large collection of gene-lists we sought to elucidate pathways enriched for damaging variation associated with bipolar disorder in an agnostic manner.We observed significant enrichment of one gene-set after correction for multiple tests: genes involved in the G1/S transition of mitotic cell cycle; OR = 1.46,P = 1.36 × 10 -5 .

Gene based-analysis approach
To boost power for gene discovery, we again restricted to ultra-rare variants which were not present in the non-psych portion of gnomAD ( 28), and we further enriched for pathogenic variants by restricting our analysis to ultra-rare variants (not in gnomAD non-psych, MAC ≤ 5) that are also either PTVs (Table S5) or damaging missense variants (supplementary materials: gene-based analysis approach; Table S5).Throughout, we use Fisher's exact tests in each gene to test for case-control enrichment (supplementary materials: gene-based analysis approach, Figures S9-13).

AKAP11 implicated by ultra rare protein truncating variants
In our primary analysis, no gene surpassed genome-wide significance (set at P < 2.14 × 10 -6 for 23,321 tests; dotted line in Figure 3), with the strongest case-control enrichment observed in AKAP11 (P = 1.15 × 10 -5 in BD, P = 5.30 × 10 -6 in BD1).We do, however, begin to observe deviation from the null in the collection of tests of ultra rare PTV enrichment in bipolar cases, particularly in BD1 (Figure S14).This deviation was not observed for BD2 (Figure S15) despite the genome-wide enrichment of the PTV signal (Figure 1B,C), and is likely due to the reduced power of Fisher's exact tests in BD2 case counts (n = 3,446).
Given the strong overlap in common variant risk between BD and schizophrenia, we sought to determine whether there is evidence of a shared signal of enrichment of ultra-rare PTVs in BD and schizophrenia cases.Due to overlap in controls between SCHEMA and BipEx, we analysed an ultra-rare variant count data-set which excluded these controls, and meta-analysed the data (supplementary materials: combining SCHEMA and BipEx data in meta-analysis).To avoid the schizophrenia ultra-rare PTV case-control enrichment signal overwhelming the BD signal when presenting results, we first sort on P-value in the primary gene-based BD analysis and display the top 10 P-values before and after meta-analysis with SCHEMA counts, Table 1 and Table S9.The combined analysis in BD and schizophrenia cases reveal one exome-wide significant gene, AKAP11 (P = 2.83 × 10 -9 ), and one suggestive gene, ATP9A (P = 5.36 × 10 -6 ).
The top gene hit, AKAP11 (the gene encoding A-Kinase Anchoring Protein 11, also known as AKAP220) contains only a single isoform, is under evolutionary constraint (LOEUF = 0.3, pLI = 0.98), and is highly expressed in the brain (cerebellar hemisphere (38.54 median TPM), frontal cortex (BA9) (31.52 median TPM); (20)) and has been shown to interact with GSK3B, the hypothesized target of lithium therapy (30)(31)(32).Therefore, we gathered all available lithium response information for carriers of AKAP11 PTVs among the BD cases (supplementary materials: lithium response).Of the eleven cases for which lithium response information was available, seven reported a good response (of which five were in SWEBIC cohort and reported 'complete response, recovered', and two were in the Cardiff collection and reported that lithium helped stabilise their moods), and four did not respond well to lithium.Of the poor responders, three were in the London cohort, and one was in the SWEBIC cohort.While the percent of good responders in AKAP11 PTV carriers (63.6%) is marginally elevated relative to the background response rate in available BD cases (52%), the sample size is far too small to form any robust conclusions from the data.
The rare damaging association appears to be specific to BD and schizophrenia.To our knowledge, there is no signal of enrichment in AKAP11 in other neurodevelopmental disorders.AKAP11 is not present in the collection of 'developmental disorder genes' curated to be associated with developmental disorders (33); https://decipher.sanger.ac.uk/ddd/ddgenes), the autism sequencing consortium (ASC) analysis (34), or Epi25 study (35).Furthermore, expression of AKAP11 tends to occur later in development (Figure S16).
We also examined ultra-rare PTV variant counts in the Bipolar Sequencing Consortium (BSC) (14) (supplementary materials: external validation with the BSC exome data, Table S10) exome sequence data.Non-zero count data were available for seven of the top ten genes exhibiting differences in ultra-rare PTV counts between BD cases and controls as measured by P-value in the BipEx dataset.Of these, one was enriched for ultra-rare PTVs in controls (FREM2) in BipEx, and did not display control enrichment in the BSC data.The remaining six displayed case-control enrichment in BipEx.In four out of these six genes (including AKAP11 and ATP9A), we observed further case enrichment (Table S11) in the BSC data.1: BipEx and SCHEMA case-control counts of the top ten most significant genes in the BipEx gene-based analysis.Case and control columns denote the count of ultra-rare PTVs in the gene in the respective dataset.P-values are determined using Fisher's exact and CMH tests for BipEx and SCHEMA (supplementary materials: gene-based analysis approach) respectively, and meta-analysed weighting by effective sample size.BipEx: BD case count 13,933, control count 14,422.SCHEMA: schizophrenia case count 24,248, control count 91,960.The SCHEMA OR is the estimated OR averaged over strata, whereas the combined OR is the simple OR calculated by combining the BipEx and SCHEMA cases and controls.Note that differential coverage across exome sequencing platforms and whole genome sequencing means that case/control counts differ across genes.

Discussion
In the largest bipolar disorder exome study to date, ultra rare PTVs in constrained genes are significantly enriched in BD cases.In fact, enrichment in constrained genes remains significant even after excluding the top 20 BD-risk associated genes (OR = 1.07;P = 0.00313) with pLI ≥ 0.9 (Table S12) reflecting the highly polygenic genetic architecture of bipolar disorder, a property shared with schizophrenia (19), and suggesting that the majority of genes involved in bipolar disorder risk will require larger sample sizes to be discovered.Furthermore, ultra rare PTVs are significantly enriched in schizophrenia risk genes identified in the SCHEMA consortium, suggesting that rare variation in these genes are not distinct to schizophrenia pathophysiology, and that overlap in risk for schizophrenia and BD is now evident in both rare and common variation.Finally, combining our results with data from SCHEMA reveals strong evidence that haploinsufficiency in AKAP11 confers risk for both BD and schizophrenia, but not for early-onset neurodevelopmental disorders.
AKAP11 codes for the AKAP-11 protein (also known as AKAP220), one of a family of scaffolding proteins that bind to the regulatory subunit of the protein kinase A (PKA).These anchoring proteins confine PKA to discrete locations in the cell to target specific substrates for phosphorylation and dephosphorylation.In particular, GSK3B is bound by AKAP-11.GSK3B is hypothesized to be the target of lithium, the primary treatment for bipolar disorder (36).By binding to GSK3B, AKAP-11 mediates PKA-dependent inhibition of GSK3B -PKA inhibits the activity of GSK3B bound to AKAP-11 more strongly than GSK3B in general, and thus modifications to AKAP-11 have the potential to affect downstream pathways.GSK3B is one of two paralogous genes (GSK3 alpha and GSK3 beta) that encode a serine/threonine protein kinase, Glycogen synthase kinase 3.The primary known function of which is phosphorylation of more than one hundred different substrates, thus affecting a myriad of signalling pathways (4,36,37).With this in mind, we looked to determine the efficacy of lithium treatment in BD cases harboring an ultra-rare PTV in AKAP11.Of the eleven individuals with treatment data available, seven displayed a good response to lithium treatment, in line with the notion that the effects of disrupting AKAP-11 may be partially rescued by lithium therapy.However, the ultra-rare PTV carrier sample size is currently too low to draw robust conclusions regarding lithium treatment response.
Beyond PTV enrichment in constrained genes, we see early evidence of enrichment in ultra-rare damaging missense variation, particularly within BD2.This enrichment is evident outside of missense constrained regions (as defined by MPC > 2), which is perhaps surprising given the signal of association seen in this slice of rare variation in schizophrenia cases is mainly within constrained missense regions.Because BD2 displays a stronger correlation of common variant effects with major depression, with BD1 more correlated to schizophrenia, there is a chance that this missense signal is capturing something distinct to mood disorders relative to psychotic disorders.However, we should be cautious not to read too much into differences in ultra-rare damaging missense enrichment across the BD subtypes; the number of BD2 samples (n = 3,446) in the BipEx dataset is less than half of BD1 (n = 8,238), and confidence intervals around the damaging missense enrichment overlap (Figure 1).Furthermore, attempts to refine this exome-wide signal to individual genes or targeted gene sets did not result in any significant signals of association after correcting for multiple testing (Figure S15, Figure S7).As with PTV enrichment, we expect to see a refinement of the putatively damaging missense signal as sample sizes increase.
Despite sequencing 13,933 BD cases, we did not observe any BD specific risk genes surpassing exome-wide significance.In contrast, the 24,248 schizophrenia cases analyzed in SCHEMA have yielded 10 significant risk genes.When we compare the observed ultra-rare PTV enrichment among constrained genes in our current sample (OR = 1.11) to SCHEMA (OR = 1.26), we currently estimate that about double the case sample size of schizophrenia is needed in BD to achieve comparable statistical power to discover individual risk genes.Moreover, schizophrenia has now seen meaningful convergence of gene overlap in the common and rare end of the allele frequency spectrum, in large part through larger exome sample sizes as well as fine-mapping of GWAS loci (19).The overlap in BD, however, remains uncertain.The BSC exome-dataset examined 3,987 BD cases (14), finding suggestive enrichment in 165 genes implicated in BD GWAS (OR = 1.9, P = 6.0 × 10 -4 ), but this finding did not replicate in the current sample (OR = 0.9, P = 0.40).Given that prior evidence of common and rare gene overlap in schizophrenia was quite modest (16,17,29), as sample sizes increase for both common and rare variation analyses in BD, we do expect to see a slow but steady convergence in much the same manner.
In summary, ultra-rare PTVs in constrained genes are significantly enriched in BD patients over controls, a result firmly established in schizophrenia and other early-onset neurodevelopmental disorders.We are beginning to see promising signals among individual genes, despite not having any surpassing exome-wide significance for BD alone.We observe that shared risk for bipolar disorder and schizophrenia is present in both common and damaging ultra-rare variation.Our top gene, AKAP11, shows shared evidence of risk for BD and schizophrenia, increasing our confidence that we are discovering true risk factors underlying psychiatric disease.Overall, the current evidence suggests BD is on a similar trajectory to schizophrenia, where increasing sample sizes and further collaborative efforts will inevitably lead to biologically meaningful risk genes and pathways underlying BD risk.

Data availability
We display all of our results, from the variant and gene level in a browser available at https://bipex.broadinstitute.org.A detailed summary of phenotype curation, and QC, including additional plots is available at https://astheeggeggs.github.io/BipEx/.Whole Exome Sequence data generated under this study are hosted on and shared with the collaborating study groups via the controlled access Terra platform (https://app.terra.bio).The Terra environment, created by the Broad Institute, contains a rich system of workspace functionalities centered on data sharing and analysis.Requests for access to the controlled datasets are managed by data custodians at the Broad Institute and sent to sample contributing investigators for approval.

Figure 1 :
Figure 1: Case-control enrichment of ultra-rare variants, split by case status and consequence category.Panel A displays enrichment in cases over controls in case subsets, according to the legend.In panels B and C, we display case-control enrichment and excess case rare variant burden increasingly a priori damaging variant subsets using logistic and linear regression respectively.Consequence categories are stratified by rarity: moving from left to right the putatively damaging nature of the variants reduces from dark red to pink according to the legend, and the rarity reduces from a variant with MAC ≤ 5 in a pLI

Figure 2 :
Figure 2: Biological insights from bipolar case-control whole-exome sequencing data.A. Enrichment of ultra-rare PTVs in BD cases over controls in tissue-specific expression genesets.Gene-sets are defined in (21) in detail.Bars are ordered first by whether they are a brain-tissue, and then by P-value.B. Enrichment of ultra-rare variants in targeted 68 gene-sets taken from the literature (19, 29).Top PTV and damaging missense gene-sets are labelled, and annotated with the number of genes in each geneset.Classes of variants tested in each gene-set are coloured according to the legend.Gene sets surpassing Bonferroni test correction are labelled with an asterisk.

Figure 3 :
Figure 3: Results of the analysis of ultra-rare PTVs in 13,933 cases and 14,422 controls.Gene-based Manhattan and QQ plot for bipolar disorder (comprising BD1, BD2 and BDNOS).-log 10 P-values obtained via Fisher's exact tests are plotted against genetic position for each of the analysed genes.In the QQ plots, observed -log 10 P-values are plotted against permutation P-values according to the procedure described in the supplementary materials: gene-based analysis approach.Points are coloured according to the discrete scale displayed in the legend.In the Manhattan plot and QQ plot, the gene symbols of the top 20 and top 10 genes by P-value are labelled, respectively.Points in the Manhattan plot are sized according to P-value as displayed in the legend. Table