Genome-wide association study of REM sleep behavior disorder identifies novel loci with distinct polygenic and brain expression effects

REM sleep behavior disorder (RBD) represents both an early marker and key symptom of synucleinopathies, mainly Parkinson disease (PD) and Lewy body dementia (LBD), and a strong opportunity for early clinical intervention for these conditions. Yet, the genetics of RBD are currently not well characterized. In this study, we perform the first genome-wide analyses on RBD, including genome-wide association study, pathway analysis, heritability, genetic correlation, and mendelian randomization. We report two novel RBD risk loci and replicate three previously reported, and identify key genetic similarities and differences between RBD and other synucleinopathies. These results contribute to overall genetic understanding of RBD, and thus a step closer to genetic prediction and targeted treatment of synucleinopathies.


INTRODUCTION
Rapid eye movement (REM) sleep behavior disorder (RBD), defined as loss of muscle atonia and dream enactment during REM sleep, is one of the most unique conditions in neurology 1 . Isolated RBD (iRBD), defined as having RBD without other significant clinical neurological signs, is the only early highly predictive clinical marker for some neurodegenerative diseases. Over 80% of iRBD patients will convert to overt neurodegeneration within 10-15 years on average, most commonly to Parkinson's disease (PD) or dementia with Lewy bodies (DLB), or in rare cases, multiple system atrophy (MSA) or another neurodegenerative disorder. 2,3 Although they demonstrate phenotypic differences, PD, DLB, and MSA are all alpha-synucleinopathies: disorders characterized by accumulation of the protein alpha-synuclein in the brain. Therefore, iRBD is considered a prodromal alpha-synucleinopathy which offers a unique opportunity to identify these conditions at a much earlier stage. 4 There is strong evidence that iRBD also represents distinct, more severe subtypes of alphasynucleinopathies. Approximately 30-60% of PD patients have RBD, including both iRBD and RBD as a symptom occurring after PD diagnosis (symptomatic RBD, sRBD). 3 In this manuscript, we will use "RBD" to refer to all instances of RBD regardless of at which stage symptoms present, and iRBD or sRBD to specify before or after overt neurodegeneration diagnosis, respectively. The presence of RBD is currently the strongest predictor for the development of dementia in PD 5 and is associated with more rapid progression of non-motor symptoms. 3 RBD is more frequent in DLB, found in approximately 50-80% of all cases. While the phenotypic differences in DLB with or without RBD are less pronounced, those diagnosed with iRBD before developing DLB tend to have increased severity of DLB symptoms and more rapid deterioration. 6 MSA patients also have a high prevalence of RBD, estimated at 75-95%, 40% of which have iRBD prior to the onset of MSA symptoms. Those with iRBD preceding MSA symptoms may have more frequent autonomic onset of MSA, less frequent parkinsonism at MSA onset, and a more severe disease course. 7 Overall, RBD, and specifically iRBD, appears to represent a more malignant subtype of alpha-synucleinopathies.
Thus far, the genetics of RBD has only been studied through the candidate gene approach. To better understand RBD and early alpha-synucleinopathy genetics and potential mechanisms, we performed a genome-wide association study (GWAS) on 2,843 cases and 139,636 controls. We further examined the biological implications of the nominated risk loci through pathway analysis, investigated variant effects on gene expression, and assessed the cumulative risk using polygenic risk score (PRS).
Using the GWAS summary statistics, we studied the genetic relationship between RBD and the synucleinopathies to which it progresses, as well as conditions and exposures of interest using linkage disequilibrium (LD) score regression and Mendelian randomization (MR).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021

Genome-wide association study identifies five RBD loci
To identify genetic risk loci across the genome associated with RBD, we performed a case-control GWAS of iRBD (N cases=1,061, N controls=8,386) and a case-control GWAS from 23andMe, Inc. using PD patients with probable RBD (pRBD) and controls (PD+pRBD, diagnosed by a validated questionnaire, N cases=1,782, N controls=131,250), meta-analyzed for a total of 2,843 cases and 139,636 controls. After filtration, imputation, and meta-analysis, a total of 11,536,573 variants were examined. We tested for systemic biases using the genomic inflation factor (lambda) and LD-score regression, with satisfactory results (lambda=1.06, lambda1000=1.01, LD intercept= -0.01). With LD-score regression, the liabilityscale narrow-sense heritability of iRBD based on common variants is calculated at 12.3% (standard error=0.07), similar to the recently reported 10.8% heritability for DLB. 8 We identified novel RBD-associated loci near SCARB2 and INPP5F, and replicated known RBD associations near SNCA, 9 GBA, 10,11 and TMEM175 12 (Table 1, Figure 1). Conditional-joint analysis identified a secondary associated single nucleotide polymorphism (SNP) in GBA, confirming RBD risk association with known PD nonsynonymous variants p.Glu326Lys and p.Asn370Ser. These five loci have also been implicated in PD, 13 however the RBD-associated SNPs in SNCA and SCARB2 are not in LD with the top PD-associated SNPs in these loci, and are thus considered independent. Additionally, SNPs associated with PD or DLB-associated SNPs in notable GWAS loci, such as MAPT, LRRK2, and APOE 8 are not associated with RBD despite sufficient power to detect them, suggesting that RBD has a distinct and only partially overlapping genetic background with PD and DLB.

Differential gene expression in different brain regions may drive the independent associations of SNCA and SCARB2 in RBD and PD
We found that although the SNCA and SCARB2 loci are important in both RBD and PD, the associations with both conditions in these loci are driven by independent variants. In the SNCA locus, rs3756059 (in the 5' region of the gene) was associated with RBD in the current GWAS and with DLB. 8 In contrast, rs356182 (in the 3' region), which is not in LD with rs3756059 (R 2 =0.17, D'=0.56), is the most significant of all association signals in GWAS of PD risk, 13 yet showed no association with iRBD. Similarly, in the SCARB2 locus, rs7697073 is associated with RBD and rs6825004 is associated with PD, and the two SNPs are not in LD (R 2 =0.06, D'=0.26). We therefore hypothesized that the different SNPs in these loci associated with RBD and PD may be associated with differential expression patterns of their respective genes in different brain regions. To examine this hypothesis, we used the publicly available Genotype-Tissue Expression (GTEx) consortium v8 online portal (https://www.gtexportal.org/home/) 14 to investigate how each variant affects gene expression in specific brain tissues. We examined the effects of these variants on the expression of SCARB2, SNCA and SNCA antisense-1 (SNCA-AS1), since variants in SNCA have previously been linked to SNCA-AS1 expression. 8 The expression quantitative trait loci . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint 5 (eQTL) effect size (ES) reported here is the slope of the linear regression of normalized expression data versus the genotype status using single-tissue eQTL analysis, performed by the GTEx consortium.
Comparative results are visualized in Figure 2.

Colocalization analyses demonstrate tissue and cell-specific differential effects of RBDassociated variants
We further investigated RBD loci effects on gene expression using colocalization analysis to determine whether association signals for RBD risk and regulation of gene expression in the whole brain or blood are driven by a shared causal variant. eQTLs were obtained from PsychENCODE 15 and eQTLGen, 16 large human brain and blood eQTL datasets, respectively. In brain, we found one RBD locus with strong evidence of colocalization with eQTLs regulating SNCA-AS1 expression (posterior probability of hypothesis 4 [one shared variant between RBD GWAS and eQTL, PPH4] = 0.89; Figure 3A; Supplementary Table 1). Using beta coefficients derived from the RBD GWAS and PsychENCODE eQTLs, we found that SNPs in the region surrounding SNCA-AS1 tended to show an inverse relationship between RBD risk and SNCA-AS1 expression, suggesting that reduced RBD risk is associated with increased SNCA-AS1 expression ( Supplementary Fig. 4), which in turn may be associated with reduced alpha-synuclein protein level.
In blood, we found evidence of colocalizations between the same RBD locus and eQTLs regulating MMRN1 expression (MMRN1, PPH4 = 0.86; Figure 3B; Supplementary Table 1). Sensitivity analyses confirmed that SNCA-AS1 and MMRN1 colocalizations were robust to changes in the prior probability of a variant associating with both traits (i.e., p 12 prior). We did not find evidence of colocalization between the RBD SCARB2 locus and eQTLs in brain or blood (Supplementary Figures 5-7).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint 6 As both the SNCA-AS1 and MMRN1 colocalizations were observed at the same RBD risk locus on chromosome 4, but in different tissues, we hypothesized that this may be due to tissue-specific regulation of SNCA-AS1 and MMRN1 expression. Using specificity, a measure of the proportion of a gene's total expression attributable to one tissue or cell type, we explored the tissue-and cell-typespecific patterns of SNCA-AS1 and MMRN1 expression in (i) human bulk-tissue RNA-sequencing from GTEx consortium 17 and (ii) human single-nucleus RNA-sequencing of the medial temporal gyrus from the Allen Institute for Brain Science (AIBS; 7 cell types). 18 At the tissue level, SNCA-AS1 expression was predominantly brain-specific, while MMRN1 expression was most specific to thyroid, adipose and lung tissues and least specific to brain tissues ( Figure 3C; Supplementary Table 2). At the cellular level, SNCA-AS1 demonstrated neuronal specificity, while MMRN1 was specific to microglia ( Figure 3D; Supplementary Table 2). Together, these findings support the hypothesis that SNCA-AS1 and MMRN1 expression is regulated differently across cell types and tissues, and in turn, suggest that the RBD risk locus at chromosome 4 may operate differently depending on the cell type or tissue investigated.

Polygenic risk scores distinguish iRBD and PD with RBD from PD without RBD
Next, we examined whether RBD-specific PRS can distinctly identify RBD as opposed to PD without RBD. We withheld a testing set of iRBD cases (N=212) and controls (N=1,265), re-performed the GWAS and meta-analysis with the remaining samples, then created a polygenic risk profile for RBD using all independent variants (R2>0.1) which passed GWAS FDR correction (p<1E-05, 47 variants, Supplementary Table 3). We calculated PRS in the withheld testing set of iRBD cases and controls, as well as independent cohorts of PD+pRBD and PD without pRBD (PD-pRBD) patients and controls. In iRBD, the PRS can differentiate between iRBD cases and controls with an area under the curve (AUC) of 0.61 (95% CI 0.58-0.65, Figure 1B), on par with recent PD PRS performance. 19 In PD+pRBD, predictive performance was similar to iRBD with an AUC of 0.60 (95% CI 0.56-0.63), compared to decreased predictive power in PD-pRBD with an AUC=0.55 (95% CI=0.51-0.59, Figure 1B).
We then divided the RBD PRS into quartiles in the iRBD, PD+pRBD, and PD-pRBD cohorts, and performed logistic regression against the phenotype ( Figure 1C). In iRBD, those in the top quartile for RBD PRS were 2.9 times more likely to have RBD than those in the bottom quartile (95% CI is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint 7 p=0.053, Figure 1C). The lack of RBD PRS predictive power in PD-pRBD suggests that we are likely tagging RBD-specific loci.
We tested whether the RBD PRS is associated with changes in age at onset (AAO) or rate of conversion from RBD to overt neurodegeneration, but found no statistically significant relationship at this sample size.

Pathway analysis reveals potential role for the autophagy-lysosomal pathway in RBD pathogenesis
To examine whether specific pathways are enriched according to the RBD GWAS results, we performed pathway enrichment analysis with WebGestalt using the GWAS-nominated genes for cellular components and biological processes ( Figure 1D, Supplementary Table 4). The nominated cellular components include whole membrane (FDR corrected p=0.004), lysosome (FDR p=0.004), and vacuole (FDR p=0.005). Biological processes include positive regulation of receptor recycling (FDR p=0.037), vacuole organization (FDR p=0.037), and endocytosis (FDR p=0.039). All these nominations suggest involvement of the autophagy lysosomal pathway (ALP), a key mechanism for clearing alpha-synuclein whose disruption can lead to the alpha-synuclein aggregation seen in alpha-synucleinopathies. 20,21

Comparison to PD GWAS loci reveals additional loci with potential distinct effects in RBD
PD is currently the best genetically-characterized alpha-synucleinopathy due to the large sample sizes available for GWAS. We aimed to further examine how PD GWAS loci behave in RBD, given the differential effects observed in the SNCA and SCARB2 loci. We compared the effect size and direction of effect of the latest PD GWAS-significant variants (N=90) 13 to the effect sizes and direction of effect of the same variants in the summary statistics of this RBD GWAS (N=90 variants were available for comparison) and the most recent DLB GWAS 8 (N=72). Figure 4A-C shows the similarities and differences of effect size and direction at these loci across RBD, PD and DLB. Notably, iRBD shows marked differences at some key PD loci ( Figure 4A) including SNCA, CYLD (encoding a deubiquitinating enzyme), and FYN (encoding a membrane-associated tyrosine kinase). In these loci, the direction of effect in iRBD is opposite of that seen in PD, with nominal significance in iRBD (none of these pass the GWAS significance threshold for iRBD). The PD+pRBD cohort significantly deviates from this pattern with 100% of loci showing the same direction of effect with PD ( Figure 4B). However, strong PD signals, such as SNCA 3' variants and LRRK2, are not significant in the PD+pRBD cohort despite sufficient power. Similarly, all DLB nominally significant loci share the same direction of effect with PD ( Figure 4C). Yet, DLB also deviates from PD, as the MAPT, LRRK2, and SNCA 3' loci are not statistically significant despite sufficient power. All statistics are detailed in Supplementary   Table 5. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint 8

LD score regression further supports distinct genetic background for iRBD
We used LD-score regression to examine the genetic correlation between RBD and relevant traits and exposures ( Figure 5A-C). Although iRBD and PD+pRBD are positively correlated with nominal significance (rg=0.56, se=0.24, p=0.02), the two cohorts behave differently when it comes to other alphasynucleinopathies. PD+pRBD is highly and significantly correlated with the latest PD GWAS (rg=0.76, se=0.13, p=1.2E-09). On the contrary, iRBD is not correlated with PD (rg=0.17, se=0.14, p=0.23).
PD+pRBD and iRBD behave more similarly when discussing the following traits, so we will simply refer to the meta-analysis results. RBD shows similarities to PD with potential genetic correlations with less smoking, more education, and excessive daytime sleepiness, without significance after multiple testing correction (Supplementary Table 6). We found no genetic correlation with other sleep traits such as insomnia or sleep duration.

Mendelian randomization suggests shared causal associations in RBD and PD
Mendelian randomization (MR) allows for the assessment of causality between an exposure and an outcome. In this case, we aimed to analyze the causal effects of traits and exposures on RBD using the RBD GWAS meta-analysis and vice-versa. Because this GWAS is not strongly powered for MR, we focused only on exposures previously linked to PD 22 (Table 2) and variables investigated in the genetic correlation analyses (Supplementary Table 7). Similar to PD, body mass index (BMI) measurements such as arm, leg, and trunk measurements showed a potential inverse causal association (smaller measurements being potentially causal for RBD); however, significance was lost after multiple testing correction ( Table 2). Other PD associations and traits from genetic correlation (e.g., Type II Diabetes and smoking) showed no causal association with RBD.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint 9

DISCUSSION
In this first genome-wide study of RBD, we identified five RBD-associated loci: two novel loci near SCARB2 and INPP5F and three previously reported loci near SNCA, GBA and TMEM175. Two of the loci, SNCA and SCARB2, have different and independent variants associated with RBD than those associated with PD. Colocalization and eQTL analyses suggest that these variants are associated with differential expression of SNCA-AS1 and SCARB2 in different brain regions, tissues, and cell types. We further show that RBD-specific PRS has different effects in individuals with iRBD or PD+pRBD compared to those with PD-pRBD. A distinct genetic background for iRBD is also supported by the lack of genetic correlation between iRBD and the most recent PD GWAS, as well as the lack of RBD association with known PD loci LRRK2 and MAPT, despite sufficient power to detect them.
The differential association at the SNCA locus, when comparing PD and RBD, may provide a mechanistic hypothesis for gene expression-dependent regional vulnerability of different brain areas. The eQTL analysis shows that the top variant associated with RBD in the current GWAS, rs3756059, is strongly associated with reduced expression of SNCA-AS1 in several cortical regions ( Figure 2). The same risk variant has also been implicated in DLB 8 and is in LD with a secondary PD hit in this locus which is linked to PD with dementia. 23 When comparing PD+pRBD patients to PD-pRBD, frequency of this variant is significantly elevated in PD+pRBD. 9 In contrast, the top variant in the recent PD GWAS, rs356182, is not associated with RBD. Additionally, we report high meta-analysis heterogeneity at this locus, which could implicate iRBD as the driver of this association. The PD+pRBD cohort, which is a mixture of PD patients who presented with RBD before PD diagnosis (iRBD) and those with symptomatic RBD later in PD progression, has a lower effect size (beta=0.15) than the iRBD cohort alone (beta=0.40).
Since SNCA-AS1 is transcribed as an antisense RNA molecule, it may lead to reduced alpha-synuclein protein expression. If this hypothesis is correct, then levels of alpha-synuclein protein may be increased in the cortical regions associated with reduced SNCA-AS1 expression (Frontal Cortex), which could make these regions more vulnerable for neurodegeneration in carriers of the RBD-associated variant. This could explain the strong association of RBD with more rapid and more severe cognitive decline in PD, 24,25 since the cortical brain regions associated with cognition may be more susceptible for neurodegeneration by this mechanism. Interestingly, PD patients without RBD are similar to controls when assessing cognition, 24,26 and this variant is not a strong risk locus in this population. 9 This hypothesized role of SNCA-AS1 as an important, neuronally specific regulator of alpha-synuclein protein expression as a determinant of risk of alpha-synucleinopathies should be further studied.
We found a similar phenomenon in the SCARB2 locus; rs7697073 is associated with RBD in the current GWAS, whereas in the recent PD GWAS there is an independent association at rs6825004. The PD-associated variant is possibly associated with SCARB2 expression in the substantia nigra, while the RBD-associated variant, rs7697073, is not. This difference in SCARB2 expression may cause an earlier degeneration of the nigrostriatal fibers in the PD cohort compared to the RBD cohort, thus explaining the earlier manifestation of motor symptoms in the former. In RBD, the top associated variant in this locus . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint (rs7697073), like the SNCA locus variant, is associated more with expression in cortical brain regions, providing additional support for our hypothesis. SCARB2, encoding the Scavenger Receptor Class B Member 2, is the transporter of glucocerebrosidase (encoded by GBA) from the endoplasmic reticulum to the lysosome. 27 It is possible that in PD this transport is affected by the variant associated with SCARB2 expression in the substantia nigra. In RBD, this transport may be more affected in cortical regions. This could lead to specific vulnerabilities of specific brain regions in PD and RBD. For example, PD patients with RBD show significant cortical thinning when compared both to controls and to PD without RBD. 28 Despite the inclusion of PD patients with probable RBD in the current meta-analysis, notable PD and DLB GWAS loci are absent, including LRRK2, MAPT, BIN1, and APOE. Our sample size has >80% power to detect at GWAS level significance (p<5x10-8) the associations reported in PD and DLB GWASs in these loci (supplementary figure 8). The apparent lack of association with RBD in these three important regions, which we have previously reported in candidate gene studies in smaller cohorts, [29][30][31][32] further supports RBD as a distinct subtype, genetically and clinically. These findings suggest that PD and DLB likely include different subgroups, some of which are associated with variants in LRRK2, MAPT and APOE, while the subgroup defined by having iRBD prior to the onset of PD or DLB is not. Previous studies have suggested that the genetic background of iRBD may affect the rate of conversion from iRBD to overt alpha-synucleinopathies. Variants in SNCA and GBA have been associated with more rapid conversion rates. 9,10,33 However, these results were based on a relatively small portion of the iRBD cohort with available conversion data. Therefore, in the current study we opted not to analyze individual loci due to insufficient power. When examining the effect of PRS on iRBD conversion, we did not detect a statistically significant association. Yet, it is possible that we were also underpowered, and improved PRS in future studies combined with larger numbers of iRBD patients with conversion data will yield different results.
RBD represents a distinct clinical subtype in synucleinopathies, associated with a more severe disease course in PD, DLB, and MSA, with more rapid deterioration and severe symptoms. 1,6,[34][35][36] Additionally, RBD is the most robust predictor of dementia in PD 25,36 and is associated with less frequent parkinsonism at MSA onset in patients who present with RBD first. 7 Overall, it appears that RBD is linked more closely to cognitive decline than motor symptoms in alpha-synucleinopathies. These clinical observations, combined with our genetic findings, support the presence of distinct subgroups within PD, DLB and MSA and suggest a genetic component to these distinctions. Better identification of these subgroups will be essential towards better stratification in clinical trials, and will require comprehensive genetic, biomarker, and clinical marker studies.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint When considering the five loci associated with RBD in the current study, the autophagy lysosomal pathway (ALP) seems to have a major role in RBD, similar to PD and DLB. However, uniquely to RBD, four of the five nominated genes in this GWAS directly converge to the glucocerebrosidase (GBA) pathway. On top of the GBA locus itself, alpha-synuclein has been shown to directly interact with glucocerebrosidase products and inhibit the transport of glucocerebrosidase to the lysosome. 37 Taken together, these previous and current findings strongly highlight the role of the glucocerebrosidase pathway in RBD. Since iRBD may appear years before the onset of overt neurodegeneration, this population is especially attractive for clinical trials aimed to prevent neurodegeneration in alphasynucleinopathies. GBA targeting therapies could also be tested in this population, especially those who carry GBA variants.
There are several limitations to this study. First, despite being the largest RBD cohort analyzed to date, this is still a relatively small GWAS, and future RBD GWASs will likely yield additional associations.
Second, the meta-analysis is PD-skewed since over half of the GWAS case cohort consists of PD+pRBD patients, some of which had iRBD prior to PD and some had PD symptoms first and RBD developed later.
It is therefore possible that some signals identified by the meta-analysis are PD-enriched, and that iRBD specific signals are diluted. However, the PD+pRBD cohort notably behaves differently (e.g., the LRRK2 and MAPT loci) compared to the published PD GWAS, 13 which includes both PD with and without RBD, suggesting the subset of PD patients who have RBD may be genetically distinct. Finally, this study only includes participants of European ancestry. Future studies in other populations are vital to characterize RBD genetic risk across all ancestries, so any clinical applications can be applied universally.
In this study, we have identified novel RBD genetic loci and presented functional genomic evidence supporting regional brain vulnerability, which is dependent on differential expression of SNCA-AS1 and SCARB2 in different brain regions in RBD and PD. The results show that the genetic background of RBD, PD and DLB only partially overlap, and larger RBD studies will be required to better elucidate the genetic background of RBD. The present study suggests that the lysosomal pathway, and more specifically the GBA pathway, could be a crucial target for therapeutic development targeting RBD and aimed to prevent neurodegeneration in this population.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint

Population
We used two cohorts for the RBD GWAS meta-analysis. The first is an iRBD cohort (N cases=1,061, N controls=8,386). The second is a cohort of PD patients with probable RBD (pRBD, N cases=1,782 and N controls=131,250), genotyped and analyzed by 23andMe. The meta-analysis combines the two for a total of 2,843 cases and 139,636 controls. We did not select individuals with pRBD but without PD from 23andMe, since a very small percentage of them will actually have iRBD; the questionnaire is not reliable for the general population but much more reliable for PD patients, albeit with false positives and false negatives. iRBD cases were collected by the International RBD Study Group and were genotyped and The cases were aged 68 +/-9 years (standard deviation) on average and were 81% male, and the controls were aged 58.5 +/-9 years on average, 68% male. 23andMe cases and controls were age-and . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Power calculation
To examine the power of our cohort to detect associations at GWAS significance level (p<5E-8), we used the Genetic Association Study (GAS) Power Calculator (http://csg.sph.umich.edu/abecasis/cats/gas_power_calculator/) 49 with the following parameters: patients n=2,800, controls n=100,000, disease prevalence 0.01. We calculated the power to detect association of variants in all frequencies, with Ors of 1.2, 1.25, 1.3, 1.4 and 1.5 (Supplementary figure 8). With our sample size, we had less than 80% power to detect associations of variants at any frequency with an OR below 1.2.

Genome-wide association study
23andMe 23andMe cohorts were collected, genotyped, and filtered as previously described. 9 All other cohorts All other cohorts were prepared and analyzed as follows. The different cohorts were genotyped on the OmniExpress GWAS chips (Illumina Inc.). We performed quality control according to a standardized GWAS pipeline created at the National Institutes of Health (NIH, https://github.com/neurogenetics/GWASpipeline) as previously reported. To merge HYPERGENES, Wellcome Trust, and LNG controls with the McGill iRBD genotypes, we performed quality control on each cohort separately, then merged the cohorts using only variants common in all data sets. Following quality control, genotypes were filtered for minor allele frequency (MAF) > 0.01 to reduce imputation errors and imputed using Michigan Imputation Server ). Coloc was run using default p 1 =10 -4 and p 2 =10 -4 priors (p 1 and p 2 are the prior probability that any random SNP in the region is associated with trait 1 and 2, respectively). The p 12 prior (the prior probability that any random SNP in the region is associated with both traits) was altered to , which has been shown to be a more robust choice than the default p 12 =10 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Cell-type and tissue specificity measures
Specificity represents the proportion of a gene's total expression attributable to one cell type/tissue, with a value of 0 meaning a gene is not expressed in that cell type/tissue and a value of 1 meaning that a gene is only expressed in that cell type/tissue. To determine specificity of a gene to a tissue or cell-type, specificity values from two independent gene expression datasets were generated, as described in Chia et al. 8 . Briefly, these datasets included 1) bulk-tissue RNA-sequencing of 53 human tissues from the Genotype-Tissue Expression consortium (GTEx; version 8) 17 and 2) human single-nucleus RNAsequencing of the middle temporal gyrus from the Allen Institute for Brain Science (AIBS; https://portal.brain-map.org/atlases-and-data/rnaseq/human-mtg-smart-seq). 18 Specificity values for GTEx were generated using modified code from a previous publication (https://github.com/jbryois/scRNA_disease), 57 and modified to reduce redundancy among brain regions and to include protein-and non-protein-coding genes. Expression of tissues was averaged by organ (except in the case of brain). Thus, specificity values were generated for a total of 35 tissues. Specificity values for the AIBS-derived dataset were generated using gene-level exonic reads and the 'generate.celltype.data' function of the EWCE package (https://github.com/NathanSkene/EWCE). 58 Specificity values for both datasets and the code used to generate these values are openly available at: https://github.com/RHReynolds/MarkerGenes.

Pathway Analysis
Pathway analysis was performed using functional enrichment analysis, specifically gene set enrichment, using the publicly available online tool WebGestalt (http://www.webgestalt.org/). 59 The gene set enrichment analysis examines the enrichment of a provided set of genes in predetermined lists of genes involved in various functional pathways, detailed in supplementary table 3. An enrichment score is calculated, representing the level to which these genes are over-expressed in the various pathways, and then statistical significance is calculated using permutation testing. RBD genes included were those closest to the most significant GWAS SNP. Multiple hypothesis adjustment is applied in accordance with the false-discovery rate (FDR).

Heritability & Genetic Correlation
The genetic contribution to RBD, known as heritability, was calculated in clinically confirmed cases of iRBD using linkage-disequilibrium (LD) score regression on the publicly available platform LD Hub (http://ldsc.broadinstitute.org/ldhub/). 60 Shared heritability across traits, also known as genetic correlation, was calculated using the same method and platform. 61 Traits for shared heritability tests were chosen based on previous association to a synucleinopathy (e.g. smoking behaviors, education levels) and relevance to RBD (e.g. sleep disorders). Summary statistics for the compared traits were accessed . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint through the LDHub platform or downloaded from publicly available sources, then formatted and analyzed using LDHub python v2.7 scripts. Bonferroni correction was calculated based on the number of traits tested (N=27, p<0.0019). LD score regression code is available on https://github.com/lynnekrohn/RBD_GWAS/blob/main/4_LD-regression.md.

Mendelian randomization
To assess causality, we examined traits or exposures that were nominally associated with RBD through LD-score regression (before FDR correction), as well as traits previously associated with synucleinopathies, using Mendelian Randomization (MR). In brief, MR examines the combined effect of GWAS-significant genetic loci from a trait or exposure in the summary statistics of the outcome (in this case, RBD). This approach mimics a randomized control trial since genetics are randomly assigned at birth and are largely unaffected by environmental factors. We conducted MR analyses, including sensitivity tests, heterogeneity and pleiotropy assessments, and the inverse variance weighted (IVW) method, using the TwoSampleMR R package (https://mrcieu.github.io/TwoSampleMR/) 62 in R version 4.0.1 according to protocol previously established. 22 Summary statistics for exposures were either downloaded from the MRBase GWAS catalogue (http://www.mrbase.org/) or extracted from published reports. P values were adjusted for multiple testing using FDR. MR code is openly available on https://github.com/lynnekrohn/RBD_GWAS/blob/main/5_MR.md.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint

IRB STATEMENT
Participants provided informed consent and participated in the research online, under a protocol approved by the external AAHRPP-accredited IRB, Ethical & Independent Review Services (E&I Review).
Participants were included in the analysis on the basis of consent status as checked at the time data analyses were initiated.

DATA AVAILABILITY
Summary statistics for the iRBD data set are available for download at www.tinyurl.com/iRBDsumStats. The full GWAS summary statistics for the 23andMe discovery data set will be made available through 23andMe to qualified researchers under an agreement with 23andMe that protects the privacy of the 23andMe participants. Please visit https://research.23andme.com/collaborate/#dataset-access/ for more information and to apply to access the data. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint

ACKNOWEDGEMENTS
The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint  biological processes (red). All pictured pathways are significant after FDR multiple testing correction.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

25
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint  In (a) and (b), the x-axis denotes chromosomal position in hg19, and the y-axis indicates association pvalues on a -log 10 scale. Plot of SNCA-AS1 and MMRN1 specificity in (c) 35 human tissues (GTEx dataset) and (d) 7 broad categories of cell type derived from human middle temporal gyrus (AIBS dataset). Specificity represents the proportion of a gene's total expression attributable to one cell type/tissue, with a value of 0 meaning a gene is not expressed in that cell type/tissue and a value of 1 meaning that a gene is only expressed in that cell type/tissue. In (c) tissues are colored by whether they belong to the brain. In (c) and (d), tissues and cell types have been ordered by specificity from high to low.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint FIGURE 6. Key GWAS significant loci across three synucleinopathies. It has been shown that the genetic risk for PD and DLB do not overlap completely, and we show that the same is true for RBD and the other two synucleinopathies. Here we demonstrate key genetic risk loci for the three synucleinopathies. Only GBA and TMEM175 are shared between all three, both of which play a role in the autophagy lysosomal pathway. SNCA plays a role in PD, DLB, and RBD risk, yet the strongest risk locus for PD is at the 3' end of the gene while RBD and DLB share a risk locus at the 5' end. Similarly, SCARB2 is a risk factor for PD as well as RBD, however the RBD locus is independent of the variant identified for PD risk (as indicated by the asterisk in the figure).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint  heterogeneity. We identified 6 independent variants significantly associated with RBD according to the standard GWAS multiple testing correction (p<5E-08). The high heterogeneity found in the SNCA locus could be attributed to the stronger effect size in iRBD (beta=0.40) compared to PD+pRBD (beta=0.15).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted September 13, 2021. ; https://doi.org/10.1101/2021.09.08.21254232 doi: medRxiv preprint