A genome wide association study of frozen shoulder identifies a common variant of WNT7B and diabetes as causal risk factors

Frozen shoulder is a painful condition that often requires surgery and affects up to 5% of individuals aged 40-60 years. Little is known about the causes of the condition, but diabetes is a strong risk factor. To begin to understand the biological mechanisms involved, we aimed to identify genetic variants associated with frozen shoulder and to use Mendelian randomization to test the causal role of diabetes. We performed a genome wide association study (GWAS) of frozen shoulder in the UK Biobank using data from 2064 cases identified from ICD-10 codes. We used data from FinnGen for replication. We used one-sample and two-sample Mendelian randomization approaches to test for a causal association of diabetes with frozen shoulder. We identified a single genome-wide significant locus (lead SNP rs62228062; OR=1.34 [1.28-1.41], p=2x10-16) that contained WNT7B. A recent transcriptome study identified WNT7B as amongst the most enriched transcripts in anterior capsule tissue in patients undergoing arthroscopic capsulotomy surgery for frozen shoulder suggesting WNT7B as a potential causal gene at the locus. The lead SNP was also strongly associated with Dupuytren's contracture (OR=2.61 [2.50, 2.72], p<1x10-100). The Mendelian randomization results provided evidence that type 1 diabetes is a causal risk factor for frozen shoulder (OR=1.04 [1.02-1.07], p=6x10-5). There was no evidence that obesity was causally associated with frozen shoulder, suggesting that diabetes influences risk of the condition through glycemic rather than mechanical effects. We have identified the first genetic variant associated with frozen shoulder. WNT7B is a potential causal gene at the locus. Diabetes is a likely causal risk factor. Our results provide evidence of biological mechanisms involved in this common painful condition.

approaches to test for a causal association of diabetes with frozen shoulder.
We identified a single genome-wide significant locus (lead SNP rs62228062; OR=1.34 [1.28-1.41], p=2x10 -16 ) that contained WNT7B. A recent transcriptome study identified WNT7B as amongst the most enriched transcripts in anterior capsule tissue in patients undergoing arthroscopic capsulotomy surgery for frozen shoulder suggesting WNT7B as a potential causal gene at the locus. The lead SNP was also strongly associated with Dupuytren's contracture (OR=2.61 [2.50, 2.72], p<1x10 -100 ). The Mendelian randomization results provided evidence that type 1 diabetes is a causal risk factor for frozen shoulder (OR=1.04 [1.02-1.07], p=6x10 -5 ). There was no evidence that obesity was causally associated with . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.
2 frozen shoulder, suggesting that diabetes influences risk of the condition through glycemic rather than mechanical effects.
We have identified the first genetic variant associated with frozen shoulder. WNT7B is a potential causal gene at the locus. Diabetes is a likely causal risk factor. Our results provide evidence of biological mechanisms involved in this common painful condition.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint 3 Background Frozen shoulder, also known as adhesive capsulitis, affects 2-5% of the population at some point in their lives [1,2]. It is characterised by initial shoulder pain followed by a gradual reduction in range of movement or "freezing". Pain subsides but joint stiffness can continue for years, causing significant disability [3]. The onset of frozen shoulder is usually between 40 and 60 years [4].
Little is known about the causes of frozen shoulder. Genome-wide association studies can provide new insights into underlying biological mechanism and possible drug targets. For example, a recent GWAS of osteoarthritis has been used to identify new therapeutic targets for the condition [5]. To date, there have been no published genome wide association studies (GWAS) of frozen shoulder.
Diabetes is the strongest known risk factor for diabetes. Individuals with diabetes have a greatly increased lifetime risk, with a hazard ratio of 1.33 [6]. Whether diabetes causes frozen shoulder is unclear because the association may reflect residual confounding by other risk factors such as age, obesity [7] and Dupuytren's disease [8].
Mendelian randomization is a statistical method that can be used to infer causal relationships between an exposure and an outcome by using genetic variants associated with the exposure [9]. The exposure associated variants can be used as an unconfounded proxy for the exposure, as their inheritance is random at conception. This method is now extensively used to infer causal associations and proof of principle examples include evidence that increased BMI causes diabetes [10] and increased LDL cholesterol causes coronary artery disease [11].
Using the UK Biobank, a deeply phenotyped longitudinal study of 500,000 individuals from the UK, we performed a genome-wide association study (GWAS) to identify genetic variants associated with frozen shoulder. We used publicly available summary statistics from 135,658 individuals from the FinnGen study for replication [12]. We then used Mendelian randomization to test for causal associations of diabetes and obesity with frozen shoulder.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint 4

Frozen shoulder cases in the UK Biobank
The primary GWAS was performed on cases of frozen shoulder identified from the ICD-10 code M750 in the UK Biobank. For validation and sensitivity analyses in the UK Biobank primary care data we used read codes (N210, XE1FL and XE1Hm) and classified as cases any individual with at least one entry of these codes. Controls were defined as individuals without a record of one of these codes.

Other phenotypes
ICD-10 codes were used for related disorders (Dupuytren's disease (M720), any fibroblastic disease (M72), rotator cuff (M751), and calcific tendinitis of shoulder (M753)). Type 1 diabetes was classified favouring specificity over sensitivity, defined by diagnosis aged ≤ 20, on insulin within 1 year of diagnosis and at time of recruitment, not using oral antihyperglycemic agents, and not self-reporting type 2 diabetes, as described in [13]. Type 2 diabetes was defined by participants answering yes to the question 'Has a doctor ever told you that you have diabetes?' excluding those who reported using insulin within one year of diagnosis, were diagnosed under the age of 35, or were diagnosed within the past year [14].
We used UK Biobank variable 21001 for BMI measurement. HbA1c in mmol/mol is defined using variable n_30750_0_0 in UK Biobank. Diabetic eye disease was defined using selfreport code 1276.

Genome-wide Association Study
We performed a case-control genome-wide association study using BOLT-LMM [15], which applies a linear mixed model to test for an association between each SNP and the outcome trait, including population structure as a part of the model. Age, sex, study centre and genotyping chip were also included as covariates in the genome-wide association study. We were used in this analysis. We used LocusZoom to plot the resulting significant locus [16] and GTEx to explore the possibility of functional variants [17].
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint

Replication analysis
We used summary statistics from freeze 3 of the FinnGen publicly available resource [12] to replicate the discovery GWAS findings. Cases of frozen shoulder were also defined using the ICD-10 code M750.

Observational Analyses
We tested for associations with demographic and clinical features using logistic regression models, firstly using a univariable logistic regression model on each variable of interest and secondly using all variables in a multivariable logistic regression model to identify independent associations. We analysed Type 1 and Type 2 diabetes separately.

Mendelian randomization
We applied two different methods. The one sample Mendelian randomization (MR) results were performed in two stages: first, the association between each exposure and a genetic risk score was used to derive a genetically predicted exposure value, and second, these predicted values were used in a logistic regression model of frozen shoulder, adjusting for age, sex, ancestral principal components, assessment centre and genotyping platform.
Second, we used two-sample methods, which involve regressing the effect sizes of variantoutcome associations against the effect sizes of the variant-risk factor associations for a set of SNPs associated with the exposure. We performed inverse variance weighted (IVW) instrumental variable analysis and two further methods that are more robust to potential violations of the standard MR assumptions (MR-Egger [18] and weighted-median MR [19]).
We used effect sizes and SNPs for type 1 diabetes reported in [20] using the Wellcome Trust Case Control Consortium, and a type 2 diabetes genetic risk score using variants identified in [21], [22]. The T1D-GRS uses 2 SNPs, rs2187668 and rs7454108, to tag and categorize the high risk DR3/DR4 haplotypes [20], this requires access to individual level data which we did not have for the FinnGen study. We therefore performed two analyses: one where we used the 2 SNPs to impute DR3/DR4 status and another where we used the SNPs as standard risk SNPs.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint

Diabetes is strongly associated with frozen shoulder in the UK Biobank
To identify associated risk factors for frozen shoulder we tested several measures of diabetes and its related traits, in univariable and, to assess their independent contributions, multivariable analyses. There were 2,064 cases of frozen shoulder in the UK Biobank based on ICD-10 codes and 7,913 based on GP records, of which 323 were common to both.
Demographic and clinical associations with Frozen shoulder can be found in Table 1. Those with frozen shoulder were more likely female, had lower Townsend deprivation index, more likely obese (higher BMI and waist-hip-ratio) and more likely to have either type 1 or type 2 diabetes, with type 1 giving the strongest association. In a multivariable logistic regression model, only the type 1 and type 2 diabetes showed strong association. Variants in WNT7B are strongly associated with frozen shoulder  Figure 2 shows the Locuszoom [16] plot of the associated locus.

Replication and sensitivity analyses of the WNT7B association signal
We sought replication of the association signal in the FinnGen study. FinnGen have recently released GWAS summary statistics for 1801 diseases and traits, including frozen shoulder [12]. The lead SNP from the UK Biobank, rs62228062, was associated with frozen shoulder  Table 2 shows the association of rs62228062 with related disorders in UK Biobank.

Mendelian randomization provides evidence that diabetes is causal to frozen shoulder
We used Mendelian randomization to explore whether the associations with Type 1 and Type 2 diabetes reported in Table 2 were causal. Using 1-sample Mendelian randomization methods, genetic data provided evidence that type 1 diabetes causes frozen shoulder: OR The association with type 1 diabetes was replicated using more robust, two-sample MR methods IVW (p=6x10 -4 ) and MR-Egger (p=2x10 -3 ). There was no evidence from the twosample methods that type 2 diabetes causes frozen shoulder (IVW: p=0.17, MR-Egger: p=0.45). The supplementary tables contain the full results from our Mendelian randomization . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review) preprint
The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint 8 analyses, including T1D-GRS results that exclude HLA variants which have greater potential for pleiotropy (Supplementary Table 1 shows the two stage regression, Supplementary Table 2 shows the IVW results and Supplementary Table 3 shows the MR-Egger results).

Figure 3 shows a plot showing the lines of best fit for MR-Egger, IVW, Median IV and
Penalised Median IV methods for type 1 and type 2 diabetes defined by ICD-10 codes, demonstrating a) for type 1 diabetes SNPs, a strong correlation between the betas for type 1 diabetes and frozen shoulder and b) no association for type 2 diabetes SNPs.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Discussion
We have identified the first robust genetic variant to be associated with frozen shoulder.
Frozen shoulder is a condition which affects up to 5% of the population around between the ages of 40 to 60, but the causes of the disease, and particularly the transient nature of the condition are unknown. The genome-wide association study and Mendelian randomization analyses we report here provide new insights into the underlying causes of the condition.
WNT7B is a strong candidate gene at the locus. As with most GWAS studies, further work is needed to identify the causal variant at the locus. GTEX analyses did not identify any strong eQTL's with the lead SNP at the associated locus. However, a recent study performed RNAseq on anterior capsule tissue from 22 patients undergoing arthroscopic capsulotomy surgery for adhesive capsulitis and compared to 26 undergoing arthroscopic stabilization surgery for a different condition [24]. WNT7B was the second most differentially expressed transcript genome-wide with a log fold change of 7 (p=1x10 -16 ), although it was noted that the expression levels of WNT7B was relatively low. The WNT signalling pathway has been highlighted by previous GWAS of related fibroblastic diseases [27].
The lead variant also strongly associates with Dupuytren's disease. Dupuytren's disease is a common condition which is characterised by a hand deformity where fingers cannot be elongated fully. A previous GWAS of Dupuytren's identified 26 associated loci with the condition [25]. One of the more strongly associated loci also contained the WNT7B locus.
However, the lead SNP at that locus has not been associated with frozen shoulder and does not associate with frozen shoulder in the UK Biobank or FinnGen. The odds ratio for the lead SNP in our study has a significantly stronger odds ratio for Dupuytren's than the strongest association signal from the Dupuytren's GWAS. The explanation may be that older versions of SNP chips or imputation panels were used in the previous study which did not capture the lead SNP from our current study of frozen shoulder. It is also unclear why a variant that causes a chronic condition of Dupuytren's disease would cause the late onset and transient condition of frozen shoulder.
Our analyses demonstrate that diabetes is a causal risk factor for frozen shoulder. Diabetes is known to be a strong observational risk factor for frozen shoulder, but the causal nature of the association was unclear. Our Mendelian randomization analyses in UK Biobank provide evidence that Type 1 diabetes is causal for the condition. The weaker association with Type 2 diabetes is likely due to differences in duration of diabetes as Type 1 diabetes is generally diagnosed earlier (half before the age of 30 years) whereas the diagnosis of Type 2 diabetes generally occurs later in life (after 50 years). It is likely all individuals with diabetes would be . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint an increased risk of frozen shoulder, with those with longer duration (like those individuals with Type 1 diabetes) and worse glycemic control at increased risk.
There are limitations to our study. The UK Biobank ICD-10 data is reliant on patients having a coded diagnosis at hospital, which is reliant on accuracy of hospital coding, and also reliant on the condition being severe enough for the patient to go to hospital, which may result in less serious cases being classified as controls. However, frozen shoulder can be difficult to accurately diagnose in primary care. Consistent with this the effect size for frozen shoulder was smaller when cases were based on presence in primary care records rather than inpatient ICD-10 codes. This study was performed using only white Europeans, and further study is needed to determine if results replicate for other ethnic groups. The UK Biobank only includes patients between the ages of 40 and 69 at recruitment, although this covers the most common incidence age range for frozen shoulder.
In conclusion, we have identified the first genetic variant associated with frozen shoulder.
WNT7B is the likely causal gene at the locus. We demonstrate that diabetes is a causal risk factor for frozen shoulder. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint    . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint a b 18 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 16, 2020. ; https://doi.org/10.1101/2020.11.13.20224360 doi: medRxiv preprint