Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

The Genetic Architecture of Alzheimer’s Disease Risk: A Genomic Structural Equation Modelling Study

View ORCID ProfileIsabelle F Foote, View ORCID ProfileBenjamin M Jacobs, View ORCID ProfileGeorgina Mathlin, View ORCID ProfileCameron J Watson, Phazha LK Bothongo, Sheena Waters, View ORCID ProfileRuth Dobson, View ORCID ProfileAlastair J Noyce, View ORCID ProfileKamaldeep S Bhui, Ania Korszun, View ORCID ProfileCharles R Marshall
doi: https://doi.org/10.1101/2021.02.23.21252211
Isabelle F Foote
1Centre for Psychiatry, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Isabelle F Foote
Benjamin M Jacobs
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Benjamin M Jacobs
Georgina Mathlin
1Centre for Psychiatry, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Georgina Mathlin
Cameron J Watson
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Cameron J Watson
Phazha LK Bothongo
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sheena Waters
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruth Dobson
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ruth Dobson
Alastair J Noyce
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alastair J Noyce
Kamaldeep S Bhui
1Centre for Psychiatry, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
3Department of Psychiatry & Nuffield Department of Primary Care Health Sciences, Medical Sciences Division, University of Oxford, UK
4Director, WPA Collaborating Centre, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kamaldeep S Bhui
Ania Korszun
1Centre for Psychiatry, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charles R Marshall
2Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Charles R Marshall
  • For correspondence: charles.marshall@qmul.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Targeting modifiable risk factors may have a role in the prevention of Alzheimer’s disease. However, the mechanisms by which these risk factors influence Alzheimer’s risk remain incompletely understood. Genomic structural equation modelling can reveal patterns of shared genetic architecture that provide insight into the pathophysiology of complex traits.

Methods We identified genome-wide association studies for Alzheimer’s disease and its major modifiable risk factors: less education, hearing loss, hypertension, high alcohol intake, obesity, smoking, depression, social isolation, physical inactivity, type 2 diabetes, sleep disturbance and socioeconomic deprivation. We performed linkage disequilibrium score regression among these traits, followed by exploratory factor analysis, confirmatory factor analysis and structural equation modelling.

Results We identified complex networks of linkage disequilibrium among Alzheimer’s disease risk factors. The data were best explained by a bi-factor model, incorporating a Common Factor for Alzheimer’s risk, and three orthogonal sub-clusters of risk factors, which were validated across the two halves of the autosome. The first sub-cluster was characterised by risk factors related to sedentary lifestyle behaviours, the second by traits associated with reduced life expectancy and the third by traits that are possible prodromes of Alzheimer’s disease. Alzheimer’s disease was more genetically distinct and displayed minimal shared genetic architecture with its risk factors, which was robust to the exclusion of APOE.

Conclusion Shared genetic architecture may contribute to epidemiological associations between Alzheimer’s disease and its risk factors. Understanding the biology reflected by this communality may provide novel mechanistic insights that could help to prioritise targets for dementia prevention.

Introduction

The rising prevalence of Alzheimer’s disease (AD) is a growing public health concern. However, estimates in several high-income countries show a decrease in age-specific incidence of dementia in recent birth cohorts [1-3]. This change has been attributed to improved access to education, reduction in the prevalence of cardiovascular disease and the uptake of healthy lifestyle behaviours, such as increased physical activity [1, 3, 4]. There is increasing interest in targeting dementia risk factors with a view to preventing or delaying the onset of dementia.

The recent Lancet Commission report on Dementia Prevention, Intervention and Care highlighted 12 potentially modifiable risk factors for dementia and estimated that elimination of these factors at key stages of life could prevent up to 40% of all-cause dementia cases [5]. However, the mechanisms by which these factors influence dementia risk, and whether they are truly causal, remain incompletely understood. Some factors such as sleep disturbance, social isolation, depression and hearing difficulty could, at least partly, serve as prodromal risk markers rather than causal risk factors [6-9]. An improved understanding of the causal pathways to clinical dementia development is urgently needed to support prevention efforts through the prioritisation of targets for intervention studies.

Advances in large, well-powered genome-wide association studies (GWASs) have shed light on the shared genetic architecture of complex traits and global pleiotropy. Significant bivariate genetic correlations measured by linkage disequilibrium (LD) score regression have been demonstrated between many behavioural and disease traits [10-14], indicating that extensive overlap in genetic pathways is present between seemingly distinct phenotypes. In the field of psychiatric genetics, genomic structural equation modelling (SEM) studies have identified patterns of shared genetic architecture between multiple psychiatric disorders, suggesting that these disorders have shared pathophysiological mechanisms driven by common pleiotropic risk variants in addition to disease-specific mechanisms [15, 16].

In this study, we aimed to apply these approaches to AD and its major risk factors within a multivariate model. We hypothesised there would be complex patterns of shared genetic architecture between AD and its risk factors, and that a factor analysis approach would suggest distinct aetiological pathways. This novel application could help to uncover shared or moderating risk factor pathways and will inform future work to better understand potential causal biological mechanisms that link modifiable risk factors to AD.

Methods and Materials

Trait selection and data formatting

In addition to Alzheimer’s disease itself, we selected AD risk factors that were identified by the Lancet Commission on Dementia Prevention, Intervention and Care [5]. The report identified 12 major modifiable risk factors, 10 of which were included in this analysis (less education, hearing loss, hypertension, high alcohol intake, obesity, smoking, depression, social isolation, physical inactivity and type 2 diabetes). Air pollution exposure was excluded as heritability estimates are low (all air pollution exposures had heritability estimates <0.015 in UK Biobank (https://nealelab.github.io/UKBB_ldsc/index.html)). We also excluded traumatic brain injury as there was not an appropriate GWAS available for this phenotype or a relevant proxy. We additionally included traits for sleep disturbance and socioeconomic deprivation. These were highlighted by the Lancet Commission as having strong associations with dementia risk, but with insufficient evidence for causality for them to be included in the final list of major modifiable risk factors [5]. We included two traits to measure social isolation (feelings of loneliness and less social activity) because existing literature indicates that these components of social isolation may have differential effects on dementia risk [7, 17].

GWAS summary statistics for each of the 14 included traits were identified (Table 1 and Supplementary Methods) and formatted according to the pre-established requirements for LD score regression and genomic SEM [12, 15, 18]. To minimise bias from rare or poorly imputed alleles, we carried out uniform quality control according to the genomic SEM protocol [15] (Supplementary Methods). All summary statistics were derived from GWAS samples of unrelated individuals of European ancestry.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1: Sample size and population prevalence for each of the included traits.

The sample sizes may differ from the numbers reported in the original GWAS because we only used publicly available and European data. Population prevalence figures are current estimates for age-specific European ancestry populations (see Supplementary Methods for further details).

SNP-based heritability and genetic correlation estimation

We initially checked the univariate SNP-based heritability of each trait to ensure that they all had heritability Z scores >4 [12]. As heritability Z scores are influenced by the magnitude and precision of the heritability estimate, as well as GWAS sample size and the proportion of causal variants [19], using this value as a cut-off was preferable to basing inclusion on the heritability estimate alone. Heritability estimates were expressed on the liability scale for binary traits and the observed scale for continuous traits. Population prevalence for binary traits was defined based on recent estimates in European ancestry cohorts (Table 1 and Supplementary Methods).

We then measured the genetic correlations between all pairs of traits to ensure that there was enough intercorrelation for genomic SEM to be appropriate, but that there were no correlations above 90% [20]. This was to ensure that we did not include highly multicollinear traits within our model, which may have induced bias towards a certain model specification due to near-complete genetic overlap between two traits [21]. We used a Bonferroni corrected significance threshold for 105 pairwise tests (p < 4.76E-4).

Both these steps were performed using LDSC software following pre-established protocols [12, 18] and calculated using publicly available LD scores and weights computed from 1000 Genomes European data restricted to Hapmap 3 SNPs with the major histocompatibility complex region removed.

Genomic factor analysis

Genomic factor analysis was performed in R version 3.6.3. Because the underlying latent structure of genetic covariance between the included traits was unknown, we initially undertook a genomic exploratory factor analysis (EFA) to identify how many factors best explain the common genetic variance. We then used this to guide the specification of the parameters of a follow-up genomic confirmatory factor analysis (CFA).

Owing to the unavailability of suitable replication GWAS datasets for all of our traits, we ran the EFA in the odd autosomes and the CFA in the even autosomes as a cross-validation to minimise model overfitting. This approach has been utilised in a recent genomic SEM study [22]. It was felt to be appropriate in this context as all the traits included were polygenic, thus similar effects should be spread across the autosomes.

We undertook multivariable LD score regression to calculate the underlying genetic covariance matrix and its associated sampling covariance matrix in odd autosomal data to use as the input for EFA [15]. Using EFA we tested different numbers of factors to identify which model performed best using pre-defined criteria (Supplementary Methods). We used oblique (promax) rotation, which assumed correlation between factors, because the results of pairwise LD score regression indicated that there were high levels of genetic correlation between most traits, so it was unlikely that the latent constructs would be uncorrelated.

To test model fit we performed CFA using the output from an additional multivariable LD score regression in the even autosomes. We specified the parameters of the CFA using the results of the best performing model from EFA (i.e. the 3-factor solution). We examined multiple models within these parameters by comparing how inclusion of cross-loadings, negative loadings and higher loading cut-off levels influenced model fit. We assessed model fit by comparing recommended test results and cut-off points [15] (Supplementary Methods). From these results, we took the best performing model and using data from all autosomes we measured its overall genome-wide model fit.

We additionally tested the performance of a common factor model, a second-order model and a bi-factor model (Supplementary Methods). We ran the CFA and SEM models using diagonally weighted least squares (DWLS) estimation as this method accounts for differences in GWAS sample size and is more accurate for modelling binary traits [15, 23].

As AD susceptibility is highly influenced by the APOE gene, we also tested our model in all autosomes except chromosome 19 to assess how far the covariance in the model is driven by this area of the genome. Although SNPs with a χ2>30 are removed from LD score regression analysis [12], there is dense LD present between genes within this region so some APOE-related variance may still be captured via SNPs below this threshold [24, 25]. Therefore, we chose to take a conservative approach to measure non-APOE-related genetic architecture by excluding variants from chromosome 19 [26].

Post-hoc sensitivity analyses

We noticed that the loadings for AD varied substantially between the EFA in odd autosomes, the CFA in even autosomes and the genome-wide SEM. We hypothesised that this may be because AD is more oligogenic than the other traits and so may show differential genetic correlations between chromosomes compared to highly polygenic traits which remain stable [26]. We therefore performed post-hoc sensitivity analyses to compare heritability and pairwise correlation estimates across individual chromosomes, compared EFA loading estimates between different chromosomal groupings and reran our models with AD excluded (Supplementary Methods).

Results

Heritability estimation and genetic correlations

All 14 traits displayed heritability Z scores >4, indicating that the traits were sufficiently heritable to be modelled using genomic SEM (Table 2).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2: Results of the univariate LD heritability estimation for the 14 traits.

SE = standard error.

We then measured SNP-based genetic correlations between all pairs of traits using LD score regression. Each trait shared at least one significant genetic correlation with another trait after Bonferroni correction, with most traits displaying positive correlations with multiple other traits, indicating a complex network of LD suitable for factor analysis (Figure 1, Figure 2 and Table S1).

Figure 1:
  • Download figure
  • Open in new tab
Figure 1: A heatmap of pairwise genome-wide genetic correlations between all 14 traits based on LD score regression.

The upper triangle of the matrix displays the strength of correlation (size of each square) and significant associations passing Bonferroni correction (p < 4.76E-4) are marked with an asterisk. The lower triangle shows the correlation coefficient values. In both triangles, the shade of each square denotes a positive (blue) or negative (red) correlation, varying in shade by magnitude of correlation.

AD Alzheimer’s disease; MDD major depressive disorder; INS insomnia; LON loneliness; LSA less social activity; HD hearing difficulty; LED less education; LPA physical inactivity; SMK smoking; ALC alcohol intake frequency; BMI body mass index; DEP deprivation status; SBP systolic blood pressure; T2DM type 2 diabetes mellitus.

Figure 2:
  • Download figure
  • Open in new tab
Figure 2: A weighted undirected graph showing the structure of the significant genetic correlations that are present between the 14 traits in our analysis.

Each trait forms an individual node (depicted by a circle) and the lines joining them are edges which represent pairwise correlation coefficients between traits. Only Bonferroni significant relationships are included. The stronger the correlation, the shorter and wider the edge and the darker the colour. As all the significant correlations were positive, all edges are blue.

AD Alzheimer’s disease; MDD major depressive disorder; INS insomnia; LON loneliness; LSA less social activity; HD hearing difficulty; LED less education; LPA physical inactivity; SMK smoking; ALC alcohol intake frequency; BMI body mass index; DEP deprivation status; SBP systolic blood pressure; T2DM type 2 diabetes mellitus.

Factor analysis

EFA indicated that a 3-factor model fitted the data best (Figure 3, Tables S2-S4 and Supplementary Results). Using the parameters of the 3-factor EFA, we ran CFA in the even chromosomes to establish model fit. As systolic blood pressure did not display any positive loadings above our pre-defined cut-off it was omitted from our subsequent models. The best fitting CFA included positive loadings ≥0.20, cross-loadings and between-factor correlations but excluded negative loadings (Table 3). Model fit improved when tested in all autosomes and after excluding chromosome 19 that contains APOE (Figure 4a and Supplementary Results).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 3: Model fit statistics for each of the genomic SEM models performed in even autosomes, all autosomes, all autosomes except chromosome 19 and all autosomes with Alzheimer’s disease (AD) excluded.

None of these models included systolic blood pressure since it produced low loading estimates during exploratory factor analysis.

Figure 3:
  • Download figure
  • Open in new tab
Figure 3: Standardised factor loadings for the 3-factor genomic exploratory factor analysis model based on data from the odd autosomes.

Each bar on the plot denotes the factor loading of each trait on each of the 3 factors. Each vertical rectangle represents a factor. These factors are ordered by the amount of total variance they explain (left to right). The vertical dashed line represents the chosen cut-off threshold (0.20). The colour of the bar represents the factor loading strength. Dark blue denotes highly stable positive loadings (>0.40), light blue represents positive loadings between 0.20 to 0.40, green denotes positive loadings not meeting our cut-off threshold (between 0 and 0.20) and red represents negative loadings.

Figure 4:
  • Download figure
  • Open in new tab
Figure 4:
  • Download figure
  • Open in new tab
Figure 4: Path diagrams of the standardised solutions for the (a) 3-factor CFA, (b) common factor model and (c) bi-factor model conducted across all autosomes.

The dark grey ovals represent latent unobserved constructs (factors), the pale grey circles are the observed traits made up of the measured genetic variance of each included trait, the white circles represent the unique variance of each trait not explained by loadings into factors. Unidirectional arrows depict regression coefficients from the independent variable to the dependent variable, curved two-headed arrows between factors represent correlations between the factors and two-headed arrows connecting a variable to itself highlights the variance of a variable with itself. Numbers in brackets represent the standard error of each measured value. Due to its lack of positive loading onto any factor during EFA, systolic blood pressure was excluded from these models.

Figure A displays the path diagram for the 3-factor CFA which shows the factor loadings for traits that displayed a loading ≥ .20 at EFA. These three latent factors show moderate levels of inter-factor correlation.

Figure B displays the path diagram for the common factor model where there is only one factor that depicts the overarching common variance between all included traits.

Figure C displays the path diagram for the bi-factor model where there is a Common Factor of shared variance between all of the traits as well as 3 uncorrelated latent factors of sub-clusters of covariance that were specified by EFA in the same way as our CFA.

AD Alzheimer’s disease; MDD major depressive disorder; INS insomnia; LON loneliness; LSA less social activity; HD hearing difficulty; LED less education; LPA physical inactivity; SMK smoking; ALC alcohol intake frequency; BMI body mass index; DEP deprivation status; T2DM type 2 diabetes mellitus.

Comparison of common factor, second-order factor and bi-factor models revealed that the bi-factor model provided the best model fit both in the even autosomes and across all autosomes (Table 3, Figure 4 and Figures S1-S2). In the genome-wide bi-factor model, the Common Factor was influenced by all of the included traits except for hearing difficulty and the factors from the CFA formed orthogonal sub-clusters of variance between specific sets of AD risk factors that were independent of AD (Figure 4c). The model fit remained good after excluding chromosome 19 (Table 3 and Supplementary Results).

Post-hoc sensitivity analyses

Our sensitivity analyses found that in comparison to the risk factor traits, AD displayed substantial differences in SNP-based heritability estimates, pairwise genetic correlations with its risk factors and EFA loading estimates between the odd and the even chromosomes (Tables S5-S6 and Supplementary Results). However, follow-up genomic factor analysis of data from all autosomes where AD and systolic blood pressure were excluded produced similar results to our main model, indicating that basing our model on data from the odd autosomes was appropriate for the AD risk factors and that the latent constructs are sufficiently stable to draw conclusions from (Table S7 and Supplementary Results).

Discussion

In this study, we modelled the shared genetic architecture between AD and its potentially modifiable risk factors. LD score regression highlighted pervasive genetic overlap between traits, though AD was more genetically unique than its risk factors. We built on this by using factor analysis and genomic SEM to model the pattern of this shared genetic architecture. The best fitting bi-factor model demonstrated the presence of a common genetic liability between AD and its risk factors, while the remaining genetic covariance was explained by three orthogonal clusters of traits. The findings demonstrate a richly interconnected network of genetic communality which may, in part, explain established epidemiological associations and suggest new avenues for both understanding the mechanisms by which modifiable risk factors influence AD, and aid the discovery of the components of missing heritability.

Pairwise bivariate LD score regression revealed many significant genetic correlations between the traits, indicating a substantial shared genetic component between AD risk factors. However, Alzheimer’s disease, systolic blood pressure and hearing difficulty had more limited and weaker genetic correlation with the other traits, suggesting that they are weaker genetic instruments and/or are more distinct in their genetic architecture. Our findings here support previous work that found AD to have a greater amount of unique genetic variance compared to psychiatric, behavioural or cardiometabolic traits [11, 12]. The strongest bivariate correlation for AD was with less education, which is consistent with previous Mendelian randomisation results, suggesting a causal effect of less education on risk of AD [27-29]. Our study goes beyond such bivariate approaches by identifying clusters of shared variance between traits that might not meet the stringent power requirements of Mendelian randomisation [30, 31], and highlighting smaller, yet potentially meaningful, overlap in genetic pathways.

Factor analysis identified an optimal 3-factor model, in which each factor contained a distinct cluster of traits, allowing us to generate some hypotheses about the mechanisms that might be reflected in these factors. Factor 1 had positive loadings for AD and a majority of the risk factors, suggesting that it might represent a common genetic pathway to AD across metabolic, psychiatric and lifestyle traits. Factor 2 had highly stable positive loadings for risk factors that are strongly associated with reduced life expectancy (smoking, deprivation and physical inactivity) [32-34] together with negative loading for AD. This factor might reflect only an apparent protective effect of these factors on AD due to premature mortality and survival bias [35-37], or could simply reflect a clustering of these traits independent of AD. Factor 3 included positive loadings for AD and a subset of risk factors that have complex relationships with AD in that they have all been hypothesised to be disease prodromes as well as causal risk factors [6-9, 38]. Moreover, systolic blood pressure negatively loaded onto this factor, and whereas risk of dementia is associated with higher blood pressure in mid-life, blood pressure declines during the period prior to cognitive symptoms and dementia diagnosis [39-42]. It is therefore possible that Factor 3 reflects AD prodromes, and hence reverse causality between AD and these traits.

A bi-factor model provided the best model fit in SEM. This model highlighted a Common Factor of shared genetic liability between the included traits in addition to the 3 distinct orthogonal clusters of covariance identified in the factor analysis. The only trait that did not pass inclusion into this model was systolic blood pressure due to its lack of communality with the other traits. This may partly be due to the complex relationship with blood pressure at the different stages of AD [39-42]. AD loaded positively onto the Common Factor but did not display positive loadings for any of the 3 sub-factors after the Common Factor was added to the model. This indicates that there is a common genetic liability between AD and its risk factors, but there is also a high degree of genetic overlap between different sets of AD risk factors independent of AD genetic pathways. The Common Factor had particularly high associations with being less educated, less physically active, more deprived and more socially isolated, indicating that these risk factors are those which have the highest degree of shared genetic liability with AD. However, the level of directly shared genetic architecture between risk factors and AD is likely to be low since 96% of the variance associated with AD was unique. The only risk factor that did not display a substantial loading onto the Common Factor was hearing difficulty. This may reflect that this risk factor has a limited role in determining AD risk, but rather is associated by reverse causation [43, 44].

The three orthogonal clusters identified in the factor analysis were largely unchanged in the SEM model, except that the AD loadings were attenuated because the shared variance between AD and its risk factors became entirely accounted for by the Common Factor. Additionally, Factor 1 only retained high loadings with traits associated with sedentary lifestyle behaviours (type 2 diabetes, higher alcohol intake, physical inactivity and obesity).

Despite their lack of direct shared genetic liability with AD, the likely relevance of these 3 sub-clusters is two-fold. Firstly, multimorbidity has been shown to increase an individual’s risk of developing AD, although the mechanisms for this remain unknown [45]. The high levels of shared genetic architecture between sets of known AD risk factors may represent mutual genetic liabilities between groups of traits that increase an individual’s likelihood of developing multiple conditions, or they might be associated with more generic ‘unhealthy’ behaviour and social circumstance. Secondly, although these clusters display no direct shared genetic pathway with AD, these shared pathways may exert pathophysiological effects that create a microenvironment within the brain that leads to an increased propensity to develop AD indirectly. In cases where an individual carries risk variants from multiple latent clusters, this may exert synergistic effects on AD development via non-genetic pathways, such as reducing cognitive reserve or via gene-environment interactions.

While the clustering of traits here may reflect causal genetic pathways between AD and certain risk factors, this analysis does not allow us to fully disentangle the nature of the observed pleiotropy, as we are unable to distinguish between causal, mediating and shared pathways. For example, since having less education is not only significantly genetically correlated with AD, but also all other risk factors (except for hearing difficulty), and it is one of the highest loading traits for the Common Factor, it seems probable that the common liability between less education may be a mediating or moderating driver of the shared relationship with AD and its other risk factors via extensive shared pleiotropy. Previous studies measuring bivariate associations have highlighted a causal genetic link between less education and AD [27-29] and its early life occurrence makes it a stronger candidate for being on the causal pathway to AD than other proposed risk factors that seem to exert risk later in life so are more likely to be linked by reverse causation [5]. Loneliness may also have a direct shared component with AD, but further work to identify the underlying SNPs in this model are needed to establish this. However, its high loading onto the Common Factor could moderate some of the shared liability seen for depression in this factor. Previous studies of the direct shared genetic architecture between depression and AD have been inconclusive [11, 46-48], so it is possible that loading of depression onto this factor is via its strong shared component with loneliness rather than by AD itself. Therefore, although we provide preliminary evidence of a potential shared liability between AD and risk factors, analysis of the SNP-based effects within this construct will enable a better understanding of how these traits are truly related on a genetic level and whether this is due to causal effects or shared risk variants.

There are several other important limitations to this work. Chief among these are the potential limitations of the genetic instruments, especially for AD. There was a high fluctuation of AD loading estimates between the CFA conducted in the even autosomes compared to the genome-wide estimates. Our sensitivity analyses indicated that this was driven by substantially different heritability and genetic correlation estimates between individual chromosomes. This may be due to the relatively oligogenic nature of AD compared to its risk factors [26], so shared genetic architecture may be confined to specific regions of the genome. This oligogenicity may not be appropriately modelled by methods designed for polygenic trait analysis, such as LD score regression, which expect risk variants to be spread evenly across the genome. Since LD score regression omits SNPs with a χ2>30, this could make AD a weaker genetic instrument by down-weighting the most important SNPs. This should be taken into consideration for genomic SEM analysis since the method bases its estimates on LD score regression and for certain traits, such as AD, it may not be suitable to split the chromosomes for factor analysis. However, post-hoc factor analysis of the more stable risk factor traits showed that the three latent constructs of shared genetic architecture between AD risk factors themselves are robust, so our conclusions relating to these constructs remain valid.

Furthermore, AD had the lowest heritability Z score of any of the traits that we measured. The precision of genetic correlations measured using LD score regression is influenced by sample size, whereas the magnitude of the correlation is not [12, 16], so the small sample size for AD probably contributed to the low Z score estimate. Heritability may also be adversely influenced by the heterogeneous nature of AD outcomes in the original GWAS [49]. The use of endophenotypes has been shown to produce more homogenous genetic phenotypes. However the current GWAS data for molecular biomarkers of AD pathology have poor heritability estimates and sample sizes too small to be used reliably for genomic SEM [50].

Another important limitation is that we have only used samples from European ancestry within our current analysis owing to the confounding effects of ancestral variation in LD score regression. Although sampling efforts in more diverse ancestral groups have started to increase, there is still a significant lack of data for many phenotypes for non-European groups and we were thus unable to perform an analysis that would include all our traits of interest [51]. Therefore, it is possible that these findings cannot be generalised to other populations. There is increasing evidence that AD risk factors exert differential effects on AD onset across diverse ethnicities [52, 53] and so, as sufficient data become available, it will be crucial to perform similar analyses in a range of ethnic groups to support targeted prevention strategies globally.

Lastly, our current model only measures shared genetic architecture, but does not identify shared environmental pathways or gene-environment interactions. There is evidence to suggest that an individual’s genetic profile can influence the magnitude of the effect that modifiable risk factors have on AD risk [54, 55]. Furthermore, increasing age remains the largest risk factor for AD and various epigenetic changes have been associated with both AD and ageing [56-58], so gene-environment interactions may be especially important to explore. However, such studies require a good understanding of the underlying genetic architecture, so our current study provides a useful and novel foundation for future work in this area.

This work provides the basis for further study to identify the SNPs and downstream pathways that are shared within the latent constructs identified here and those that are unique to AD and specific risk factors. GWAS of the factors within the model here may also yield new genetic associations with AD risk. Understanding these biological pathways that link AD and its risk factors could help to prioritise interventions that might prevent AD. The methods used here are also transferable to other disorders and risk factors so may provide further insight into a variety of diseases. Genomic SEM has proven successful in the field of psychiatric and behavioural genetics but has not been as widely adopted within other health research specialities and we hope this study will help show its potential merits. More broadly, future work in AD genetics should focus on phenotype optimisation and larger sample sizes of clinical AD cases that may lead to improved GWAS discovery and more links to risk factors may be unearthed. Larger GWASs of AD biomarkers, such as CSF amyloid-β and tau, will enable exploration of how AD risk factors share genetics with these specific pathologies, providing a deeper understanding of whether risk factors influence AD genetically via shared architecture with pathological processes or by reducing cognitive reserve. Lastly, analytical methods particularly suited to measuring oligogenic conditions will help to find the optimal way of modelling a relatively oligogenic disorder alongside highly polygenic ones.

Taken together, these results demonstrate a complex pattern of shared genetic architecture between AD and its risk factors, including a Common Factor for AD risk that may link pathophysiological mechanisms across metabolic, psychiatric and lifestyle traits. Understanding the biology underpinning this extensive communality could yield novel preventive strategies to mitigate the growing global challenge of dementia.

Key Resources Table

View this table:
  • View inline
  • View popup
  • Download powerpoint

Data Availability

The data used in the study are all publicly available for download (GWAS summary statistics, LD scores and weights, and the reference genome file). See the Key Resources Table in the main manuscript for details.

http://web.pasteur-lille.fr/en/recherche/u744/igap/igap_download.php

https://www.med.unc.edu/pgc/download-results/mdd/

https://ctg.cncr.nl/software/summary_statistics

http://diagram-consortium.org/downloads.html

http://www.nealelab.is/uk-biobank

https://alkesgroup.broadinstitute.org/LDSCORE/

https://utexas.app.box.com/s/vkd36n197m8klbaio3yzoxsee6sxo11v

Author contributions

IFF, CRM, AK and KSB created the study concept and design. IFF and CRM drafted the manuscript. IFF performed the data curation and analysis. AJN and BMJ provided technical advice on data analysis and genetic methods. GM guided aspects relating to factor analysis. RD provided critique and refinement on the methods. CJW, PLKB and SW provided input and theoretical knowledge with interpretation of results. All authors contributed to the final version of this manuscript. CRM, AK and KSB supervised this project.

Disclosures

IFF is funded by the George Henry Woolfe Legacy Fund (WOL1000B) provided by the Wolfson Institute of Preventive Medicine (Queen Mary University of London). CRM is funded by a grant from Bart’s Charity (MGU0366) and receives personal fees from GE Healthcare, outside the current study. RD reports personal fees from Biogen Idec, Merck, Celgene, Janssen, Novartis, and Genzyme outside of the current study. AJN reports personal fees from Britannia Pharmaceuticals, Profile, Neurology Academy, Roche, Biogen, LEK consulting, AbbVie, BIAL, and the Movement Disorders Society, outside of the current study. The other authors declare no competing interests.

Acknowledgements

We would like to thank the participants in the included data cohorts that we used in this study and the researchers who conducted the original GWAS studies for making their data publicly accessible, thereby making our work possible. We also thank the developers of all the software we used for making this open source and special thanks to Andrew Grotzinger (University of Texas at Austin) for providing valuable technical support with the GenomicSEM software. We are grateful to the late Dr Robert Keers for his insightful comments on the analysis plan. This project was supported by the George Henry Woolfe Legacy Fund (WOL1000B) and a Bart’s Charity grant (MGU0366).

Footnotes

  • ↵* These authors jointly supervised this work

References

  1. 1.↵
    Ahmadi-Abhari, S., et al., Temporal trend in dementia incidence since 2002 and projections for prevalence in England and Wales to 2040: modelling study. BMJ, 2017. 358: p. j2856.
    OpenUrlAbstract/FREE Full Text
  2. 2.
    James, S.L., et al., Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. The Lancet, 2018. 392(10159): p. 1789–1858.
    OpenUrlCrossRefPubMed
  3. 3.↵
    Wu, Y.T., et al., The changing prevalence and incidence of dementia over time - current evidence. Nat Rev Neurol, 2017. 13(6): p. 327–339.
    OpenUrlCrossRefPubMed
  4. 4.↵
    Langa, K.M., et al., A Comparison of the Prevalence of Dementia in the United States in 2000 and 2012. JAMA Internal Medicine, 2017. 177(1): p. 51–58.
    OpenUrl
  5. 5.↵
    Livingston, G., et al., Dementia prevention, intervention, and care: 2020 report of the Lancet Commission. The Lancet, 2020. 396(10248): p. 413–446.
    OpenUrl
  6. 6.↵
    Leng, Y., et al., Genetic Risk of Alzheimer’s Disease and Sleep Duration in Non-Demented Elders. Annals of Neurology, 2021. 89(1): p. 177–181.
    OpenUrl
  7. 7.↵
    Holwerda, T.J., et al., Feelings of loneliness, but not social isolation, predict dementia onset: results from the Amsterdam Study of the Elderly (AMSTEL). J Neurol Neurosurg Psychiatry, 2014. 85(2): p. 135–42.
    OpenUrlAbstract/FREE Full Text
  8. 8.
    Brommelhoff, J.A., et al., Depression as a risk factor or prodromal feature for dementia? Findings in a population-based sample of Swedish twins. Psychology and aging, 2009. 24(2): p. 373–384.
    OpenUrlCrossRefPubMedWeb of Science
  9. 9.↵
    Hardy, C.J.D., et al., Hearing and dementia. Journal of neurology, 2016. 263(11): p. 2339–2354.
    OpenUrlPubMed
  10. 10.↵
    Tylee, D.S., et al., Genetic correlations among psychiatric and immune-related phenotypes based on genome-wide association data. Am J Med Genet B Neuropsychiatr Genet, 2018. 177(7): p. 641–657.
    OpenUrlCrossRef
  11. 11.↵
    Brainstorm Consortium, et al., Analysis of shared heritability in common disorders of the brain. Science, 2018. 360(6395).
  12. 12.↵
    Bulik-Sullivan, B., et al., An atlas of genetic correlations across human diseases and traits. Nature Genetics, 2015. 47(11): p. 1236–1241.
    OpenUrlCrossRefPubMed
  13. 13.
    Hagenaars, S.P., et al., Genetic comorbidity between major depression and cardio-metabolic traits, stratified by age at onset of major depression. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, 2020. 183(6): p. 309–330.
    OpenUrl
  14. 14.↵
    Hill, W.D., et al., Molecular Genetic Contributions to Social Deprivation and Household Income in UK Biobank. Current Biology, 2016. 26(22): p. 3083–3089.
    OpenUrlCrossRefPubMed
  15. 15.↵
    Grotzinger, A.D., et al., Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat Hum Behav, 2019. 3(5): p. 513–525.
    OpenUrl
  16. 16.↵
    Lee, P.H., et al., Genomic Relationships, Novel Loci, and Pleiotropic Mechanisms across Eight Psychiatric Disorders. Cell, 2019. 179(7): p. 1469-1482.e11.
    OpenUrlCrossRef
  17. 17.↵
    Wilson, R.S., et al., Loneliness and Risk of Alzheimer Disease. Archives of General Psychiatry, 2007. 64(2): p. 234–240.
    OpenUrlCrossRefPubMedWeb of Science
  18. 18.↵
    Zheng, J., et al., LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics, 2016. 33(2): p. 272–279.
    OpenUrl
  19. 19.↵
    Hill, W.D., et al., Molecular genetic aetiology of general cognitive function is enriched in evolutionarily conserved regions. Translational Psychiatry, 2016. 6(12): p. e980–e980.
    OpenUrl
  20. 20.↵
    Yoo, W., et al., A Study of Effects of MultiCollinearity in the Multivariable Analysis. International journal of applied science and technology, 2014. 4(5): p. 9–19.
    OpenUrl
  21. 21.↵
    Tarka, P., An overview of structural equation modeling: its beginnings, historical development, usefulness and controversies in the social sciences. Quality & quantity, 2018. 52(1): p. 313–354.
    OpenUrl
  22. 22.↵
    Grotzinger, A.D., et al., Genetic Architecture of 11 Major Psychiatric Disorders at Biobehavioral, Functional Genomic, and Molecular Genetic Levels of Analysis. medRxiv, 2020: p. 2020.09.22.20196089.
  23. 23.↵
    Beauducel, A. and P.Y. Herzberg, On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation in CFA. Structural Equation Modeling: A Multidisciplinary Journal, 2006. 13(2): p. 186–203.
    OpenUrl
  24. 24.↵
    Ware, E.B., et al., Considering the APOE locus in Alzheimer’s disease polygenic scores in the Health and Retirement Study: a longitudinal panel study. BMC Medical Genomics, 2020. 13(1): p. 164.
    OpenUrl
  25. 25.↵
    Kulminski, A.M., et al., Genetic and regulatory architecture of Alzheimer’s disease in the APOE region. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring, 2020. 12(1): p. e12008.
    OpenUrl
  26. 26.↵
    Zhang, Q., et al., Risk prediction of late-onset Alzheimer’s disease implies an oligogenic architecture. Nature Communications, 2020. 11(1): p. 4799.
    OpenUrl
  27. 27.↵
    Anderson, E.L., et al., Education, intelligence and Alzheimer’s disease: evidence from a multivariable two-sample Mendelian randomization study. Int J Epidemiol, 2020. 49(4): p. 1163–1172.
    OpenUrl
  28. 28.
    Andrews, S.J., et al., Causal Associations Between Modifiable Risk Factors and the Alzheimer’s Phenome. Ann Neurol, 2021. 89(1): p. 54–65.
    OpenUrl
  29. 29.↵
    Larsson, S.C., et al., Modifiable pathways in Alzheimer’s disease: Mendelian randomisation analysis. BMJ, 2017. 359: p. j5375.
    OpenUrlAbstract/FREE Full Text
  30. 30.↵
    Davies, N.M., M.V. Holmes, and G. Davey Smith, Reading Mendelian randomisation studies: a guide, glossary, and checklist for clinicians. BMJ, 2018. 362: p. k601.
    OpenUrlFREE Full Text
  31. 31.↵
    Burgess, S., A.S. Butterworth, and J.R. Thompson, Beyond Mendelian randomization: how to interpret evidence of shared genetic predictors. Journal of Clinical Epidemiology, 2016. 69: p. 208–216.
    OpenUrlCrossRefPubMed
  32. 32.↵
    Holford, T.R., et al., Tobacco control and the reduction in smoking-related premature deaths in the United States, 1964-2012. JAMA, 2014. 311(2): p. 164–171.
    OpenUrlCrossRefPubMedWeb of Science
  33. 33.
    Lewer, D., et al., Premature mortality attributable to socioeconomic inequality in England between 2003 and 2018: an observational study. The Lancet Public Health, 2020. 5(1): p. e33–e41.
    OpenUrl
  34. 34.↵
    Warburton, D.E.R., et al., A systematic review of the evidence for Canada’s Physical Activity Guidelines for Adults. International Journal of Behavioral Nutrition and Physical Activity, 2010. 7(1): p. 39.
    OpenUrl
  35. 35.↵
    Hernán, M.A., A. Alonso, and G. Logroscino, Cigarette Smoking and Dementia: Potential Selection Bias in the Elderly. Epidemiology, 2008. 19(3).
  36. 36.
    Weuve, J., et al., Guidelines for reporting methodological challenges and evaluating potential bias in dementia research. Alzheimer’s & dementia : the journal of the Alzheimer’s Association, 2015. 11(9): p. 1098–1109.
    OpenUrl
  37. 37.↵
    Kukull, W.A., The association between smoking and Alzheimer’s disease: effects of study design and bias. Biological Psychiatry, 2001. 49(3): p. 194–199.
    OpenUrlCrossRefPubMedWeb of Science
  38. 38.↵
    Johnson, J.C.S., et al., Hearing and dementia: from ears to brain. Brain, 2020.
  39. 39.↵
    Ma, Y., et al., Variation in blood pressure and long-term risk of dementia: A population-based cohort study. PLOS Medicine, 2019. 16(11): p. e1002933.
    OpenUrl
  40. 40.
    Perera, G., et al., Vascular and metabolic risk factor differences prior to dementia diagnosis: a multidatabase case–control study using European electronic health records. BMJ Open, 2020. 10(11): p. e038753.
    OpenUrlAbstract/FREE Full Text
  41. 41.
    Lane, C.A., et al., Associations between blood pressure across adulthood and late-life brain structure and pathology in the neuroscience substudy of the 1946 British birth cohort (Insight 46): an epidemiological study. The Lancet Neurology, 2019. 18(10): p. 942–952.
    OpenUrl
  42. 42.↵
    Sproviero, W., et al., High Blood Pressure and Risk of Dementia: a Two-Sample Mendelian Randomization study in the UK Biobank. Biological Psychiatry, 2020.
  43. 43.↵
    Brenowitz, W.D., et al., Clinician-judged hearing impairment and associations with neuropathologic burden. Neurology, 2020. 95(12): p. e1640.
    OpenUrl
  44. 44.↵
    Brenowitz, W.D., et al., Association of genetic risk for Alzheimer disease and hearing impairment. Neurology, 2020. 95(16): p. e2225–e2234.
    OpenUrl
  45. 45.↵
    Peters, R., et al., Combining modifiable risk factors and risk of dementia: a systematic review and meta-analysis. BMJ Open, 2019. 9(1): p. e022846.
    OpenUrlAbstract/FREE Full Text
  46. 46.↵
    Gibson, J., et al., Assessing the presence of shared genetic architecture between Alzheimer’s disease and major depressive disorder using genome-wide association data. Transl Psychiatry, 2017. 7(4): p. e1094.
    OpenUrl
  47. 47.
    Lutz, M.W., et al., Shared genetic etiology underlying Alzheimer’s disease and major depressive disorder. Translational Psychiatry, 2020. 10(1): p. 88.
    OpenUrl
  48. 48.↵
    Ni, H., et al., The GWAS Risk Genes for Depression May Be Actively Involved in Alzheimer’s Disease. J Alzheimers Dis, 2018. 64(4): p. 1149–1161.
    OpenUrl
  49. 49.↵
    Ryan, J., et al., Phenotypic Heterogeneity in Dementia: A Challenge for Epidemiology and Biomarker Studies. Front Public Health, 2018. 6: p. 181.
    OpenUrl
  50. 50.↵
    Deming, Y., et al., Genome-wide association study identifies four novel loci associated with Alzheimer’s endophenotypes and disease modifiers. Acta Neuropathologica, 2017. 133(5): p. 839–856.
    OpenUrlCrossRefPubMed
  51. 51.↵
    Martin, A.R., et al., Clinical use of current polygenic risk scores may exacerbate health disparities. Nature Genetics, 2019. 51(4): p. 584–591.
    OpenUrlCrossRefPubMed
  52. 52.↵
    Babulal, G.M., et al., Perspectives on ethnic and racial disparities in Alzheimer’s disease and related dementias: Update and areas of immediate need. Alzheimer’s & Dementia, 2019. 15(2): p. 292–312.
    OpenUrl
  53. 53.↵
    Mukadam, N., et al., Population attributable fractions for risk factors for dementia in low-income and middle-income countries: an analysis using cross-sectional survey data. The Lancet Global Health, 2019. 7(5): p. e596–e603.
    OpenUrl
  54. 54.↵
    Eid, A., I. Mhatre, and J.R. Richardson, Gene-environment interactions in Alzheimer’s disease: A potential path to precision medicine. Pharmacology & Therapeutics, 2019. 199: p. 173–187.
    OpenUrl
  55. 55.↵
    Dunn, A.R.K.M. S. O’Connell, and C.C. Kaczorowski, Gene-by-environment interactions in Alzheimer’s disease and Parkinson’s disease. Neuroscience & Biobehavioral Reviews, 2019. 103: p. 73–80.
    OpenUrl
  56. 56.↵
    Li, P., et al., Epigenetic dysregulation of enhancers in neurons is associated with Alzheimer’s disease pathology and cognitive symptoms. Nature Communications, 2019. 10(1): p. 2246.
    OpenUrl
  57. 57.
    Nativio, R., et al., Dysregulation of the epigenetic landscape of normal aging in Alzheimer’s disease. Nature Neuroscience, 2018. 21(4): p. 497–505.
    OpenUrlCrossRefPubMed
  58. 58.↵
    Reynolds, C.A., et al., A decade of epigenetic change in aging twins: Genetic and environmental contributions to longitudinal DNA methylation. Aging Cell, 2020. 19(8): p. e13197.
    OpenUrl
  59. 59.
    Lambert, J.C., et al., Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet, 2013. 45(12): p. 1452–8.
    OpenUrlCrossRefPubMed
  60. 60.
    Wray, N.R., et al., Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat Genet, 2018. 50(5): p. 668–681.
    OpenUrlCrossRefPubMed
  61. 61.
    Jansen, P.R., et al., Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nat Genet, 2019. 51(3): p. 394–403.
    OpenUrlCrossRefPubMed
  62. 62.
    Scott, R.A., et al., An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans. Diabetes, 2017. 66(11): p. 2888.
    OpenUrlAbstract/FREE Full Text
Back to top
PreviousNext
Posted March 02, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The Genetic Architecture of Alzheimer’s Disease Risk: A Genomic Structural Equation Modelling Study
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The Genetic Architecture of Alzheimer’s Disease Risk: A Genomic Structural Equation Modelling Study
Isabelle F Foote, Benjamin M Jacobs, Georgina Mathlin, Cameron J Watson, Phazha LK Bothongo, Sheena Waters, Ruth Dobson, Alastair J Noyce, Kamaldeep S Bhui, Ania Korszun, Charles R Marshall
medRxiv 2021.02.23.21252211; doi: https://doi.org/10.1101/2021.02.23.21252211
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
The Genetic Architecture of Alzheimer’s Disease Risk: A Genomic Structural Equation Modelling Study
Isabelle F Foote, Benjamin M Jacobs, Georgina Mathlin, Cameron J Watson, Phazha LK Bothongo, Sheena Waters, Ruth Dobson, Alastair J Noyce, Kamaldeep S Bhui, Ania Korszun, Charles R Marshall
medRxiv 2021.02.23.21252211; doi: https://doi.org/10.1101/2021.02.23.21252211

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Neurology
Subject Areas
All Articles
  • Addiction Medicine (160)
  • Allergy and Immunology (412)
  • Anesthesia (90)
  • Cardiovascular Medicine (855)
  • Dentistry and Oral Medicine (157)
  • Dermatology (97)
  • Emergency Medicine (247)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (392)
  • Epidemiology (8542)
  • Forensic Medicine (4)
  • Gastroenterology (383)
  • Genetic and Genomic Medicine (1741)
  • Geriatric Medicine (167)
  • Health Economics (371)
  • Health Informatics (1235)
  • Health Policy (618)
  • Health Systems and Quality Improvement (467)
  • Hematology (196)
  • HIV/AIDS (372)
  • Infectious Diseases (except HIV/AIDS) (10274)
  • Intensive Care and Critical Care Medicine (552)
  • Medical Education (192)
  • Medical Ethics (51)
  • Nephrology (210)
  • Neurology (1668)
  • Nursing (97)
  • Nutrition (248)
  • Obstetrics and Gynecology (325)
  • Occupational and Environmental Health (450)
  • Oncology (925)
  • Ophthalmology (263)
  • Orthopedics (100)
  • Otolaryngology (172)
  • Pain Medicine (111)
  • Palliative Medicine (40)
  • Pathology (252)
  • Pediatrics (534)
  • Pharmacology and Therapeutics (246)
  • Primary Care Research (207)
  • Psychiatry and Clinical Psychology (1760)
  • Public and Global Health (3831)
  • Radiology and Imaging (622)
  • Rehabilitation Medicine and Physical Therapy (318)
  • Respiratory Medicine (518)
  • Rheumatology (207)
  • Sexual and Reproductive Health (165)
  • Sports Medicine (157)
  • Surgery (190)
  • Toxicology (36)
  • Transplantation (100)
  • Urology (74)