Mapping the aetiological foundations of the heart failure spectrum using human genetics

Heart failure (HF), a syndrome of symptomatic fluid overload due to cardiac dysfunction, is the most rapidly growing cardiovascular disorder. Despite recent advances, mortality and morbidity remain high and treatment innovation is challenged by limited understanding of aetiology in relation to disease subtypes. Here we harness the de-confounding properties of genetic variation to map causal biology underlying the HF phenotypic spectrum, to inform the development of more effective treatments. We report a genetic association analysis in 1.9 million ancestrally diverse individuals, including 153,174 cases of HF; 44,012 of non-ischaemic HF; 5,406 cases of non-ischaemic HF with reduced ejection fraction (HFrEF); and 3,841 cases of non-ischaemic HF with preserved ejection fraction (HFpEF). We identify 66 genetic susceptibility loci across HF subtypes, 37 of which have not previously been reported. We map the aetiologic contribution of risk factor traits and diseases as well as newly identified effector genes for HF, demonstrating differential risk factor effects on disease subtypes. Our findings highlight the importance of extra-cardiac tissues in HF, particularly the kidney and the vasculature in HFpEF. Pathways of cellular senescence and proteostasis are notably uncovered, including IGFBP7 as an effector gene for HFpEF. Using population approaches causally anchored in human genetics, we provide fundamental new insights into the aetiology of heart failure subtypes that may inform new approaches to prevention and treatment.


Summary paragraph
Heart failure (HF), a syndrome of symptomatic fluid overload due to cardiac 264 dysfunction, is the most rapidly growing cardiovascular disorder. Despite recent 265 advances, mortality and morbidity remain high and treatment innovation is challenged 266 by limited understanding of aetiology in relation to disease subtypes. Here we harness 267 the de-confounding properties of genetic variation to map causal biology underlying 268 the HF phenotypic spectrum, to inform the development of more effective treatments. 269 We report a genetic association analysis in 1.9 million ancestrally diverse individuals, 270 including 153,174 cases of HF; 44,012 of non-ischaemic HF; 5,406 cases of non-271 ischaemic HF with reduced ejection fraction (HFrEF); and 3,841 cases of non-272 ischaemic HF with preserved ejection fraction (HFpEF). We identify 66 genetic 273 susceptibility loci across HF subtypes, 37 of which have not previously been reported. 274 We map the aetiologic contribution of risk factor traits and diseases as well as newly 275 identified effector genes for HF, demonstrating differential risk factor effects on 276 disease subtypes. Our findings highlight the importance of extra-cardiac tissues in HF, 277 particularly the kidney and the vasculature in HFpEF. Pathways of cellular senescence 278 and proteostasis are notably uncovered, including IGFBP7 as an effector gene for 279 HFpEF. Using population approaches causally anchored in human genetics, we 280 provide fundamental new insights into the aetiology of heart failure subtypes that may 281 inform new approaches to prevention and treatment. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.

301
Multi-ancestry genetic association analysis highlights novel heart failure loci 302 We performed a meta-analysis of case-control GWAS across 42 studies to investigate 303 the association of up to 10,199,961 common genetic variants (minor allele frequency 304 . CC-BY 4.0 International license It is made available under a perpetuity. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023. 10 Figure 2). We identified 59 conditionally independent (sentinel) genetic variants at 56 310 non-overlapping genomic loci (distance > 500 kilobase pairs) associated with HF at a 311 genome-wide significance (P < 5 x 10 -8 ) (Figure 1, Supplementary Table 1). Sentinel  Table 2). GWAS of ni-HF, ni-HFrEF, and ni-HFpEF subtypes highlighted 10 additional 315 sentinel variants that were not identified in the HF GWAS. In total, 66 independent 316 genomic risk loci were identified across HF phenotypes 1,4,5 among which 37 have not 317 previously been reported in GWAS of HF [1][2][3][4][5] . Of the 66 loci, 46 (70%) were associated 318 with at least one of the non-ischaemic HF phenotypes at P < 0.05 / 66. These loci were 319 then classified as non-ischaemic HF loci, and the remaining 20 (30%) HF loci were 320 classified as secondary HF (s-HF) loci. Amongst the non-ischaemic loci, 17 were 321 associated with ni-HFrEF and 3 were associated with ni-HFpEF at P < 0.05 / 66. In  Table 3). is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023. 10.01.23296379 doi: medRxiv preprint The genetic architecture of HF was found to be highly polygenic, evidenced by an  Figures 5-6), and there was an exponential relationship between allele frequency and 331 effect size for associated variants (Supplementary Figure 7). The estimated 332 proportion of variance in disease liability explained by common genetic variants, i.e.

333
SNP-based heritability (h 2 g), was 5.4 ± 0.2% for HF; 6.1 ± 0.5% for non-ischaemic HF; 334 11.8 ± 2.6% for non-ischaemic HFrEF, and 1.8 ± 1.3% for non-ischaemic HFpEF  To explore the potential utility of SNP-based heritability for prediction, we derived a is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. 353 To identify HF effector genes, we characterised the functional properties of variants 354 and genes within each identified GWAS locus using a range of orthogonal approaches.  Table 4). 362 We then prioritised putative causal genes among 758 protein-coding genes 363 overlapping HF loci by triangulating evidence from three predictors of gene relevance: 364 1) variant-to-gene (V2G) score derived from functional properties of fine-mapped and 365 sentinel variants 12 , 2) polygenic priority score (PoPS) calculated from enrichment of 366 gene features 13 , and 3) predicted gene expression levels across tissues derived from 367 multi-tissue transcriptome-wide association study (TWAS) 14 . 142 genes were ranked 368 by at least one predictor (labelled as candidate genes), of which 71 were further 369 prioritised based on: ranking highest on a combined predictor-score; highest ranking 370 with at least two predictors; colocalisation with gene transcript expression in a relevant 371 tissue 15 ; or association with a phenotypically relevant Mendelian disorder (labelled as 372 prioritised genes) 16 (Figure 2a- is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint and HSPB7, [9][10][11]23 , CAMKD2 (linked to cardiac hypertrophy 24 ), STRN (linked to canine 379 DCM 25 ), as well as PITX2, KLF12, and ATP1B1 which have been associated with 380 cardiac arrhythmia [26][27][28] . Other notable findings include CAND2, a muscle-specific 381 gene of which upregulation has been linked to pathological cardiac remodelling 29 and 382 NKX2-5, a core cardiac transcription factor associated with congenital heart disease 383 and cardiomyopathy [30][31][32] .

384
To facilitate genomic appraisal at each identified locus, we constructed an online 385 dashboard visualisation containing regional genetic association, gene prioritisation,  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint that is associated with impaired cardiac function 33,34 . Genes relating to the formation 397 of aggresomes, cellular organelles store misfolded proteins for subsequent disposal 398 throughautophagy 35 , were enriched, including HSBP7 and BAG3. BAG3-mediated 399 sarcomeric protein turnover is implicated as a mechanism underlying heart failure 36 .

400
Pathways regulating cell division and senescence were identified, implicating tissue 401 homeostasis and renewal in predisposition to age-associated organ dysfunction, 402 Notably, these included the senescence-associated secretory phenotype, a distinct  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint followed by musculoskeletal / connective tissues (12 enrichments), and nervous 423 system tissues (8 enrichments). The most highly enriched tissues for HF, ni-HF, and 424 ni-HFrEF, were cardiac whilst the kidney and pancreas were the most highly enriched 425 for ni-HFpEF, highlighting the relative importance of these organs for specific HF is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint binding motif has been identified as a mechanism for LMNA cardiomyopathy 42 . In 447 contrast to prior reports, we did not observe differential expression of IGFBP7 in 448 cardiomyocytes from failing hearts 37 . is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint and cancer, while Cluster 5 comprised genes associated with non-ischaemic HF and 466 ni-HFpEF, and disease including adiposity, diabetes, and carpal tunnel syndrome.

467
Given the extensive pleiotropic effects of HF susceptibility loci, we further investigated  Table 12). All tested traits were associated with at least one HF locus at FDR <1%, is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint subunit 1 (PFDN1) an important molecular chaperone for protein folding associated 490 with mortality and cardiovascular phenotypes in mouse knockouts 47,48 .

492
Genetic appraisal of risk factors across the heart failure spectrum 493 Next, we sought to explore evidence for causal relationships between risk phenotypes 494 associated with HF in the pleiotropy analysis, using genetic correlation and Mendelian is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  Second, our findings highlight the importance of cardiac contractility traits in HF. Non-572 ischaemic HFrEF was the most heritable subtype (h 2 SNP = 11.8% ± 2.6%) and was 573 highly correlated with DCM (rg = 0.57 ± 0.12), as expected. We demonstrate opposing 574 associations between contractility-related traits and ni-HFrEF and ni-HFpEF risk: 575 higher baseline contractility decreased the risk of HFrEF, but likely increased the risk 576 of HFpEF. Our findings do not provide positive evidence to support the widely held 577 hypothesis that diastolic function is the major cause of HFpEF. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint Third, our heritability enrichment analysis suggests that kidney, vascular, and 579 metabolic tissues play a central role in the aetiology of HFpEF. The kidney is the 580 primary organ for managing body fluid, and renal impairment is common in heart 581 failure patients across the phenotypic spectrum. CKD, however, co-occurs more 582 frequently with HFpEF than HF with mid-range or reduced ejection fraction 56    In summary, by harnessing the de-confounding qualities of human germline variation, 600 we map the aetiology of the heart failure spectrum, from organs to molecules. Our is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.   is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint Cross-ancestry allelic effect heterogeneity assessment 673 To account for a possible bias due to heterogeneity of allelic effect across ancestries, 674 we performed a sensitivity analysis using a meta-regression technique to model   is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint Genetic architecture assessment 718 To assess genetic architecture and polygenicity across HF phenotypes, we compared 719 quantiles of the expected and observed genome-wide genetic association P-values 720 using quantile-quantile (QQ plot) and calculated genomic control coefficient ( !" ). An  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  Genes with highest PoPS, highest V2G score, or lowest TWAS P value within a locus 803 were considered as candidate effector genes for HF. In step 2, we prioritised these is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.   is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint Single-nucleus differential gene expression in failing vs. non-failing heart 856 To assess the extent to which transcriptional pattern of risk genes for HF changes in 857 failing heart, we performed single-nucleus differential gene expression analysis of Benjamini-Hochberg correction was applied for multiple testing correction. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  Ontology (GO) 95,96 . For presentation (Figure 3a), we excluded terms with more than 894 2,000 genes (representing ~10% of protein coding genes in the human genome) and 895 included terms with adjusted P < 0.05 following g:Profiler correction for multiple testing.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. and then mapped to the node size so that a more influential node appears larger.

933
Further, we performed a data-driven community detection analysis on the graph using  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  Figure 15), effect estimates were converted to Z-scores derived from the regression  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint Figure 6. Bivariate genetic correlation (rg) and Mendelian randomisation (MR) estimates across 24 traits and 4 HF phenotypes. Asterisks (*) indicate binary traits. MR effects estimates are reported as odds ratio (ORMR ) per doubling prevalence for binary traits; or per standard deviation increase for quantitative traits. Estimates which were robust to multiple testing adjustment and sensitivity analyses were indicated by light blue shade (for rg < 0 and ORMR < 1) or light red shade (for rg > 0 and ORMR > 1). The heat maps represent P-values for different MR models, colour-coded with direction of MR estimates and strength of associations.
. CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint Data availability GWAS summary statistics from the meta-analysis will be made available on the Cardiovascular Disease Knowledge Portal (https://cvd.hugeamp.org/) and GWAS Catalog (https://www.ebi.ac.uk/gwas/summary-statistics) upon publication.

Code availability
A sample code to define heart failure phenotypes in UK Biobank is available on: https://github.com/ihi-comp-med/ukb-hf-phenotyping. Other codes to perform key analyses presented in this work will be made available on https://github.com/ihi-comp-med/hermes2gwas . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted October 3, 2023. ; https://doi.org/10.1101/2023.10.01.23296379 doi: medRxiv preprint