Abstract
Genome-wide association studies (GWAS) have linked hundreds of loci to cardiac diseases. However, in most loci the causal variants and their target genes remain unknown. We developed a combined experimental and analytical approach that integrates single cell epigenomics with GWAS to prioritize risk variants and genes. We profiled accessible chromatin in single cells obtained from human hearts and leveraged the data to study genetics of Atrial Fibrillation (AF), the most common cardiac arrhythmia. Enrichment analysis of AF risk variants using cell-type-resolved open chromatin regions (OCRs) implicated cardiomyocytes as the main mediator of AF risk. We then performed statistical fine-mapping, leveraging the information in OCRs, and identified putative causal variants in 122 AF-associated loci. Taking advantage of the fine-mapping results, our novel statistical procedure for gene discovery prioritized 46 high-confidence risk genes, highlighting transcription factors and signal transduction pathways important for heart development. In summary, our analysis provides a comprehensive map of AF risk variants and genes, and a general framework to integrate single-cell genomics with genetic studies of complex traits.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was funded by National Institutes of Health (NIH) grants, R01MH110531 and R01HG010773 (to X.H.), R01HL163523 (to X.H., S.P., and I.P.M.), and R21 AI144417-02 (to O.B). This project has been made possible in part by grant number CZF2019-002431 from the Chan Zuckerberg Initiative DAF, an advised fund of Silicon Valley Community Foundation.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Heart tissue samples were obtained from National Disease Research Interchange (NDRI) without identifying information. The work with these samples was determined to be Non-Human subject research and approved by the IRB committee of the University of Chicago (IRB19-1429).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
We performed a benchmarking study of our gene discovery tool, Mapgen. We provided additional background on the statistical fine-mapping technique in the Results section and also added a new section in Methods with details of how fine-mapping was performed in our study. We significantly extended the discussion of related work that aimed to identify risk variants and genes underlying genetics of atrial fibrillation. We now include a comparison of our data with published single-cell dataset in the heart, validating our cell type assignments. We significantly revised the introduction of the eQTL section in the Results part.
Data Availability
Our snRNA-seq and scATAC-seq data has been deposited in GEO repository: accession GSE224997.
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE224997