Abstract
Genome-wide association studies (GWAS) have identified hundreds of genetic risk loci for coronary artery disease (CAD). However, non-European populations are underrepresented in GWAS and the causal gene-regulatory mechanisms of these risk loci during atherosclerosis remain unclear. We incorporated local ancestry and haplotype information to identify quantitative trait loci (QTL) for gene expression and splicing in coronary arteries obtained from 138 ancestrally diverse Americans. Of 2,132 eQTL-associated genes (eGenes), 47% were previously unreported in coronary arteries and 19% exhibited cell-type-specific expression. Colocalization analysis with GWAS identified subgroups of eGenes unique to CAD and blood pressure. Fine-mapping highlighted additional eGenes of interest, including TBX20 and IL5. Splicing (s)QTLs for 1,690 genes were also identified, among which TOR1AIP1 and ULK3 sQTLs demonstrated the importance of evaluating splicing events to accurately identify disease-relevant gene expression. Our work provides the first human coronary artery eQTL resource from a patient sample and exemplifies the necessity of diverse study populations and multi-omic approaches to characterize gene regulation in critical disease processes.
Competing Interest Statement
J.L.M.B is a shareholder in Clinical Gene Network AB and has an invested interest in STARNET. J.C.K. is the recipient of an Agilent Thought Leader Award, which includes funding for research that is unrelated to the current manuscript. S.W.vdL. has received Roche funding for unrelated work. C.L.M. has received AstraZeneca funding for unrelated work. All other authors declare that they have no competing interests relevant to the contents of this paper to disclose.
Funding Statement
This work was supported by grants from: the National Institutes of Health (grant numbers R01HL148239 and R01HL164577 to C.L.M; T32HL007284 to C.J.H; and R01HL125863 to J.L.M.B; R01HL130423, R01HL135093 and R01HL148167 to J.C.K.); the American Heart Association (grant number 20POST35120545 to A.W.T.; AHA909150 to J.V.M.; A14SFRN20840000 to J.L.M.B.); the Swedish Research Council and Heart Lung Foundation (grant number 2018-02529 and 20170265 to J.L.M.B.); the Fondation Leducq (grant number 18CVD02 to C.L.M., J.L.M.B) and the Single-Cell Data Insights award from the Chan Zuckerberg Initiative, LLC and Silicon Valley Community Foundation (to C.L.M). This work was also supported by fellowship grants from the Bench to Bassinet Pediatric Cardiac Genomics Consortium (PCGC) & Cardiovascular Development Data Resource Center (CDDRC) (to C.J.H).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Institutional Review Board (IRB) of Stanford University (protocols #4237 and #11925) and of University of Virginia (protocol #20008) gave ethical approval for the procurement and use of human tissues and information.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Data Availability
All raw and processed bulk RNA-sequencing data will be made available on the Gene Expression Omnibus (GEO) database (accession number:). Low-pass whole-genome sequencing-based genotyping data are available on dbGaP (accession code phs002855.v1.p1). The full summary statistics for the mixQTL eQTL analyses, as well as the local ancestry eQTL and the sQTL analyses are available here: https://doi.org/10.5281/zenodo.7581778 The single-cell RNA-seq datasets from coronary and carotid artery were re-analyzed and integrated from the original datasets available through GEO (accession numbers: GSE131778 (Wirka, et al.), GSE155512 (Pan, et al.), GSE159677 (Alsaigh, et al.) and https://doi.org/10.5281/zenodo.6032099 (Hu, et al.). The raw and processed single-nucleus ATAC-seq datasets are available through GEO (GSE175621 and GSE188422). The reprocessed and analyzed human scRNA-seq datasets are also available on PlaqView (https://plaqview.com). GTEx gene expression and eQTL data were obtained from the v8 portal website (https://gtexportal.org). STARNET gene expression, eQTL, and clinical trait enrichment data were obtained from dbGaP (accession number: phs001203.v2.p1) and are also available at http://starnet.mssm.edu. The HCASMC ATAC-seq and H3K27ac HiChIP data used to calculate ABC scores are available through GEO (accession numbers: GSE113348 and GSE101498). All custom scripts used to generate the results are available on GitHub (https://github.com/MillerLab-CPHG/CAD_QTL). Detailed parameters for published software tools are also included in the Methods.
Abbreviations
- General
- FDR
- False discovery rate
- GWAS
- Genome-wide association study (or studies)
- HCASMC
- Human coronary artery smooth muscle cells
- LA
- Local-ancestry adjusted QTL analysis
- QTL
- Quantitative trait locus
- SMC
- Smooth muscle cell
- SNP
- Single nucleotide polymorphism
- Diseases and traits
- BP
- Blood pressure
- CAD
- Coronary artery disease
- DBP
- Diastolic BP
- HDL
- High-density lipoproteins
- LDL
- Low-density lipoproteins
- logTG
- Log10 of triglycerides
- MI
- Myocardial infarction
- PP
- Pulse pressure
- SBP
- Systolic BP
- TC
- Total cholesterol
- Study-specific
- GTEx
- Genotype-Tissue Expression Project
- STARNET
- Stockholm-Tartu Atherosclerosis Reverse Network Engineering Task
- AOR
- Aorta
- COR
- Coronary artery
- MAM
- Mammary artery
- TIB
- Tibial artery
- 1000G
- 1000 Genomes Project
- AFR,AMR,EAS,EUR,SAS
- 1000G superpopulation abbreviations for participating individuals from populations in Africa, the Americas, East Asia, Europe, and South Asia, respectively.
- Gene names
- AKT3
- AKT Serine/Threonine Kinase 3
- ARHGAP42
- Rho GTPase Activating Protein 42 (protein AKA GRAF3)
- HPSE2
- Heparanase 2
- IL5
- Interleukin 5
- LIPG
- Lipase G, endothelial type
- TARID
- TCF21 Antisense RNA Inducing Promoter Demethylation
- TBX20
- T-Box 20
- TCF21
- Transcription factor 21
- TOR1AIP1
- Torsin 1A Interacting Protein 1
- ULK3
- Unc-51 Like Kinase 3
- YY1AP1
- YY1-associated protein 1