Abstract
Interpreting the functional significance of non-coding GWAS variants remains challenging. While co-localizing variants with cell-type specific cis-regulatory elements (CREs) has improved our understanding, many variants remain unassociated. In this study, we propose GEM-Finder (Genomic Element Mapping for Fine Discovery of Promoter-Linked Variants), a novel analytical framework that integrates transcriptomic, epigenomic (H3K27ac ChIP-seq), and chromatin interaction data. GEM-Finder utilizes long-range chromatin interactions to identify CREs that connect differentially expressed genes of specific cell types. When we apply GEM-Finder to endothelial differentiation, unlike conventional methods primarily focused on cell-type specific CREs, GEM-Finder identifies 7.6 times more disease/trait associations. Specifically, by integrating transcriptome, epigenome (particularly H3K27ac ChIP-seq), and long-range chromatin interactions during endothelial differentiation, we identified CREs linked to differentiation-specific genes. Our enrichment analyses revealed both shared and unique associations for 53 human diseases/traits. Notably, the majority of these (68%) exhibited unique associations in a differentiation-specific manner. Hematological traits and neuropsychiatric disorders were primarily linked to the final stage of endothelial differentiation, while several complex diseases, such as colorectal cancer (CRC), were unexpectedly associated with the late stage. Our findings underscore the importance of leveraging long-range chromatin interactions to accurately identify disease-associated CREs in the functional characterization of non-coding GWAS variants.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
The Ministry of Science and ICT supported this study through the National Research Foundation in the Republic of Korea (2022R1A5A1026413, 2023R1A2C3002773, RS-2023-00223069, RS-2023-00262527).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
All raw sequencing data used in this study have been obtained from the GEO repository (GSE201709).
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Footnotes
Enhance the clarity of the Material and Methods.
Data Availability
All raw sequencing data used in this study have been obtained from the GEO repository (GSE201709).