Abstract
Background Drugs targeting triglyceride (TG)-associated genes have the potential to improve cardiovascular outcomes for patients with elevated TG levels. However, we know little regarding the potential additional benefits or deleterious effects of such targeting, particularly among individuals of African ancestry (AA). Mendelian randomization and PheWAS approaches offer the opportunity to examine such primary and secondary effects.
Methods We examined 12 variants reported previously in Mendelian randomization studies from 5 genes that have been identified as TG-lowering targets (APOA5, LPL, APOC3, ANGPTL3, and ANGPTL4); for those variants associated with measured TG levels, we tested selected phenotypes, including lipid, cardiovascular, and other potential effects reported in previous studies, using PheWAS in separate cohorts of European ancestry (EA) patients and AA patients in BioVU. We also tested unspecified other phenotypes (i.e., without previously reported associations with TGs) for additional effects. We then replicated results in All of Us (AoU). As a secondary analysis, we tested the genetically predicted expression of these TG-lowering target genes for their association with the selected phenotypes in EA BioVU patients.
Results Among BioVU EA patients (n=63,094), 11 previously reported SNPs were associated with measured TGs; of these, 9 SNPs were associated with lipid and cardiovascular phenotypes. Results were largely consistent in AoU EA participants (n=97,532). Among AA patients in BioVU (n=12,515) and AoU (n=31,710), results were more limited; only 6 of the 12 reported SNPs were associated with measured TGs in BioVU AA patients. While 4 of these validated 6 SNPs were associated with a lipid or cardiovascular phenotype in either BioVU or AoU, none were consistent across both cohorts. Additionally, we detected few secondary effects in either EA or AA BioVU patients, and none were replicated. In the secondary analysis assessing predicted gene expression, results were largely consistent with the primary analysis for EA BioVU patients.
Conclusions These results suggest that beyond cardiovascular benefits there may be limited additional benefits, but few deleterious effects, from targeting known TG-associated genes for individuals of EA. However, we found limited information supporting the efficacy or safety of these targets for mitigating cardiovascular risk among AA individuals.
BACKGROUND
Elevated triglyceride (TG) levels are associated with an increased risk of coronary heart disease (CHD), the leading cause of death globally.1,2 In the United States, 31% of adults have triglyceride (TG) levels ≥150 mg/dL (i.e., mildly elevated and above),2 with studies demonstrating that the risks for CHD increase 14% for men and 37% for women for each mmol (88 mg/dL) rise in TG levels.3 Indeed, targeting TG can have benefits for reducing CHD risk, even among patients already receiving standard lipid-lowering treatments (e.g., statins, fibrates, omega 3 fatty acids, and niacin).4–6
As such, identifying effective therapeutic targets associated with TG levels is an important goal for the reduction of CHD risk; however, successful targeting requires not only a decrease in TG levels, but also an associated secondary effect of improved cardiovascular outcomes. The degree to which and the mechanisms whereby TGs are associated with CHD has been the subject of debate, particularly given the potential for confounding.2 One relatively recent methodological approach has offered a means to identify the causal role of TGs in CHD more accurately—namely, Mendelian randomization (MR). MR mirrors clinical trials, but rather than random assignment, randomization is achieved through the arbitrary assignment of gene variants passed from parents to offspring; beneficially, this approach offers the ability to assess causality between exposures and outcomes while limiting unmeasured confounding, since genetic information is fixed at conception.7–12 Indeed, applying such an approach, researchers have used functional variations in genes with robust effects on TG levels (e.g., LPL, APOA5, and APOC3) to establish the causal relationship between TGs and CHD.11,13–18 These insights have spurred new drug development, targeting specific TG-associated genes for more effective CHD prevention.19 In particular, current efforts have focused on increasing lipoprotein lipase (LPL) activity; both the LPL gene and genes in the LPL pathways (e.g., ANGPTL3, ANGPTL4, and APOC3) have been considered as potential drug targets.13,20–26
In addition to establishing these primary causal relationships, MR approaches can be used to identify other secondary effects of targeting TG-associated genes, including adverse effects. During the approval process, clinical trials are designed to test not just the efficacy of new treatments, but also to identify potential side effects (or lack thereof). However, not all phenotypes (effects) can be tested in clinical trials given their relatively limited power and the short periods of follow-up that can obscure long-term effects; these effects typically are only later detected by case reports and retrospective cohort studies of patients that have experienced those effects (e.g., the discovery of cardiovascular risk associated with rofecoxib27,28). MR has been deployed to detect these types of long-term effects. For example, statins are among the most extensively studied drugs in history and millions of patients take them; however, their increased risk of diabetes was only identified after statins had been on the market for 21 years, and was later confirmed by MR.8 Beneficially, MR also offers the advantage of pre-emptively detecting potential long-term effects of such therapies, particularly when deployed in conjunction with a phenome wide association study (PheWAS) approach.29 As such, MR presents researchers with the opportunity to identify other potential beneficial or deleterious effects of TG-lowering treatments under development.30 Given that TG-associated genes (particularly LPL) are implicated in numerous pathological conditions (e.g., atherosclerosis, diabetes, pancreatitis, and Alzheimer’s disease and related dementias [ADRD]),31–34 targeting such genes with TG-lowering treatments may have additional effects on these and other conditions.
Despite this promise, studies of non-CHD effects associated with either TG levels or candidate drugs for TG-lowering have remained limited and inconclusive.35 In particular, the research to date has focused almost exclusively on patients of European ancestry (EA); this narrow emphasis has potentially hindered our ability to detect effective TG-related targets and predict associated secondary effects, given that lipid profiles can vary significantly by ancestry. Notably, average triglyceride levels are lower in populations of African ancestry (AA) than those of EA,36 including the lower proportion of individuals with very high TG levels.36–42 Although genetic studies on inter-individual TG variability have begun to incorporate ancestries beyond those of EA patients,42–44 relatively little is known about the relationship between TG-associated genes and secondary effects in individuals of AA. Accordingly, it is important to determine whether associations validated among patients of EA also apply to patients of AA as well determining whether there are ancestry-specific secondary effects (either beneficial or deleterious).
As such, to better define potential benefits and risks for TG-lowering medications currently in development, we deployed MR with PheWAS to assess the potential pleiotropic effects of genetic predictors of TG levels in ancestry-specific cohorts of EA and AA patients at Vanderbilt University Medical Center (VUMC), with replication among All of Us (AoU) participants. First, we validated associations between 12 genetic variants reported in previous MR studies from 5 genes identified as TG-lowering targets (i.e., APOA5, LPL, APOC3, ANGPTL3, and ANGPTL4)45,46 and measured TG levels.
Then, among the validated variants, we conducted PheWAS to assess whether these variants were associated with selected prespecified 1) lipid phenotypes, 2) cardiovascular phenotypes, and 3) other phenotypes previously associated with TGs (including diabetes, pancreatitis, and ADRD), as well as 4) unspecified other phenotypes (i.e., without previously reported associations with TGs). As a secondary analysis in EA patients, we tested the associations between the prespecified phenotypes and the predicted expression of the 5 TG-associated genes.
METHODS
The primary analysis was conducted in cohorts of EA and AA patients from BioVU, a biobank at VUMC. BioVU offers genome-wide genotyping data for ∼90,000 patients on the Illumina Infinium Expanded Multi-Ethnic Genotyping Array plus custom content platform (MEGA), which are linked to de-identified copies of the patients’ electronic health records (EHRs). These EHRs include demographics, clinical notes, lab results, medications, and diagnostic and procedure codes (e.g., International Classification of Diseases, Ninth Revision, Clinical Modification [ICD-9-CM] codes and Tenth Revision [ICD-10-CM] codes).47 Replication analysis was conducted among comparable cohorts of EA and AA participants in AoU (Supplementary Methods §1). The study was approved by the VUMC Institutional Review Board (IRB# 220725).
Cohort Eligibility
Primary cohort eligibility criteria included BioVU patients with available MEGA genotyping data that passed standard quality controls (see below; Supplementary Methods §2), and at least 2 ICD codes recorded in their EHR through study end date (January 11, 2023). We restricted the study to patients with genetically-determined predominantly EA or AA).48 Although socially identified EHR-reported race is largely consistent with genetic ancestry in BioVU,49–51 to minimize the conflation of genetic and social factors, we further restricted the cohorts to patients with consistent EHR-reported race (i.e., White race for EA patients and Black race for AA patients); individuals with more than one reported race were excluded. We further included adjustment for three principal components (PCs) to help account for intra-ancestral variability.
Clinical characteristics
Patient characteristics were extracted from the EHR, including age (at most recent EHR visit), sex, length of EHR, lipid levels (i.e., TG, HDL-C, and LDL-C), and whether patients were ever prescribed lipid-lowering medications (statins, niacin, or fibrates; Supplementary Table 1); median lipid values were calculated using all measurements after quality control (excluding those that were biologically implausible, e.g. negative values). We adjusted median TG levels for use of lipid-lowering medications for each individual.42,52
Genotyping and Imputation
Genotyping was performed on the MEGA platform with quality control as performed previously (Supplementary Methods §2).53–55 Genetically-determined ancestry and PCs for ancestry were determined following previous reports.48
Genetic variants in TG gene targets
Primary analysis
We assessed 12 variants from 5 genes that have been identified previously by MR as TG-lowering targets: APOA5, LPL, APOC3, ANGPTL3, and ANGPTL4 (Supplementary Table 2).45,46 To validate the suitability of these candidates for our EA and AA cohorts, we compared minor allele frequencies (MAFs) to patients of EA and AA, respectively, in the 1000 Genome project.56 For SNPs with consistent MAFs, we then assessed whether the genotypes were associated with adjusted measured TG levels using linear associations adjusted for age and sex in the EA and AA BioVU cohorts; further analyses testing associations with phenotypes (see below) were restricted to these variants significantly associated with measured TG levels (p<0.05).
Predicted gene expression
We conducted a secondary analysis testing the association between the prespecified phenotypes (see below) and the predicted gene expression of the 5 TG-lowering genes among EA patients (Supplementary Methods §3).
Phenotypes
Phecodes
We established the presence or absence of clinical phenotypes using phecodes, a system based on clinical diagnosis codes (i.e., ICD-9-CM and ICD-10-CM).57,58 Cases were individuals with 2 or more occurrences of the phecode; controls had zero occurrences; and individuals with 1 mention of the code or related codes were excluded from the analysis to limit misclassification. We required ≥50 cases to include a phecode for analysis.
Phenotype categories
We separated the qualifying phecodes (i.e., ≥50 cases) into 4 categories for analysis in both cohorts, based on previous evidence of association with measured TGs or genetically-determined TG levels (Supplementary Table 3): (1) 4 prespecified lipid phenotypes [i.e., hyperlipidemia, hypercholesterolemia, hyperglyceridemia, and mixed hyperlipidemia], to confirm the efficacy of the target; (2) 4 prespecified cardiovascular phenotypes [i.e., essential hypertension, angina pectoris, coronary atherosclerosis, and other chronic ischemic heart disease, unspecified], to confirm the desired secondary cardiovascular benefits; (3) 13 prespecified other phenotypes previously associated with TGs [i.e., type 2 diabetes, gout, dementias, Alzheimer’s disease, vascular dementia, delirium due to conditions classified elsewhere, headache, migraine, dizziness, diseases of pancreas, acute pancreatitis, chronic pancreatitis, and urinary tract infection], to determine if reduced lipid or cardiovascular risk could coincide with elevated risk for known potential secondary outcomes; and (4) all remaining qualifying phecodes in a global PheWAS, to assess whether there are unknown outcomes associated with targeting TG-lowering genes.
Statistical Analysis
For the EA and AA cohorts in BioVU and AoU, we used logistic regression to test the associations between phecodes and previously reported variants from TG-lowering gene targets, with adjustment for age at most recent clinical visit, sex, length of EHR, and 3 PCs for ancestry. In BioVU, for associations with the prespecified clinical phenotypes (i.e., categories 1-3; n=21), we considered p<0.05 as significant, and for the remaining phenotypes in BioVU we considered p<3.13×10−5 (0.05/1600 unspecified phecodes tested) and p<4.69×10−5 (0.05/1065 unspecified phecodes tested) as significant in EA and AA patients, respectively. For the AoU replication cohorts, p<0.05 was considered significant. In the secondary analysis, we used the Generalized Berk-Jones (GBJ) test59 to summarize associations across different tissues, accounting for potential expression correlation;60,61 p<0.05 was considered significant for these tests. We present categorical variables as numbers and percentages and continuous variables as means and standard deviation (SD). All analyses were conducted using R version 4.1.0 and PLINK version 2.0.
RESULTS
Cohort characteristics
The EA cohort from BioVU included 63,094 individuals; 35,320 (56.0%) were female, the mean age at the most recent visit was 54.4 ± 21.8, and the mean length of EHR was 12.0 ± 7.9 years (Table 1). The AA cohort from BioVU included 12,515 individuals; 7,737 (61.8%) were female, the mean age was 45.2 ± 20.5, and the mean length of EHR was 11.8 ± 8.7 years (Table 1). The adjusted TG levels for the EA and AA cohorts were 176.5 ± 118.3 mg/dL and 125.5 ± 88.9 mg/dL, respectively. Characteristics for the replication EA (n=97,532) and AA (n=31,710) cohorts in AoU are available in Supplementary Table 4.
European ancestry
The MAF of all 12 previously reported variants from TG-associated genes in the BioVU cohort of EA patients were consistent with ancestral population-wide MAF available from public databases (Supplementary Table 2). We then tested their association with adjusted median measured TGs to validate their use as instrumental variables. Among these 12 variants, 11 variants were significantly associated with measured TGs in the EA cohort (Supplementary Table 5); there was no minor allele carriage for the remaining SNP (rs147210663) to test. We restricted subsequent analyses in the BioVU EA cohort and replication AoU cohort to those significantly associated 11 SNPs.
Association between validated variants and clinical phenotypes
For the primary analysis, we first examined potential associations between the 11 validated variants and 21 prespecified clinical phenotypes. We found 47 significant associations in BioVU EA patients (Figure 1; Supplementary Table 6a). The most significant associations were lipid and cardiovascular phenotypes; 10 of 11 variants were significantly associated with at least one of these phenotypes in the expected direction. Among the potential prespecified secondary effects, we identified associations between the variants and altered risk of urinary tract infection (rs3135506, p=0.019), gout (rs1801177, p=0.026), disease of the pancreas (rs1801177, p=0.038), chronic pancreatitis (rs1801177, p=0.025), and acute pancreatitis (rs268, p=0.029; Figure 1). For the unspecified phenotypes in the global PheWAS (n=1600), there were additional associations with cataplexy and narcolepsy (rs118204057, p=3.42×10−6), glaucoma (rs138326449, p=1.5×10−5), atherosclerosis of the renal artery (rs1801177, p=1.29×10−5), and complication of colostomy or enterostomy (rs1801177, p=1.12×10−5; Supplementary Table 7a; Supplementary File 2).
○ = replicated in All of Us. blue = significant decreased risk; lower p-values represented by darker hues. red = significant increased risk; lower p-values represented by darker hues.
Replication
In the AoU EA cohort, the validated variants (n=11) had MAF that were consistent with ancestral population wide MAF (Supplementary Table 8). For the 21 prespecified phenotypes, we replicated 27 associations (p<0.05) with these variants (Figure 1; Supplementary Table 9a). Consistent with the BioVU cohort of EA patients, the AoU cohort had significant associations between prespecified lipid phenotypes and genetic variants (i.e., rs268, rs328, rs619054, rs651821, rs3135506, rs11207977, rs115849089, and rs116843064). Additionally, we replicated associations with prespecified cardiovascular phenotypes, including angina pectoris (rs328 and rs115849089), coronary atherosclerosis (rs115849089), and other chronic ischemic heart disease, unspecified (rs115849089). No other prespecified or unspecified phenotypes identified in the BioVU EA cohort were replicated in the AoU EA cohort.
Predicted gene expression
In the secondary analysis, we found that 4 TG-associated genes had sufficient tissue samples to conduct cross-tissue analysis (APOA5 only had a single qualifying tissue). Of these, both LPL and ANGLPTL3 yielded significant associations with prespecified lipid phenotypes; LPL also had significant associations with prespecified cardiovascular phenotypes (Table 2; Supplementary Table 10). Among the prespecified other phenotypes, ANGLPTL3 had significant associations with gout and migraine, while APOC3 had a significant association with dementias.
African ancestry
The MAF of all 12 previously reported variants from candidate TG genes in the BioVU cohorts of AA patients were consistent with ancestral population-wide MAF available from public databases (Supplementary Table 2). However, only 6 of these variants were significantly associated with measured TGs; 5 variants were not associated, and there was no minor allele carriage for the remaining SNP (Supplementary Table 5).
Association between validated variants and clinical phenotypes
In the AA BioVU cohort, there were fewer significant associations among known TG-associated variants. We identified 2 significant associations between the 6 validated variants and the prespecified lipid or cardiovascular clinical phenotypes: mixed hyperlipidemia (rs11207977, p=4.04×10−3) and angina pectoris (rs1801177, p=0.048; Figure 2; Supplementary Table 6b). Additionally, among prespecified other phenotypes, we found associations with headache (rs11207977, p=0.041; and rs651821, p=0.031), dizziness (rs11207977, p=0.023), and Alzheimer’s disease (rs651821, p=0.029). For the 1065 unspecified phenotypes in the global PheWAS, there were significant associations with heart valve replacement (rs651821, p=1.04×10−5), hemorrhagic disorder due to intrinsic circulating anticoagulants (rs651821, 4.70×10−6), decreased libido (rs1801177, p=2.20×10−6), and stricture of artery (rs147210663, p=2.00×10−5)(Figure 2; Supplementary Tables 7b and 9b; Supplementary File 2).
○ = replicated in All of Us. blue = significant decreased risk; lower p-values represented by darker hues. red = significant increased risk; lower p-values represented by darker hues.
DISCUSSION
Major findings
To the best of our knowledge, this is the first systematic application of MR and PheWAS approaches, as deployed previously to interrogate drug efficacy and adverse effects7,57,58,62,63 to assess both deleterious and beneficial effects of targeting TG-associated genes; we assessed effects in both EA and AA populations using two extensive EHR databases: BioVU and AoU. We found that for EA individuals, the lipid and cardiovascular benefits were largely confirmed, while we detected few secondary effects (either previously identified or unknown) that might pose increased risk by targeting these TG-associated genes. Among AA individuals, however, we found only limited evidence for lipid and cardiovascular benefits; as with EA individuals, there were few secondary effects associated with these genes.
Benefits of targeting TG-associated genes for hyperlipidemia and cardiovascular phenotypes
Our results support current efforts to target TG-associated genes65,66 for the prevention and treatment of heart disease, particularly among EA individuals. Indeed, 10 variants were significantly associated with hyperlipidemia in the expected direction in the BioVU cohort, and all of these had at least one hyperlipidemia phenotype validated in the AoU cohort. Moreover, 7 of the validated variants were also associated with heart disease-related phenotypes in at least one cohort in the expected direction; 5 variants had significant results in both cohorts (although the phenotypes did not replicate in all instances). The results were relatively consistent for LPL, the primary target for drug development among TG-associated genes, in the secondary analysis using predicted gene expression. In contrast, for AA individuals, the associations between genetic variants and hyperlipidemia-related phenotypes were more limited, consistent with previous findings.42,64 Only 6 out of 12 previously reported variants showed significant associations with adjusted measured TG levels; of these, only 3 variants were significantly associated with hyperlipidemia (phecode 272.*) in either the BioVU or AoU cohort, none replicated, and none of these SNPs were also associated with other cardiovascular phenotypes. Given that AA individuals tend to have lower average TG levels and lower frequencies of severe or moderate hypertriglyceridemia compared to EA individuals,42 it remains unclear whether the same variants have comparable effect sizes on the cardiovascular outcomes in AA and EA populations.
Potential secondary effects from targeting TG-associated genes
Along with the consideration of efficacy for cardiovascular benefits, it was also important to consider the other potential long-term benefits and risks of these drugs, particularly because there has been little evidence regarding the impact of TG-lowering therapies on non-cardiovascular phenotypes associated with TG candidate genes and most studies have focused on EA populations.
Pancreatitis, diabetes, and dementia/Alzheimer’s disease were among the most serious conditions previously associated with TGs and TG-related genes. For EA individuals, our analysis of pancreatitis phenotypes yielded results consistent with previous findings, indicating increased risk of pancreatitis with increased risk of high triglycerides.45,67–72 However, results regarding potential benefits for diabetes prevention were not conclusive, despite previous research indicating this possibility.34,73–77 Likewise, despite prior findings indicating potential linkages between TG-associated genes and ADRD, we found limited evidence of robust associations (i.e., a secondary analysis that found an association between dementias and APOC3 in EA patients) which supports ideas that TG-lowering treatment could reduce the risk of dementia or Alzheimer’s diseases.78–82 The lack of consistent results parallel research assessing the relationship between LDL-C lowering treatments (i.e., statins) and dementias; indeed, the association between lipid management and the benefit for Alzheimer’s disease remains controversial.83–88 We found no consistent evidence to support these potential secondary beneficial effects among AA individuals.
Strengths and limitations
Our study featured several notable strengths. First, the combined use of MR and PheWAS enabled an unbiased investigation of over 1,000 clinical phenotypes. Second, leveraging longitudinal EHR data from BioVU and AoU allowed us to evaluate long-term effects in both EA and AA individuals. This is particularly significant as the AA population has historically been underrepresented in genetic and pharmacogenetic research. Moreover, the use of two large cohorts offered more robust results. Finally, by focusing on previously reported variants, our study contributed to a more nuanced understanding of the long-term effects—both beneficial and deleterious—of TG-lowering medications.
However, there were limitations. First, despite conducting analyses in two large biobanks, the AA population remained relatively small, which limits the statistical power compared to EA individuals, especially for low-frequency variants. Future validation using larger, multi-ancestry cohorts is needed. As AoU continues to recruit participants, this analysis may be more informed with an updated dataset. Second, this study focused exclusively on previously reported variants within candidate TG-lowering targets. There may be other unidentified functional variants whose impact on the association results remains unknown, particularly in the AA population. Given the historical underrepresentation of the AA population in genetic studies, it is possible that some important ancestry-specific TG-related variants remain unidentified. With the expansion of multi-ancestry biobanks and whole-genome sequencing data, future studies could aim to include more diverse ancestry groups and investigate low-frequency genetic variants. Similarly, larger populations of elderly patients could more readily reveal associations between these TG-related variants and certain age-related outcomes, including Alzheimer’s and diabetes. As the AoU cohort ages, we anticipate the ability to more effectively conduct such research with more available long-term outcomes. Fourth, additional factors (e.g., comorbidities and social determinants of health) could affect these outcomes; moreover, these factors could help explain why lipid-lowering effects do not correspond automatically to reduced cardiovascular risk. Fifth, we were unable to complete analysis of APOA5 in the secondary analysis given the limited availability of tissues, nor were we able to complete secondary analysis among patients of AA. Finally, detecting protective effects can be more challenging than demonstrating adverse effects related to known risk alleles (e.g., lowered diabetes risk). It is possible that there will be protective benefits from drugs targeting TG-lowering genes, but we do not yet have sufficient power to detect these effects.
CONCLUSION
By leveraging two large EHR-based biobanks (BioVU and AoU), we applied genetic approaches to evaluate the associations between clinical phenotypes and previously reported TG variants in both EA and AA populations. We confirmed the TG-lowering effect for most of the candidate genes in EA individuals, which suggest a beneficial effect for preventing cardiovascular disease. Our results also suggest the potential for a few additional beneficial effects, with few deleterious effects, for EA individuals using TG-lowering treatments; however, these results do not provide comparable evidence to assess the efficacy and safety of these treatments for AA individuals. As datasets continue to grow in breadth (e.g., whole genome sequencing and survey data regarding social drivers of health) and length of follow-up, testing in populations other than those of EA is a priority.
Data Availability
All data produced in the present study are available upon reasonable request to the authors.
DECLARATIONS
Ethics approval and consent to participate
The study was approved by the VUMC Institutional Review Board (IRB# 220725).
Consent for publication
Not applicable.
Availability of data and materials
BioVU data and code
Statistical code: Available at https://github.com/FengLabVUMC/Triglyceride-PheWAS. Data set: Full summary statistics for validated variants and qualifying phenotypes are available in Supplementary File 2. For individual level data, VUMC EHR data are de-identified using Safe-harbor methods. BioVU genomic data are linked to de-identified records and are further protected by BioVU data use agreements ensuring that researchers will not attempt re-identification. BioVU participant consenting procedures limit access to individual level data. Limited primary cohort data are available by request to Dr. Feng (e-mail, Qiping.feng@vumc.org), pending BioVU approval and a data use agreement.
All of Us data and code
In line with the privacy standards set by the All of Us Research Program, data and code used for these cohorts are available to approved researchers who register for access to the Researcher Workbench platform at https://workbench.researchallofus.org/login. This analysis was run on the All of Us Research Program Controlled Tier Dataset version 7, production release C2022Q4R9.
Competing interests
The authors declared no competing interests for this work.
Funding
R01HL163854 (Q.F.), R01GM120523 (Q.F.), R01HL133786 (W.Q.W.), R01HL171809 (Q.F, W.Q.W) and Vanderbilt Faculty Research Scholar Fund (Q.F.).
The primary dataset(s) were obtained from Vanderbilt University Medical Center’s BioVU, which is supported by institutional funding, 1S10RR025141-01, and CTSA grants UL1TR002243, UL1TR000445, and UL1RR024975. Additional funding provided by the NIH through grants P50GM115305 and U19HL065962. The authors wish to acknowledge the expert technical support of the VANTAGE and VANGARD core facilities, supported in part by the Vanderbilt-Ingram Cancer Center (P30 CA068485) and Vanderbilt Vision Center (P30 EY08126).
The validation dataset(s) were obtained form the All of Us Research Program. This program would not be possible without the partnership of its participants. Additionally, the All of Us Research Program is supported by the National Institutes of Health, Office of the Director: Regional Medical Centers: 1 OT2 OD026549; 1 OT2 OD026554; 1 OT2 OD026557; 1 OT2 OD026556; 1 OT2 OD026550; 1 OT2 OD 026552; 1 OT2 OD026553; 1 OT2 OD026548; 1 OT2 OD026551; 1 OT2 OD026555; IAA #: AOD 16037; Federally Qualified Health Centers: HHSN 263201600085U; Data and Research Center: 5 U2C OD023196; Biobank: 1 U24 OD023121; The Participant Center: U24 OD023176; Participant Technology Systems Center: 1 U24 OD023163; Communications and Engagement: 3 OT2 OD023205; 3 OT2 OD023206; and Community Partners: 1 OT2 OD025277; 3 OT2 OD025315; 1 OT2 OD025337; 1 OT2 OD025276.
Author contributions
QF, CMS, WQW, and NJC conceived and designed the research; EO, YX, LJ, GK, SK, XZ, and QF acquired, analyzed, and interpreted data; QF, ALD, and CMS drafted the manuscript; all authors have approved the submitted version of this manuscript.
Acknowledgements
The first and corresponding authors had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
Footnotes
Role of the Funder/Sponsor: The funders had no role in design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.
List of abbreviations
- AA
- African ancestry
- ADRD
- Alzheimer’s disease and related dementias
- AoU
- All of Us Research Program
- CHD
- coronary heart disease
- EA
- European ancestry
- EHR
- electronic health record
- GBJ
- generalized Berk-Jones
- HDL-C
- high-density lipoprotein cholesterol
- ICD-9-CM
- International Classification of Diseases, Ninth Revision, Clinical Modification
- ICD-10-CM
- International Classification of Diseases, Tenth Revision, Clinical Modification
- IRB
- Institutional Review Board
- LDL-C
- low density lipoprotein cholesterol
- MAF
- minor allele frequency
- MEGA
- Illumina Infinium Expanded Multi-Ethnic Genotyping Array
- MR
- Mendelian randomization
- PC
- principal component
- PheWAS
- phenome wide association study
- SD
- standard deviation
- TG
- triglyceride
- VUMC
- Vanderbilt University Medical Center