Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Clotting factor genes are associated with preeclampsia in high altitude pregnant women in the Peruvian Andes

Keyla M. Badillo Rivera, View ORCID ProfileMaria A. Nieves-Colón, View ORCID ProfileKarla Sandoval Mendoza, Vanessa Villanueva Dávalos, Luis E. Enriquez Lencinas, View ORCID ProfileJessica W. Chen, View ORCID ProfileElisa T. Zhang, Alexandra Sockell, Patricia Ortiz Tello, Gloria Malena Hurtado, View ORCID ProfileRamiro Condori Salas, Ricardo Cebrecos, José C. Manzaneda Choque, Franz P. Manzaneda Choque, View ORCID ProfileGermán P. Yábar Pilco, Erin Rawls, Celeste Eng, Scott Huntsman, View ORCID ProfileEsteban González Burchard, Giovanni Poletti, View ORCID ProfileCarla Gallo, Carlos D. Bustamante, Julie C. Baker, Christopher R. Gignoux, Genevieve L. Wojcik, View ORCID ProfileAndrés Moreno-Estrada
doi: https://doi.org/10.1101/2021.05.20.21257549
Keyla M. Badillo Rivera
1Department of Genetics, Stanford School of Medicine, Stanford, CA, United States, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maria A. Nieves-Colón
2Department of Anthropology, University of Minnesota Twin Cities, Minneapolis, MN, United States, 55455
3Laboratorio Nacional de Genómica para la Biodiversidad (UGA-LANGEBIO), CINVESTAV, Irapuato, GTO, Mexico, 36821
4School of Human Evolution and Social Change, Arizona State University, Tempe, AZ, United States, 85281
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Maria A. Nieves-Colón
Karla Sandoval Mendoza
3Laboratorio Nacional de Genómica para la Biodiversidad (UGA-LANGEBIO), CINVESTAV, Irapuato, GTO, Mexico, 36821
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Karla Sandoval Mendoza
Vanessa Villanueva Dávalos
5Departmento de Gineco-Obstetricia, Hospital Regional Manuel Nuñez Butrón, Puno, Peru, 21002
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Luis E. Enriquez Lencinas
5Departmento de Gineco-Obstetricia, Hospital Regional Manuel Nuñez Butrón, Puno, Peru, 21002
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jessica W. Chen
1Department of Genetics, Stanford School of Medicine, Stanford, CA, United States, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jessica W. Chen
Elisa T. Zhang
1Department of Genetics, Stanford School of Medicine, Stanford, CA, United States, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elisa T. Zhang
Alexandra Sockell
1Department of Genetics, Stanford School of Medicine, Stanford, CA, United States, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patricia Ortiz Tello
1Department of Genetics, Stanford School of Medicine, Stanford, CA, United States, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gloria Malena Hurtado
6Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru, 15102
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ramiro Condori Salas
6Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru, 15102
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ramiro Condori Salas
Ricardo Cebrecos
6Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru, 15102
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
José C. Manzaneda Choque
7Universidad Nacional del Altiplano, Puno, Peru, 21002
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Franz P. Manzaneda Choque
7Universidad Nacional del Altiplano, Puno, Peru, 21002
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Germán P. Yábar Pilco
7Universidad Nacional del Altiplano, Puno, Peru, 21002
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Germán P. Yábar Pilco
Erin Rawls
4School of Human Evolution and Social Change, Arizona State University, Tempe, AZ, United States, 85281
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Celeste Eng
8Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, United States, 94143
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Scott Huntsman
8Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, United States, 94143
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Esteban González Burchard
8Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, United States, 94143
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Esteban González Burchard
Giovanni Poletti
6Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru, 15102
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carla Gallo
6Laboratorios de Investigación y Desarrollo, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru, 15102
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Carla Gallo
Carlos D. Bustamante
1Department of Genetics, Stanford School of Medicine, Stanford, CA, United States, 94305
11Department of Biomedical Data Science, Stanford School of Medicine, Stanford, CA, United States, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julie C. Baker
1Department of Genetics, Stanford School of Medicine, Stanford, CA, United States, 94305
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christopher R. Gignoux
9University of Colorado Anschutz Medical Campus, Aurora, CO, United States, 80045
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Genevieve L. Wojcik
10Bloomberg School of Public Health, John Hopkins University, Baltimore, MD, United States, 21205
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrés Moreno-Estrada
3Laboratorio Nacional de Genómica para la Biodiversidad (UGA-LANGEBIO), CINVESTAV, Irapuato, GTO, Mexico, 36821
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrés Moreno-Estrada
  • For correspondence: andres.moreno@civestav.mx
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Study question What is the genetic basis of preeclampsia in Andean families residing at high altitudes?

Summary answer A top candidate region associated with preeclampsia containing clotting factor genes PROZ, F7 and F10 was found on chromosome 13 of the fetal genome in affected Andean families.

What is known already Preeclampsia, a multi-organ complication of pregnancy, is a leading cause of maternal morbidity and mortality worldwide. Diagnosed by the onset of maternal hypertension and proteinuria after 20 weeks of gestation, this disorder is a common cause of preterm delivery and affects approximately 5-7% of global pregnancies. The heterogeneity of preeclampsia has posed a challenge in understanding its etiology and molecular basis. However, risk for the condition is known to increase in high altitude regions such as the Peruvian Andes.

Study design, size, duration To investigate the genetic basis of preeclampsia in a high-altitude resident population, we characterized genetic diversity in a cohort of Andean families (N=883) from Puno, Peru, a high-altitude city above 3,500 meters. Our study collected DNA samples and medical records from case-control trios and duos between 2011-2016, thus allowing for measurement of maternal, paternal, and fetal genetic factors influencing preeclampsia risk.

Participants/materials, setting, methods We generated high-density genotype data for 439,314 positions across the genome, determined ancestry patterns and mapped associations between genetic variants and preeclampsia phenotype. We also conducted fine mapping of potential causal variants in a subset of family participants and tested ProZ protein levels in post-partum maternal and cord blood plasma by ELISA.

Main results and the role of chance A transmission disequilibrium test (TDT) revealed variants near genes of biological importance in pregnancy physiology for placental and blood vessel function. The most significant SNP in this cluster, rs5960 (p<6×10−6) is a synonymous variant in the clotting factor F10. Two other members of the coagulation cascade, F7 and PROZ, are also in the top associated region. However, we detected no difference of PROZ levels in maternal or umbilical cord plasma.

Limitations, reasons for caution Our genome-wide association analysis (GWAS) was limited by a small sample size and lack of functional follow up. Our ELISA was limited to post-natal blood sampling (only samples collected immediately after birth). But, despite a small sample size, our family based GWAS design permits identification of novel significant and suggestive associations with preeclampsia. Further longitudinal studies could analyze clotting factor levels and activity in other pregnant cohorts in Peru to assess the impact of thrombosis in preeclampsia risk among Andean highlanders.

Wider implications of the findings These findings support previous evidence suggesting that coagulation plays an important role in the pathology of preeclampsia and potentially underlies susceptibility to other pregnancy disorders exacerbated at high altitudes. This discovery of a novel association related to a functional pathway relevant to pregnancy biology in an understudied population of Native American origin demonstrates the increased power of family-based study design and underscores the importance of conducting genetic research in diverse populations.

Study funding/competing interest(s) This work was supported in part by the National Science Foundation (NSF) Graduate Research Fellowship Program Grant No. DGE–1147470 awarded to K.M.B.R. (fellow no. 2014187481); NSF SBE Postdoctoral Research Fellowship Award No. 1711982 awarded to M.N.C.; an A.P. Giannini Foundation postdoctoral fellowship, a Stanford Child Health Research Institute postdoctoral award, and a Stanford Dean’s Postdoctoral Fellowship awarded to E.T.Z.; the Chan Zuckerberg Biohub Investigator Award to C.D.B; a Burroughs Welcome Prematurity Initiative Award to J.C.B.; the George Rosenkranz Prize for Health Care Research in Developing Countries, and the International Center for Genetic Engineering and Biotechnology (ICGEB, Italy) grant CRP/ MEX15-04_EC, and Mexico’s CONACYT grant FONCICYT/50/2016, each awarded to A.M.E. Further funding was provided by the Sandler Family Foundation, the American Asthma Foundation, the RWJF Amos Medical Faculty Development Program, Harry Wm. and Diana V. Hind Distinguished Professor in Pharmaceutical Sciences II, National Institutes of Health, National Heart, Lung, and Blood Institute Awards R01HL117004, R01HL128439, R01HL135156, R01HL141992, National Institute of Health and Environmental Health Sciences Awards R01ES015794, R21ES24844, the National Institute on Minority Health and Health Disparities Awards R01MD010443, and R56MD013312, and the National Human Genome Research Institute Award U01HG009080, each awarded to E.G.B. Author J.W.C. is currently a full-time employee at Genentech, Inc. and hold stocks in Roche Holding AG. Author E.G.B. reports grants from the National Institute of Health, Lung, Blood Institute, the National Institute of Health, General Medical Sciences, the National Institute on Minority Health and Health Disparities, the Tobacco-Related Disease Research Program, the Food and Drug Administration, and the Sandler Family Foundation, during the conduct of the study.

Trial registration number N/A

*for MESH terms see PubMed at http://www.ncbi.nlm.nih.gov/pubmed/

Introduction

Preeclampsia is a hypertensive disorder of pregnancy that is a leading cause of morbidity and mortality for mothers and infants worldwide. The disorder complicates 5-7% of global pregnancies, causes nearly 40% of all premature births, and is associated with 10-15% of all maternal deaths (Duley, 2009, Rana et al., 2019, Valenzuela et al., 2012). This morbidity is even higher in developing countries and among communities with limited access to healthcare (Osungbade and Ige, 2011). Despite posing a significant global disease burden, the heterogeneity of preeclampsia has posed a major challenge for understanding its etiology and genetic basis (Phipps et al., 2019, Valenzuela et al., 2012).

Clinical and pathological research suggests a major role for the placenta in preeclampsia, where shallow invasion of fetal cells into the maternal endometrium results in insufficient remodeling of the maternal vasculature (Yong et al., 2018). While it roots in early placental development, preeclampsia is usually not detected until the third trimester of pregnancy (>20 weeks gestation), when it is identified by a sudden onset of hypertension and signs of organ damage, typically proteinuria (excess protein in the urine). The severity of preeclampsia is determined by gestational age at onset, as well as the magnitude of hypertension and organ damage (American College of Obstetricians and Gynecologists, 2013). The disorder is known to be heritable with multicomponent risk determined by maternal, fetal, and paternal factors (McGinnis et al., 2017, Pappa et al., 2011, Phipps et al., 2019, Valenzuela et al., 2012). Other risk factors include family history (Boyd et al., 2013, Cincotta and Brennecke, 1998), socioeconomic status (Silva et al., 2008) and chronic hypertension or diabetes (Rana, et al., 2019). Residence at high altitudes above 2,500 meters (m) also contributes considerably to risk of developing preeclampsia (Zamudio, 2007).

Residence at high altitudes increases the risk for preeclampsia and other hypertensive pregnancy disorders at least two to threefold (Moore et al., 2011). For example, Bolivian communities living at 3,500 m altitude have an incidence of preeclampsia of up to 20% (Keyes et al., 2003), about three times higher than the world average (Abalos et al., 2013). In neighboring Peru, preeclampsia complicates up to 22% of all pregnancies and is the second leading cause of maternal deaths (Gil Cipirán, 2017, Guevara Ríos and Meza Santibáñez, 2014). Due to this high incidence, highland pregnancy studies have been proposed as a natural experiment to elucidate genetic factors involved in preeclampsia and other hypertensive pregnancy complications (Moore et al., 1982, Moore et al., 2004, Palmer et al., 1999, Tissot van Patot et al., 2009, Zamudio, 2007). Native Andean populations are of particular interest for this research due to their unique physiological adaptations to chronic high-altitude hypoxia, such as enhanced pulmonary volumes and elevated blood hemoglobin concentrations (Bigham et al., 2013). Candidate genes involved in these adaptations include EGLN1, NOS2 and the hypoxia-inducible factor 1 (HIF1) pathway, among others (Beall, 2014, Bigham, et al., 2013).

Previous research has found that Highland Andean ancestry and long term, multi-generational residence at altitude are associated with lower rates of hypoxia induced pregnancy complications among high altitude resident women (Julian et al., 2009, Moore, et al., 2011, Moore, et al., 2004). Because preeclampsia risk increases with altitude (Palmer, et al., 1999), these findings suggest that Andeans with Native American ancestry may carry rare adaptive variants or a unique repertoire of genetic risk factors for preeclampsia—distinct from other populations previously studied (Michita et al., 2018). Characterizing fine-scale ancestry and genetic structure patterns in native Andeans may uncover preeclampsia relevant genetic variation found at higher frequencies due to selection for altitude adaptation (Bigham and Lee, 2014, Tishkoff, 2015).

To this end, here we analyze genotype data from a large cohort of preeclamptic Andean families from Puno, Peru (Figure 1A). This city, located at 3,830 m altitude, has a population with one of the highest incidences of preeclampsia and associated maternal mortality in the world (Bristol, 2009, Gil Cipirán, 2017). Our work takes a comprehensive approach to the genetic study of preeclampsia in a population adapted to high-altitude by employing a family-study design within a case-control cohort. This enables identification of genetic regions that influence preeclampsia considering each of the family members that affect disease risk—mothers, fathers, and offspring—unlike most genome-wide studies focused on pregnancy disorders which tend to solely include maternal or fetal genomes (Williams and Broughton Pipkin, 2011). We also aim to understand the role of ancestry-related susceptibility in this disorder by characterizing genetic diversity and admixture patterns in the Puno cohort. Additionally, because preeclampsia presents in a spectrum of severity based on gestational age, organ damage, and hypertension, we take advantage of extensive cohort phenotyping to study associations of genetic variants with disease severity. Our findings have implications for general understanding of preeclampsia, and human pregnancy hypertensive disorders more broadly, while also shedding light on the genetic factors that underlie human adaptations for successful reproduction at high altitudes.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1. Location and population structure of the Puno preeclampsia cohort.

A) Approximate location of Puno, Peru. B) Principal components analysis including PRE cases, PUN and UNA controls, and five continental reference populations from the 1000 Genomes. C) ADMIXTURE analysis results showing unsupervised clustering models assuming K=4 and K=6. At K=6 a Puno-specific sub-continental ancestry component not shared with 1000 Genomes Peruvians from Lima appears in the Puno cohort (shown in light blue).

Materials & Methods

Puno cohort

Preeclamptic families (PRE) were recruited between 2011 and 2016 in the Puno regional hospital (Hospital Regional Manuel Nuñez Butrón) after their preeclampsia diagnosis. Expecting parents (mothers and fathers) had to be at least 18 years of age and report at least two generations of parents from Puno or nearby Andean regions. Recruited families and subjects included 136 trios (mother, father, and fetal umbilical cord), 197 duos (190 mother and fetal umbilical cord duos, and 7 mother and father pairs), and 14 singletons (mother or umbilical only). 100 healthy same-population control families from Puno (PUN) were also recruited at the hospital at their time of admission for labor. These included 4 trios and 96 duos (mother and fetal umbilical cord). Lastly, 110 unrelated population controls were recruited at the local university, Universidad Nacional del Altiplano (UNA) in Puno. In total, 1,129 samples were collected, including 815 PRE cases, 204 PUN and 110 UNA controls (Supplementary Table 1).

Ethical approval

All participants were recruited with informed consent and with approval by the Stanford University Institutional Review Board eProtocols 20782 (Investigating the Genetic Basis of Preeclampsia in Populations Adapted to High Altitude) and 20839 (Population and Functional Genomics of the Americas). Local IRB approvals were provided by the ethics committee at the Manuel Nuñez Butrón Regional Hospital (01541-11-UADI-HR“MNB”-RED-PUNO) and the Peruvian National Institute of Health (213-2011-CIEI/INS).

Phenotypic data

Preeclampsia was defined as new onset of hypertension with presence of proteinuria in urine after 20 weeks of gestation. Hypertension was defined as systolic blood pressure 30 mmHg higher than basal level, and diastolic blood pressure at least 15 mmHg higher over basal level. If no prior blood pressure measurements were available, average basal levels were used as prior (85/55 mmHg). Note that measured basal arterial pressure levels in pregnant women in Puno are around 80/50 – 90/60 mmHg (systolic/diastolic), much lower than the U.S. standards, possibly due to altitude adaptation (Segura-Vega, 2019). Proteinuria levels were confirmed to be at least 30mg/dL by dipstick in two tests 24 hours apart. Severity of preeclampsia was defined by the attending physician and categorized into mild or severe. Gestational time was self-reported by the mother (by date of last menstrual period: LMP) or determined by the neonate Capurro test.

Blood and tissue collection

Whole blood from the mothers was collected within a few hours post-partum by venipuncture into EDTA tubes and frozen at −20C. Umbilical cord blood was collected by venipuncture following clamping of the cord immediately after delivery. Paternal blood, and blood from UNA controls, was obtained upon consent. For plasma, EDTA tubes were spun within 60min of collection at 1,200g for 10min in a tabletop centrifuge. Separated plasma was transferred to Eppendorf tubes, spun again under the same conditions for better purity, then stored at −20C in cryovials.

Genotypic data

DNA was obtained from whole blood with the Promega (USA) Wizard ® Genomic DNA Purification Kit following manufacturer’s instructions. DNA extracts were initially quantified with the Nanodrop. DNA content and quality were further assessed through quantification with the Qubit® Broad Range Assay and by visualizing on a 1% agarose gel, respectively. Samples that had both >10 ng/uL of DNA concentration and visible bands on the gel were selected for genotyping. Genotype data at over 800,000 sites across the genome were generated with the Affymetrix (USA) Axiom Genome-wide LAT 1 array for 950 samples in two batches. Batch 1 was genotyped in February 2014 at the University of California San Francisco, Gladstone Genomics Core in Mission Bay, San Francisco, CA. This batch included 360 PRE, 10 PUN and 110 UNA individuals (n=480). A total of 813,366 variants were successfully genotyped with Batch 1. Batch 2 was genotyped in November 2018 at Affymetrix Research Services Laboratories, Thermo Fisher Scientific in Santa Clara, CA. This batch included 324 PRE and 146 PUN individuals (n=470), as well as 10 controls added by the genotyping facility. Three samples failed the genotyping facility filtering metrics, therefore a total of 477 samples and 818,154 variants were successfully genotyped with Batch 2.

Quality control

Batch 1 data

The genotyping facility performed a first round of QC restricting the raw dataset to 713,709 recommended SNPs that passed filtering thresholds for heterozygous strength offset, cluster resolution, off-target variants, call rate and genotype quality. We further removed 42 variants with duplicate marker names and flipped 21 SNPs to the forward strand using snpflip (https://github.com/biocore-ntnu/snpflip) and Plink v1.9 (Chang et al., 2015). We revised that all variants had physical positions in the NCBI Build GRCh37 human reference (hg19 assembly). After QC, Batch 1 dataset included 713,667 biallelic SNPs and 480 individuals.

Batch 2 data

We removed 214 variants with duplicate marker names, 4,233 structural variants and 540 variants with no physical position in the NCBI Build GRCh37 human reference. 64 SNPs were flipped to the forward strand as above. Additionally, we followed the genotyping facility recommendations to restrict this dataset to 777,946 recommended SNPs that passed filtering thresholds for cluster resolution, off-target variants, call rate and genotype quality. The 10 genotyping controls were also removed. After QC, Batch 2 dataset included 777,946 biallelic SNPs and 467 individuals.

Batch 1 and 2 merge

We intersected Batch 1 and 2 datasets at overlapping sites using Plink v1.9. The merged dataset contained 689,528 SNPs and 947 individuals. Using Plink, we removed 1,438 SNPs with genotype missing call frequency >5% (flag: --geno 0.05) and 183,054 SNPs with minor allele frequency (MAF) <0.5% (flag: --maf 0.005). We also excluded two individuals with missing call frequency <10% (flag: --mind 0.1). 561 SNPs failing Hardy-Weinberg equilibrium at 10e-10 were also excluded. We next filtered our dataset for families with excess Mendelian errors, cryptic relatedness, and duplicate samples (see Supplementary Table 2 for list of individuals assigned as unrelated after pedigree revision). 31 individuals were removed, and 56 pedigrees were updated. Chromosomal sex was estimated and sex misassignments were corrected for 176 individuals whose biological sex was either not recorded or incorrectly recorded during data collection. After QC, the merged Batch 1 + 2 dataset included 504,475 genome wide SNPs and 914 individuals (Supplementary Figure 1).

Batch effect correction

We tested for batch effects by calculating principal components analysis in Plink after filtering the dataset for linkage disequilibrium and removing related offspring (flags: --indep-pairwise 100 10 0.1, --pca). We initially identified a strong batch effect with the top principal components statistically significantly associated with batch (P<0.05) (Supplementary Figure 2). To correct this effect, we conducted an additional round of site and sample-specific filtering. We removed symmetrical SNPs (AT, CG), excluded all sites not included in the “Best and Recommended” list provided by Affymetrix for this array, and filtered sites with genotype missingness <5% and MAF >0.5%. Additionally, we removed individuals with excess heterozygosity (outliers >4SD), duplicate individuals and individuals with cryptic or unexpected relatedness. In total, 65,161 SNPs and 31 individuals were removed. We repeated the principal components calculation as above on the filtered dataset and found no statistically significant association between batch and the top principal components (Supplementary Figure 2). The final dataset after batch effect correction included 439,314 genome wide SNPs and 883 individuals.

Population structure

We intersected our dataset with reference panels including five populations from 1000 Genomes (1KG) Phase 3: Yoruba from Ibadan, Nigeria (YRI), Utah residents with Northern and Western European ancestry (CEU), Han Chinese from Beijing, China (CHB), Mexican Americans from Los Angeles, USA (MXL) and Peruvians from Lima, Peru (PEL). After merging, we removed offspring and related individuals, restricted to autosomes and re-applied quality filters. The filtered, merged dataset consisted of 422,224 variants and 1,057 individuals. The unsupervised clustering algorithm ADMIXTURE (Alexander et al., 2009) was run on this dataset to explore global patterns of population structure. As recommended by the ADMIXTURE manual, the input data was LD pruned using Plink (flag: --indep-pairwise 50 10 0.1). After LD pruning, 45,496 variants remained for analysis. Ten ancestral clusters (K=2 through K=10) were tested and the best fit model was selected after examining cross-validation errors. To account for possible convergence variation, we performed 10 additional runs using different random seeds per run and estimated parameter standard errors using 200 bootstrap replicates per run. ADMIXTURE results were plotted with the R pophelper package (Francis, 2017). Principal components analysis (PCA) was applied to the LD pruned dataset using EIGENSOFT v7.2.1 (Patterson et al., 2006) and plots were generated using the ggplot2 package in R v4.0.3 (R Core Team, 2018, Wickham, 2016).

Phasing and local ancestry estimation

We used RFMix v1.5.4 (Maples et al., 2013) to determine genome wide local ancestry proportions for the Puno cohort founders, assuming a model of K=3 ancestral populations. The choice of K=3 reference populations was informed by the ADMIXTURE results. The reference panel included 108 YRI and 94 CEU individuals from 1000 Genomes Phase 3, and 94 native individuals from Mexico (30 Mixe, 15 Zapotec, 49 Nahua) genotyped as part of the GALA II study (Galanter et al., 2014). These reference samples were used as proxies for African, European, and Native American ancestral source populations, respectively. After merging, the analysis ready dataset consisted of 420,105 overlapping variants and 899 individuals. The data were phased with SHAPEIT2 (O’Connell et al., 2014). RFMix was run with default parameters and EM=2 iterations. Ancestry call cutoffs were determined with a 0.9 posterior probability threshold as recommended in (Kidd et al., 2012).

Ancestry proportions analysis

We tested for significant differences in proportions of Native American, European, and African ancestry components between PRE cases, PUN and UNA controls. We applied the Wilcoxon signed ranks test in R v3.5.1 (pairwise.wilcox.test function) with Bonferoni correction for multiple testing. This non-parametric test assesses whether significant differences exist between two distributions (Moore et al., 2009). Our null hypothesis was that the distribution of each ancestry proportion was identical between PRE cases, PUN and UNA controls.

Statistical analysis of clinical phenotypes

We assessed batch bias of clinical phenotypes and correlation with each other by statistical analysis in R v3.4.0 (R Core Team, 2018). The following dichotomous phenotypes were tested for batch association with a chi squared test: severity of diagnosis (mild or severe), proteinuria (+/++ or +++), parity (nulliparous or more than one birth), sex of newborn and mode of delivery (vaginal or C-section). The following continuous phenotypes were tested for batch association by t-test: gestational time measured by the mother (date of last menstrual period, or LMP) and by the fetus (Capurro test), neonate weight, systolic and diastolic blood pressure measurements, and maternal age.

Transmission-disequilibrium test (TDT) and parent of origin (POO)

Leveraging the trio family structure, we applied the transmission disequilibrium test (TDT) and parent-of-origin (TDT-POO) test on all 87 parent-offspring case trios (preeclamptic families with offspring) in Plink v1.9 using the --tdt flag, with and without the ‘poo’ modifier. Variants were then filtered by MAF > 0.05 within the analyzed cohort. The TDT test assumes Mendelian rules for transmission of alleles and tests if the queried allele is being transmitted/untransmitted disproportionately from parents to the affected offspring population (Purcell et al., 2007, Purcell et al., 2005). The POO analysis is part of TDT, and separately queries transmission from each parent individually to assess paternal or maternal specific transmission. This test self-corrects for covariate effects by treating each trio as a separate unit.

GWAS for case-control association

Puno cohort individuals were divided into offspring and mothers for two separate case-control GWAS analyses using logistic regression in Plink (flag: --logistic) with the first 3 PCs and sequencing batch as covariates. The analysis on the mothers includes 254 PRE and 70 PUN controls. The offspring analysis includes 225 PRE cases and 60 PUN controls. These analyses included individuals in trios, duos, and singletons. Variants were filtered by MAF > 0.05 within the analyzed cohort.

GWAS in additional phenotypes

Multiple phenotypes measured and captured in the recruited patient’s medical history allow for testing of additional genetic associations. We performed additional genome-wide association analyses of endophenotypes in the PRE mothers (N=254) and offspring (N=225), separately. These analyses included individuals in trios, duos, and singletons. The endophenotypes tested for each were: (1) gestational age, maternal measurement; (2) gestational age, fetal measurement; (3) diastolic blood pressure at diagnosis of preeclampsia; (4) systolic blood pressure at diagnosis of preeclampsia; (5) proteinuria at diagnosis and (6) severity of diagnosis. The first four were treated as continuous variables and analyzed by linear regression in Plink (flag: --linear). Proteinuria and severity of diagnosis were dichotomous variables analyzed in Plink by logistic regression (flag: --logistic), with proteinuria reduced to + and ++ vs. +++. Genotyping batch was included as a discrete covariate and the first 3 PCs as continuous covariates. Several of these analyses included less individuals due to missing data. Specifically, GWAS with systolic and diastolic blood pressure included 253 PRE mothers and 224 PRE offspring, and GWAS with maternal measurement of gestational age included 252 PRE mothers and 223 PRE offspring.

GWAS data visualization

All genome-wide analyses were filtered by MAF >= 0.05 within the analyzed cohorts and visualized by Manhattan plots using the qqman R package v0.1.4 (Turner, 2017). QQ plots were generated with the same package to confirm no effects from population structure or other confounders. Regions of interest were selected if they met two criteria: (1) p-value (p<10E-4 in most cases—unless specified in the results section) and (2) the presence of nearby associated SNPs forming a skyscraper-like structure in the Manhattan plot. Top SNPs in these regions were selected, and their genomic regions plotted using LocusZoom (Pruim et al., 2010). Maps displaying the geographic distribution of candidate associated variants were produced using the Geography of Genetic Variants (GGV) browser (Marcus and Novembre, 2017).

Capture sequencing

We conducted fine mapping of potential causal variants in a subset of families genotyped in Batch 1 previous to Batch 2 genotyping. Preliminary data obtained from Batch 1 genotypes were analyzed using standard family-based TDT on Plink for preeclampsia associations (as above), and regression analysis on secondary phenotypes was conducted using linear mixed models in GTCA (Yang et al., 2011) (flag: –mlma-loco). Based on these preliminary results, we designed a target capture assay including windows around top hits for preeclampsia and secondary phenotypes, as well as several genes previously suggested to be associated with preeclampsia in the GWAS catalog (release 2.0.5) (Buniello et al., 2019). The total capture size was approximately 10Mb (Supplementary File 1).

We next selected families from Batch 1 with the strongest associations on the preliminary TDT analysis (n=86 individuals, Supplementary Table 1). Genomic DNA from 86 individuals (Supplementary Figure 3) was fragmented by mechanical shearing (Covaris) and prepared using the KAPA Hyperprep library preparation kit (Kapa Biosystems, now part of Roche, Switzerland). DNA capture was performed on the libraries using the Agilent (USA) SureSelect platform following manufacturer’s instructions. Paired-end sequencing of captured libraries was performed on the Illumina NextSeq. Sequence data were analyzed through a standard FASTQC-BWA-GATK pipeline following guidelines as described in (Koboldt, 2020). We then performed the same GWAS analyses listed above (TDT test for the preeclampsia phenotype and linear regressions for continuous phenotypes) in the captured regions in a limited set of individuals: 25 trios, 4 duos (3 mother-offspring, 1 father-offspring) and 3 singletons (1 offspring and 2 mothers). Candidate loci identified in these analyses were individually merged and annotated with ANNOVAR (Yang and Wang, 2015) and overlapped with GTEx single-tissue cis-eQTL data (version V6p) from the online database (https://gtexportal.org/home/datasets) to find relevant GTEx annotations in our data set (Carithers et al., 2015, Carithers and Moore, 2015).

ProZ ELISA

ProZ levels in post-partum maternal and cord blood plasma were assayed using the human-ProZ ELISA kit from MyBioSource (USA, Cat. No. MBS765710), following manufacturer instructions. Maternal and fetal plasma samples were diluted at 1:400 in sample diluent and all washes were performed manually with a multichannel pipet. Final optical density absorbance at 450nm was read using the Bio Rad (USA) iMarkTM Microplate Absorbance reader. A 4-Parameter curve fit was applied to the standards, and the resulting equation was used to calculate concentration in the experimental samples. Boxplots and t-tests were done in R v3.4.0 (R Core Team, 2018).

Results

We obtained blood samples and maternal clinical records from consented families at the Hospital Regional Manuel Nuñez Butrón, and blood alone from individuals recruited at the Universidad Nacional del Altiplano. At the time of recruitment, mothers from case families (labeled PRE throughout this study) were at hospital experiencing pregnancy with a preeclampsia diagnosis, defined as hypertension and proteinuria after 20 weeks of gestation. It is important to note that basal blood pressure in this population is lower than in the U.S., and hypertensive levels can be as low as 110/65 mmHg, compared to 140/90mmHg in U.S. guidelines. Rather than based on a cutoff, hypertension was defined as a systolic measurement 30 mmHg higher than basal and diastolic at least 15 mmHg higher than basal for each individual (see Materials & Methods for more details). For consistency, and to control for other hypertensive complications of pregnancy, we included proteinuria in the diagnosis, despite this factor not being currently required in many diagnostic guidelines (American College of Obstetricians and Gynecologists, 2020).

Mothers from control families (labelled PUN) were experiencing a pregnancy without complications at time of hospital recruitment. 88 PRE families and two PUN families were collected as complete trios—including both biological parents and offspring; the rest are duos (one parent and offspring) and single individuals (mothers) (Table I). Overall, the Puno cohort collected for this study includes 815 individuals from the PRE group, 204 from the hospital control group (PUN), and 110 from the university (UNA) as ‘population controls. We extracted DNA from blood and genotyped PRE, PUN and UNA individuals in two batches on the Affymetrix Axiom LAT array. Our final dataset after quality filtering included 439,314 genome wide SNPs and 883 individuals (see Table I and Supplementary Table 1 for breakdown of PRE, PUN and UNA).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table I. All individuals genotyped by group (case/control) and batch after QC filtering.

Puno individuals have high proportions of Native American ancestry

We sought to understand the demographic history of our test population by characterizing patterns of genetic diversity and population structure in the Puno study cohort. To this end we intersected the entirety of the Puno cohort dataset (883 individuals) with a reference panel including five continental populations from the 1000 Genomes (1KG) Project Phase 3 panel: Yorubans (YRI), Europeans (CEU), Mexicans (MXL), Han Chinese (CHB) and Peruvians from Lima (PEL). Using principal component (PC) analysis, we find that individuals from Puno (either PRE, PUN, UNA) cluster together in PC space, and are distributed in a clinal pattern alongside Peruvians from Lima who have high proportions of Native American ancestry (Figure 1B, Supplementary Figure 4).

We next investigated admixture patterns in the Puno population with the goal of characterizing proportions of Native versus non-Native genomic ancestry. Using the clustering algorithm ADMIXTURE (Alexander, et al., 2009), we explored unsupervised models assuming K=2 through K=10 ancestral clusters (Supplementary Figure 5). Cross-validation errors for each K cluster are shown in Supplementary Figure 6. At K=4, we observe a clear separation of continental-scale ancestry components. We find that Puno individuals have large proportions of Native American ancestry and small proportions of European ancestry, represented by blue and red in Figure 1C, respectively. At the best fit model of K=6, ADMIXTURE analysis finds substructure within the Native American ancestry component of the Puno cohort. Specifically, we observe a Puno-specific ancestry component (shown in light blue in Figure 1C) which is not present within the Native American ancestry components of 1KG Mexican and Peruvian individuals. This substructure may derive from an Andean specific ancestry component that has been previously identified among Indigenous and mestizo communities from the Andean Highlands (Barbieri et al., 2019, Harris et al., 2018). Overall, we find that individuals in the Puno cohort are predominantly of Native American ancestry (95.7% on average) and have low levels of non-Native American admixture (approximately 4.2% on average; Supplementary Table 3). We further find that the Puno population carries a Highland-specific Native American sub-continental ancestry component, as noted in previous work (Barbieri, et al., 2019, Harris, et al., 2018).

Finally, we tested for significant differences in ancestry proportions between cases (PRE) and controls (PUN, UNA) in the Puno cohort. Guided by the findings of the ADMIXTURE analysis, we used RFMix to determine local ancestry proportions in the Puno cohort assuming a model of K=3 ancestral components. We next extrapolated average ancestry proportions per individual from the RFMix local ancestry calls (Supplementary Tables 4-5). The results of this estimation further confirm the predominantly Native American ancestry background and highlight the small proportion of European admixture present in our sample. We next performed a Wilcoxon rank test to contrast ancestry proportions between PRE, PUN and UNA. This test identified a small but significant difference in European ancestry proportions between PRE and UNA but found no significant differences in Native American or African ancestry proportions (Supplementary Figure 7, Supplementary Table 6). Overall, UNA individuals have slightly higher proportions of European ancestry than PRE and PUN individuals. However, proportions of Native American ancestry are not significantly different between cases (PRE) and controls (PUN, UNA). These findings support the results of the ADMIXTURE analysis and further underscore the primarily Native American ancestry background of the Puno cohort.

Family-based analysis reveals association of a cluster of clotting factor genes (PROZ, F7, F10) with preeclampsia

Next, we sought to identify genetic loci associated with the risk of preeclampsia in this highly susceptible population adapted to the hypoxic conditions of the Andean Highlands. As decades of genetic research have shown a role for maternal, paternal and offspring genomes on preeclampsia risk (Galaviz-Hernandez et al., 2018, Gray et al., 2018, Phipps et al., 2019), we collected family trios from 88 cases, as well as duos when trio sampling was not possible (either for lack of consent or due to samples failing genotyping QC), enabling all three genomes to be evaluated. Since preeclampsia is a complex disease with wide ranging phenotypes, we provide summaries of relevant phenotypic data for all case pregnancies organized by batch and in trio cases only (Table II). By statistical comparison, we find that there is moderate batch bias in approximately half of the measured phenotypes (e.g., Batch 2 had significantly more vaginal deliveries than C-sections, when compared to Batch 1, p<0.04), but none likely to influence the analysis when supported by batch correction. In addition to the data shown in Table II, most mothers (>98%) had no history of chronic hypertension or diabetes and all were non-smokers.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table II. Phenotypic characteristics of analyzed case families with preeclampsia (duos and trios).

The sum of batch 1 and 2 correspond to the total in the first column. “Trios only” identifies the subset from the total that are in whole trio units (the rest are mother-offspring duos). The last column represents chi-squared or t-test p-values for each phenotype between batches. Significant tests with p<0.05 are identified with an asterisk (*).

To find genetic linkage between genomic loci and preeclampsia, we first performed a parent-offspring trio GWAS analysis, or transmission-disequilibrium test (TDT), in the 88 affected (PRE) trios. The TDT offers a robust association test of genotype to phenotype in affected families by measuring over-transmission of alleles from heterozygous parents to the offspring. With this analysis, we identified a group of SNPs in linkage disequilibrium (LD) over a cluster of blood clotting factor genes with a high odds ratio for preeclampsia (Figure 2; Table III; Supplementary Figure 8). The most significant SNP in this cluster, rs5960 (OR 3.05, 95% CI 1.841-5.054, p<6×10−6; 1000G MAF 0.623), is a synonymous variant in the clotting factor F10. Two other members of the coagulation cascade, F7 and PROZ, are also in this region. Another top hit in the TDT, SNP rs553316 (OR 0.339, 95% CI 0.2041-0.5629, p=1.15E-05; 1000G MAF 0.408), is in high LD with rs5960 in 1KG Peruvian populations (R2=0.7476) (Machiela and Chanock, 2015). Additionally, rs553316 is annotated in GTEx as an eQTL for PROZ on mammary tissue (note that, as of our analysis, no placental or pregnancy blood data were available on GTEx). The global distribution of allele frequencies for rs5960 and rs553316 in 1KG reference populations are shown in Supplementary Figure 9 and noted in Supplementary Table 7.

Figure 2.
  • Download figure
  • Open in new tab
Figure 2. Top associations from trio analyses by TDT and TDT-POO.

A) Manhattan plot showing top association with preeclampsia in the offspring genome: SNP rs5960 on chromosome 13 at p<10e-5 suggestive of significance (shown in red). B) Locus Zoom plot depicting the top associated SNP cluster from the TDT on chromosome 13. C) Locus Zoom plot depicting the top paternal region from TDT-POO analysis on chromosome 13.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table III. GWAS statistics and genomic annotations of top hits (P<5×10−4) from the TDT.

Given the importance of clotting genes in pregnancy, we sought to complement the genotype analysis by performing deep sequencing of targeted genomic regions surrounding rs5960 in a subset of cohort participants (Supplementary Table 8, Supplementary Figure 3). To fine-map potential causal variants, we repeated the same TDT analysis described above in the fine-mapped individuals and cross-referenced with the GTEx database for expression phenotypes in relevant tissues. This analysis found a strong association of preeclampsia with several eQTLs for PROZ (Supplementary Table 9). Other top hits from the genotype TDT that were recapitulated in this analysis include variants in the SLC46A3 and CUL4A genes, also located on chromosome 13 (Supplementary Table 9). Both genes have been previously associated with preeclampsia risk in clinical studies (McGinnis, et al., 2017, Tan et al. 2017). These data suggest that clotting factors on chromosome 13 may play an important role in preeclamptic pregnancies.

Finally, we asked whether this PROZ eQTL resulted in differential PROZ protein expression between PRE cases and PUN controls. Since the TDT identifies associated variants in the offspring, we analyzed the umbilical cord plasma of 8 PUN controls and 16 PRE cases by ELISA. In this limited sample, we detected no difference of PROZ levels in umbilical cord plasma (difference in means = 41.550 ug/mL, 95% CI -342.758 to 425.858, p = 0.85) collected after delivery (Supplementary Table 10, Supplementary Figure 10). However, future testing could evaluate PROZ levels in the placenta, where interaction with the maternal environment is more significant to the preeclampsia phenotype than in umbilical cord blood.

Clotting factor locus shows paternal inheritance

We next examined whether there were loci associated with preeclampsia that were disproportionately inherited either maternally or paternally. To this end, we performed parent-of-origin TDT GWAS in the same 88 trios tested above. This test investigates whether any of the associated SNPs are disproportionately inherited from fathers versus mothers, and vice versa. The most significant SNP from the TDT analysis, rs5960 in F10, is suggested to be paternally inherited more often than expected by chance (p=10−4, Figure 2, Table IV, Supplementary Figure 11). Other loci show evidence of paternal inheritance, such as rs79278805 (p = 1.77E-04), located within SPAG6 on chromosome 10, and rs9399401 (p=2.76E-04) in ADGRG6/GPR126 on chromosome 6. Similarly, we find several SNPs that show maternal origin bias. The most significant is rs130121 (p=1.91E-04) on chromosome 22 in the FAM19A5/TAFA5 gene, followed by rs10282765 (p=2.39E-04) on chromosome 8 within a ncRNA (Table IV, Supplementary Figures 12-13). Several genes in the vicinity of these SNPs have been implicated in reproduction. SPAG6 is recognized by anti-sperm antibodies and might be involved in infertility (Cooley et al., 2016, Neilson et al., 1999). ADGRG6/GPR126 is a G-coupled protein receptor involved in angiogenesis. It is upregulated in umbilical vein endothelial cells and was found previously to be upregulated in preeclamptic placentas (Cui et al., 2014, Sitras et al., 2009). Overall, these parent-of-origin effects support the hypothesis that maternal and/or paternal bias might contribute to preeclampsia disease.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table IV. GWAS P values and genomic annotations of top hits (P<5×10−4) from the TDT-POO
View this table:
  • View inline
  • View popup
  • Download powerpoint
Table V. Statistics and annotations of the top SNPs (p<5×10−4) with biological relevance for preeclampsia of secondary phenotype and case-control GWAS analyses.

All SNPs in this table are described in the text (for a complete list of regions at p<5×10−4, see supplemental tables. Beta values are reported for linear regressions and odds ratio (OR) for logistic regressions. GA, gestational age; BP, blood pressure.

Case-control analysis, placental gene S100P is associated with preeclampsia in the offspring

While the TDT identifies preeclampsia risk variants from inheritance analysis, a more common way to test for disease risk variants is to compare cases and controls. The collection of control (PUN) mother-offspring duos allowed us to compare preeclamptic to healthy pregnancies in both the mothers and the offspring. To this end, we performed two case-control GWAS of preeclampsia using Plink (see Materials & Methods): (1) 268 PRE vs. 70 PUN mothers; and (2) 230 PRE and 60 PUN offspring. Several genetic regions showed suggestive association with preeclampsia in both test groups (Supplementary table 11; Supplementary Figures 14-15). The most interesting association was the top SNP in the offspring, rs34360485 on chromosome 4 (p <2E-5, OR 3.615, 95% CI 2.003-6.524, MAF 0.36, (Table V), which contains the placental gene S100P. S100P is a calcium-binding protein strongly expressed in the placenta (Zhu et al., 2015) that promotes trophoblast proliferation in culture (Zhou et al., 2016). The global distribution of allele frequencies for rs34360485 in 1KG reference populations is shown in Supplementary Figure 16 and noted in Supplementary table 9.

Associations of secondary phenotypes reveal loci with roles in placental biology

Preeclampsia is a heterogeneous disease with varying potential markers of severity. For instance, the earlier in gestation preeclampsia occurs, the more severe it is considered to be (Gong et al., 2012, Wojtowicz et al., 2019). Likewise, all the characteristic clinical features associated with preeclampsia (such as proteinuria and elevated blood pressure) can present at varying levels of severity. Harnessing the availability of clinical records for all individuals in the PRE cohort, we next performed GWAS tests on six secondary phenotypes of preeclampsia measured at the time of diagnosis: (1) gestational age, maternal measurement; (2) gestational age, fetal measurement; (3) diastolic blood pressure; (4) systolic blood pressure; (5) proteinuria and (6) severity of diagnosis as stated by the clinician. It is worth clarifying that gestational age (the time of the fetus in the womb) was measured in two different ways throughout the study. The fetal measurement was done by the “Capurro” test, which combines five different measurements in the neonate, while the maternal measurement relies on the date of the mother’s last menstrual period before pregnancy.

To investigate possible genetic associations with secondary phenotypes of preeclampsia, we performed GWAS analyses by logistic and linear regression for each of the six phenotypes in 254 mothers and 225 offspring, separately. In total, we ran 12 GWAS tests. Logistic regression was applied to binary phenotypes (proteinuria and severity of diagnosis), while linear regression was applied to continuous phenotypes (gestational age and blood pressure measurements). All analyses were corrected for batch and the first three principal components were included as continuous covariates. With this analysis we found several strong associations of SNPs to secondary maternal phenotypes (Table V; Supplementary table 12). These findings point to several genetic regions containing relevant genes associated with pregnancy and the complex biology of preeclampsia, as detailed below.

Gestational Age

Gestational age was associated in mothers with one locus on chromosome 1 (rs952593, beta - 1.66, 95% CI ± 0.61, p=3.12×10−7, MAF 0.13). This region is near TBX15 (Table V; Supplementary Table 12; Supplementary Figure 17-20), a t-box transcription factor shown to be downregulated in intrauterine growth restricted placentas (Chelbi et al., 2011). The association held true with both measurements of gestational age (by maternal last period and neonate Capurro test). The maternal measurement, but not the fetal measurement, of gestational age was associated with a multigenic locus on chromosome 11 (top SNP rs2581927, beta -2.03, 95% CI ± 0.85, p = 4.85×10−6; MAF 0.06). A gene of interest in this locus is APLNR, the receptor to ELABELA, which causes preeclampsia symptoms in mice (Supplementary Figures 21-22) (Ho et al., 2017).

Diastolic and Systolic Blood Pressure

Diastolic blood pressure reached genome-wide significance for one association in the maternal genome on chromosome 4 (top SNP rs1874237, p<5×10-8, beta -4.257, 95% CI -5.711 ─ -2.804, MAF 0.45; Table V; Figure 3). This SNP is within an uncharacterized non-coding RNA locus near NKX6-1, a gene involved in β-cell development and function (Taylor et al., 2013). In the offspring, both systolic and diastolic blood pressure were strongly associated with SNPs in KCNS3/K(V)9.3 (top SNP rs4553827, beta 7.44, 95% CI ± 2.82, p = 5.26×10−7, MAF 0.25), a voltage-gated potassium channel gene that is highly expressed in the human placenta, where it localizes to placental vascular tissues and syncytiotrophoblast cells (Fyfe et al., 2012) (Supplementary Table 13; Supplementary Figures 23-26).

Figure 3.
  • Download figure
  • Open in new tab
Figure 3. Manhattan plot showing top association in the maternal genome with diastolic blood pressure.

SNP rs1874237 on chromosome 4 at p<5×10-8, genomewide significance (shown in red).

Proteinuria and Severity of Diagnosis

Proteinuria was most strongly associated in the mothers with rs2760751 on chromosome 17 (OR 2.83 ± 1.02, p = 5.65E-06, MAF 0.29). This SNP is intronic to SMG6, a telomerase binding protein. A second association with proteinuria in the maternal genome was found with SNP rs12276362 (OR 0.41 ± 0.14, p = 1.19E-05, MAF 0.49) in chromosome 11, by the PIWIL4 gene (Supplementary Figures 27-30). This region is also correlated with severity of diagnosis in the mothers (rs1940640, OR 2.4 ± 0.8, p = 1.30E-05, MAF 0.43; Supplementary Figures 29, 31). It is not surprising that proteinuria and severity of diagnosis share a common association, since these two phenotypes are correlated—clinically severe cases generally have higher levels of protein in the urine. Aberrant PIWI proteins, which interact with pi-RNAs to drive post-transcriptional gene regulation, have been found in cancers (Wang et al., 2016), and theoretical evidence from piRNA evolution suggests a role in placentation, although this has yet to be proven empirically (Chirn et al., 2015). In the offspring genome, proteinuria showed an association with placental gene RARB, or retinoic acid (RA) receptor beta (rs4241542, OR 0.26 ± 0.14, p=7.04×10−6, MAF 0.21) (Comptour et al., 2016, Huebner et al., 2018), while the strongest association with proteinuria is on a different region, in a SNP intronic to STK32B (rs62297274 (OR 0.35 ± 0.12, p=3.5104×10−6; Supplementary Table 13; Supplementary Figure 32-33). Interestingly, the minor allele for SNP rs62297274 is found at high frequencies in Peruvians compared to other global populations. In the Puno cohort MAF for this variant is 0.49, slightly higher than among Peruvians from Lima sampled in the 1KG (PEL MAF 0.41) (Supplementary Figure 34). In contrast, the minor allele is found at low frequencies in the rest of the Americas (1KG AMR MAF 0.19) and is rarely observed globally (1KG MAF <0.05) (Supplementary Table 9).

Discussion

In this analysis, we investigate the genetic diversity of a preeclampsia cohort of Andean families from Puno, Peru; a population with one of the highest incidences of this disease in the world (Bristol, 2009, Gil Cipirán, 2017). We harness the power of a trio study design to uncover maternal, paternal, and fetal genetic factors influencing the incidence and severity of preeclampsia in this cohort. In contrast to previous preeclampsia GWAS studies, which have been hampered by limited phenotyping and heterogeneous sampling (Williams and Broughton Pipkin, 2011), the present work includes a case-control cohort sampled from a single population, treated at the same hospital, and exposed to similar selective pressures due to long-term residence at high altitude. Thus, despite a small sample size, our family based GWAS design permits identification of novel significant and suggestive associations with preeclampsia that would remain otherwise undiscovered (Tishkoff, 2015).

Most genetic studies on preeclampsia have not investigated whole family units (Boyd, et al., 2013, Cincotta and Brennecke, 1998, McGinnis, et al., 2017, Salonen Ros et al., 2000), despite the evidence of a complex genetic risk involving factors from both parents and the fetus (Valenzuela, et al., 2012). This reinforces the strength of our approach, where the top association in the trio study was rs5960, an intronic variant in the clotting factor gene PROZ, in a locus with two other clotting factors: F7 and F10. PROZ, a vitamin K-dependent factor, is an anticoagulant protein with a role in factor X inhibition (Almawi et al., 2013). Several previous studies have suggested a hypercoagulative state in preeclampsia (reviewed in Ismail and Higgins, 2011), as spiral arteries of preeclamptic pregnancies often present thrombosis and atherosis (Haram et al., 2014). In fact, strong evidence supporting an effect of thrombotic processes on preeclampsia is based on the observation that aspirin, a known blood thinner, successfully delays preeclampsia onset (Wright and Nicolaides, 2019).

Low PROZ levels are associated with thrombotic disorders, and many adverse pregnancy outcomes have also been linked with maternal PROZ levels (Almawi et al., 2013). A small, prospective case-control study found low PROZ levels associated to intrauterine growth restriction (IUGR) and intrauterine fetal demise, but not preeclampsia (Bretelle et al., 2005). In contrast, a larger cross-sectional study found lower median levels of PROZ in preeclampsia outcomes but not IUGR or fetal demise (Erez et al., 2007). One study found a correlation between lower PROZ levels and severity of HELLP syndrome (a complication of preeclampsia that stands for haemolysis, elevated liver enzymes, and low platelets), which occurs in 10-20% of preeclamptic pregnancies (Haram, et al., 2014, Kaygusuz et al., 2011). However, no study on PROZ or other clotting factors in preeclampsia has been successfully replicated, likely due to the extreme heterogeneity of the disease and the mix of populations studied.

As most previous studies on PROZ have focused on the mother’s genome (Erez, et al., 2007, Xu et al., 2018), ours is the first study to suggest a correlation between the fetal PROZ/F7/F10 locus on chromosome 13 and preeclampsia. In a subset of our sample, we found no differences in protein plasma levels of PROZ between preeclamptic and healthy pregnancies in the mother or the offspring. However, this analysis was limited by small sample size and post-natal blood sampling. In other words, since samples were only collected immediately after birth, we were unable to monitor changes in PROZ protein levels throughout the pregnancy. Further longitudinal studies could analyze clotting factor levels and activity in this pregnant population to assess the impact of thrombosis in preeclampsia risk among Andean highlanders.

Expanding the TDT to a parent of origin analysis (POO), we found several associations to genetic regions with suggested paternal inheritance. For instance, the top TDT hit on F10, rs5960, is also the locus with the strongest paternal origin effect in the TDT-POO. Although future research examining variation at the PROZ/F7/F10 region in a larger population will be needed to confirm this finding, our results are of interest to studies investigating the role of paternal genetic factors, genomic imprinting and paternal-offspring conflict in preeclampsia and other pregnancy disorders (Christians et al., 2017, Galaviz-Hernandez, et al., 2018, Hollegaard et al., 2013, Pilvar et al., 2019, Wikstrom et al., 2012, Zadora et al., 2017).

Other top regions in the TDT-POO include biologically relevant genes SPAG6 and ADGRG6, previously described as being involved in infertility and the immune system (SPAG6) (Cooley, et al., 2016, Neilson, et al., 1999), or angiogenesis (ADGRG6/GPR126) (Cui, et al., 2014, Sitras, et al., 2009). Of these, only ADGRG6 has been associated with preeclampsia in previous research that found it upregulated in preeclamptic placentas (Cui et al., 2014, Sitras et al., 2009). Future work could investigate potential roles of these candidate genes in the maternal-fetal interface and elucidate their involvement in the pathophysiology of preeclampsia.

We also found several placental genes associated with secondary phenotypes that underline the severity of preeclampsia, such as hypertension, gestational age, and proteinuria. Differential expression of these genes may contribute to the insufficiency of placental development in early pregnancy that leads to hypertension and proteinuria in the third trimester. Some of our suggestive associations are near genes previously shown to have roles in pregnancy, vascular processes, and even preeclampsia. One such gene is APLNR, the receptor to ELABELA, which causes preeclampsia symptoms in mice (Ho, et al., 2017) and is lower in the serum and placentas of some women with late-onset, but not early-onset preeclampsia (Zhou et al., 2019). However, this gene is in a multigenic locus, and fine-mapping approaches with functional studies are required to discover the effect of this locus in our cohort.

Our study is one of only a few preeclampsia GWAS studies to include the offspring genome. One recent study with a large cohort found a gene, sFLT1, associated with late (but not early) preeclampsia (Gray, et al., 2018, McGinnis, et al., 2017), suggesting that dysregulation of genes in the fetal genome contribute to preeclampsia. In our study, we found novel fetal associations with preeclampsia and its severity phenotypes in the fetus. For instance, we found an association between severity of hypertension (systolic and diastolic pressure measurements) and KCNS3/K(V)9.3 a gene that is highly expressed in the human placenta, where it localizes to placental vascular tissues and syncytiotrophoblast cells (Fyfe, et al., 2012). We also found an association of the retinoic acid (RA) signaling gene RARB and severity of the proteinuria in the preeclamptic fetal genome. RA signaling is essential for healthy placental and fetal development in animal models, with evidence of similar requirement in humans (reviewed in (Comptour, et al., 2016)). RARB is expressed in the extravillous part of the placenta and its activation induces RARRES, shown in one study to be overexpressed in preeclamptic placentas (Huebner, et al., 2018). Our study adds to this body of literature and highlights the role of RA in proper placentation. Lastly, the most interesting region in the offspring genome was identified in our case-control study; the S100P gene, a calcium-binding protein strongly expressed in the placenta (Zhu, et al., 2015) that promotes trophoblast proliferation in culture (Zhou, et al., 2016). This finding suggests that fetal biology, and specifically placental development driven by fetal genes, highly contributes to the pathology of preeclampsia.

We examined the global distribution of allele frequencies for each of the candidate associated SNPs detailed above. Most alleles were shared among several global populations (see global distribution plots in Supplementary Figures). A notable exception is SNP rs62297274, an intronic variant located in gene STK32B which is associated with proteinuria in the offspring genome. The minor allele reaches its highest global frequency in Peruvian populations (Supplementary Figure 34). As of this writing SNP rs62297274 has no reported clinical significance in dbSNP. However, intronic variants are known to have functional impacts on RNA splicing patterns (Cooper, 2010). To elucidate the functional significance of this variant, future research could evaluate its pathogenic potential in Peruvian populations (Joynt et al., 2020, Lin et al., 2019).

As discussed, several genes found in our analyses are involved in placental function. Interestingly, morphological studies comparing placentas from Andean-descent and European-descent individuals in Bolivia, at both low and high altitudes, describe differences in placental composition (Jackson et al., 1987, Jackson et al., 1988). Highland placentas from individuals of both ancestries show more intervillous space but less villi, and the Andean highland placenta, compared to the European, have more trophoblast and villous stroma on average. Differences in placental morphology suggest an adaptive mechanism to the lower oxygen pressure at high altitude, but one that does not lower the risk of preeclampsia.

In conclusion, this study investigates a cohort of preeclamptic Highland Andean families from Puno, Peru to elucidate the genetic basis of this pregnancy disorder at high altitudes. We generated high-density genotype data at over 400,000 positions across the genome and used these data to determine ancestry patterns and map associations between genetic variants and preeclampsia phenotypes. Our trio-based recruitment strategy, including genotype data from mothers, fathers, and offspring, allowed us to identify novel genetic regions not previously reported in preeclampsia genome-wide association studies. Specifically, we identified strong associations with several variants near genes involved with placental and blood vessel function, and therefore, of functional importance for human pregnancy biology. The strongest association hit involves a cluster of clotting factor genes on chromosome 13 including PROZ, F7 and 10 in the fetal genome. This finding provides supporting evidence that coagulation plays an important role in the pathology of preeclampsia and potentially underlies other pregnancy disorders exacerbated at high altitude.

Studying diverse human groups with unique genetic adaptations enables identification of the primary genetic factors underlying complex phenotypes and gene function. This research examined Andean populations as a model to understand human pregnancy physiology in hypoxic conditions. This natural experimental setting provides a unique opportunity to understand the genetic factors influencing human reproductive fitness in challenging environments worldwide and to discover population-specific variants underlying biomedical traits. Our work also underscores the importance of including diverse populations in genome wide association studies and functional variant discovery efforts to better understand human physiology and disease globally.

Data Availability

The data underlying this article are available in the European Genome-Phenome Archive (EGA) at https://ega-archive.org/ and can be accessed with Data Access Committee approval under Study EGAS00001004625.

Authors’ roles

K.M.B.R. and M.A.N.C. wrote the article with input from G.L.W., A.M.E. and J.C.B. K.M.B.R., M.A.N.C., J.W.C., E.T.Z., C.R.G. and G.L.W. performed data analyses. P.O.T. and K.S.M. designed and coordinated data collection. P.O.T., K.S.M, L.E.L., V.V.D., J.C.M.C., F.M.C. and G.P.Y.P. collected samples and medical records in Puno and handled fieldwork logistics. K.M.B.R., M.A.N.C., A.S., E.R., G.M.H., R.C.S, R.C., C.E., S.H., E.G.B., E.T.Z., G.P. and C.G. performed laboratory work. C.D.B, J.C.B., C.R.G., A.M.E., C.G., and M.A.N.C. provided resources, funding, and/or laboratory space. All authors revised the article and approved the final submitted version.

Funding sources

This work was supported in part by the National Science Foundation (NSF) Graduate Research Fellowship Program Grant No. DGE–1147470 awarded to K.M.B.R. (fellow no. 2014187481); NSF SBE Postdoctoral Research Fellowship Award No. 1711982 awarded to M.N.C.; an A.P. Giannini Foundation postdoctoral fellowship, a Stanford Child Health Research Institute postdoctoral award, and a Stanford Dean’s Postdoctoral Fellowship awarded to E.T.Z.; the Chan Zuckerberg Biohub Investigator Award to C.D.B.; a Burroughs Welcome Prematurity Initiative Award to J.C.B.; the George Rosenkranz Prize for Health Care Research in Developing Countries, and the International Center for Genetic Engineering and Biotechnology (ICGEB, Italy) grant CRP/ MEX15-04_EC, and Mexico’s CONACYT grant FONCICYT/50/2016, each awarded to A.M.E.. Further funding was provided by the Sandler Family Foundation, the American Asthma Foundation, the RWJF Amos Medical Faculty Development Program, Harry Wm. and Diana V. Hind Distinguished Professor in Pharmaceutical Sciences II, National Institutes of Health, National Heart, Lung, and Blood Institute Awards R01HL117004, R01HL128439, R01HL135156, R01HL141992, National Institute of Environmental Health Sciences Awards R01ES015794, R21ES24844, the National Institute on Minority Health and Health Disparities Awards R01MD010443, and R56MD013312, and the National Human Genome Research Institute Award U01HG009080, each awarded to E.G.B.

Conflicts of interest statement

J.W.C. is currently a full-time employee at Genentech, Inc. and hold stocks in Roche Holding AG. E.G.B. reports grants from the National Institute of Health, Lung, Blood Institute, the National Institute of Health, General Medical Sciences, the National Institute on Minority Health and Health Disparities, the Tobacco-Related Disease Research Program, the Food and Drug Administration, and from the Sandler Family Foundation, during the conduct of the study.

Data Availability Statement

The data underlying this article are available in the European Genome-Phenome Archive (EGA) at https://ega-archive.org/ and can be accessed with Data Access Committee approval under Study EGAS00001004625.

Author notes

Keyla M. Badillo Rivera and Maria A. Nieves-Colón contributed equally as first authors. Christopher R. Gignoux, Genevieve L. Wojcik and Andrés Moreno-Estrada contributed equally as last authors.

Acknowledgements

We extend our deepest gratitude to the people of Puno, Peru who participated in this study at Hospital Regional Manuel Nuñez Butrón and Universidad del Altiplano. We are tremendously grateful to Javier Mendoza Revilla who provided commentary on the final version of this manuscript, and to the Mendoza Revilla family, who provided lodging and logistics support in Lima during fieldwork seasons of the technical team.

Footnotes

  • ↵† The authors consider that the first two authors should be regarded as joint First authors.

References

  1. ↵
    Abalos E, Cuesta C, Grosso AL, Chou D, Say L. Global and regional estimates of preeclampsia and eclampsia: a systematic review. Eur J Obstet Gynecol Reprod Biol 2013;170: 1–7.
    OpenUrlCrossRefPubMed
  2. ↵
    Almawi WY, Al-Shaikh FS, Melemedjian OK, Almawi AW. Protein Z, an anticoagulant protein with expanding role in reproductive biology. Reproduction 2013; 146(2): R73–R80.
    OpenUrlAbstract/FREE Full Text
  3. ↵
    Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res 2009;19: 1655–1664.
    OpenUrlAbstract/FREE Full Text
  4. ↵
    American College of Obstetricians and Gynecologists. Hypertension in Pregnancy. Report of the American College of Obstetricians and Gynecologists’ Task Force on Hypertension in Pregnancy. Obstetrics & Gynecology 2013; 122(5): 1122–1131.
    OpenUrlCrossRefPubMedWeb of Science
  5. ↵
    American College of Obstetricians and Gynecologists. Gestational Hypertension and Preeclampsia: ACOG Practice Bulletin, Number 222. Obstetrics & Gynecology 2020; 135(6): e237–260
    OpenUrlPubMed
  6. ↵
    Barbieri C, Barquera R, Arias L, Sandoval JR, Acosta O, Zurita C, Aguilar-Campos A, Tito-Alvarez AM, Serrano-Osuna R, Gray RD et al. The Current Genomic Landscape of Western South America: Andes, Amazonia, and Pacific Coast. Mol Biol Evol 2019;36: 2698–2713.
    OpenUrl
  7. ↵
    Beall CM. Adaptation to High Altitude: Phenotypes and Genotypes. Annual Review of Anthropology 2014;43: 251–272.
    OpenUrlCrossRef
  8. ↵
    Bigham AW, Lee FS. Human high-altitude adaptation: forward genetics meets the HIF pathway. Genes Dev 2014;28: 2189–2204.
    OpenUrlAbstract/FREE Full Text
  9. ↵
    Bigham AW, Wilson MJ, Julian CG, Kiyamu M, Vargas E, Leon-Velarde F, Rivera-Chira M, Rodriquez C, Browne VA, Parra E et al. Andean and Tibetan patterns of adaptation to high altitude. Am J Hum Biol 2013;25: 190–197.
    OpenUrlCrossRefPubMed
  10. ↵
    Boyd HA, Tahir H, Wohlfahrt J, Melbye M. Associations of personal and family preeclampsia history with the risk of early-, intermediate- and late-onset preeclampsia. Am J Epidemiol 2013;178: 1611–1619.
    OpenUrlCrossRefPubMed
  11. ↵
    Bretelle F, Arnoux D, Shojai R, D’Ercole C, Sampol J, Dignat F, Camoin-Jau L. Protein Z in patients with pregnancy complications. Am J Obstet Gynecol 2005;193: 1698–1702.
    OpenUrlCrossRefPubMedWeb of Science
  12. ↵
    Bristol N. Dying to give birth: Fighting maternal mortality in Peru. Health Affairs 2009;28: 997–1002.
    OpenUrlFREE Full Text
  13. ↵
    Buniello A, MacArthur JAL, Cerezo M, Harris LW, Hayhurst J, Malangone C, McMahon A, Morales J, Mountjoy E, Sollis E et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res 2019;47: D1005–D1012.
    OpenUrlCrossRefPubMed
  14. ↵
    Carithers LJ, Ardlie K, Barcus M, Branton PA, Britton A, Buia SA, Compton CC, DeLuca DS, Peter-Demchok J, Gelfand ET et al. A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project. Biopreserv Biobank 2015;13: 311–319.
    OpenUrlCrossRefPubMed
  15. ↵
    Carithers LJ, Moore HM. The Genotype-Tissue Expression (GTEx) Project. Biopreserv Biobank 2015;13: 307–308.
    OpenUrlCrossRef
  16. ↵
    Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 2015;4: 7.
    OpenUrlCrossRefPubMed
  17. ↵
    Chelbi ST, Doridot L, Mondon F, Dussour C, Rebourcet R, Busato F, Gascoin-Lachambre G, Barbaux S, Rigourd V, Mignot TM et al. Combination of promoter hypomethylation and PDX1 overexpression leads to TBX15 decrease in vascular IUGR placentas. Epigenetics 2011;6: 247–255.
    OpenUrlCrossRefPubMedWeb of Science
  18. ↵
    Chirn GW, Rahman R, Sytnikova YA, Matts JA, Zeng M, Gerlach D, Yu M, Berger B, Naramura M, Kile BT et al. Conserved piRNA Expression from a Distinct Set of piRNA Cluster Loci in Eutherian Mammals. PLoS Genet 2015;11: e1005652.
    OpenUrlCrossRefPubMed
  19. ↵
    Christians JK, Leavey K, Cox BJ. Associations between imprinted gene expression in the placenta, human fetal growth and preeclampsia. Biol Lett 2017;13.
  20. ↵
    Cincotta RB, Brennecke SP. Family history of pre-eclampsia as a predictor for pre-eclampsia in primigravidas. Int J Gynaecol Obstet 1998;60: 23–27.
    OpenUrlCrossRefPubMed
  21. ↵
    Comptour A, Rouzaire M, Belville C, Bouvier D, Gallot D, Blanchon L, Sapin V. Nuclear retinoid receptors and pregnancy: placental transfer, functions, and pharmacological aspects. Cell Mol Life Sci 2016;73: 3823–3837.
    OpenUrl
  22. ↵
    Cooley LF, El Shikh ME, Li W, Keim RC, Zhang Z, Strauss JF, Zhang Z, Conrad DH. Impaired immunological synapse in sperm associated antigen 6 (SPAG6) deficient mice. Sci Rep 2016;6: 25840.
    OpenUrl
  23. ↵
    Cooper DN. Functional intronic polymorphisms: Buried treasure awaiting discoveriy within our genes. Human Genomics 2010;4: 284–288.
    OpenUrlCrossRefPubMed
  24. ↵
    Cui H, Wang Y, Huang H, Yu W, Bai M, Zhang L, Bryan BA, Wang Y, Luo J, Li D et al. GPR126 protein regulates developmental and pathological angiogenesis through modulation of VEGFR2 receptor signaling. J Biol Chem 2014;289: 34871–34885.
    OpenUrlAbstract/FREE Full Text
  25. ↵
    Duley L. The global impact of pre-eclampsia and eclampsia. Semin Perinatol 2009;33: 130–137.
    OpenUrlCrossRefPubMedWeb of Science
  26. ↵
    Erez O, Hoppensteadt D, Romero R, Espinoza J, Goncalves L, Nien JK, Kusanovic JP, Fareed J, Gotsch F, Pineles B et al. Preeclampsia is associated with low concentrations of protein Z. J Matern Fetal Neonatal Med 2007;20: 661–667.
    OpenUrlCrossRefPubMedWeb of Science
  27. ↵
    Francis RM. Pophelper: An R package and web app to analyse and visualize population structure. Mol Ecol Resour 2017;17: 27–32.
    OpenUrlCrossRef
  28. ↵
    Fyfe GK, Panicker S, Jones RL, Wareing M. Expression of an electrically silent voltage-gated potassium channel in the human placenta. J Obstet Gynaecol 2012;32: 624–629.
    OpenUrlCrossRefPubMed
  29. ↵
    Galanter JM, Gignoux CR, Torgerson DG, Roth LA, Eng C, Oh SS, Nguyen EA, Drake KA, Huntsman S, Hu D et al. Genome-wide association study and admixture mapping identify different asthma-associated loci in Latinos: The Genes-environments & Admixture in Latino Americans study. J Allergy Clin Immunol 2014;134: 295–305.
    OpenUrlCrossRefPubMed
  30. ↵
    Galaviz-Hernandez C, Sosa-Macias M, Teran E, Garcia-Ortiz JE, Lazalde-Ramos BP. Paternal Determinants in Preeclampsia. Front Physiol 2018;9: 1870.
    OpenUrl
  31. ↵
    Gil Cipirán F. Situación epidemiológica de la mortalidad materna en el Perú Boletín Epidemiológico del Perú. 2017. Centro Nacional de Epidemiología, Prevención y Control de Enfermedades, Ministerio de Salud, Lima, pp. 1514–1516.
  32. ↵
    Gong YH, Jia J, Lu DH, Dai L, Bai Y, Zhou R. Outcome and risk factors of early onset severe preeclampsia. Chin Med J (Engl) 2012;125: 2623–2627.
    OpenUrlPubMed
  33. ↵
    Gray KJ, Saxena R, Karumanchi SA. Genetic predisposition to preeclampsia is conferred by fetal DNA variants near FLT1, a gene involved in the regulation of angiogenesis. Am J Obstet Gynecol 2018;218: 211–218.
    OpenUrl
  34. ↵
    Guevara Ríos E, Meza Santibáñez L. Manejo de la preeclampsia/eclampsia en el Perú. Revista Peruana de Ginecología y Obstetricia 2014; October: 385–393.
  35. ↵
    Haram K, Mortensen JH, Nagy B. Genetic aspects of preeclampsia and the HELLP syndrome. J Pregnancy 2014;2014: 910751.
    OpenUrl
  36. ↵
    Harris DN, Song W, Shetty AC, Levano KS, Caceres O, Padilla C, Borda V, Tarazona D, Trujillo O, Sanchez C et al. Evolutionary genomic dynamics of Peruvians before, during, and after the Inca Empire. Proc Natl Acad Sci U S A 2018;115: E6526–E6535.
    OpenUrlAbstract/FREE Full Text
  37. ↵
    Ho L, van Dijk M, Chye STJ, Messerschmidt DM, Chng SC, Ong S, Yi LK, Boussata S, Goh GH, Afink GB et al. ELABELA deficiency promotes preeclampsia and cardiovascular malformations in mice. Science 2017;357: 707–713.
    OpenUrlAbstract/FREE Full Text
  38. ↵
    Hollegaard B, Byars SG, Lykke J, Boomsma JJ. Parent-offspring conflict and the persistence of pregnancy-induced hypertension in modern humans. PLoS One 2013;8: e56821.
    OpenUrlCrossRefPubMed
  39. ↵
    Huebner H, Hartner A, Rascher W, Strick RR, Kehl S, Heindl F, Wachter DL, Beckmann Md MW, Fahlbusch FB, Ruebner M. Expression and Regulation of Retinoic Acid Receptor Responders in the Human Placenta. Reprod Sci 2018;25: 1357–1370.
    OpenUrl
  40. ↵
    Ismail SK, Higgins JR. Hemostasis in pre-eclampsia. Semin Thromb Hemost 2011;37: 111–117.
    OpenUrlCrossRefPubMed
  41. ↵
    Jackson MR, Mayhew TM, Haas JD. The volumetric composition of human term placentae: altitudinal, ethnic and sex differences in Bolivia. J Anat 1987;152: 173–187.
    OpenUrlPubMedWeb of Science
  42. ↵
    Jackson MR, Mayhew TM, Haas JD. On the factors which contribute to thinning of the villous membrane in human placentae at high altitude. II. An increase in the degree of peripheralization of fetal capillaries. Placenta 1988;9: 9–18.
    OpenUrlPubMed
  43. ↵
    Joynt AT, Evans TA, Pellicore MJ, Davis-Marcisak EF, Aksit MA, Eastman AC, Patel SU, Paul KC, Osorio DL, Bowling AD et al. Evaluation of both exonic and intronic variants for effects on RNA splicing allows for accurate assessment of the effectiveness of precision therapies. PLoS Genet 2020;16: e1009100.
    OpenUrl
  44. ↵
    Julian CG, Wilson MJ, Moore LG. Evolutionary adaptation to high altitude: a view from in utero. Am J Hum Biol 2009;21: 614–622.
    OpenUrlCrossRefPubMedWeb of Science
  45. ↵
    Kaygusuz I, Firatli-Tuglular T, Toptas T, Ugurel V, Demir M. Low levels of protein Z are associated with HELLP syndrome and its severity. Clin Appl Thromb Hemost 2011;17: 214–219.
    OpenUrlCrossRefPubMed
  46. ↵
    Keyes LE, Armaza JF, Niermeyer S, Vargas E, Young DA, Moore LG. Intrauterine growth restriction, preeclampsia, and intrauterine mortality at high altitude in Bolivia. Pediatr Res 2003;54: 20–25.
    OpenUrlCrossRefPubMedWeb of Science
  47. ↵
    Kidd JM, Gravel S, Byrnes J, Moreno-Estrada A, Musharoff S, Bryc K, Degenhardt JD, Brisbin A, Sheth V, Chen R et al. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation. Am J Hum Genet 2012;91: 660–671.
    OpenUrlCrossRefPubMed
  48. ↵
    Koboldt DC. Best practices for variant calling in clinical sequencing. Genome Med 2020;12: 91.
    OpenUrl
  49. ↵
    Lin H, Hargreaves KA, Li R, Reiter JL, Wang Y, Mort M, Cooper DN, Zhou Y, Zhang C, Eadon MT et al. RegSNPs-intron: a computational framework for predicting pathogenic impact of intronic single nucleotide variants. Genome Biol 2019;20: 254.
    OpenUrl
  50. ↵
    Machiela MJ, Chanock SJ. LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants. Bioinformatics 2015;31: 3555–3557.
    OpenUrlCrossRefPubMed
  51. ↵
    Maples BK, Gravel S, Kenny EE, Bustamante CD. RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference. Am J Hum Genet 2013;93: 278–288.
    OpenUrlCrossRefPubMed
  52. ↵
    Marcus JH, Novembre J. Visualizing the geography of genetic variants. Bioinformatics 2017;33: 594–595.
    OpenUrlCrossRefPubMed
  53. ↵
    McGinnis R, Steinthorsdottir V, Williams NO, Thorleifsson G, Shooter S, Hjartardottir S, Bumpstead S, Stefansdottir L, Hildyard L, Sigurdsson JK et al. Variants in the fetal genome near FLT1 are associated with risk of preeclampsia. Nat Genet 2017;49: 1255–1260.
    OpenUrlCrossRefPubMed
  54. ↵
    Michita RT, Kaminski VL, Chies JAB. Genetic Variants in Preeclampsia: Lessons From Studies in Latin-American Populations. Front Physiol 2018;9: 1771.
    OpenUrl
  55. ↵
    Moore DS, McCabe GP, Craig BA. Introduction to the Practice of Statistics, 2009. W.H. Freedman, New York.
  56. ↵
    Moore LG, Charles SM, Julian CG. Humans at high altitude: hypoxia and fetal growth. Respir Physiol Neurobiol 2011;178: 181–190.
    OpenUrlCrossRefPubMed
  57. ↵
    Moore LG, Hershey DW, Jahnigen D, Bowes W, Jr.. The incidence of pregnancy-induced hypertension is increased among Colorado residents at high altitude. Am J Obstet Gynecol 1982;144: 423–429.
    OpenUrlCrossRefPubMedWeb of Science
  58. ↵
    Moore LG, Shriver M, Bemis L, Hickler B, Wilson M, Brutsaert T, Parra E, Vargas E. Maternal adaptation to high-altitude pregnancy: an experiment of nature--a review. Placenta 2004;25 Suppl A: S60–71.
    OpenUrlCrossRefPubMedWeb of Science
  59. ↵
    Neilson LI, Schneider PA, Van Deerlin PG, Kiriakidou M, Driscoll DA, Pellegrini MC, Millinder S, Yamamoto KK, French CK, Strauss JF, 3rd.. cDNA cloning and characterization of a human sperm antigen (SPAG6) with homology to the product of the Chlamydomonas PF16 locus. Genomics 1999;60: 272–280.
    OpenUrlCrossRefPubMedWeb of Science
  60. ↵
    O’Connell J, Gurdasani D, Delaneau O, Pirastu N, Ulivi S, Cocca M, Traglia M, Huang J, Huffman JE, Rudan I et al. A general approach for haplotype phasing across the full spectrum of relatedness. PLoS Genet 2014;10: e1004234.
    OpenUrlCrossRefPubMed
  61. ↵
    Osungbade KO, Ige OK. Public health perspectives on preeclampsia in developing countries: Implications for health system strengthening. J Pregnancy 2011;2011: 481095.
    OpenUrlPubMed
  62. ↵
    Palmer SK, Moore LG, Young D, Cregger B, Berman JC, Zamudio S. Altered blood pressure course during normal pregnancy and increased preeclampsia at high altitude (3100 meters) in Colorado. Am J Obstet Gynecol 1999;180: 1161–1168.
    OpenUrlCrossRefPubMedWeb of Science
  63. ↵
    Pappa KI, Roubelakis M, Vlachos G, Marinopoulos S, Zissou A, Anagnou NP, Antsaklis A. Variable effects of maternal and paternal-fetal contribution to the risk for preeclampsia combining GSTP1, eNOS, and LPL gene polymorphisms. J Matern Fetal Neonatal Med 2011;24: 628–635.
    OpenUrlCrossRefPubMedWeb of Science
  64. ↵
    Patterson N, Price AL, Reich D. Population structure and eigenanalysis. PLoS Genet 2006;2: e190.
    OpenUrlCrossRefPubMed
  65. ↵
    Phipps EA, Thadhani R, Benzing T, Karumanchi SA. Pre-eclampsia: Pathogenesis, novel diagnostics, and therapies. Nat Rev Nephrol 2019; 15(5): 275–289.
    OpenUrlPubMed
  66. ↵
    Pilvar D, Reiman M, Pilvar A, Laan M. Parent-of-origin-specific allelic expression in the human placenta is limited to established imprinted loci and it is stably maintained across pregnancy. Clin Epigenetics 2019; 11: 94.
    OpenUrlCrossRef
  67. ↵
    Pruim RJ, Welch RP, Sanna S, Teslovich TM, Chines PS, Gliedt TP, Boehnke M, Abecasis GR, Willer CJ. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics 2010;26: 2336–2337.
    OpenUrlCrossRefPubMedWeb of Science
  68. ↵
    Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007;81: 559–575.
    OpenUrlCrossRefPubMed
  69. ↵
    Purcell S, Sham P, Daly MJ. Parental phenotypes in family-based association analysis. Am J Hum Genet 2005;76: 249–259.
    OpenUrlCrossRefPubMedWeb of Science
  70. ↵
    Rana S, Lemoine E, Granger JP, Karumanchi SA. Preeclampsia: Pathophysiology, Challenges, and Perspectives. Circ Res 2019;124: 1094–1112.
    OpenUrlCrossRef
  71. ↵
    Salonen Ros H, Lichtenstein P, Lipworth L, Cnattingius S. Genetic effects on the liability of developing pre-eclampsia and gestational hypertension. Am J Med Genet 2000;91: 256–260.
    OpenUrlCrossRefPubMedWeb of Science
  72. ↵
    Segura-Vega L. New blood pressure levels in Peruvian high altitude populations and the new North American high blood pressure guidelines. Journal of Cardiology and Current Research 2019;12: 84–87.
    OpenUrl
  73. ↵
    Silva LM, Coolman M, Steegers EA, Jaddoe VW, Moll HA, Hofman A, Mackenbach JP, Raat H. Low socioeconomic status is a risk factor for preeclampsia: the Generation R Study. J Hypertens 2008;26: 1200–1208.
    OpenUrlCrossRefPubMedWeb of Science
  74. ↵
    Sitras V, Paulssen RH, Gronaas H, Leirvik J, Hanssen TA, Vartun A, Acharya G. Differential placental gene expression in severe preeclampsia. Placenta 2009;30: 424–433.
    OpenUrlCrossRefPubMedWeb of Science
  75. ↵
    Tan D, Liang H, Cao K, Zhang Q. CUL4A enhances human trophoblast migration and is associated with pre-eclampsia. Int J Clin Exp Pathol 2017; 10: 1054—10551.
    OpenUrl
  76. ↵
    Taylor BL, Liu FF, Sander M. Nkx6.1 is essential for maintaining the functional state of pancreatic beta cells. Cell Rep 2013;4: 1262–1275.
    OpenUrlCrossRefPubMedWeb of Science
  77. ↵
    Team R Core. R: A language and environment for statistical computing. In Computing RFfS (ed). 2018, Vienna, Austria.
  78. ↵
    Tissot van Patot MC, Murray AJ, Beckey V, Cindrova-Davies T, Johns J, Zwerdlinger L, Jauiaux E, Burton GJ, Serkova NJ. Human placental metabolic adaptation to chronic hypoxia, high altitude: hypoxic preconditioning. Am J Physiol Regul Integr Comp Physiol 2009; 298: R166–R172.
    OpenUrl
  79. ↵
    Tishkoff S. Strength in small numbers. Science 2015;349: 1282–1283.
    OpenUrlAbstract/FREE Full Text
  80. ↵
    Turner SD. qqman: Q-Q and Manhattan Plots for GWAS Data. 2017.
  81. ↵
    Valenzuela FJ, Perez-Sepulveda A, Torres MJ, Correa P, Repetto GM, Illanes SE. Pathogenesis of preeclampsia: the genetic component. J Pregnancy 2012;2012: 632732.
    OpenUrlPubMed
  82. ↵
    Wang Z, Liu N, Shi S, Liu S, Lin H. The Role of PIWIL4, an Argonaute Family Protein, in Breast Cancer. J Biol Chem 2016;291: 10646–10658.
    OpenUrlAbstract/FREE Full Text
  83. ↵
    Wickham H. ggplot2: Elegant Graphics for Data Analysis. 2 edn, 2016. Springer International Publisher, New York.
  84. ↵
    Wikstrom AK, Gunnarsdottir J, Cnattingius S. The paternal role in pre-eclampsia and giving birth to a small for gestational age infant; a population-based cohort study. BMJ Open 2012;2.
  85. ↵
    Williams PJ, Broughton Pipkin F. The genetics of pre-eclampsia and other hypertensive disorders of pregnancy. Best Pract Res Clin Obstet Gynaecol 2011;25: 405–417.
    OpenUrlCrossRefPubMed
  86. ↵
    Wojtowicz A, Zembala-Szczerba M, Babczyk D, Kolodziejczyk-Pietruszka M, Lewaczynska O, Huras H. Early- and Late-Onset Preeclampsia: A Comprehensive Cohort Study of Laboratory and Clinical Findings according to the New ISHHP Criteria. Int J Hypertens 2019;2019: 4108271.
    OpenUrl
  87. ↵
    Wright D, Nicolaides KH. Aspirin delays the development of preeclampsia. Am J Obstet Gynecol 2019.
  88. ↵
    Xu Z, Zhang Y, Liu W, Liu Y, Su Y, Xing Q, He X, Wei Z, Cao Y, Xiang H. Polymorphisms of F2, PROC, PROZ, and F13A1 Genes are Associated With Recurrent Spontaneous Abortion in Chinese Han Women. Clin Appl Thromb Hemost 2018;24: 894–900.
    OpenUrl
  89. ↵
    Yang H, Wang K. Genomic variant annotation and prioritization with ANNOVAR and wANNOVAR. Nat Protoc 2015;10: 1556–1566.
    OpenUrlCrossRefPubMed
  90. ↵
    Yang J, Lee SH, Goddard ME, Visscher PM. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 2011;88: 76–82.
    OpenUrlCrossRefPubMed
  91. ↵
    Yong HEJ, Murthi P, Brennecke SP, Moses EK. Genetic Approaches in Preeclampsia. Methods Mol Biol 2018;1710: 53–72.
    OpenUrl
  92. ↵
    Zadora J, Singh M, Herse F, Przybyl L, Haase N, Golic M, Yung HW, Huppertz B, Cartwright JE, Whitley G et al. Disturbed Placental Imprinting in Preeclampsia Leads to Altered Expression of DLX5, a Human-Specific Early Trophoblast Marker. Circulation 2017;136: 1824–1839.
    OpenUrlAbstract/FREE Full Text
  93. ↵
    Zamudio S. High-altitude hypoxia and preeclampsia. Frontiers in Bioscience 2007;12: 2967–2977.
    OpenUrlCrossRefPubMedWeb of Science
  94. ↵
    Zhou L, Sun H, Cheng R, Fan X, Lai S, Deng C. ELABELA, as a potential diagnostic biomarker of preeclampsia, regulates abnormally shallow placentation via APJ. Am J Physiol Endocrinol Metab 2019;316: E773–E781.
    OpenUrl
  95. ↵
    Zhou T, Wang H, Zhang S, Jiang X, Wei X. S100P is a potential molecular target of cadmium-induced inhibition of human placental trophoblast cell proliferation. Exp Toxicol Pathol 2016;68: 565–570.
    OpenUrl
  96. ↵
    Zhu HY, Tong XM, Lin XN, Jiang LY, Wang JX, Zhang SY. Expression and Distribution of Calcium-Binding Protein S100P in Human Placenta during Pregnancy. Int J Fertil Steril 2015;8: 445–452.
    OpenUrl
Back to top
PreviousNext
Posted May 21, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Clotting factor genes are associated with preeclampsia in high altitude pregnant women in the Peruvian Andes
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Clotting factor genes are associated with preeclampsia in high altitude pregnant women in the Peruvian Andes
Keyla M. Badillo Rivera, Maria A. Nieves-Colón, Karla Sandoval Mendoza, Vanessa Villanueva Dávalos, Luis E. Enriquez Lencinas, Jessica W. Chen, Elisa T. Zhang, Alexandra Sockell, Patricia Ortiz Tello, Gloria Malena Hurtado, Ramiro Condori Salas, Ricardo Cebrecos, José C. Manzaneda Choque, Franz P. Manzaneda Choque, Germán P. Yábar Pilco, Erin Rawls, Celeste Eng, Scott Huntsman, Esteban González Burchard, Giovanni Poletti, Carla Gallo, Carlos D. Bustamante, Julie C. Baker, Christopher R. Gignoux, Genevieve L. Wojcik, Andrés Moreno-Estrada
medRxiv 2021.05.20.21257549; doi: https://doi.org/10.1101/2021.05.20.21257549
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Clotting factor genes are associated with preeclampsia in high altitude pregnant women in the Peruvian Andes
Keyla M. Badillo Rivera, Maria A. Nieves-Colón, Karla Sandoval Mendoza, Vanessa Villanueva Dávalos, Luis E. Enriquez Lencinas, Jessica W. Chen, Elisa T. Zhang, Alexandra Sockell, Patricia Ortiz Tello, Gloria Malena Hurtado, Ramiro Condori Salas, Ricardo Cebrecos, José C. Manzaneda Choque, Franz P. Manzaneda Choque, Germán P. Yábar Pilco, Erin Rawls, Celeste Eng, Scott Huntsman, Esteban González Burchard, Giovanni Poletti, Carla Gallo, Carlos D. Bustamante, Julie C. Baker, Christopher R. Gignoux, Genevieve L. Wojcik, Andrés Moreno-Estrada
medRxiv 2021.05.20.21257549; doi: https://doi.org/10.1101/2021.05.20.21257549

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (215)
  • Allergy and Immunology (495)
  • Anesthesia (106)
  • Cardiovascular Medicine (1096)
  • Dentistry and Oral Medicine (196)
  • Dermatology (141)
  • Emergency Medicine (274)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (500)
  • Epidemiology (9766)
  • Forensic Medicine (5)
  • Gastroenterology (480)
  • Genetic and Genomic Medicine (2308)
  • Geriatric Medicine (222)
  • Health Economics (462)
  • Health Informatics (1558)
  • Health Policy (735)
  • Health Systems and Quality Improvement (603)
  • Hematology (236)
  • HIV/AIDS (503)
  • Infectious Diseases (except HIV/AIDS) (11641)
  • Intensive Care and Critical Care Medicine (617)
  • Medical Education (237)
  • Medical Ethics (67)
  • Nephrology (257)
  • Neurology (2142)
  • Nursing (134)
  • Nutrition (336)
  • Obstetrics and Gynecology (426)
  • Occupational and Environmental Health (517)
  • Oncology (1176)
  • Ophthalmology (364)
  • Orthopedics (128)
  • Otolaryngology (220)
  • Pain Medicine (146)
  • Palliative Medicine (50)
  • Pathology (311)
  • Pediatrics (695)
  • Pharmacology and Therapeutics (300)
  • Primary Care Research (267)
  • Psychiatry and Clinical Psychology (2180)
  • Public and Global Health (4655)
  • Radiology and Imaging (777)
  • Rehabilitation Medicine and Physical Therapy (457)
  • Respiratory Medicine (623)
  • Rheumatology (274)
  • Sexual and Reproductive Health (225)
  • Sports Medicine (210)
  • Surgery (251)
  • Toxicology (43)
  • Transplantation (120)
  • Urology (94)