ancGWAS: a post genome-wide association study method for interaction, pathway and ancestry analysis in homogeneous and admixed populations

Bioinformatics. 2016 Feb 15;32(4):549-56. doi: 10.1093/bioinformatics/btv619. Epub 2015 Oct 27.

Abstract

Motivation: Despite numerous successful Genome-wide Association Studies (GWAS), detecting variants that have low disease risk still poses a challenge. GWAS may miss disease genes with weak genetic effects or strong epistatic effects due to the single-marker testing approach commonly used. GWAS may thus generate false negative or inconclusive results, suggesting the need for novel methods to combine effects of single nucleotide polymorphisms within a gene to increase the likelihood of fully characterizing the susceptibility gene.

Results: We developed ancGWAS, an algebraic graph-based centrality measure that accounts for linkage disequilibrium in identifying significant disease sub-networks by integrating the association signal from GWAS data sets into the human protein-protein interaction (PPI) network. We validated ancGWAS using an association study result from a breast cancer data set and the simulation of interactive disease loci in the simulation of a complex admixed population, as well as pathway-based GWAS simulation. This new approach holds promise for deconvoluting the interactions between genes underlying the pathogenesis of complex diseases. Results obtained yield a novel central breast cancer sub-network of the human interactome implicated in the proteoglycan syndecan-mediated signaling events pathway which is known to play a major role in mesenchymal tumor cell proliferation, thus providing further insights into breast cancer pathogenesis.

Availability and implementation: The ancGWAS package and documents are available at http://www.cbio.uct.ac.za/~emile/software.html.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms / epidemiology
  • Breast Neoplasms / genetics
  • Breast Neoplasms / pathology*
  • Female
  • Gene Regulatory Networks
  • Genetic Predisposition to Disease
  • Genetics, Population*
  • Genome-Wide Association Study / methods*
  • Humans
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide / genetics*
  • Protein Interaction Mapping / methods*
  • Signal Transduction*
  • Software*