Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Machine learning methods applied to genotyping data capture interactions between single nucleotide variants in late onset Alzheimer’s disease

Magdalena Arnal Segura, Dietmar Fernandez Orth, Claudia Giambartolomei, Giorgio Bini, Eleftherios Samaras, Maya Kassis, Fotis Aisopos, Jordi Rambla De Argila, Georgios Paliouras, Peter Garrard, View ORCID ProfileGian Gaetano Tartaglia
doi: https://doi.org/10.1101/2021.08.30.21262815
Magdalena Arnal Segura
aDepartment of Biology ‘Charles Darwin’, Sapienza University of Rome, P.le A. Moro 5, Rome 00185, Italy
bCentre for Human Technologies, Istituto Italiano di Tecnologia, Via Enrico Melen, 83, 16152 Genova GE, Italy
cCentre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dietmar Fernandez Orth
cCentre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claudia Giambartolomei
bCentre for Human Technologies, Istituto Italiano di Tecnologia, Via Enrico Melen, 83, 16152 Genova GE, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Giorgio Bini
bCentre for Human Technologies, Istituto Italiano di Tecnologia, Via Enrico Melen, 83, 16152 Genova GE, Italy
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eleftherios Samaras
dStroke and Dementia Research Centre, St George’s, University of London, Cranmer Terrace, London SW17 ORE, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maya Kassis
dStroke and Dementia Research Centre, St George’s, University of London, Cranmer Terrace, London SW17 ORE, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fotis Aisopos
eInstitute of Informatics and Telecommunications, NCSR Demokritos, Athens, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jordi Rambla De Argila
cCentre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Georgios Paliouras
eInstitute of Informatics and Telecommunications, NCSR Demokritos, Athens, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter Garrard
dStroke and Dementia Research Centre, St George’s, University of London, Cranmer Terrace, London SW17 ORE, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gian Gaetano Tartaglia
aDepartment of Biology ‘Charles Darwin’, Sapienza University of Rome, P.le A. Moro 5, Rome 00185, Italy
bCentre for Human Technologies, Istituto Italiano di Tecnologia, Via Enrico Melen, 83, 16152 Genova GE, Italy
cCentre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
fCatalan Institution for Research and Advanced Studies, ICREA, Passeig Lluís Companys 23 08010 Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Gian Gaetano Tartaglia
  • For correspondence: gian.tartaglia@iit.it
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

INTRODUCTION Genome-wide association studies (GWAS) in late onset Alzheimer’s disease (LOAD) provide lists of individual genetic determinants. However, GWAS are not good at capturing the synergistic effects among multiple genetic variants and lack good specificity.

METHODS We applied tree-based machine learning algorithms (MLs) to discriminate LOAD (> 700 individuals) and age-matched unaffected subjects using single nucleotide variants (SNVs) from AD studies, obtaining specific genomic profiles with the prioritized SNVs.

RESULTS The MLs prioritized a set of SNVs located in close proximity genes PVRL2, TOMM40, APOE and APOC1. The captured genomic profiles in this region showed a clear interaction between rs405509 and rs1160985. Additionally, rs405509 located in APOE promoter interacts with rs429358 among others, seemingly neutralizing their predisposing effect. Interactions are characterized by their association with specific comorbidities and the presence of eQTL and sQTLs.

DISCUSSION Our approach efficiently discriminates LOAD from controls, capturing genomic profiles defined by interactions among SNVs in a hot-spot region.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

The research leading to these results has been supported by European Research Council [RIBOMYLOME_309545 and ASTRA_855923], the H2020 projects [IASIS_727658 and INFORE_825080] and the Spanish Ministry of Science and Innovation (RYC2019-026752-I and PID2020-117454RA-I00).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

No need for approval. The data used in this manuscript are publicly available upon request to UK Biobank.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The data are available upon request to the first and corresponding authors

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted September 03, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Machine learning methods applied to genotyping data capture interactions between single nucleotide variants in late onset Alzheimer’s disease
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Machine learning methods applied to genotyping data capture interactions between single nucleotide variants in late onset Alzheimer’s disease
Magdalena Arnal Segura, Dietmar Fernandez Orth, Claudia Giambartolomei, Giorgio Bini, Eleftherios Samaras, Maya Kassis, Fotis Aisopos, Jordi Rambla De Argila, Georgios Paliouras, Peter Garrard, Gian Gaetano Tartaglia
medRxiv 2021.08.30.21262815; doi: https://doi.org/10.1101/2021.08.30.21262815
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Machine learning methods applied to genotyping data capture interactions between single nucleotide variants in late onset Alzheimer’s disease
Magdalena Arnal Segura, Dietmar Fernandez Orth, Claudia Giambartolomei, Giorgio Bini, Eleftherios Samaras, Maya Kassis, Fotis Aisopos, Jordi Rambla De Argila, Georgios Paliouras, Peter Garrard, Gian Gaetano Tartaglia
medRxiv 2021.08.30.21262815; doi: https://doi.org/10.1101/2021.08.30.21262815

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (227)
  • Allergy and Immunology (500)
  • Anesthesia (110)
  • Cardiovascular Medicine (1230)
  • Dentistry and Oral Medicine (205)
  • Dermatology (147)
  • Emergency Medicine (282)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (529)
  • Epidemiology (10005)
  • Forensic Medicine (5)
  • Gastroenterology (497)
  • Genetic and Genomic Medicine (2445)
  • Geriatric Medicine (236)
  • Health Economics (479)
  • Health Informatics (1635)
  • Health Policy (751)
  • Health Systems and Quality Improvement (633)
  • Hematology (248)
  • HIV/AIDS (531)
  • Infectious Diseases (except HIV/AIDS) (11855)
  • Intensive Care and Critical Care Medicine (625)
  • Medical Education (251)
  • Medical Ethics (74)
  • Nephrology (267)
  • Neurology (2273)
  • Nursing (139)
  • Nutrition (349)
  • Obstetrics and Gynecology (452)
  • Occupational and Environmental Health (532)
  • Oncology (1244)
  • Ophthalmology (375)
  • Orthopedics (133)
  • Otolaryngology (226)
  • Pain Medicine (154)
  • Palliative Medicine (50)
  • Pathology (324)
  • Pediatrics (729)
  • Pharmacology and Therapeutics (311)
  • Primary Care Research (282)
  • Psychiatry and Clinical Psychology (2280)
  • Public and Global Health (4824)
  • Radiology and Imaging (833)
  • Rehabilitation Medicine and Physical Therapy (488)
  • Respiratory Medicine (650)
  • Rheumatology (283)
  • Sexual and Reproductive Health (237)
  • Sports Medicine (225)
  • Surgery (266)
  • Toxicology (44)
  • Transplantation (124)
  • Urology (99)