TY - JOUR T1 - metPropagate: network-guided propagation of metabolomic information for prioritization of neurometabolic disease genes JF - medRxiv DO - 10.1101/2020.01.12.20016691 SP - 2020.01.12.20016691 AU - Emma Graham AU - Phillip A Richmond AU - Maja Tarailo-Graovac AU - Udo Engelke AU - Leo A.J. Kluijtmans AU - Karlien L.M Coene AU - Ron A. Wevers AU - Wyeth Wasserman AU - Clara D.M. van Karnebeek AU - Sara Mostafavi Y1 - 2020/01/01 UR - http://medrxiv.org/content/early/2020/01/16/2020.01.12.20016691.abstract N2 - Many inborn errors of metabolism (IEMs) are amenable to treatment, therefore early diagnosis before irreversible damage occurs is imperative. Despite recent advances, the genetic basis of many metabolic phenotypes remains unknown. For discovery purposes, Whole Exome Sequencing (WES) variant prioritization coupled with phenotype-guided clinical and bioinformatics expertise is currently the primary method used to identify novel disease-causing variants; however, it can be challenging to identify the causal candidate gene given the large number of plausible variants. Using untargeted metabolomics (UM) to prioritize metabolically relevant candidate genes is a promising approach to diagnosing known or novel IEMs in a single patient. Here, we present a network-based bioinformatics approach, metPropagate, that uses UM data from a single patient and a group of controls to prioritize candidate genes. We validate metProp on 107 patients with diagnosed IEMs and 11 patients with novel IEMs diagnosed through the TIDE gene discovery project at BC Children’s Hospital. The metPropagate method ranks candidate genes by considering the network of interactions between them. This is done by using a graph smoothing algorithm called label propagation to quantify the metabolic disruption in genes’ local neighbourhood. metPropagate was able to prioritize the causative gene in the top 20th percentile of candidate genes for 91% of patients with known IEM disorders. For novel IEMs, metPropagate placed the causative gene in the top 20th percentile in 9/11 patients. Using metPropagate, the causative gene was ranked higher than Exomiser’s phenotype-based ranking in 6/11 patients. The results of this study indicate that for diagnostic and gene discovery purposes, network-based analysis of metabolomics data can lend support to WES gene-discovery methods by providing an additional mode of evidence to help identify causal genes.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work is supported by the B.C. Children’s Hospital Foundation (1st Collaborative Area of Innovation), the Canadian Institutes of Health Research (#301221 grant; CvK), Rare Diseases Foundation (MTG, CvK) and Natural Sciences and Engineering Research Council of Canada Discovery Grants (awarded separately to both SM and WW). CvK is recipient of the Michael Smith Foundation for Health Research Scholar Award and a Stichting Metakids salary stipend. EG is a recipient of BC Children’s Hospital Healthy Starts Graduate Studentship Award, NSERC-CREATE scholarship and NSERC CGS-M Award. No authors at any time received payment or services from a third party for any aspect of the submitted work.Author DeclarationsAll relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe raw metabolite z-scores from the Miller analysis were downloaded from the supplementary data section of their publication (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4626538/). Patient-specific variant and metabolomics data from the TIDEX project that support the findings of this study are available on request from the corresponding author (SM). This data is not publicly available due to them containing information that could compromise research participant privacy. ER -