%0 Journal Article %A Luciana Correia Alves %A Carlos Eduardo Beluzo %A Natália Martins Arruda %A Rodrigo Campos Bresan %A Tiago Carvalho %T Assessing the Performance of Machine Learning Models to Predict Neonatal Mortality Risk in Brazil, 2000-2016 %D 2020 %R 10.1101/2020.05.22.20109165 %J medRxiv %P 2020.05.22.20109165 %X Neonatal mortality figures are an important health’s problem, as the first month of life is the most vulnerable time for survival. Factors associated with neonatal mortality are complexly and influenced by the maternal and newborn biological characteristics, social conditions and the care provided by the health services. The aim of this study was investigated the association between features related and neonatal mortality risk in Brazil. Data came from two surveys: The Mortality Information System and Information System on Live Births. The final sample was composed of 302,943 children between 2006 and 2016. We highlight the proposition of a new approach based on machine learning to address the problem of neonatal mortality death risk classification. The results using three different machine learning classifiers points toward expressiveness of features, being newborn weight, Apgar at the first and fifth minute, congenital malformations, gestational weeks and number of prenatal appointments the six more expressive.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis research was supported by Bill & Melinda Gates Foundation (Process no: OPP1201970) and Ministry of Health of Brazil, through the National Council for Scientific and Technological Development (CNPq) (Process no: 443774/2018-8). It was also supported by NVIDIA, that donated a GPU XP Titan used by the research team.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This paper uses publicly available data (SIM and SINASC) that has been de-identified and was deemed exempt from approval from a human research ethics committee.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe dataset used in this paper is available. %U https://www.medrxiv.org/content/medrxiv/early/2020/05/25/2020.05.22.20109165.full.pdf