PT - JOURNAL ARTICLE AU - Salami, Donald AU - Sousa, Carla Alexandra AU - Martins, Maria do Rosário Oliveira AU - Capinha, César TI - Predicting dengue importation into Europe, using machine learning and model-agnostic methods AID - 10.1101/19013383 DP - 2020 Jan 01 TA - medRxiv PG - 19013383 4099 - http://medrxiv.org/content/early/2020/04/28/19013383.short 4100 - http://medrxiv.org/content/early/2020/04/28/19013383.full AB - The geographical spread of dengue is a global public health concern. This is largely mediated by the importation of dengue from endemic to non-endemic areas via the increasing connectivity of the global air transport network. The dynamic nature and intrinsic heterogeneity of the air transport network make it challenging to predict dengue importation.Here, we explore the capabilities of state-of-the-art machine learning algorithms to predict dengue importation. We trained four machine learning classifiers algorithms, using a 6-year historical dengue importation data for 21 countries in Europe and connectivity indices mediating importation and air transport network centrality measures. Predictive performance for the classifiers was evaluated using the area under the receiving operating characteristic curve, sensitivity, and specificity measures. Finally, we applied practical model-agnostic methods, to provide an in-depth explanation of our optimal model’s predictions on a global and local scale.Our best performing model achieved high predictive accuracy, with an area under the receiver operating characteristic score of 0.94 and a maximized sensitivity score of 0.88. The predictor variables identified as most important were the source country’s dengue incidence rate, population size, and volume of air passengers. Network centrality measures, describing the positioning of European countries within the air travel network, were also influential to the predictions.We demonstrated the high predictive performance of a machine learning model in predicting dengue importation and the utility of the model-agnostic methods to offer a comprehensive understanding of the reasons behind the predictions. Similar approaches can be utilized in the development of an operational early warning surveillance system for dengue importation.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was partially funded by Fundação para a Ciência e a Tecnologia, Portugal (GHTM - UID/Multi/04413/2013). DS has a PhD grant from the Fundação para a Ciência e a Tecnologia, Portugal (PD/BD/128084/2016).Author DeclarationsAll relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.YesAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe air travel data used in this study, cannot be shared publicly because of a nondisclosure agreement with the International Air Travel Association (IATA). The same data can be purchased for use by any other researcher by contacting the International Air Travel Association (IATA)- Passenger Intelligence Services (PaxIS) (https://www.iata.org/services/statistics/intelligence/paxis/Pages/index.aspx). The disease (dengue) data are available by request from the European Centre for Disease Prevention and Control (ECDC) (https://www.ecdc.europa.eu/en/publicationsdata/european-surveillance-system-tessy). All other relevant data sources are referenced in the article. https://www.iata.org/services/statistics/intelligence/paxis/Pages/index.aspx https://www.ecdc.europa.eu/en/publicationsdata/european-surveillance-system-tessy