PT - JOURNAL ARTICLE AU - Jain, Sansiddh AU - Tiwari, Avtansh AU - Bannur, Nayana AU - Deva, Ayush AU - Shingi, Siddhant AU - Shah, Vishwa AU - Kulkarni, Mihir AU - Deka, Namrata AU - Ramaswami, Keshav AU - Khare, Vasudha AU - Maheshwari, Harsh AU - Dhavala, Soma AU - Sreedharan, Jithin AU - White, Jerome AU - Merugu, Srujana AU - Raval, Alpan TI - A Flexible Data-Driven Framework for COVID-19 Case Forecasting Deployed in a Developing-world Public Health Setting AID - 10.1101/2021.11.01.21260020 DP - 2021 Jan 01 TA - medRxiv PG - 2021.11.01.21260020 4099 - http://medrxiv.org/content/early/2021/11/10/2021.11.01.21260020.short 4100 - http://medrxiv.org/content/early/2021/11/10/2021.11.01.21260020.full AB - Forecasting infection case counts and estimating accurate epidemiological parameters are critical components of managing the response to a pandemic. This paper describes a modular, extensible framework for a COVID-19 forecasting system, primarily deployed during the first Covid wave in Mumbai and Jharkhand, India. We employ a variant of the SEIR compartmental model motivated by the nature of the available data and operational constraints. We estimate best fit parameters using Sequential Model-Based Optimization (SMBO), and describe the use of a novel, fast and approximate Bayesian model averaging method (ABMA) for parameter uncertainty estimation that compares well with a more rigorous Markov Chain Monte Carlo (MCMC) approach in practice. We address on-the-ground deployment challenges such as spikes in the reported input data using a novel weighted smoothing method. We describe extensive empirical analyses to evaluate the accuracy of our method on ground truth as well as against other state-of-the-art approaches. Finally, we outline deployment lessons and describe how inferred model parameters were used by government partners to interpret the state of the epidemic and how model forecasts were used to estimate staffing and planning needs essential for addressing COVID-19 hospital burden.CCS CONCEPTSApplied computing → Health care information systems; Forecasting;Computing methodologies → Modeling methodologies.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study is made possible by the generous support of the American People through the United States Agency for International Development (USAID). The work described in this article was implemented under the TRACETB Project, managed by WIAI under the terms of Cooperative Agreement Number 72038620CA00006. The contents of this manuscript are the sole responsibility of the authors and do not necessarily reflect the views of USAID or the United States Government. This work is co-funded by the Bill and Melinda Gates Foundation, Fondation Botnar and CSIR - Institute of Genomics and Integrative Biology.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Since there were no medical data collection on our part, IRB approval is not requiredI confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe majority of the data used in this study was provided to us by Brihanmumbai Municipal Corporation (BMC), with whom we had a partnership. This data was also publically available on www.covid19india.org. The rest of the data used for this study was publically available on https://github.com/CSSEGISandData/COVID-19.