ABSTRACT
Forecasting infection case counts and estimating accurate epidemiological parameters are critical components of managing the response to a pandemic. This paper describes a modular, extensible framework for a COVID-19 forecasting system, primarily deployed during the first Covid wave in Mumbai and Jharkhand, India. We employ a variant of the SEIR compartmental model motivated by the nature of the available data and operational constraints. We estimate best fit parameters using Sequential Model-Based Optimization (SMBO), and describe the use of a novel, fast and approximate Bayesian model averaging method (ABMA) for parameter uncertainty estimation that compares well with a more rigorous Markov Chain Monte Carlo (MCMC) approach in practice. We address on-the-ground deployment challenges such as spikes in the reported input data using a novel weighted smoothing method. We describe extensive empirical analyses to evaluate the accuracy of our method on ground truth as well as against other state-of-the-art approaches. Finally, we outline deployment lessons and describe how inferred model parameters were used by government partners to interpret the state of the epidemic and how model forecasts were used to estimate staffing and planning needs essential for addressing COVID-19 hospital burden.
CCS CONCEPTS
Applied computing → Health care information systems; Forecasting;
Computing methodologies → Modeling methodologies.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This study is made possible by the generous support of the American People through the United States Agency for International Development (USAID). The work described in this article was implemented under the TRACETB Project, managed by WIAI under the terms of Cooperative Agreement Number 72038620CA00006. The contents of this manuscript are the sole responsibility of the authors and do not necessarily reflect the views of USAID or the United States Government. This work is co-funded by the Bill and Melinda Gates Foundation, Fondation Botnar and CSIR - Institute of Genomics and Integrative Biology.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Since there were no medical data collection on our part, IRB approval is not required
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
ACM Reference Format: Sansiddh Jain1, Avtansh Tiwari1, Nayana Bannur1, Ayush Deva1, Siddhant Shingi1, Vishwa Shah1, Mihir Kulkarni1, Namrata Deka1, Keshav Ramaswami1, Vasudha Khare1, Harsh Maheshwari2, Soma Dhavala1, Jithin Sreedharan1, Jerome White1, Srujana Merugu1, Alpan Raval1. 2021. A Flexible Data-Driven Framework for COVID-19 Case Forecasting Deployed in a Developingworld Public Health Setting. In. ACM, New York, NY, USA, 11 pages.
Data Availability
The majority of the data used in this study was provided to us by Brihanmumbai Municipal Corporation (BMC), with whom we had a partnership. This data was also publically available on www.covid19india.org. The rest of the data used for this study was publically available on https://github.com/CSSEGISandData/COVID-19.