Abstract
Accurate estimates of infection prevalence and seroprevalence are essential for evaluating and informing public health responses needed to address the ongoing spread of COVID-19 in the United States. A data-driven Bayesian single parameter semi-empirical model was developed and used to evaluate state-level prevalence and seroprevalence of COVID-19 using daily reported cases and test positivity ratios. COVID-19 prevalence is well-approximated by the geometric mean of the positivity rate and the reported case rate. As of December 8, 2020, we estimate nation-wide a prevalence of 1.4% [Credible Interval (CrI): 0.8%-1.9%] and a seroprevalence of 11.1% [CrI: 10.1%-12.2%], with state-level prevalence ranging from 0.3% [CrI: 0.2%-0.4%] in Maine to 3.0% [CrI: 1.1%-5.7%] in Pennsylvania, and seroprevalence from 1.4% [CrI: 1.0%-2.0%] in Maine to 22% [CrI: 18%-27%] in New York. The use of this simple and easy-to-communicate model will improve the ability to make public health decisions that effectively respond to the ongoing pandemic.
Biographical Sketch of Authors Dr. Weihsueh A. Chiu, is a professor of environmental health sciences at Texas A&M University. He is an expert in data-driven Bayesian modeling of public health related dynamical systems. Dr. Martial L. Ndeffo-Mbah, is an Assistant Professor of Epidemiology at Texas A&M University. He is an expert in mathematical and computational modeling of infectious diseases.
Summary Line Relying on reported cases and test positivity rates individually can result in incorrect inferences as to the spread of COVID-19, and public health decision-making can be improved by instead using their geometric mean as a measure of COVID-19 prevalence and transmission.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
National Science Foundation (NSF DEB RAPID 2028632) and National Institutes of Health, National Institute of Environmental Health Sciences (P30 ES029067). The funders have no role in the design of the study, collection, analysis, and interpretation of data.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
IRB was not required
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Ethical approval: Ethical approval was not required for this work.
Data sharing: All data and code will be made publicly available on a github repository upon peer-review publication. In the meantime, the code can be may available upon request.
Funding Sources: National Science Foundation (NSF RAPID 2028632) and National Institutes of Health, National Institute of Environmental Health Sciences (P30 ES029067). The sponsors of the study had no role in the study design, analysis, results interpretation, writing of the report, or the decision to submit for publication. The corresponding author had full access to all data and had final responsibility for the decision to submit for publication.
Conflicts of Interest: No financial relationships with any organizations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.
Data Availability
The codes and data used to generate our results are available on GitHub