Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Sentinel Event Surveillance to Estimate Total Sars-CoV-2 Infections, United States

View ORCID ProfileAndrew A. Lover, View ORCID ProfileThomas McAndrew
doi: https://doi.org/10.1101/2020.03.17.20037648
Andrew A. Lover
Department of Biostatistics and Epidemiology, University of Massachusetts- Amherst, Amherst, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrew A. Lover
  • For correspondence: alover@umass.edu
Thomas McAndrew
Department of Biostatistics and Epidemiology, University of Massachusetts- Amherst, Amherst, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Thomas McAndrew
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Human infections with a novel coronavirus (SARS-CoV-2) were first identified via syndromic surveillance in December of 2019 in Wuhan China. Since identification, infections (coronavirus disease-2019; COVID-19) caused by this novel pathogen have spread globally, with more than 180,000 confirmed cases as of March 16, 2020. Effective public health interventions, including social distancing, contact tracing, and isolation/quarantine rely on the rapid and accurate identification of confirmed cases. However, testing capacity (having sufficient tests and laboratory throughput) to support these non-pharmaceutical interventions remains a challenge for containment and mitigation of COVID-19 infections.

We undertook a sentinel event strategy (where single health events signal emerging trends) to estimate the incidence of COVID-19 in the US. Data from a recent national conference, the Conservative Political Action Conference, (CPAC) near Washington, DC and from the outbreak in Wuhan, China were used to fit a simple exponential growth model to estimate the total number of incident SARS-CoV-2 infections in the United States on March 1, 2020, and to forecast subsequent infections potentially undetected by current testing strategies. Our analysis and forecasting estimates a total of 54,100 SARS-CoV-2 infections (80 % CI 5,600 to 125,300) have occurred in the United States to March 12, 2020.

Our forecast predicts that a very substantial number of infections are undetected, and without extensive and far-reaching non-pharmaceutical interventions, the number of infections should be expected to grow at an exponential rate.

1 Introduction

To date, testing for COVID-19 cases (clinical and otherwise) has been limited in the US. Hospitals and healthcare facilities that do have tests have limited the testing criteria to those with highest potential risk, to optimize the use of scarce resources [1]. Where widespread testing has occurred, increased prevalences in high-risk groups have been found [2, 3].

Evidence-based public health programming depends on data-driven estimates of disease burden to develop interventions. As such, accurate estimate of the underlying burden of SARS-CoV-2 infection within the US can provide critical information for hospital administrators, county- and state-level Departments of Health, and the US Centers for Disease Control and Prevention (CDC) to optimize and prioritize resource allocation.

The Conservative Political Action Conference (CPAC), that took place Feb. 26-29, 2020 outside Washington DC is a large annual event that draws approximately 20,000 participants from across the entire US, thus serving as a representative national sample. A single person was reported as ‘presumptive positive’ for COVID-19 on Mar 7, 2020, and exposure was reported to be prior to the CPAC event [4].

The point incidence of COVID-19 in this defined cohort was used to estimate the national incidence during this period, and subsequent forecasting of current SARS-CoV-2 infections across the entire US. The number of attendees and number of confirmed cases were used to fit a simple exponential growth model to estimate the total COVID-19 infections per day throughout the United States up to March 15th, 2020.

2 Overview of Data and Methods

This analysis proceeded in three stages. Firstly, the point incidence of SARS-CoV-2 infection was calculated for the CPAC cohort. Secondly, With an assumption that this group was a near-random sample of Americans, this proportion (daily incidence rate) was then extrapolated to the entire US population (329 million persons) to provide an estimated national daily incidence on March 1, 2020. Finally, this estimate with appropriate error structures, was projected forward in time using epidemiological parameters from the epidemic in Wuhan, PRC to forecast cumulative infections on March 12, 2020.

Data for this analysis were complied from routine sources; press reports from CPAC; and parameters estimated from modeling studies for the early (exponential growth phase) of the epidemic in China. The number of attendees and the number of confirmed COVID-19 cases at CPAC was collected from press reporting. The epidemic doubling time was taken from previous work which estimated from data on the outbreak in Wuhan, China. This doubling time from the Wuhan data was estimated to be 6.4 days, (95 CrI 5.8–7.1), and we used a lognormal distribution for D following literature values for other doubling times [5, 6].

3 Results and Conclusions

Projecting this total infection burden forward (Fig. 1) using doubling time parameters from Wuhan gives a national cumulative incidence estimate of 54,145 infections (80 % CI 5,647 to 125,274) across the United States up until March 12, 2020. This total includes all infections-asymptomatic, subclinical and clinical cases.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1:

Projected cumulative national SARS-CoV-19 incidence, after March 1, 2020.

This forecasted cumulative number of infections is consistent with those of Perkins et al., from independent data sources and with entirely different modeling frameworks, and greatly strengthens the validity of both sets of estimates [7]. The large number of predicted infections compared against the currently reported total confirmed COVID-19 cases in the United States suggest an important proportion of infections may be currently undetected, while entering a phase of potentially very rapid growth of the epidemic.

4 Limitations of this analysis

This analysis is not without limitations. Our forecasts are based on parameter estimates using best-available data at the time of analysis to inform public health policy (coined “outbreak science” [8]). Demographic data for CPAC attendees was not available, so we are unable to assess how representative this sentinel event may be for the entire US population. This analysis also relies on simplistic models that assumes a well-mixed population, and uses data derived from transmission scenarios that may not represent transmission within the US. The upper limit of total infections should be interpreted with caution, as small uncertainties early in the projections also accumulate exponentially. Finally, this analysis does not account for spatial heterogeneity and diverse population mixing patterns throughout the United States, which are a critical driver of observed transmission.

5 Detailed Methods

5.1 The model

We assumed that cumulative infections up to day t, denoted by It, followed an exponential growth model: Embedded Image where I0 is the number of initial infections, t is the time, in days, since March 1, 2020, and D is the doubling time (in days). Estimates of I0 and D are needed to forecast the cumulative number of infections at time t.

5.2 Estimating I0

We estimated total incidence among the US population on March 1st, denoted by I0, as follows, Embedded Image where P denotes the size of the US population (329M, [9]), Embedded Image the incidence among CPAC attendees and P (CPAC) the number of CPAC attendees. The parameter I0 represents the assumed fraction of the US population that is infected with COVID19. The incidence estimated from CPAC was extrapolated to the US population based on the assumption of equal incidence in CPAC attendees and the US population.

The initial relative incidence Embedded Image was assumed to follow a beta distribution Embedded Image with a prior over θ, the probability of infection, equal to a Embedded Image The prior is generated by having observed one sample from the general population with incidence “as observed in the CPAC population”

5.3 Estimating D

The doubling time of the disease was taken from previous work using data on the outbreak in Wuhan, China, and was estimated to be 6.4 days, (95 CrI 5.8–7.1) and lognormally distributed [5, 6] Embedded Image

5.4 Model-based projections

Point estimates for projected incidence were obtained directly using Eq. 1 and the point estimates of doubling time and initial incidence. Uncertainty in the number of infections was obtained by taking 30, 000 Monte Carlo samples of I0 and D according to their probability densities, and constructing a probability mass function for trajectories, It, Embedded Image

Specifically, the 80% projection interval for It is given by the 10th and 90th percentiles of samples Embedded Image. Analysis was performed using R [10] for a 16-day window after March 1, in 1-day intervals. Standard 95% and 80% confidence intervals were constructed across the forecast period.

Data Availability

All data and statistical analysis code have been uploaded onto GitHub with open access (CC-by).

https://github.com/andrewlover/COVID_19_CPAC_estimation

Revision History

View this table:
  • View inline
  • View popup
  • Download powerpoint

6 Acknowledgements

The authors are grateful for feedback and insightful suggestions from Nick Reich and Leontine Alkema (UMass-Amherst).

Footnotes

  • ↵* https://www.umass.edu/sphhs/person/faculty/andrew-lover

  • ↵† http://www.thomasmcandrew.com/

  • mcandrew{at}umass.edu

References

  1. [1].↵
    Jinnong Zhang, Luqian Zhou, Yuqiong Yang, Wei Peng, Wenjing Wang, and Xuelin Chen. Therapeutic and triage strategies for 2019 novel coronavirus disease in fever clinics. The Lancet Respiratory Medicine, 2020.
  2. [2].↵
    US Centers for Disease Control and Prevention. CDC announces additional COVID-19 infections. Library Catalog: www.cdc.gov.
  3. [3].↵
    Robinson Meyer, Erin Kissane, and Alexis Madrigal. The COVID Tracking Project. https://covidtracking.com/.
  4. [4].↵
    J. Edward Moreno. CPAC attendee tests positive for coronavirus, March 2020. Library Catalog: thehill.com.
  5. [5].↵
    Joseph T. Wu, Kathy Leung, and Gabriel M. Leung. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. The Lancet, 395(10225):689–697, February 2020. Publisher: Elsevier.
    OpenUrl
  6. [6].↵
    Dong K. Kim. Statistical methods for estimating doubling time in in vitro cell growth. In Vitro Cellular & Developmental Biology. Animal, 33(4):289–293, 1997. Publisher: [Springer, Society for In Vitro Biology].
    OpenUrl
  7. [7].↵
    T. Alex Perkins, Sean M. Cavany, Sean M. Moore, Rachel J. Oidtman, Anita Lerch, and Marya Poterek. Estimating unobserved SARS-CoV-2 infections in the United States. March 2020.
  8. [8].↵
    Caitlin Rivers, Jean-Paul Chretien, Steven Riley, Julie A Pavlin, Alexandra Woodward, David Brett-Major, Irina Maljkovic Berry, Lindsay Morton, Richard G Jarman, Matthew Biggerstaff, et al. Using “outbreak science” to strengthen the use of models during epidemics. Nature Communications, 10(1):1–3, 2019.
    OpenUrl
  9. [9].↵
    US Census Bureau. Monthly population estimates for the United States (na-est2019-01).
  10. [10].↵
    R Core Team. R: A language and environment for statistical computing, 2013.
Back to top
PreviousNext
Posted March 20, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Sentinel Event Surveillance to Estimate Total Sars-CoV-2 Infections, United States
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Sentinel Event Surveillance to Estimate Total Sars-CoV-2 Infections, United States
Andrew A. Lover, Thomas McAndrew
medRxiv 2020.03.17.20037648; doi: https://doi.org/10.1101/2020.03.17.20037648
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Sentinel Event Surveillance to Estimate Total Sars-CoV-2 Infections, United States
Andrew A. Lover, Thomas McAndrew
medRxiv 2020.03.17.20037648; doi: https://doi.org/10.1101/2020.03.17.20037648

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (164)
  • Allergy and Immunology (416)
  • Anesthesia (92)
  • Cardiovascular Medicine (867)
  • Dentistry and Oral Medicine (159)
  • Dermatology (98)
  • Emergency Medicine (251)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (397)
  • Epidemiology (8589)
  • Forensic Medicine (4)
  • Gastroenterology (390)
  • Genetic and Genomic Medicine (1772)
  • Geriatric Medicine (169)
  • Health Economics (375)
  • Health Informatics (1252)
  • Health Policy (625)
  • Health Systems and Quality Improvement (472)
  • Hematology (197)
  • HIV/AIDS (380)
  • Infectious Diseases (except HIV/AIDS) (10344)
  • Intensive Care and Critical Care Medicine (553)
  • Medical Education (193)
  • Medical Ethics (51)
  • Nephrology (214)
  • Neurology (1692)
  • Nursing (97)
  • Nutrition (252)
  • Obstetrics and Gynecology (330)
  • Occupational and Environmental Health (451)
  • Oncology (933)
  • Ophthalmology (265)
  • Orthopedics (104)
  • Otolaryngology (172)
  • Pain Medicine (115)
  • Palliative Medicine (40)
  • Pathology (255)
  • Pediatrics (539)
  • Pharmacology and Therapeutics (257)
  • Primary Care Research (210)
  • Psychiatry and Clinical Psychology (1785)
  • Public and Global Health (3871)
  • Radiology and Imaging (627)
  • Rehabilitation Medicine and Physical Therapy (322)
  • Respiratory Medicine (525)
  • Rheumatology (208)
  • Sexual and Reproductive Health (170)
  • Sports Medicine (158)
  • Surgery (191)
  • Toxicology (36)
  • Transplantation (101)
  • Urology (76)