Empirical Model of Spring 2020 Decrease in Daily Confirmed COVID-19 Cases in King County, Washington

Projections of the near future of daily case incidence of COVID-19 are valuable for informing public policy. Near-future estimates are also useful for outbreaks of other diseases. Short-term predictions are unlikely to be affected by changes in herd immunity. In the absence of major net changes in factors that affect reproduction number (R), the two-parameter exponential model should be a standard model – indeed, it has been standard for epidemiological analysis of pandemics for a century but in recent decades has lost popularity to more complex compartmental models. Exponential models should be routinely included in reports describing epidemiological models as a reference, or null hypothesis. Exponential models should be fitted separately for each epidemiologically distinct jurisdiction. They should also be fitted separately to time intervals that differ by any major changes in factors that affect R. Using an exponential model, incidence-count half-life (t1/2) is a better statistic than R. Here an example of the exponential model is applied to King County, Washington during Spring 2020. During the pandemic, the parameters and predictions of this model have remained stable for intervals of one to four months, and the accuracy of model predictions has outperformed models with more parameters. The COVID pandemic can be modeled as a series of exponential curves, each spanning an interval ranging from one to four months. The length of these intervals is hard to predict, other than to extrapolate that future intervals will last about as long as past intervals.

one of the four highest residuals; it occurred seven days after Memorial Day and could represent a response to social gathering on Memorial Day. However, that seems unlikely because the flanking days do not have similarly high residuals. There is also a mild weekly periodicity in the data, presumably corresponding to testing cycles ( Figure S2). Therefore, smoothing over at least a week's time is recommended; use of a two-parameter model achieves that goal.
Utility of R as a Public Health Policy Tool When R Hovers Near Unity. The half-life statistic is particularly useful when used separately for pre-and post-peak modeling. R is a more versatile and general statistic that has value in informing policy throughout an outbreak. It is particularly useful if the value of R hovers near 1, as there are profound implications for policy depending upon which side of unity the statistic lies. If R is less than but near 1, half-life is near infinity and is inelegant as a reportable metric. Half-life is much more useful if R is consistently somewhat less than 1, and in these circumstances is a key statistic for planning for healthcare demand and tempo of economic and social adjustments. Similarly, if R is greater than 1, the doubling time statistic grows in value for public communication. Also, due to the nonlinearity of the models, the confidence intervals for R can be asymmetric around the point estimate, increasing the likelihood of misinterpretations across a chain of public communication.
Rigor Metrics for Natural Language Processing. Sex as a biological variable data was not provided by Seattle & King County (PHSKC). Breakdown of daily data into subcategories including sex and ethnicity was not available in part due to privacy concerns related to small aggregate bin sizes (personal communication from PHSKC, May 28, 2020). This study did not require institutional review board review. This human-subjects exempt study did not require consent, randomization, or blinding. A power analysis was not applicable. No cell lines were used; no authentication was necessary. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 7, 2022. ;https://doi.org/10.1101/2020 doi: medRxiv preprint Figure S1. Model residuals. There is little structure to the data, and no significant structure. Therefore deviations of observed real values from model predictions are well described as noise, rather than attributed to failure of modeling assumptions. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 7, 2022. ;https://doi.org/10.1101/2020 doi: medRxiv preprint Figure S2. Autocorrelation of model residuals, as computed by the default parameters of the R acf function. Lag is expressed in days. A peak at seven days, and a smaller one at 14 days, indicates that there are weekly testing cycles, such as likely results from changes in availability of testing sites and in personal schedules over the weekend.

Series Residuals
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted February 7, 2022. ; https://doi.org/10. 1101/2020