Abstract
During February and March 2020, several countries implemented non-pharmaceutical interventions, such as school closures and lockdowns, with variable schedules, to control the COVID-19 pandemic caused by the SARS-CoV-2 virus. Overall, these interventions seem to have successfully reduced the spread of the pandemic. We hypothesise that the official and effective start date of such interventions can significantly differ, for example due to slow adoption by the population, or due to unpreparedness of the authorities and the public. We fit an SEIR model to case data from 12 countries to infer the effective start dates of interventions and contrast them with the official dates. We find both late and early effects of interventions. For example, Italy implemented a nationwide lockdown on Mar 11, but we infer the effective date on Mar 16 (±0.47 days 95% CI).In contrast, Spain announced a lockdown on Mar 14, but we infer an effective start date on Mar 8 (± 1.08 days 95% CI). We discuss potential causes and consequences of our results.
Introduction
The COVID-19 pandemic has resulted in implementation of extreme non-pharmaceutical interventions (NPIs) in many affected countries. These interventions, from social distancing to lockdowns, are applied in a rapid and widespread fashion. NPIs are designed and assessed using epidemiological models, which follow the dynamics of infection to forecast the effect of different mitigation and suppression strategies on the levels of infection, hospitalization, and fatality. These epidemiological models usually assume that the effect of NPIs on infection dynamics begins at the officially declared date7,9,14.
Adoption of public-health recommendations is often critical for effective response to infectious diseases, and has been studied in the context of HIV13 and vaccination4,19, for example. However, behavioural and social change does not occur immediately, but rather requires time to diffuse in the population through media, social networks, and social interactions. Moreover, compliance to NPIs may differ between different interventions and between people with different backgrounds. For example, in a survey of 2,108 adults in the UK during Mar 2020, Atchison et al.2 found that those over 70 years old were more likely to adopt social distancing than young adults (18-34 years old), and that those with lower income were less likely to be able to work from home and to self-isolate. Similarly, compliance to NPIs may be impacted by personal experiences. Smith et al.16 have surveyed 6,149 UK adults in late Apr 2020 and found that people who believe they have already had COVID-19 are more likely to think they are immune, and less likely to comply with social distancing guidelines. Compliance may also depend on risk perception as perceived by the the number of domestic cases or even by reported cases in other regions and countries. Interestingly, the perceived risk of COVID-19 infection has likely caused a reduction in the number of influenza-like illness cases in the US starting from mid-Feb 202020.
Here, we hypothesise that there is a significant difference between the official start of NPIs and their effective adoption by the public and therefore their effect on infection dynamics. We use a Susceptible-Exposed-Infected-Recovered (SEIR) epidemiological model and a Markov Chain Monte Carlo (MCMC) parameter estimation framework to infer the effective start date of NPIs from publicly available COVID-19 case data in 12 geographical regions. We compare these estimates to the official dates, and find that they include both late and early effects of NPIs on infection dynamics. We conclude by demonstrating how differences between the official and effective start of NPIs can confound assessments of the effectiveness of the NPIs in a simple epidemic control framework.
Results
Several studies have described the effects of non-pharmaceutical interventions in different geographical regions7,9,14. Some of these studies have assumed that the parameters of the epidemiological model change at a specific date (Eq. 6), and set the change date τ to the official NPI date τ* (Table 1). They then fit the model once for time t < τ* and once for time t ≥ τ*. For example, Li et al.14 estimate the infection dynamics in China before and after τ*, which is set at Jan 23, 2020. Thereby, they effectively estimate the transmission and reporting rates before and after τ* separately.
Here, we estimate the joint posterior distribution of the effective start date of the NPIs τ and the transmission and reporting rates before and after τ from the entire data, rather than splitting the data at τ. We then estimate the marginal posterior probability of τ by marginalising the joint posterior, and estimate as the posterior median.
We compare the posterior predictive plots of a model with a free τ with those of a model with τ fixed at τ* and a model without τ (i.e. transmission and reporting rates are constant). The model with free τ clearly produces better and less variable predictions (Figure S4a). When we compare the models using WAIC (Eq. 9, Table S1), the model with a free τ is preferred in 8 out of 12 of the regions (although only narrowly for 5 of the 8). The exceptions are Austria, Belgium, Norway, and United Kingdom.
We compare the official τ* and effective start of NPIs and find that in most regions the effective start of NPIs significantly differs from the official date (Figure 1), that is, the credible interval on does not include τ* (Figure 1). The exceptions are, as with the comparison to the simpler models, Austria, Belgium, and United Kingdom, as well as Switzerland (see below). Norway also has a relatively wide credible interval, maybe because it has the longest duration between the first and last NPIs (Table 1). In the following, we describe our findings in more detail.
Late effective start of NPIs
In both Wuhan, China, and in Italy we estimate that the effective start of NPIs is significantly later than the official date τ*.
In Italy, the first case was officially confirmed on Feb 21. School closures were implemented on Mar 57, a lockdown was declared in Northern Italy on Mar 8, with social distancing implemented in the rest of the country, and the lockdown was extended to the entire nation on Mar 119. That is, the first and last official NPI dates are Mar 8 and Mar 11. However, we estimate the effective date T at Mar 16 (±0.47 days 95% CI; Figure 2).
Similarly, in Wuhan, China, a lockdown was ordered on Jan 2314, but we estimate the effective start of NPIs to be more than a week later, at Feb 2 (±2.85 days 95% CI; Figure 2).
Early effective start of NPIs
In contrast, in other regions we estimate an effective start of NPIs that is earlier then the official date τ* (Figure 1).
In Spain, social distancing was encouraged starting on Mar 87, but mass gatherings still occurred on Mar 8, including a march of 120,000 people for the International Women's Day, and a football match between Real Betis and Real Madrid (final score: 2–1) with a crowd of 50,965 in Seville. A national lockdown was only announced on Mar 147. Nevertheless, we estimate the effective start of NPI on Mar 8-9 (±1.08 days 95 %CI), rather than Mar 14 (Figure 3).
Similarly, in France we also estimate the effective start of NPIs on Mar 8 or 9 (±6.27 days 95% CI, Figure 3). Although the credible interval is wider compared to Spain, spanning from Mar 2 to Mar 15, the official lockdown start at Mar 17 is later still, and even the earliest NPI, banning of public events, only started on Mar 13 7.
Interestingly, the effect of NPIs in both France and Spain is estimated to have started on Mar 8-9, although the official NPI dates differ significantly: the first NPI in France is only one day before the last NPI in Spain. The number of daily cases was similar in both countries until Mar 8, but diverged by Mar 13, reaching significantly higher numbers in Spain (Figure S2). This may suggest correlations between effective starts of NPIs due to global or international events.
Like a Swiss watch
We find one case in which the official and effective dates match: Switzerland ordered a national lockdown on Mar 20, after banning public evens and closing schools on Mar 13 and 147. Indeed, the posterior median is Mar 20 (±8.46 days 95% CI), and the posterior distribution shows two density peaks: a smaller one between Mar 10 and Mar 14, and a bigger one between Mar 17 and Mar 22 (Figure S3). It’s also worth mentioning that Switzerland was the first to mandate self isolation of confirmed cases7.
Consequences of late and early effect of NPIs on real-time assessment
The success of non-pharmaceutical interventions is assessed by health officials using various metrics, such as the decline in the growth rate of daily cases. These assessments are made a specific number of days after the intervention began, to accommodate for the expected serial interval3 (i.e. time between successive cases in a chain of transmission), which is estimated at about 4-7 days9.
However, a significant difference between the beginning of the intervention and the effective change in transmission rates can invalidate assessments that assume a serial interval of 4-7 days and neglect the late or early population response to the NPI. This is illustrated in Figure 4 using data and parameters from Italy: a lockdown was officially ordered on Mar 10 (τ*), but its late effect on the infection dynamics starts on Mar 16 . If health officials assumed the dynamics to immediately change at τ*, they will have expected the number of cases be within the red lines (posterior predictions assuming τ = τ*). This would have lead to a significant underestimation, which might have been interpreted by as ineffectiveness of the NPI, leading to further escalations. However, the number of cases would actually follow the blue lines (posterior predictions using ), which corresponds well to the real data (stars).
Discussion
We have inferred the effective start date of NPIs in several geographical regions using an SEIR epidemiological model and an MCMC parameter estimation framework. We find examples of both late and early effect of NPIs (Figure 1).
For example, in Italy and in Wuhan, China, the effective start of the lockdowns seems to have occurred five days or more after the official date (Figure 2). This difference might be explained by low compliance: In Italy, for example, the government intention to lockdown Northern provinces leaked to the public, resulting in people leaving those provinces9. Late effect of NPIs may also be due to the time required by both the government and the citizens to organise for a lockdown, and for the new guidelines to be adopted by the population.
In contrast, in most investigated countries (e.g., Spain and France), we inferred reduced transmission rates even before official lockdowns were implemented (Figure 3). An early effective date might be due to early adoption of social distancing and similar behavioural adaptations in parts of the population. Adoption of these behaviours may occur via media and social networks, rather than official government recommendations and instructions, and may have been influenced by increased risk perception due to domestic or international COVID-19-related reports. Indeed, the evidence supports a change in infection dynamics (i.e. a model with fixed or free τ) even for Sweden (Table S1, Figure S4a), where a lockdown was not implemented*.
Attempts to asses the effect of NPIs3,7 generally assume a seven-day delay between the implementationof the intervention and the observable change in dynamics, due to the characteristic serial interval of COVID-199. However, late and early effects such as we have inferred may bias these assessments and lead to wrong conclusions about the effects of NPIs (Figure 4).
We have found that the evidence supports a model in which the parameters change at a specific time point τ over a model without such a change-point in 9 out of 12 regions (i.e. free or fixed model in Table S1). It could be interesting to check if the evidence favors a model with two change-points, rather than one. Two such change-points could reflect escalating NPIs (e.g. school closures followed by lockdowns), or an intervention followed by a relaxation. However, interpretation of such models will be harder, as two change-points can also reflect a mix of NPIs and other events, such as changing weather, news of new treatments, and international outbreaks.
As several countries begin to relieve lockdowns and ease restrictions, we expect similar delays and advances to occur: in some countries the population will behave as if restrictions were eased even before the official date, and in some countries the population will continue to self-restrict even after restrictions are officially removed.
Conclusions
We have inferred the effective start date of NPIs and found that they often differ from the official dates. Our results highlight the complex interaction between personal, regional, and global determinants of behavioral response to an epidemic. Therefore, we emphasize the need to further study variability in compliance and behavior over both time and space. This can be accomplished both by surveying differences in compliance within and between populations2, and by incorporating specific behavioral models into epidemiological models1,5,18.
Models and Methods
Data
We use daily confirmed case data X = (X1,…, XT) from 12 regions during Jan–Mar 2020. These incidence data summarise the number of individuals Xt tested positive for SARS-CoV-2 RNA (using RT-qPCR) at each day t. Data for Wuhan, China retrieved from Pei and Shaman15, data for 11 European countries retrieved from Flaxman et al.7. Where there were multiple sequences of days with zero confirmed cases (e.g. France), we cropped the data to begin with the last sequence so that our analysis focuses on the first sustained outbreak rather than isolated imported cases. For official NPI dates see Table 1.
SEIR model
We model SARS-CoV-2 infection dynamics by following the number of susceptible S, exposed E, reported infected Ir, unreported infected Iu, and recovered R individuals in a population of size N. This model distinguishes between reported and unreported infected individuals: the reported infected are those that have enough symptoms to eventually be tested and thus appear in daily case reports, to which we fit the model. This model is inspired by Li et al.14 and Pei and Shaman15, who used a similar model with multiple regions and constant transmission and reporting rates to study COVID-19 dynamics in China and in the continental US.
Susceptible (S) individuals become exposed due to contact with reported or unreported infected individuals (Ir or Iu) at a rate βt or μβt, respectively. The parameter 0 < μ < 1 represents the decreased transmission rate from unreported infected individuals, who are often subclinical or even asymptomatic6,17. The transmission rate βt ≥ 0 may change over time t due to behavioural changes of both susceptible and infected individuals. Exposed individuals, after an average incubation period of Z days, become reported infected with probability αt or unreported infected with probability (1 − αt). The reporting rate 0 < αt < 1 may also change over time due to changes in human behaviour. Infected individuals remain infectious for an average period of D days, after which they either recover, or become ill enough to be quarantined. In either case, they no longer infect other individuals, and therefore effectively become recorved (R). The model is described by the following set of equations:
The initial numbers of exposed E(0) and unreported infected Iu(0) are free model parameters (i.e. inferred from the data), whereas the initial number of reported infected and recovered is assumed to be zero, Ir (0) = R(0) = 0, and the number of susceptible is S(0) = N − E(0) − Iu(0).
Likelihood function
For a given vector θ of model parameters the expected cumulative number of reported infected individuals (Ir) until day t, following Eq. 1, is
We assume that reported infected individuals are confirmed and therefore observed in the daily case report of day t with probability pt (note that an individual can only be observed once, and that pt may change over time, but t is a specific date rather than the time elapsed since the individual was infected). We denote by Xt the observed number of confirmed cases in day t, and by the cumulative number of confirmed cases until end of day t,
Therefore, at day t the number of reported infected yet-to-be confirmed individuals is . We therefore assume that Xt conditioned on is Poisson distributed, such that
Hence, the likelihood function for a parameter vector θ given the confirmed case data X = (X1,…, XT) is defined by the probability to observe X given θ,
NPI model
To model non-pharmaceutical interventions (NPIs), we set the start of the NPIs to day τ and define where 0 < λ < 1. The values for pt follow Li et al.14, who estimated the average time between infection and reporting in Wuhan, China, at 9 days before the start of NPIs and 6 days after start of NPIs.
Parameter estimation
To estimate the model parameters from the daily case data X, we apply a Bayesian inference approach. We start our model Δt days 9 before the outbreak (defined as consecutive days with increasing confirmed cases) in each country. The model in Eq. 1 is parameterised by the vector θ, where
The likelihood function is defined in Eq. 5. The posterior distribution of the model parameters P(θ | X) is estimated using the affine-invariant ensemble sampler for Markov chain Monte Carlo (MCMC)11 implemented in the emcee Python package8.
We defined the following prior distributions on the model parameters P(θ): where the prior for τ is a truncated normal distribution shaped so that the date of the first and last NPI, τ0 and τ* (Table 1), are at minus and plus one standard deviation, and taking values only between 1 and T – 2, where T is the number of days in the data X. We also tested an uninformative uniform prior, Uniform(1, T – 2). WAIC (Eq. 9) of a model with this uniform prior was either higher, or lower by less than 2, compared to WAIC of a model with the truncated normal prior. The uninformative prior resulted in non-negligible posterior probability for unreasonable τ values, such as Mar 1 in the United Kingdom. This was probably due to MCMC chains being stuck in low posterior regions of the parameter space. We therefore decided to use the more informative truncated normal prior for τ. Other priors follow Li et al.14, with the following exceptions. λ is used to ensure transmission rates are lower after the start of the NPIs (λ < 1). We checked values of Δt larger than five days and found they generally produce lower likelihood and unreasonable parameter estimates, and therefore chose Uniform(1,5) as the prior for Δt.
Model comparison
We perform model selection using two methods. First, we compute WAIC (widely applicable information criterion)10, where and are the expectation and variance operators taken over the posterior distribution P(θ | X). We compare models by reporting their relative WAIC; lower is better (Table S1). A minority (<5%) of MCMC chains that fail to fully converge can lead to overestimation of the variance (the second term in Eq. 9). Therefore, we exclude from the computation of WAIC chains with mean log-likelihood that is three standard deviations or more from the overall mean.
We also plot posterior predictions: we sample 1,000 parameter vectors from the posterior distribution P(θ | X), use these parameter vectors to simulate the SEIR model (Eq. 1), and plot the simulated dynamics (Figure S4a). Both the accuracy (i.e. overlap of data and prediction) and the precision (i.e. the tightness of the predictions) are good ways to visually compare models.
Source code
We use Python 3 with the NumPy, Matplotlib, SciPy, Pandas, Seaborn, and emcee packages. All source code will be publicly available under a permissive open-source license at github.com/yoavram-lab/EffectiveNPI. Samples from the posterior distributions will be deposited on FigShare.
Supplementary Material
Acknowledgements
We thank Lilach Hadany and Oren Kolodny for discussions and comments. This work was supported in part by the Israel Science Foundation 552/19 and 1399/17.
Footnotes
↵* Sweden banned public events on Mar 12, encouraged social distancing on Mar 16, and closed schools on Mar 187.