Abstract
The outbreak of the novel Coronavirus (COVID-19) has lead countries worldwide to administer quarantine policies. However, each country or state decides independently what mobility restrictions to administer within its borders, while aiming to maximize its own citizens’ welfare. In turn, since individuals travel between countries and states, at least during periods when quarantines are less restrictive, the policy in one country may ultimately affect the infection level in other countries. Therefore, major questions are whether the policy dictated by a decentralized government is efficient, and if not, how the governments can coordinate a better policy. Here, we focus on the decision regarding the timing of releasing the quarantines. We consider a game theory model in which each of two governments decides when to switch from a restrictive to a non-restrictive quarantine and vice versa. We used parameter values driven by the literature and publically available data. We show that, if travel is sufficiently frequent during the non-restrictive quarantine periods, then the strategies are sub-optimal: Each governor tends to release the quarantine sooner, which ultimately leads to longer periods of restrictive quarantines and a higher prevalence of the disease. In turn, if the governments restrict international and interstate travel to a low level even when the quarantines are non-restrictive, the policy dictated by the decentralized governance may become optimal.
Introduction
The outbreak of the novel Coronavirus (COVID-19) necessitates quarantine policies [1-4]. While mobility restriction measures have been taken during several disease outbreaks in history [5-11], the quarantines needed for the COVID-19 pandemic is at a global scale that encompasses entire countries. Such quarantines have economic and social costs [12, 13], and major relevant questions are how restrictive the quarantines should be, and what would be the right timing to release some of the mobility restrictions [1-3]. In practice, this decision is made independently by multiple countries [14-16], or independently by multiple states in countries like the U.S., or even independently by multiple municipal authorities. Each governor is expected to dictate the strategy that best serves her/his own citizens; however, in periods when the quarantines are less restrictive, travelers can transmit the disease between countries, sates, and cities. Consequently, the quarantine policy in one country/state may ultimately affect the outcome in other countries/states.
Decentralized governance has a benefit: Each country or state may have a better knowledge of its own citizens’ lifestyle and needs and may dictate a policy that better suits its own citizens. However, the decentralized policy also comes with a cost: Each government might ignore the cost borne to citizens of other countries due to international and interstate travel. Accordingly, various previous game theory studies have suggested that agents (individuals and countries) under-invest in the prevention and control of diseases [17-22].
In this paper, we examine the case where each governor decides independently about the timing of releasing the quarantine, and we ask what the inefficiency is due to such decentralized governance. Namely, we examine how the strategy of different governments differ from the socially optimal strategy of a hypothetical centralized government that aims to maximize the welfare of all the citizens in all the countries (price of anarchy). Specifically, we consider two countries/states, and we analyze the following three cases (Fig. 1) [14-16, 23]: (1) the countries have approximately the same population size and infection level (e.g., Italy and Spain); (2) the countries have approximately the same population size, but one country experiences a more severe outbreak at a given point in time (e.g., Italy and Germany); and (3) one state experiences a more severe outbreak compared to the rest of the country (e.g., New York State vs. the rest of the U.S.).
Model
Our model assumptions and parameterization are explicitly motivated by the COVID-19 outbreak. We consider two states or countries, 1 and 2, and in line with concurrent data about the COPVID-19 outbreak [4, 15, 16, 23-26], we assume that the number of infected people in each country is very small compared to the country’s total population size. Accordingly, we consider only the dynamics of the infection level in each country, Ii, defined as the portion of individuals that are infected in country i. (Namely, in contrast to traditional SIR models [27], here we consider shorter timescales during which the number of susceptible individuals is constant, which is in line with the COVID-19 data as of May 5, 2020 [4, 15, 16, 23-26]). In turn, we assume for simplicity that each government can administer one of two types of quarantine at any given time: restrictive and non-restrictive. Each government can choose when to switch between these two quarantines. Specifically, we assume that under a restrictive quarantine in country i, Ii decreases exponentially after the first two weeks at a rate −r0 as long as Ii > Imin. (Even a restrictive quarantine is not expected to eliminate the disease entirely, and Imin characterizes some minimal infection level that persists in the population.) In turn, according to evidence showing that the infection level may still increase under non-restrictive quarantine conditions [4, 15, 16, 25, 26], we assume that Ii increases at a rate ri under a non-restrictive quarantine, where ri is country-specific and can be higher in those countries or states where interactions among individuals are more frequent. Also, we assume that if a country is under a restrictive quarantine, there is no travel from or to that country, whereas if two countries are both under a non-restrictive quarantine, some individuals travel between these countries (we denote μij as the number of individuals that reside in country i but are in travel to country j). We describe in more detail the dynamics of I1 and I2 in Methods: Dynamics of the infection levels.
In turn, the government in each country dictates the timing at which it switches from the non-restrictive quarantine to the restrictive one and vice versa. We assume that a government will not allow the health system in its country to collapse [1], and therefore, it will always switch to a restrictive quarantine if Ii approaches some maximum capacity, Ii = Imax. In turn, under this constraint, the objective of each government is to maximize the proportion of time during which its own citizens are under a non-restrictive quarantine. We describe in more detail the objective functions in Methods: Objective functions.
Finally, we calculate the strategies of the governments in a Nash equilibrium (decentralized governance), and we compare them to the socially optimal solution. In general, the Nash equilibria in such a dynamic game depend on various assumptions about the information that each government has, and the resulting strategies may depend on how the governments coordinate their actions [28]. Here, we make the simplifying assumption that each government does not get feedback about the infection level in the other country/state and decides in advance when it releases the quarantine. In our model, this results in a unique Nash equilibrium per a given set of parameter values (open-loop Nash equilibrium [28]). The numerical method that we used for finding the open-loop Nash equilibria and the optimal solutions is described in Methods: Numerical methods.
Results & Discussion
Our results show that, in all cases, if the number of travelers between the countries is greater than a certain threshold, the governments switch to a non-restrictive quarantine sooner compared to the socially optimal solution (Figs. 2,3). Namely, in Nash equilibrium, the governors administer the restrictive quarantine for shorter periods compared to the socially optimal solution, which in turn results in shorter periods before the infection level approaches its full capacity and the restrictive quarantine is administered again (Figs. 2,3). Consequently, under decentralized governance, the solution is sub-optimal, the total amount of time during which a restrictive quarantine is administered is greater, and the average infection level is higher. This result is consistent with previous results that suggested that agents under-invest in control of diseases [17-22]. In turn, notice that in both the optimal solution and the Nash equilibrium, the governments tend to be coordinated in the sense that the second country to switch does so when its infection level is similar to that of the other country (Fig. 2D,F).
The difference between the optimal solution and the Nash equilibrium emerges if the number of travelers between the authorities is above a certain threshold, in which case the price of anarchy increases with the number of travelers (Fig. 3). On the other hand, if the number of travelers is below the threshold, the Nash equilibrium and the optimal solution are identical (Fig. 3). This suggests that one way to prevent the inefficiency due to the decentralized governance is to declare in advance that travel will be restricted to a low level even when the quarantines are non-restrictive. This may be applicable at the international scale, but much more difficult at the municipal scale.
There are two mutually dependent reasons why decentralized governance results in releasing the restrictive quarantine sooner. First, each governor ignores the damages that its own travelers inflict on other countries, and thus, keeping a higher level of infection is perceived by the governor as less costly. Second, as a consequence of the first reason, each country hosts more infected travelers from the other country, and its own travelers are also hosted in countries with higher infection levels. Consequently, if the infection level in a given country is low, it increases rapidly due to travel, and therefore, it is not worthwhile for the country to reduce its infection level beyond a certain value (where due to travel, this value may be lower than Imin).
Finally, note that we have made numerous simplifying assumptions in our model, which suggests various future directions for the examination of the consequences of relaxing these assumptions. First, we considered only two types of quarantine, whereas in reality, more options are available. In particular, governments can try to administer an intermediate level of quarantine that keeps the infection at a constant level. Examining whether this policy is better than the ones that we considered is beyond the scope of this paper (see [1-3]); however, it is likely that a similar result will hold: Decentralized governance might maintain a higher infection level than the optimum. Second, we assumed that travel is allowed under non-restrictive quarantine. However, it would be interesting to examine policies that also dictate how to best integrate the quarantine policy with travel policy. Specifically, further restrictions on travel might mitigate the problem (Fig. 3); however, it should be noted that restrictions on travel come with economic cost [13]. Third, we considered only two countries, whereas considering more countries is generally expected to increase the price of anarchy [28]. And fourth, we considered open-loop solutions in which the governments pre-determine their policy, but communication among the governments might lead to the formation of agreements and coordination of a more global quarantine policy.
Data Availability
Some parameter values are taken from estimates in the literature and some are estimated from publically available datasets (see Method: Parameterization). All the data and parameter values needed to regenerate the results shown in the figures are given in Methods.
Methods
Dynamics of the infection levels
Here we describe in detail the dynamics of I1(t) and I2(t), which characterize the proportions of infected individuals in the populations of countries 1 and 2, respectively. Assume that each of the two countries can be under a restrictive quarantine or a non-restrictive quarantine, and the type of quarantine can change over time. The dynamics of Ii at time t depend on the type of quarantine administered in both countries at that time.
Option 1: Country i is under a restrictive quarantine
If country i is under a restrictive quarantine at time t, Ii (t) declines at a daily rate −r0, regardless of what happens in the other country (no travel). However, there are two exceptions. First, Ii cannot decline below a certain threshold, Imin, characterizing the minimal level of infection in the population. Second, if the government switches from restrictive to non-restrictive quarantine, Ii starts to decline only after a delay of Tdelay days. Therefore, in summary, where ts is the time (in days) from the day when the quarantine became restrictive.
Option 2: Country i is under a non-restrictive quarantine, but the other country, j ≠ i. is under a restrictive quarantine
In this case, there is still no travel between the countries due to the restrictive quarantine in country j. Consequently, Ii grows exponentially at a daily rate ri:
Option 3: Both countries are under a non-restrictive quarantine
In this case, individuals travel between the countries, where μij of the residents of country i are in travel to country j. If we assume that the number of travelers from a country is much smaller than the number of the country’s own residents (μij ≪ Ni and μij ≪ Nj), it follows that
Specifically, (i) the term riIi characterizes residents of country i that get infected inside country i by other residents of country i; (ii) the term riμij/Ni characterizes residents of country i that get infected inside country i by residents of country j; and (iii) the term μij/Ni characterizes residents of country i that get infected inside country j by residents of country j. Note that we assume that travel is temporary (Lagrangian approach [29, 30]); that is, a traveler from country 1 to country 2 is still a resident of country 1 who will return to country 1, and therefore, is s/he gets infected, it increases I1, even the infection has occurred in country 2.
In turn, without the simplifying assumption that μij ≪ Ni, Eq. (3) can be written in a more general form:
In our simulations, we used values of μij that are much smaller than Ni (see Parameterization subsection), and therefore, the results obtained using Eq. (3) and those obtained using Eq. (4) were almost indistinguishable.
Objective functions
The government in each country chooses the time periods during which each type of quarantine is administered in that country. We assume that the objective of each government is to maximize the relative time during which the quarantine is non-restrictive in its country, under the constraint that the health system does not collapse, and therefore, a restrictive quarantine has to be administered in country i whenever Ii approaches Imax [1].
Specifically, denote as the time to complete a cycle during which Ii increases to Imax and returns back to its initial value, Ii(0). In turn, denote as the total time within such a cycle during which a non-restrictive quarantine is administered in country i. Then, we define the utility of country i as
In turn, we calculate both the optimal solution and the open-loop Nash equilibrium. Specifically, the optimal strategy is the one that maximizes N1u1 + N2u2. The open-loop Nash equilibrium is given by the set of pre-determined strategies that are such that no country can increase its own utility by unilaterally change its strategy.
Numerical methods
Since the growth rates of I1 and I2 are piecewise-linear, and since we considered a time delay after the government switches to restrictive quarantine before the infection level starts to decline, it follows from Pontryagin’s maximum principle [28, 31] that the optimal strategy of each government is to choose a time, Ti ≥ 0, at which it switches to a restrictive quarantine and then to wait until Ii = Imax before switching to a restrictive quarantine again (Fig. 2).
Then, , is the period from Ti until Ii approaches Imax, and the time during which the quarantine is restrictive, , is given by the sum of three distinct periods: (i) between t = 0 and t = Ti, (ii) Tdelay days after Ii approaches Imax, and (iii) the time until Ii declines back to its initial value, Ii (0).
To find the optimal solution and the open-loop Nash equilibrium, we first generate a matrix in which each cell characterizes a set of strategies, (T1, T2), and we simulate dynamics to calculate the utility for each country in every cell. Then, the optimal solution is given by T1 and T2 that correspond to the cell in which N1u1 + N2u2 is maximized. In turn, the open-loop Nash equilibrium is found by calculating the best response of each country to every strategy of the other country, where the intersection of the curves characterizes the Nash equilibrium (Fig. 2 A,C,E). The values of T1 and T2 in the matrix vary between 0 and Tmax, where Tmax is the time when a restrictive quarantine reduces the infection level in both countries to Imin. (It is never worthwhile to keep a restrictive quarantine for more than Tmax days.) In turn, the size of the matrix determines the resolution at which the strategies T1 are T2 are examined, and in Fig. 2, we used a 200 × 200 matrix.
Parameterization
The estimations of the parameter values used for the simulation are taken from datasets and literature related to COVID-19 [4, 15, 16, 23-26]. Notice that some parameter values vary significantly between countries and states, and therefore, we performed sensitivity analyses to verify that the main results are general and hold within wider parameter ranges.
Under restrictive quarantine conditions, a daily decline rate of 5% in the infection level after the first two weeks is a reasonable estimate. For example, in China, the number of infected individuals declined from ~86,000 to ~3,000 within 76 days, which implies r0 ≈ −5% per day. In turn, in several countries, it took about 14-21 days before any decline occurred following a quarantine, which suggests that considering Tdelay ≈ 14 days is reasonable. Next, note that ri depends on the restrictions used in a given non-restrictive quarantine, and it may vary between states and countries. We used estimates that reflect the weeks before a restrictive quarantine was administered in countries in Europe, where 5% ≤ ri ≤ 15% per day is a reasonable estimate.
In turn, we assume that Imax is given approximately by the infection level that was approached in countries in Europe before the restrictive quarantine was administered or two weeks after it was administered, where considering Imin ≈ 0.01% − 0.1% of the total population size is a reasonable estimate. Next, parameters like Imin and the travel rates μ12 and μ21 are harder to estimate, as they depend on the specific location and scale of the countries and state considered. Therefore, we performed sensitivity analyses and examined a variety of values (e.g., Fig. 3). Finally, the ratio between N1 and N2 reflects the relative population sizes of the two countries: N1 = N2 characterizes the cases demonstrated in Fig. 1A,B and 2A-D, while N1 < N2 characterizes the case demonstrated in Fig. 1C and 2E-F.
Parameter values used to generate the figures
Fig. 2A,B: r0 = −5% (day−1), r1 = r2 = 7% (day−1), Imax = 0.1%, Imin = 0.005%, Tdelay = 14 (days), N1 = N2 = 1, μ12 = μ21 = 0.2% × N1.
Fig. 2C,D: r0 = −5% (day−1), r1 = 9% (day−1), r2 = 7% (day−1), Imax = 0.1%, Imin = 0.005%, Tdelay = 14 (days), N1 = N2 = 1, μ12 = μ21 = 0.25% × N1.
Fig. 2E,F: r0 = −5% (day−1), r1 = 11% (day−1), r2 = 5% (day−1), Imax = 0.1%, Imin = 0.005%, Tdelay = 14 (days), N1 = 0.3, N2 = 1, μ12 = 0.25% × N1, μ21 = 0.25% × N2.
Fig. 3: μ12 = μ21 varies (x-axis), and the other parameters are the same as those in Fig. 2A,B.
Data availability
Some parameter values are taken from estimates in the literature [4, 15, 16, 23, 24] and some are estimated from publically available datasets [25, 26] (see Method: Parameterization). All the data and parameter values needed to regenerate the results shown in the figures are given in Methods.