Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Spatialized Epidemiological Forecasting applied to Covid-19 Pandemic at Departmental Scale in France

Matthieu Oliver, Didier Georges, Clémentine Prieur
doi: https://doi.org/10.1101/2021.11.03.21265855
Matthieu Oliver
bUniv. Grenoble Alpes, CNRS, Inria, Grenoble INP*, LJK, 38000 Grenoble, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Didier Georges
aUniv. Grenoble Alpes, CNRS, Grenoble INP*, GIPSA-lab, F-38000 Grenoble, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Clémentine Prieur
bUniv. Grenoble Alpes, CNRS, Inria, Grenoble INP*, LJK, 38000 Grenoble, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: clementine.prieur@univ-grenoble-alpes.fr
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

In this paper, we present a spatialized extension of a SIR model that accounts for undetected infections and recoveries as well as the load on hospital services. The spatialized compartmental model we introduce is governed by a set of partial differential equations (PDEs) defined on a spatial domain with complex boundary. We propose to solve the set of PDEs defining our model by using a meshless numerical method based on a finite difference scheme in which the spatial operators are approximated by using radial basis functions. Such an approach is reputed as flexible for solving problems on complex domains. Then we calibrate our model on the French department of Isère during the first period of lockdown, using daily reports of hospital occupancy in France. Our methodology allows to simulate the spread of Covid-19 pandemic at a departmental level, and for each compartment. However, the simulation cost prevents from online short-term forecast. Therefore, we propose to rely on reduced order modeling tools to compute short-term forecasts of infection number. The strategy consists in learning a time-dependent reduced order model with few compartments from a collection of evaluations of our spatialized detailed model, varying initial conditions and parameter values. A set of reduced bases is learnt in an offline phase while the projection on each reduced basis and the selection of the best projection is performed online, allowing short-term forecast of the global number of infected individuals in the department.

1. Introduction

Over the past year, the spread of Covid-19 has had more serious consequences than expected. This pandemic has brought about massive organizational changes in many countries, with repeated lockdowns and the use of remote solutions for work, study and personal life. It has also had massive impacts on hospital care services with the need to expand hospital capacity and postpone operations for non-Covid-19 patients. Finally, the pandemic has killed several million people around the world (see, e.g., Pifarré i Arolas et al., 2021). One of the most difficult specifics to model regarding the spread of Covid-19 is the amount of asymptomatic infected individuals, as described, for example, in Al-Tawfiq (2020), which makes the pandemic difficult to track and has led to massive testing campaigns to assess the evolution of the pandemic and the effectiveness of health measures. The efficiency of different strategies of lockdown and testing has been studied (see, e.g., Berger et al., 2020). The incubation period of more than 10 days resulted in a lag between the implementation of restriction rules and their effect on hospital admissions, and made the effectiveness of restrictions more difficult to measure. Forecasting the number of infections and modeling hospital needs is critical to pandemic management and has been the subject of numerous articles over the past year.

Since the start of the epidemic, many studies have been carried out for modeling and forecasting the spread Covid-19. The asymptomatic infected individuals were typically taken into account by building compartmental models that are more detailed than the usual susceptible-infected-recovered (SIR) model (see, e.g., Liu et al., 2021; Berger et al., 2020; Roques et al., 2020; Charpentier et al., 2020, and references therein). These works focus specifically on the effect of lockdown on the propagation of Covid-19 by taking into account detected and undectected infected individuals as well as hospitalizations. Forecasting the number of infections of Covid-19 has been studied in Liu et al. (2021) by using a 3-step approach on the transmission rate during a pandemic outbreak with linear, then exponential increase in infections and finally an exponential decrease of the transmission rate.

In this paper, we propose a spatialized version of a detailed compartmental model inspired by Charpentier et al. (2020), we then restrict the spatial domain to the French department of Isère and propose a calibration procedure over the period of the first French confinement, extending the whole methodology exposed in Guan et al. (2020). This procedure can be adapted to any scale, provided that corresponding data is available. It allows to reproduce the macroscopical behaviour of Covid-19 pandemic on different geographical areas. A preliminary step to the calibration and a second contribution of this paper is the simulation of the model. Our model is defined as a set of spatio-temporal partial differential equations over a complex domain, namely the French department of Isère in our study. We propose a meshless method based on a finite difference scheme relying on radial basis function approximation of the spatial operators (the RBF-FD method, see Fornberg and Flyer (2015)) to solve it, as such an approach is known to be flexible to solve problems on complex domains. To the best of our knowledge, it is the first time RBF-FD is applied in the framework of epidemiology to study the spatio-temporal spread of a disease. Our methodology allows to simulate the spread of Covid-19 pandemic at a departmental level, and for each compartment.

However the simulation cost prevents from online short-term forecast. Therefore we propose as last contribution of the paper to rely on reduced order modeling tools introduced in Bakhta et al. (2021) to compute short-term fore-casts of infection number. The strategy consists in learning a time-dependent reduced order model with few compartments from a collection of evaluations of our spatialized detailed model, by varying initial conditions, initial times and parameter values. A set of reduced bases is learnt in an offline phase while the projection on each reduced basis and the selection of the best projection is performed online, allowing short-term forecast of the global number of infections in the department.

The paper is organized as follows. In Section 2, we present an extension of the classical SIR epidemiological model that models the infections, recoveries and hospitalizations spatially over a given geographical area of interest. In Section 3, we present the meshless RBF-FD scheme for solving the set of differential equations from model proposed in Section 2. In Section 4 the model is calibrated using hospital data from Covid-19 outbreak in the department of Isère, France. In Section 5, we use model order reduction techniques to build a surrogate model and extrapolate the number of infected individuals on a 10-day horizon.

2. Spatialized SIRHUD+/− model

In this section, we present the usual susceptible-infected-recovered (SIR) compartmental model, then we add compartments to adapt to Covid-19 specificities. Finally, we propose an extension of this detailed model to account for the spatial spread of Covid-19 pandemic on a geographical setting, in our framework the Isère department in France.

2.1. Usual SIR model

The model presented in this paper is an extension of the usual SIR epidemiological model from Brauer and Castillo-Chavez (2001). The typical SIR model describes a population of constant size, where each individual can be either Susceptible S to a disease, Infected I with the disease or Recovered R from the disease. The transmission of the disease is ruled by the following equations: Embedded Image with Embedded Image where β is the transmission rate of the disease and γ the recovery rate.

2.2. The SIRHUD+/− model

More detailed models such as the SIRHUD+/− model in Charpentier et al. (2020) can be better suited for modeling Covid-19 pandemic as they account for detected and undetected infected individuals I+ and I−, detected and undetected recovered individuals R+ and R− as well as hospitalized individuals H, individuals receiving intensive care U and deceased individuals D. This approach is particularly useful in the case of Covid-19 because undetected infections and hospital saturation both play a key role in the handling of the pandemic.

The flow diagram of this model is sketched out in Figure 1.

Figure 1:
  • Download figure
  • Open in new tab
Figure 1:

Flow diagram of the pandemic model described by the set of equations in (2)

The dynamics of each compartment is modelled by the following equations: Embedded Image with Embedded Image

This model was calibrated in Guan et al. (2020) to fit the Covid-19 outbreak at the regional scale in France. In this paper, we aim to enrich this model to better account for spatial non-homogeneity of Covid-19 propagation.

2.3. Spatialization of the SIRHUD+/− model

In our model, we keep the same variables as in the SIRHUD+/− model from Charpentier et al. (2020), but each variable will be depending on space and time. The governing equations of our model are then: Embedded Image where ∇ denotes the spatial gradient operator.

Note that the variables S, I−, R− and R+ now feature a diffusion term. This diffusion term translates the fact that individuals from these compartments move locally around their position, whereas individuals from compartments I+, H, U and D are isolated and do not transmit the disease. The amount of local movement is quantified by the variable k(x, t) that may depend on the position x and vary in time due to changes in sanitary measures. The model that we choose for the diffusion term is the following: Embedded Image where ρ(x) is the local population density, β(t) the transmission rate and adiff a positive constant. It seems indeed reasonable to assume that in high population density, more local movement occurs, and when sanitary measures are taken, individuals reduce their local movements. The variable adiff will be part of the calibration variables.

Parameters aAB are derived from the probability of moving from compartment A to B and the average duration in compartment A. These variables are more precisely described in Guan et al. (2020). Parameter λ1(t) corresponds to the proportion of undetected infected I− that are being tested positive, hence moving to compartment I+, this models virological testing such as PCR tests. λ2(t) corresponds to serological testing of recovered people from R− that are moved to R+ when tested. These parameters are time-dependent because the testing policies and capabilities have changed during the pandemic. Moreover they can also be used for optimal control, because individuals from I+ isolate themselves and do not spread the virus. In this work, we considered that the serological testing was negligible: λ2(t) = 0. For the calibration presented in Section 4, virological testing was set to λ1(t) = 0.01 because there was very few testing during the period on which we calibrate the model. However, in the forecasting part of the study that is described in Section 5, this parameter is varied over a wide range of values to replicate different phases of the testing policies and capabilities. In this model, we do not account for the loss of immunity (which could be done by adding a transition from the recovered compartment to the susceptible one), this is because our modeling is done at scale of days to a couple of months, at which scale the loss of immunity is neglected.

3. A meshless approach for solving partial differential equations

In this work, we used the radial basis function finite difference (RBF-FD) method from Fornberg and Flyer (2015) to solve the set of partial differential equations defined by (3) in Section 2. To the best of our knowledge, this is the first implementation of the RBF-FD approach to simulate an epidemiological model. Meshless approaches are particularly well-suited to our framework because the domain on which we solve PDE’s is a geographical area that has a complex boundary. In the following, we take Nd points denoted Embedded Image in the interior of the domain. We also take Nb points denoted Embedded Image over the boundary. In the following of this section, we focus on compartment of susceptible individuals S. Note that all the compartments involved in (3) are treated in a similar way.

3.1. Radial Basis Functions

Radial basis functions (RBF) are functions ϕ(xk, xl) that depend on the distance r = ‖xk − xl‖ between xk and xl. We will use the set of points Embedded Image to get an approximation of variable S. For each point xi we denote In(xi) the indices of the n nearest neighbors of xi in Embedded Image. The set of points indexed by In(xi) defines the RBF-FD stencil of xi. A local approximation of S around xi can then be obtained from the radial basis functions ϕk(x) = ϕ(x, xk) for k ∈ In(xi). In this paper, to compute the approximation Sa of S, we used the inverse multi quadratic functions: Embedded Image

Equation (4) below gives a local approximation of S around xi: Embedded Image

Similarly, we denote Xa the approximation of X ∈{I−, R−, R+} whose dynamic is governed by reaction-diffusion PDEs of system (3).

3.2. Radial Basis Function Finite Differences

The approximation of the variables around stencil points allows us to locally discretize the diffusion operators that appears in the set of equations (3).

3.2.1. Operator discretization on the domain

Using (4), we can approximate the diffusion operator for variable S appearing in (3) as follows: Embedded Image

The sum can be written in matrix form Embedded Image with B(xi) containing the operator values at points in the stencil of xi, indexed by In(xi) = {j1, …, jn}, Embedded Image

K a n × n symmetric matrix containing the values of the radial basis functions at each couple of points in the stencil of xi, Embedded Image

Embedded Image a vector containing the values of Sa at each point of the stencil of xi: Embedded Image

The same diffusion operator approximations as given by (5) hold for the compartment variables X ∈{I−, R−, R+} governed by reaction-diffusion PDEs of system (3).

3.2.2. Discretization of the boundary conditions

Boundary conditions can be defined by operator Bc, leading for variable S to: Embedded Image where Sb(x, t) is the boundary term that is imposed on point x. Applying the same method as in the previous section to the boundary points, we obtain for points Embedded Image of the boundary: Embedded Image where K(xi) and Embedded Image are defined by (6) and (7) respectively, and Φ(xi) is the vector containing the values of the radial basis functions between xi and the n nearest neighbors of xi: Φ(xi) = (ϕj1 (xi), …, ϕjn (xi)). Similar boundary operator approximations as given by (9) hold for other compartment variables X ∈ {I−, R−, R+} governed by reaction-diffusion PDEs of system (3).

3.3. Reducing the equations to a system of ordinary differential equations (ODEs)

We finally apply (5) to the diffusion terms in (3) which leads, for points Embedded Image in the interior of the domain, to: Embedded Image where Fα is the source term of compartment α in (3). Additionally, for points xi on the boundary, the boundary condition for each compartment variable X ∈ {I−, R−, R+} can be written as Embedded Image. This set of equations with the boundary conditions can be summarized as the following algebraic-differential equations: Embedded Image where G is the vector of boundary conditions for the compartment variables at each boundary point: Embedded Image

Embedded Image and Embedded Image are the vectors of variables S, I−, R−, R+ evaluated at each point of the interior and boundary of the domain respectively, Z2 is the vector of variables I+, H, U, D evaluated at each of the points (interior and on the boundary) of the domain, and matrices D1, D2, D3, D4 contain the coefficients of the discretized diffusion and boundary operators using (5) and (9).

When applying Dirichlet conditions as proposed in what follows, we have D3 = 0 and D4 = Id, leading to the following system of nonlinear ODEs: Embedded Image

3.4. Application to the epidemiological model

The methodology described in the previous section allows us to solve the set of differential equations (3) on a domain given a set of collocation points in the interior of the domain and over its boundary. In our application we want to model the spread of Covid-19 in the department of Isère, France. However our methodology is generic and could be applied to any geographical area of interest, provided corresponding data is available.

3.4.1. Collocation points

For generating the collocation points in the interior of the domain, we used a two-dimensional Halton sequence (see, e.g., Lemieux, 2009) defined on a square and only selected points in the interior of the region, as shown in Figure 2. We thus get a low discrepancy sequence of Nd = 1000 points in the interior of the domain.

Figure 2:
  • Download figure
  • Open in new tab
Figure 2:

1000 collocation points in the interior of the domain obtained by Halton sequence.

Each collocation point was given the population density value of the closest city taken from INSEE (2014). A polygon describing the boundary of the department was obtained from Lexman (2019) then interpolated to obtain Nb = 800 points along the boundary.

3.4.2. Boundary conditions

As shown in Figure 3, the population density is very heterogeneous over the domain, with the south-east being very rural with densities under 100 hab/km2 and the north-east and center of the department being a lot more urbanised. We chose to apply null boundary condition on the boundary points xi with density ρ(xi) < dmin = 200 hab/km2. On the rest of the boundary we assumed that the population of each compartment was equal to the average population of that compartment over the domain. More precisely, for Nd +1 ≤ i ≤ Nd +Nb with ρ(x) ≥ dmin, we assumed Embedded Image.

Figure 3:
  • Download figure
  • Open in new tab
Figure 3:

Population density attributed to each collocation point.

3.4.3. Initial Conditions

At t = 0, the proportion of individuals in each compartment should be defined in every point of the domain, which represents 8 × Nd = 8000 parameters. These proportions are not known everywhere nor for each compartment and thus have to be calibrated. To reduce the number of parameters to be calibrated, the proportions of infected and hospitalized individuals are modelled by a uniform distribution over the domain. For i ≤ Nd, we assume: Embedded Image where i0/2 and h0/2 are the initial proportions of infected and hospitalized individuals in the domain.

Then we used a numerical integrator based on backward differentiation formulas to solve the stiff system of ordinary differential equations expressed in (11). The time step was set equal to one day.

4. Model Calibration

In this section, we propose a calibration procedure for our spatialized detailed model, extending the procedure proposed in Guan et al. (2020), based on hospital data from Santé Publique France (2021).

4.1. Initial calibration

To initialize the calibration procedure of our spatialized model, we took the result of the calibration of its non-spatialized version performed in Guan et al. (2020).

4.2. Variables to fit

We used data from Santé Publique France (2021), namely the number of hospitalizations Hobs, the number of intensive care hospitalizations Uobs, the death toll Dobs and the number of recoveries from hospitalization Rhobs. The variables H, U and D are already involved in the model description (3), we only added to the model the number of returns from hospitalization: Embedded Image

Because of the weekly fluctuations of the data, a rolling average of 7 days was applied to the data from Santé Publique France (2021).

4.3. Calibration period

We calibrated our model over the first wave of Covid-19 in France which took place between March 17th and May 11th as was done in Guan et al. (2020). This period is suitable for calibration purposes because the sanitary conditions remained stable over this time period. Moreover, we used hospital data as this data set is the more reliable one, in opposition to test data that is submitted to changes in testing capabilities and policies.

4.4. Calibration parameters

Our model has numerous parameters that are presented in this section. Because of the amount of parameters we only calibrated a selection of them, the rest staying at the value given in Guan et al. (2020) as detailed hereafter.

4.4.1. Transmission rate β

The transmission rate is one of the most sensitive parameters to tune the model (see, e.g., Da Veiga et al., 2021), hence its parametrization has to be chosen carefully. We used the one proposed in Liu et al. (2021) and added a non-zero asymptotic value β∞: β(t) = (β0 − β∞) exp−μ(t−τ) + β∞. Note that we chose a transmission rate that is only dependent on time. This assumption is motivated by the fact that sanitary measures were the same all around the region of interest. However, for a larger scale implementation of this model, the transmission rate could be defined region-wise depending on the scale at which sanitary measures are enforced.

This model is quite reasonable for lockdown conditions: after time τ, the transmission rate decreases exponentially at speed μ from it’s original value β0 to an asymptotic value β∞. An example can be seen on Figure 4 above. We then have 4 parameters to calibrate β.

Figure 4:
  • Download figure
  • Open in new tab
Figure 4:

Transmission rate β over time with coefficients β0 = 0.25, μ = 0.2, τ = 5 and β∞ = 0.05.

4.4.2. Diffusion coefficient

The diffusion coefficient adiff appearing in (3) accounts for the non-homogeneity of the transmission of the disease.

4.4.3. Compartment transition variables

The parameters aXY appearing in (3) depend on transition times and probabilities. The detailed formulas to obtain aXY are written in Guan et al. (2020). The transition times and probabilities coming into play are:

  • - pa, ph and pu the respective probabilities of needing no hospital care, hospitalization and intensive care,

  • - phu, phd, pud the respective probabilities of needing intensive care or dying while hospitalized or in intensive care,

  • - Na and Ns the recovery times of asymptomatic and lightly symptomatic individuals with no hospitalization,

  • - Nih, Nhd, Nhu, Nud, Nhr and Nur the average transition times between compartments I±, H, U and D.

This totals 14 constants to be calibrated to account for recoveries.

4.4.4. Initial Conditions

As detailed in Section 3.4.3, the initial conditions are reduced to the initial proportions i0/2 and h0/2 of I− and H. If the calibration was done in a later phase of the pandemic, we could add initial proportions of recovered individuals (R±) as well as intensive care occupation (U).

4.5. Calibration process

We presented the variables to fit and the corresponding French hospital data, the set of parameter values for the initialization, the time period over which we performed the calibration and finally all the parameters to tune, including initial conditions. In this section we detail the optimization procedure and we present the results of the calibration.

4.5.1. Loss function

As we used hospital data aggregated on the department of Isère to fit our spatialized model, we first need to define Hsim, Usim, Dsim and Rhsim the total number of individuals in the department of Isère of the corresponding compartment in the simulation output. We then define the loss function to be minimized over the set of parameters p as: Embedded Image with tmax = 55 days, which is length of the first lockdown in France.

4.5.2. Minimization method

For minimizing the calibration loss (13), we used a stochastic optimization Python package noisyopt Mayer (2017). We also gave to the solver bounds for each parameter equal to [v0/5, 5v0] where v0 is the nominal value obtained from Section 4.1.

4.5.3. Results

Only a subset of parameters was calibrated. Parameters pa, ph, pu, phu, phd, pud, Na and Ns were kept to the value obtained in the calibration of the non-spatialized version of our model performed in Guan et al. (2020): pa = 0.9, ph = 0.9, pu = 0.2, phu = 0.001, phd = 0.176, pud = 0.3, Na = 7.82 and Ns = 15. This last point is commented in Section 4.5.4.

The optimal parameters to fit the data as viewed on Figure 5 are given in Table 1.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1:

Optimal calibration parameters.

Figure 5:
  • Download figure
  • Open in new tab
Figure 5:

Fitting of compartments H, U, D and Rh in Isère during the first lockdown in France (March-May) using the spatialized model described by (3).

4.5.4. Calibration limitations

This calibration was done under lockdown condition where the evolution of β(t) was shown to be exponentially decreasing Liu et al. (2021). This assumption does not hold in the general case; the evolution of the transmission rate can follow more complex variations as will be shown in the forecasting of β that is carried out in Section 5. We also assumed that the probabilities of moving from one compartment to the other and the average duration in each compartment remained constant regardless of the sanitary measures, as being epidemiological parameters. Note however that this assumption may not hold when studying subsequent variants of Covid-19 and their propagation in a partially vaccinated population, hence these parameters should be included in the calibration procedure if studying this later phase of the pandemic.

5. Forecasting using reduced order modeling

We saw in Section 4 that our spatialized model is able to reproduce the outbreak of Covid-19 at the scale of the department while giving a very localized information on hospital needs and infection number. However this model is computationally expensive compared to a non-spatialized model, this is why we propose to rely on reduced order modeling to fasten the online computations required for short-term forecasts of the infection number. In this section, we extend the model reduction tools introduced in Bakhta et al. (2021) to the treatment of our spatialized model. In the following of this section, we explain how to build a reduced basis for extrapolating variables Embedded Image and Embedded Image defined in (14). We then detail how it is possible to use the reduced basis to compute online the nforecast-day forecasts of the global infection number in the department of Isère from an observation period of ncalib = 10 days.

5.1. Fitting epidemiological data with a time dependent SIR model

In this section, we project our spatialized 8-compartment model into a time-dependent SIR model described by (1). This is done by aggregating the spatial output of each compartment over the domain and regrouping separate compartments in either S, I or R as follows: Embedded Image

Starting from an initial epidemiological state (I(t = 0) = i0, R(t = 0) = r0), the evolution of the SIR model is ruled by parameters β(t) and γ(t). As a consequence, forecasting the total number of infections and recoveries over the domain can be done by extrapolating β(t) and γ(t). Let Sobs(t), Iobs(t) and Robs(t) denote the evolution of S(t), I(t), R(t) observed over a time period. We then define: Embedded Image

Under mild assumptions that are detailed in Bakhta et al. (2021), equations in (14) allow a simple SIR modeling of the observed data starting from the initial state Embedded Image. Note that in (14), one can replace observation data by simulation data from a model that outputs S, I and R over time, leading to the definition of Embedded Image and Embedded Image that we will use for forecasting as detailed in Section 5.5.2. It is possible to infer Iobs and Robs from the data provided by Santé Publique France (2021), applying an adjustment factor (see Bakhta et al. (2021)[Section 4.1] for more details): Iobs = f Hobs, Robs = f Rhobs + Dobs, with f = 15 computed from the literature (Mizumoto et al., 2020; Di Domenico et al., 2020) and assessing the proportion of individuals needing hospitalization when infected with Covid-19.

The evolution of Embedded Image is shown on Figure 6, on which one observes the influence of lockdown periods and sanitary measures. Note that Embedded Image takes negative values around July, 2020. This reflects the very low tendency of hospital data in Isère during the summer of 2020. During that period, the SIR dynamics did not fit well the hospital data, leading to negative values for Embedded Image. However this only happened in a short interval of time during which the pandemic was stable.

Figure 6:
  • Download figure
  • Open in new tab
Figure 6:

Evolution of Embedded Image obtained from hospital data between March 2020 and July 2021. We can see that the dates of the sanitary measures match the change of variations of Embedded Image At the very end of the profile we can see the start of the “delta wave” that occured during the redaction of this work.

5.2. Detailed model evaluations

In order to fasten the online computations we built a reduced order model. A key stone in the reduction of the model is the construction of a reduced basis from a set of evaluations of the detailed model. We built K different reduced bases (ℬk)1≤k≤K for extrapolating the short-term evolution of βobs and γobs, by moving the initial time. We then selected the most suited reduced basis and used it for actual forecasting. Note that for the construction of the reduced bases, the detailed model described in (3) was slightly modified by considering that β and λ1 do not depend on time. Then the model was run with different set of initial conditions and parameter values. The range of values we considered for initial conditions and for parameters β and λ1 is provided in Table 2. Even if the bounds for each parameter were chosen to be consistent with the sanitary condition at the start of the second wave of Covid-19 in France (September, 2020), the idea is that the range of admissible values is chosen wide enough to match different calibration periods. Each parameter was sampled from a Halton sequence of length nruns = 500. The use of a Halton sequence allows to better explore the interval in which each parameter is defined. Other parameters were fixed from Table 1 in Section 4.5.3.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2:

Bounds for the training set of parameters.

Then, the detailed model was run with these nruns combinations of parameter values over a period of neval = 200 days, producing a set of detailed outputs - the number of individuals in each of the 8 compartments over time - that are collapsed into SIR outputs using the following rules: Embedded Image

For each run, we computed a realization of Embedded Image using (14) on the SIR outputs. The set of realizations of Embedded Image and Embedded Image are respectively denoted B and G in the following sections. The infection number over time is shown in Figure 7 for a selection of 50 sets of parameter values. In Sections 5.3 and 5.4 we detail the construction of the reduced order model from the set of realizations of Embedded Image.

Figure 7:
  • Download figure
  • Open in new tab
Figure 7:

Evolution of the infection number over neval days whith 50 sets of parameter values from the training set.

On most of the simulations in Figure 7, we observe a wave growing over 50 days after the start of the simulation, as the bounds in Table 2 were chosen to be coherent with the sanitary conditions in France at the start of the second wave in France (September, 2020). However, recall that we widen the bounds in order to build a more flexible set of reduced bases, which explains why we observe other scenarios, such as simulations with a wave starting around 110 days after the starting time, or even simulations without any wave.

Even though Figure 8 was obtained from detailed model evaluations with β not depending on time, the projection to a simple SIR model of the output as described in (15) led to a set of realizations of time-dependent Embedded Image.

Figure 8:
  • Download figure
  • Open in new tab
Figure 8:

Subsets of B (left) and G (right) obtained by applying (14) to the SIR trajectories outputted by the detailed model.

5.3. Simulation output reduction

From the set of realizations of Embedded Image shown in Figure 8 we built respective sets of functions ℬ and 𝒢 defined on neval = 200 days for β and γ that can reproduce the typical variations of the detailed model. We used two different methods for building ℬ and 𝒢: one based on Singular Value Decomposition and another one on a greedy algorithm. In the following, we detail the construction of ℬ from B, the same steps were applied to construct 𝒢 from G.

5.3.1. Singular Value Decomposition

This approach consists in discretizing the functions in B at a 1-day timestep. We then obtain a neval × nruns matrix denoted Mβ containing all the Embedded Image in B. We then compute eigenvalues and eigenvectors of the matrix Embedded Image. The ordered eigenvalues of A are shown in Figure 9. Then ℬSVD is composed with the nspan eigenvectors with the largest eigenvalues. The basis we obtained for nspan = 4 is shown on Figure 10. We remark on Figure 9 that the magnitude of the 10 largest eigenvalues is significantly larger than the rest, which is why with this approach we never used more than 10 functions to build ℬSVD in the following.

Figure 9:
  • Download figure
  • Open in new tab
Figure 9:

Eigenvalues of matrix A.

Figure 10:
  • Download figure
  • Open in new tab
Figure 10:

The set ℬSVD of functions obtained through the Spectral Values Decomposition-based algorithm with nspan = 4.

As can be seen on Figure 10, the elements in ℬSVD are not constrained to remain positive. Section 5.3.2 tackles this issue by constructing in a greedy way a basis of positive functions.

5.3.2. Greedy algorithm

The method is based on the algorithm called Enlarged Nonnegative Greedy (ENG) presented in Bakhta et al. (2021). We first use a greedy algorithm to build a set 𝒜 of nonnegative functions selected from B. This set is initialized as follows: Embedded Image.

Then, while 𝒜 has k ≤ nspan elements (𝒜 = {a1, …, ak}), we add the function anew ∈ B that has the maximal distance from Span (𝒜): Embedded Image

We thus obtain a set of nspan nonnegative elements of B that will be used to build the set ℬENG using the enlarge cone algorithm from Bakhta et al. (2021):

Algorithm 1:

Enlarge Cone

Figure
  • Download figure
  • Open in new tab

Note that, due to numerical integration errors, some of the curves in Figure 8 take a few values below zero. However in practice, applying Algorithm 1 to these functions, we obtained positive functions in ℬENG as can be seen on Figure 11. The same happened for 𝒢ENG (defined from G as ℬENG from B).

Figure 11:
  • Download figure
  • Open in new tab
Figure 11:

The set ℬENG of functions obtained through the greedy algorithm combined with Algorithm 1 for nspan = 4.

Figure 12:
  • Download figure
  • Open in new tab
Figure 12:

Extraction of a reduced basis from the functions obtained with the detailed model evaluations and the reduced order modeling algorithms presented in Sections 5.3.1 and 5.3.2.

5.4. Building a set of reduced bases for short-term forecasting by moving the initial time

In this section, we detail the way ℬ (obtained from SVD or ENG) was used to build a set of reduced bases for extrapolating Embedded Image.

5.4.1. Extraction of a reduced basis

Given a set Embedded Image of functions defined on neval days, we extract a set of reduced bases (ℬ k)1≤k≤K by applying a sliding window to these functions. More precisely, we first choose a window step a of 5 days. Then, for 1 ≤ k ≤ Embedded Image, with E(x) the integer part of x, we denote ℬk the set of functions Embedded Image corresponding to the restriction of bi to the window [(k − 1) × a, (k − 1) × a + ncalib + nforecast], for 1 ≤ i ≤ nspan.

5.4.2. Enrichment of the reduced bases

In Section 5.4.1 we built a set of reduced bases (Bk)1≤k≤K for extrapolating Embedded Image from the reduction of our detailed model evaluations. Additionally, we can add well-chosen functions to these reduced bases thanks to prior knowledge of the evolution of Embedded Image. As can be seen on Figure 6, the variations of Embedded Image are mostly exponential. Starting at time t0 and given a ncalib-day evolution of Embedded Image, we can fit the following exponential function: Embedded Image to Embedded Image using least squares and add the function Efit([t0, t0 + ncalib + nforecast]) to the reduced basis. This is illustrated in Figure 13. In the following, this raw exponential extrapolation will be used as benchmark extrapolation of Embedded Image and Embedded Image or in combination with the reduced bases obtained from ℬSVD and ℬENG.

Figure 13:
  • Download figure
  • Open in new tab
Figure 13:

Exponential fit and forecast of the observed β in Isère department over a 20-day window starting on March 17th, 2020. Here t0 = 0, ncalib = 10, nforecast = 10.

5.5. Short-term forecasts of the infection number

We now have K potential candidates for the reduced basis of Embedded Image extrapolation. In this section, we present the way we selected the best reduced basis, and then how we used it to get a forecast on the infection number.

5.5.1. Selection of the best forecast for beta

For computing a forecast on [t0 + ncalib + 1, t0 + ncalib + nforecast], we divided the calibration period in T1 = [t0, t0 + n1 − 1] and T2 = [t0 + n1, t0 + ncalib]. Then for each reduced basis Embedded Image, we defined the loss function: Embedded Image

By optimizing it, we computed a function Embedded Image defined on [t0, t0 + ncalib + nforecast] fitting at best Embedded Image on T1. We then computed: Embedded Image and selected the function Embedded Image with the lowest evaluation of the loss Embedded Image for extrapolating Embedded Image on [t0 + ncalib + 1, t0 + ncalib + nforecast].

5.5.2. Short-term forecast of the infection number

By applying the methodology presented in Section 5.5.1 to the parameter β, we obtained a calibration-forecast function βcalib as presented on Figure 14. The methodology was applied in parallel to the parameter γ. From βcalib and γcalib, we could run the time-dependent SIR model defined by (1) with initial conditions (Iobs(t0), Robs(t0)) and parameters β = βcalib, γ = γcalib. The model output on the infection number is shown on Figure 15.

Figure 14:
  • Download figure
  • Open in new tab
Figure 14:

10-day fit followed by a 10-day forecast of Embedded Image using the SVD basis presented in 5.3.1.

Figure 15:
  • Download figure
  • Open in new tab
Figure 15:

Forecast of the infection number in Isère department using the SVD reduced basis with nspan = 4.

5.6. Comparison of the SVD and greedy algorithms

To evaluate the error of our models, we generated 10-day forecasts every 5 days between March 2020 and May 2021. In this section and in the following, the forecast error for a calibration starting at t0 is: Embedded Image

We compared the SVD and Greedy algorithms with and without the addition of an exponential extrapolation of Embedded Image, the number of basis elements was set to nspan = 5. The average error on the full timeframe are shown on Figure 16. The ENG and SVD algorithms performed very similarly, and the exponential enrichment of the reduced bases did not improve significantly the forecasting score. Therefore in the following we computed 10 day forecasts with the SVD algorithm, as it is much faster. Also, in the following sections, we did not use the exponential enrichment.

Figure 16:
  • Download figure
  • Open in new tab
Figure 16:

10-day forecast error for the greedy algorithm (ENG), the enriched greedy algorithm (ENGexp), the SVD algorithm (SVD), the enriched SVD algorithm (SVDexp) and the exponential benchmark algorithm (exp). We can see that the enrichment does not improve significantly the forecasting score, and that the SVD and ENG algorithms perform very similarly.

5.7. Building an aggregated model

In this section, we chose to focus on the SVD algorithm that we evaluated on 10-day forecasts. The choice of nspan is crucial because fewer basis functions may not be sufficient for capturing the complexity of the problem while a too large span of basis functions could lead to overfitting over the calibration and deteriorating the forecast accuracy of the reduced basis. We compared the models obtained by using different sizes of reduced basis, and built an aggregated model (Agg) that averages the forecasts obtained with the different basis size.

We computed the forecast errors on the infection number during a full year of 10-day forecasts with different sizes of the reduced basis, keeping the same number of basis functions for β and γ. The results are shown on Figure 17 for the SVD algorithm. We can see that our aggregated model performed better than any individual model, and that this model reached a 5.5% error on 10 day forecasts. The visualization of some individual model predictions can be seen on Figure 19. Moreover, the aggregated model seems more robust than the exponential extrapolation, preventing from very large errors as can be seen, e.g., on Figure 18.

Figure 17:
  • Download figure
  • Open in new tab
Figure 17:

Forecast error of the aggregated model and of each individual model obtained with nspan ∈ [1, 8]. The aggregated model is the average of models 3, 4 and 5. We can see that the aggregated model performed slightly better than any of these individual models.

Figure 18:
  • Download figure
  • Open in new tab
Figure 18:

Comparison of forecasts obtained with the aggregated and exponential models at periods where the dynamics of Covid-19 changed quickly. We can see that the exponential extrapolation of Embedded Image is more prone to exploding forecasts while the aggregated model has a more robust behaviour. Additionnaly, the third plot shows an example of a change in the dynamics that creates a large error as the forecasts can not anticipate a brutal change, e.g., in sanitary restrictions.

Figure 19:
  • Download figure
  • Open in new tab
Figure 19:

Visualization of the individual model forecasts in different phases of the pandemic. The bottom left plot illustrates the fact that the aggregated model performs better than any individual model.

5.8. Epidemiological forecast using the detailed model

To evaluate the efficiency of the reduced modeling approach for forecasting, we compared its cost with the one consisting in forecasting the infection number directly from the detailed model. All the computation times mentioned below were obtained on a regular laptop. In the reduced basis approach, the offline phase consists of nruns = 500 evaluations of the detailed model over ndays = 200 days. The detailed model runs in τ = 0.15 s.day−1, which totals Coff = nruns ndays τ = 15000 seconds. The online phase has a cost of Con = 26 seconds, which is the time for projecting Embedded Image and Embedded Image onto the reduced bases (ℬ k)1≤k≤K and (𝒢 k)1≤k≤K, as described in 5.5.1. The cost of running the reduced model itself is negligible because it is run only once, with the optimal extrapolations of β and γ. Hence, the total cost of the reduced model approach for κ forecasts is κ Con + Coff.

In the direct approach, the detailed model is evaluated over ncalib+nforecast = 20 days and the optimization of the 9 parameters presented in Section 5.2 required neval = 600 evaluations, totalling a direct time of C = (ncalib + nforecast) × neval × τ = 1800 seconds per forecast.

For κ forecasts, the gain is Embedded Image With κ ≈ 10 forecasts, the cost of the offline phase is compensated by the gain in the online phase.

5.9. Discussion

This study has shown that extrapolating Embedded Image and Embedded Image using the exponential function as presented in 5.4.2 is a reasonable solution for forecasting the infection number at a 10-day timescale. However our approach led to a slightly better forecasting score with a model that is more robust, hence more reliable. We would like to point out that the forecasting score has an underlying error due to the changes in dynamics of the epidemic with sanitary measures that are taken. In fact, the model has large overestimations of the infection number every time a new measure is taken, as can be seen on the bottom plot of Figure 18. As a consequence, the forecasting error could not drop to a very low value without inputting the sanitary measures into the model. The model would perform better with no changes in the dynamics, however this model is still very useful because it gives the evolution of the pandemic if no measures were taken, which can help to measure the efficiency of the restriction rules.

6. Conclusion

In this paper, we presented a spatialized extension of a detailed mutli-compartmental epidemiological model that allowed us to reproduce the evolution of hospital needs at the scale of the department of Isère during the Covid-19 outbreak, while giving a detailed information about the geographical heterogeneity of the sanitary conditions. The partial differential equations that define the model were solved with a very efficient meshless scheme on a very irregular domain. From this model we built a reduced basis for extrapolating the transmission rate β and recovery rate γ obtained from hospital data and we used them in a surrogate model to output forecasts of the global infection number in the department of Isère. A simple exponential extrapolation proved to be efficient for the extrapolation of the transmission and recovery rates but the aggregation of surrogate models using different reduced bases gave a more robust forecast. The work presented in the present paper could be used to evaluate the efficiency of restriction rules that are taken by comparing the forecasts as a reference for the evolution of the pandemic with the actual evolution of the infection number when restriction rules are taken. Also, testing policies could be evaluated by controlling parameters λ1 and λ2 and by evaluating the dynamics of the pandemic while more people are being isolated after being tested. Note that the spatialized detailed model we introduced could be completed to account, e.g., for vaccination or emergence of a variant. We could add compartments for the vaccinated population by calibrating their own probability of transmission and severity of symptoms. In particular, this could allow to compare different vaccination policies. The spatialized model could take into account age categories using contact, testing and vaccination rates as well as symptom severity in each age category. This could be solved with the same approach, but would benefit from more precise hospital data involving the age of patients for calibration.

Data Availability

All data produced in the present study are available upon reasonable request to the authors

Footnotes

  • * Institute of Engineering and Management, Univ. Grenoble Alpes

  • Email addresses: matthieu.oliver{at}mines-paristech.fr (Matthieu Oliver), didier.georges{at}gipsa-lab.grenoble-inp.fr (Didier Georges), clementine.prieur{at}univ-grenoble-alpes.fr (Clémentine Prieur)

References

  1. ↵
    Al-Tawfiq, J., 2020. Asymptomatic coronavirus infection: Mers-cov and sars-cov-2 (covid-19). Travel Medicine and Infectious Disease 35, 101608. doi:10.1016/j.tmaid.2020.101608.
    OpenUrlCrossRef
  2. Pifarré i Arolas, H., Acosta, E., Léopez-Casasnovas, G., Lo, A., Nicodemo, C., Riffe, T., Myrskylä, M., 2021. Years of life lost to covid-19 in 81 countries. Scientific Reports 11. doi:10.1038/s41598-021-83040-3.
    OpenUrlCrossRefPubMed
  3. ↵
    Bakhta, A., Boiveau, T., Maday, Y., Mula, O., 2021. Epidemiological forecasting with model reduction of compartmental models. application to the covid-19 pandemic. Biology 10.
  4. ↵
    Berger, D., Herkenhoff, K.F., Mongey, S., 2020. An seir infectious disease model with testing and conditional quarantine. Political Economy - Development: Public Service Delivery eJournal.
  5. ↵
    Brauer, F., Castillo-Chavez, C., 2001. Mathematical Models in Population Biology and Epidemiology. Texts in Applied Mathematics, Springer New York.
  6. ↵
    Charpentier, A., Elie, R., Laurière, M., Tran, V.C., 2020. Covid-19 pandemic control: balancing detection policy and lockdown intervention under icu sustainability. Mathematical Modelling of Natural Phenomena 15, 57.
    OpenUrl
  7. ↵
    Da Veiga, S., Gamboa, F., Iooss, B., Prieur, C., for Industrial, S., Mathe-matics, A., 2021. Basics and Trends in Sensitivity Analysis: Theory and Practice in R. Society for Industrial and Applied Mathematics. URL: https://books.google.fr/books?id=Tqt9zgEACAAJ.
  8. ↵
    Di Domenico, L., Pullano, G., Sabbatini, C.E., Boëlle, P.Y., Colizza, V., 2020. Impact of lockdown on covid-19 epidemic in île-de-france and possible exit strategies. BMC medicine 18, 1–13.
    OpenUrlCrossRef
  9. ↵
    Fornberg, B., Flyer, N., 2015. Solving PDEs with radial basis functions. Acta Numerica 24, 215–258.
    OpenUrl
  10. ↵
    Guan, L., Prieur, C., Zhang, L., Prieur, C., Georges, D., Bellemain, P., 2020. Transport effect of COVID-19 pandemic in France. Annual Reviews in Control 50, 394–408.
    OpenUrl
  11. ↵
    INSEE, 2014. Population villes de France. https://sql.sh/ressources/sql-villes-france/.
  12. ↵
    Lemieux, C., 2009. Quasi-monte carlo constructions, in: Monte Carlo and Quasi-Monte Carlo Sampling. Springer, pp. 1–61.
  13. ↵
    Lexman, A., 2019. Carte des départements. https://www.data.gouv.fr/fr/datasets/carte-des-departements-2-1/.
  14. ↵
    Liu, Z., Magal, P., Webb, G., 2021. Predicting the number of reported and unreported cases for the covid-19 epidemics in china, south korea, italy, france, germany and united kingdom. Journal of theoretical biology 509, 110501.
    OpenUrlCrossRef
  15. ↵
    Mayer, A., 2017. Noisyopt: A python library for optimizing noisy functions. Journal of Open Source Software 2, 258.
    OpenUrl
  16. ↵
    Mizumoto, K., Kagaya, K., Zarebski, A., Chowell, G., 2020. Estimating the asymptomatic proportion of coronavirus disease 2019 (covid-19) cases on board the diamond princess cruise ship, yokohama, japan, 2020. Eurosurveillance 25, 2000180.
    OpenUrl
  17. ↵
    Roques, L., Klein, E.K., Papaïx, J., Sar, A., Soubeyrand, S., 2020. Impact of lockdown on the epidemic dynamics of covid-19 in france. Frontiers in medicine 7, 274.
    OpenUrl
  18. ↵
    Santé Publique France, 2021. Données hospitalières relatives a l’épidémie de COVID-19. https://www.data.gouv.fr/en/datasets/r/63352e38-d353-4b54-bfd1-f1b3ee1cabd7.
Back to top
PreviousNext
Posted November 04, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Spatialized Epidemiological Forecasting applied to Covid-19 Pandemic at Departmental Scale in France
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Spatialized Epidemiological Forecasting applied to Covid-19 Pandemic at Departmental Scale in France
Matthieu Oliver, Didier Georges, Clémentine Prieur
medRxiv 2021.11.03.21265855; doi: https://doi.org/10.1101/2021.11.03.21265855
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Spatialized Epidemiological Forecasting applied to Covid-19 Pandemic at Departmental Scale in France
Matthieu Oliver, Didier Georges, Clémentine Prieur
medRxiv 2021.11.03.21265855; doi: https://doi.org/10.1101/2021.11.03.21265855

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Infectious Diseases (except HIV/AIDS)
Subject Areas
All Articles
  • Addiction Medicine (175)
  • Allergy and Immunology (421)
  • Anesthesia (97)
  • Cardiovascular Medicine (901)
  • Dentistry and Oral Medicine (170)
  • Dermatology (102)
  • Emergency Medicine (257)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (407)
  • Epidemiology (8792)
  • Forensic Medicine (4)
  • Gastroenterology (405)
  • Genetic and Genomic Medicine (1864)
  • Geriatric Medicine (179)
  • Health Economics (388)
  • Health Informatics (1292)
  • Health Policy (644)
  • Health Systems and Quality Improvement (492)
  • Hematology (207)
  • HIV/AIDS (395)
  • Infectious Diseases (except HIV/AIDS) (10570)
  • Intensive Care and Critical Care Medicine (564)
  • Medical Education (193)
  • Medical Ethics (52)
  • Nephrology (218)
  • Neurology (1758)
  • Nursing (103)
  • Nutrition (267)
  • Obstetrics and Gynecology (343)
  • Occupational and Environmental Health (461)
  • Oncology (965)
  • Ophthalmology (283)
  • Orthopedics (107)
  • Otolaryngology (177)
  • Pain Medicine (118)
  • Palliative Medicine (43)
  • Pathology (264)
  • Pediatrics (557)
  • Pharmacology and Therapeutics (266)
  • Primary Care Research (220)
  • Psychiatry and Clinical Psychology (1846)
  • Public and Global Health (3989)
  • Radiology and Imaging (655)
  • Rehabilitation Medicine and Physical Therapy (344)
  • Respiratory Medicine (536)
  • Rheumatology (215)
  • Sexual and Reproductive Health (178)
  • Sports Medicine (166)
  • Surgery (197)
  • Toxicology (37)
  • Transplantation (107)
  • Urology (80)