Modeling risk of infectious diseases: a case of Coronavirus outbreak in four countries

Md. Mazharul Islam; Md. Monirul Islam; Md. Jamal Hossain; Faroque Ahmed

doi:10.1101/2020.04.01.20049973

Abstract

Background The novel coronavirus (2019-nCOV) outbreak has been a serious concern around the globe. Since people are in tremor due to the massive spread of Coronavirus in the major parts of the world, it requires to predict the risk of this infectious disease. In this situation, we develop a model to measure the risk of infectious disease and predict the risk of 2019-nCOV transmission by using data of four countries—US, Australia, Canada and China.

Methods The model underlies that higher the population density, higher the risk of transmission of infectious disease from human to human. Besides, population size, case identification rate and travel of infected passengers in different regions are also incorporated in this model.

Results According to the calculated risk index, our study identifies New York State in United States (US) to be the most vulnerable area affected by the novel Coronavirus. Besides, other areas (province/state/territory) such as Hubei (China, 2^nd), Massachusetts (US, 3^rd), District of Columbia (US, 4^th), New Jersey (US, 5^th), Quebec (Canada, 20^th), Australian Capital Territory (Australia, 29^th) are also found as the most risky areas in US, China, Australia and Canada.

Conclusion The study suggests avoiding any kind of mass gathering, maintaining recommended physical distances and restricting inbound and outbound flights of highly risk prone areas for tackling 2019-nCOV transmission.

1. Introduction

Novel Coronavirus, Middle East Respiratory Syndrome-Coronavirus (MERS-CoV) related to Coronavirus that caused Severe Acute Respiratory Syndrome (SARS) epidemic 10 years ago, was first detected in Saudi Arabia and it has been spreading in other countries since 2012[1]. In December 2019, China noticed few pneumonia cases relating to novel Coronavirus (2019-nCOV). Told case incidence has been increasing dramatically and reached the hundreds, but it is likely to be an under-estimate[2]. The Coronavirus menace has now been spread all over the countries in globe. As of March 24, 2020, there were 372757 confirmed Novel Coronavirus-infected pneumonia (COVID-19) cases in 196 countries of which 16224 death cases were traced out worldwide[3]. This endemic of novel Coronavirus has been caused due to the mobility and interaction of people who moved from one country to another by outbound fights. Therefore, we argue that densely populated countries as well as cities are more likely to be infected with novel Coronavirus because of inbound and outbound flights and other means of transports. It implies that both in-bound and out-bound flights of passengers are significant factors in triggering Coronavirus infection risk in more densely populated countries than in less densely populated countries. However, previous studies did not consider the population density factor while modeling the risk of 2019-nCOV transmission. Recent studies estimates the risk of Coronavirus transmission from the Chinese cities to other destinations[4,5]. More importantly, Haider et al. (2020) built a risk index using the number of air travelers multiplied by the weight of the number of infected cases. Apart from previous studies, our study can fill the research gap constructing a risk index that takes population density and size, case identification rate, and the number of air travel into account.

2 Methods

2.1 Risk Index

We present a mathematical model that assesses the risk of infectious diseases such as Coronavirus. The model includes population size and density, the number of identified cases, case identification rate, and passengers’ travel among different regions. Risk Index (RI) of a certain infectious disease in a specific area a_ij of a country C_i where i = 1,2,3,…, n and j = 1,2,3, …, m_i is estimated for a given Calculation Period (CP) which is usually days, weeks, months or even years. RI is unit free which makes it comparable among areas; higher the RI, the greater the risk. Therefore, RI of an area a_ij is defined as where, A_ij and T_ij are the area and the travel specific risk components respectively.

2.1.1 Area specific risk

Area specific component estimates the risk of infectious diseases posed by the factors such as the number of identified cases, population size, and case identification rate. We also consider mobility—interaction factor to estimate the risk of spreading infectious diseases as mobility and interaction among people contribute to spread infectious diseases according to some studies [6–8]. So, it is expected that the higher the mobility and interaction among people in an area, the higher the risk of spreading infectious diseases. We can call it “Mobility-Interaction Effect”. The reasoning behind such hypothesis is that people have the tendency to amass in an area which provides better environment, livelihood, education, and other amenities. The aggregation process continues further with the economic development, forcing people to move low density areas to high density areas. As a result, people living in high density areas have higher mobility and interaction among themselves than the people living in low density areas[9]. Therefore, we propose that Relative Mobility-Interaction Effect (RMIE) can be estimated by the relative population density of that area within a country.

For an area a_ij, having the population density D_ij in country C_i, RMIE is measured as

Now using RMIE along with the number of identified cases, case identification rate, calculation period of risk measurement, and assuming that infectious disease spread at geometric rate, we write the area specific risk component as

Here, within the risk calculation period (CP),

P_i = Population of country i

P_ij = Population of area/city a_ij of country i

I _i = The total identified case of infectious disease in country i

I _ij = The total identified cases of infectious disease in a_ij

t _s = Starting time of CP

t _e = Ending time of CP

t_ij = Time when first case was identified in area/city a_ij

t _i = Time when first case was identified in country i

t _g= Time when first case was identified globally

Δt_g = t_e − t _g the duration of global identification

Δt_i = t_e − t_i the duration of first case identification in country i

Δt_ij = t_e − t _ij the duration of first case identification in area a_ij.

It accumulates two risk components: one is the area/city risk and the other is the country risk. An important feature of the measure (3) is that if an area has no identified case, we can assess area specific risk by country specific risk since through mobility-interaction effect of other areas create pressure of spreading disease to that particular area.

2.1.2 Travel specific risk

Drawing experience from previous outbreak of infectious diseases including the recent epidemic of Coronavirus, literatures suggest addressing the risk of travelling among different areas and countries[10–12]. In our study, the travel specific risk component for area a_ij of country C_i is measure as where V_pq is the number of travelers who left the area a _pq for different destinations within the time duration, V_{pq, ij} is the number of travelers who arrive at the area a_ij from a_pq within the same time duration, and A_pq is the area specific component for a_pq as described in equation 3. The travel risk component incorporates contribution of other area specific risks proportionately to the proportion of travelers from those areas. Combining equation (3) and (4), we get the final RI as follows:

Some interesting features of this index are worthwhile to mention. The index says that there is a risk of Coronavirus spreading even though the travel specific risk component becomes zero. That means even if a country imposes air travel restriction internationally, the risk of transmission still exists. The area specific component of the index actually portrays the dynamics of infectious disease spreading influenced by mobility and interaction among people. One of the important implications of the area specific component is that restricting international movement is not sufficient to contain the risk of Coronavirus spreading in particular and infectious diseases in general. Internal restriction is also important to suppress transmission risk in an area which is exactly what the current situation says. In that sense, we expect that our index has the higher probability of mimicking the current situation of Coronavirus.

2.2 Data sources

Data on Coronavirus identified cases, population size and density, and air travel among areas of the four countries – US, Canada, Australia and China are collected from three different sources. First, the total number of confirmed cases (up to 26 March 2020) and the dates of first identification of Coronavirus cases in geographic areas (territory/province/state) are extracted from Johns Hopkins University Center for Systems Science and Engineering (JHU CCSE) (https://systems.jhu.edu/research/public-health/ncov/)[13,14]. Second, data on population size and geometric area are collected from the website https://www.citypopulation.de. Finally, because of insufficient passengers’ travel data, we incorporate air travel data only among these areas. Air travel data are obtained from FLIRT (https://flirt.eha.io), a flight network analysis tool developed by EcoHealth Alliance which provides simulated passengers’ data for the final destination from different airports based on flight schedule database of 800 airlines[12,15].

2.3 Data analysis

We have defined the calculation period (CP) from 31^st December 2019 to 27^th March 2020.The total number of confirmed Coronavirus cases are accumulated up to 26^th March 2020 for different areas of respective countries. Population density (population per square kilometer) of these areas is calculated by dividing population size by the respective geometric area. At first, we calculate area specific component using equation 3. To estimate travel specific risk component presented in equation 4, we calculate the proportion of travelers arrived at area a_ij from other areas directly by using simulated data from FLIRT. In total 781 international airports among these areas are included in this study. All analysis are performed using statistical software R version 3.6.0[16].

3. Results

In this study, we present Coronavirus risk ranking of 104 areas (territory/province/state) of four countries based on our developed risk index (RI). Table 1 presents the rank of these areas along with values of area specific component (A), travel specific component (T) and overall RI score as of time from 31 December 2019 to 26 March 2020.

View this table:

Table 1:

Areas based on risk index for Coronavirus transmission

Figure 1:

Risk Index of Coronavirus infection (as of 26th March 2020)

Empirical findings depict that New York State in US is the most risky area in terms of Coronavirus infection among US, Canada, Australia, and China. As per report of 26 March 2020, about 38262 Coronavirus cases are detected in this state, and the value of RI is 0.00691484 which clearly shows the severity of infection comparing to others. Besides, some other areas such as Hubei (China), Massachusetts (US), District of Columbia (US), New Jersey (US), Georgia (US), Colorado (US), Illinois (US), Michigan (US) and Maryland (US) are ranked 2nd, 3rd, 4th, 5th, 6th, 7th, 8th, 9th and 10th respectively as per calculation of RI. Besides, Quebec is ranked 20th whereas Alberta is ranked 57th among Provinces in Canada. Moreover, Australian Capital Territory, Australia is ranked 29th followed by New South Wales in 49th position. More interestingly, remaining Chinese Provinces are ranked at the bottom of the table, implying less risk of Coronavirus infection in China comparing to US, Canada and Australia.

4. Discussion

China (Hubei Province) as a breeding ground was the highest in the importation to Coronavirus infection across the globe. Later, this pandemic rapidity of this disease has now captured US and some other parts of the world. It has been spread at a geometric rate due to the mobility and interaction of people through air-borne movement from one country to another or from one city to another. In this respect, our study models a risk index to measure the human to human Coronavirus transmission mainly caused by people’s mobility and interaction. Therefore, we recommend avoiding any mass gathering, maintaining recommended physical distance and restricting inbound and outbound flights across different areas/cities and countries as well.

Data Availability

All data are publicly available. 1. The total number of confirmed COVID-19 cases (up to 26 March 2020) and the dates of first identification in geographic areas (territory/province/state) are extracted from Johns Hopkins University Center for Systems Science and Engineering (JHU CCSE) (https://systems.jhu.edu/research/public-health/ncov/) Current Github repository - https://github.com/CSSEGISandData/COVID-19 2. Data on population size and geometric area are collected from the website https://www.citypopulation.de 3. Air travel data are obtained from FLIRT (https://flirt.eha.io), a flight network analysis tool developed by EcoHealth Alliance.

References

↵
Sumdani H, Frickle S, Le M, et al. Effects of Population Density on the Spread of Disease.
↵
Nishiura H, Jung S, Linton NM, et al. The Extent of Transmission of Novel Coronavirus in Wuhan, China, 2020. J Clin Med 2020;9:330. doi:10.3390/jcm9020330
OpenUrl CrossRef
↵
World Health organization (WHO). Coronavirus disease 2019 (COVID-19) Situation Report – 64. 2020. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200324-sitrep-64-covid-19.pdf?sfvrsn=703b2c40_2
↵
Chinazzi M, Davis JT, Ajelli M, et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science (80-) 2020;:eaba9757. doi:10.1126/science.aba9757
OpenUrl Abstract/FREE Full Text
↵
Bogoch II, Watts A, Thomas-Bachli A, et al. Potential for global spread of a novel coronavirus from China. J Travel Med 2020;27. doi:10.1093/jtm/taaa011
OpenUrl CrossRef PubMed
↵
Crooks AT, Hailegiorgis AB. An agent-based modeling approach applied to the spread of cholera. Environ Model Softw 2014;62:164–77. doi:10.1016/j.envsoft.2014.08.027
OpenUrl CrossRef
Wesolowski A, Eagle N, Tatem AJ, et al. Quantifying the Impact of Human Mobility on Malaria. Science (80-) 2012;338:267–70. doi:10.1126/science.1223467
OpenUrl Abstract/FREE Full Text
↵
Prothero RM. Disease and mobility: a neglected factor in epidemiology. Int J Epidemiol 1977;6:259–67.
OpenUrl CrossRef PubMed Web of Science
↵
Fujimoto S, Mizuno T, Ohnishi T, et al. Relationship between population density and population movement in inhabitable lands. Evol Institutional Econ Rev 2017;14:117–30.
OpenUrl
↵
Stoddard ST, Morrison AC, Vazquez-Prokopec GM, et al. The Role of Human Movement in the Transmission of Vector-Borne Pathogens. PLoS Negl Trop Dis 2009;3:e481. doi:10.1371/journal.pntd.0000481
OpenUrl CrossRef PubMed
Epstein JM, Goedecke DM, Yu F, et al. Controlling Pandemic Flu: The Value of International Air Travel Restrictions. PLoS One 2007;2:e401. doi:10.1371/journal.pone.0000401
OpenUrl CrossRef PubMed
↵
Haider N, Yavlinsky A, Simons D, et al. Passengers’ destinations from China: low risk of Novel Coronavirus (2019-nCoV) transmission into Africa and South America. Epidemiol Infect 2020;148:e41. doi:10.1017/S0950268820000424
OpenUrl CrossRef
↵
Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect Dis Published Online First: February 2020. doi:10.1016/S1473-3099(20)30120-1
OpenUrl CrossRef PubMed
↵
JHU CSSE. 2019 Novel Coronavirus COVID-19 (2019-nCoV) Data Repository by Johns Hopkins CSSE. 2020.https://github.com/CSSEGISandData/COVID-19 (accessed 27 Mar 2020).
↵
EcoHealth alliance. FLIRT: a product of EcoHealth alliance. 2020.https://flirt.eha.io (accessed 16 Mar 2020).
↵
Team RC. R: a language and environment for statistical computing computer program. R Core Team, Vienna, Austria. 2019.