Summary
The COVID-19 outbreak is triggering a global crisis that is challenging governments, health systems and the scientific community worldwide1. A central question in the COVID-19 pandemic is whether climatic factors modulate its progression. This information is key to epidemiologists and healthcare decision-makers for improving their management plans2. Previous attempts to assess the impact of climatic parameters have yielded ambiguous results, either because they were using geographically or temporally restricted data, or because they were comparing infection rates across countries, which were measured differently depending on local screening strategies. In March 2020, the spread of COVID-19 dramatically increased the number of countries recording deaths3, providing an opportunity to use mortality rate, which is measured more homogeneously across countries than infection rates, as a descriptor of the COVID-19 outbreak over a large latitudinal range. Here, using data recorded in 208 territories from 88 countries, we show that mortality rate is negatively influenced by warmer air temperature and positively affected by higher relative humidity. Each additional Celsius degree decreases mortality rate by ∼4%, while a 1% increase in relative humidity raises mortality rate by ∼2%. Temperature is positively correlated with UV-index, for which one unit of increase results in a ∼15% decrease in the mortality rate. We also show that other factors contribute to the dynamics of the COVID-19 outbreak, such as the proportion of vulnerable age classes in the population, access to a non-overwhelmed health system, as well as governmental travel restrictions for controlling the spread of the disease. All of them are critical factors impacting the mortality rate of COVID-19. The influence of climatic factors is a warning to all southern hemisphere countries where winter is coming soon. Northern hemisphere countries should also be warned that climatic factors alone will likely not be sufficient to contain the disease.
Introduction
The current COVID-19 outbreak is due to the coronavirus SARS-CoV-2 that originated in the area of Wuhan, Hubei province, in China4,5. On March 11, COVID-19 was officially declared a pandemic by the World Health Organization (WHO), as it spread extremely rapidly since December 2019, following an exponential curve worldwide6. As of April 20, the pandemic provoked more than 2.4 million diagnosed infections worldwide, and at least 170,000 deaths3. The health, social and economic consequences of this pandemic are huge7, with more than three billion humans under different regimes of containment at the beginning of April 2020. If most of the infections occur in the northern hemisphere, the southern hemisphere is not spared by the disease, despite the summer season. Indeed, many viral respiratory infections show seasonal fluctuation, especially in temperate regions8,9. Air temperature and humidity have been shown to influence the spread of influenza viruses10, as well as the coronaviruses SARS-CoV11, responsible of the severe acute respiratory syndrome (SARS) and MERS-CoV12, responsible of the Middle East respiratory syndrome.
One debated question about COVID-19 is whether its spread is being affected by season-related climatic factors, as are other seasonal viruses2. This is of utmost importance, as the possible climatic dependency of COVID-19 could help decision-makers to adopt the most suitable strategy to control the disease in the near future. There are several recent studies that have tackled this question since the onset of the COVID-19 outbreak, available on online archive repositories. As for other viruses, temperature and humidity seem to be factors with some influence on the spread of SARS-CoV-2, but recent studies led to contrasting results. In general, a negative relationship is found between air temperature and the spread of SARS-CoV-2 virus13,14, but a positive one15 and a lack of effect16 have also been reported. In the case of humidity, limited17,18, positive19 and negative13 effects on the infection rate have been reported. Almost all of those studies were restricted to data from China at the early stage of the epidemic. The few exceptions13,18,19 considered up to 47 countries, but the time frame was limited at a maximum to February the 29th and these studies were based on the number of COVID-19 confirmed positive cases, which is unevenly assessed among countries.
To analyse the potential effects of climatic factors on the COVID-19 pandemic, an accurate descriptor of the outbreak that can be compared among countries is needed. The main descriptors that have been used to date include the cumulative number of reported COVID-19 positive cases at a given date20, the daily number of new positive cases14,21,22, and the daily mean number of positive cases for a given time period19. Yet, these descriptors are all based on the number of COVID-19 positive patients, which in turn depends on the countries’ screening strategies.
As of 20 March, many countries were testing patients exhibiting clear COVID-19 symptoms, such as Italy, Spain, France, Belgium, Switzerland, the UK, and the USA. Other countries, such as South Korea, Singapore, Australia, United Arab Emirates and more recently Germany, have opted to screening their population more widely. It results that countries performing more tests are either following a strategy of wide screening, whatever the proportion of positive patients, or are having numerous patients with clear COVID-19 symptoms. These two contrasting situations lead to infection estimates that are hardly comparable among countries.
To show that descriptors based on COVID-19 positive patients might be strongly biased when analysing data reported from multiple countries, we performed a correlation analysis between the mean maximum air temperature between mid-February and mid-March 2020 and the total number of tests performed per country, as of March 20. We found a strong negative correlation, where countries with lower temperatures tend to perform more tests (r =-0.48, n = 41, p value = 0.001). This correlation might be influenced by the higher impact of this disease in the northern hemisphere during the first months of 2020. However, the reasons for this correlation remain unclear, and may lead to temperature-biased numbers of COVID-19 positive patients across countries. Therefore, we argue that descriptors based on the number of tests performed should not be used to assess the contribution of air temperature (or other climatic factors) in the trans-country COVID-19 outbreak.
To reach a better comprehension of the COVID-19 pandemic, the global extent of the disease in March 2020 brings new data to test the effects of climatic factors. Indeed, the number of countries recording deaths increased from nine by the end of February 2020 to more than 120 by the end of March 20203, providing a first-hand opportunity to use the mortality rate as the descriptor of COVID-19 outbreak over a large latitudinal range. Here we use the mortality rate expressed as the number of deaths per inhabitant over a standardized time period of 15 days, corresponding to the 15 days following the first reported COVID-19 death in the country. This metric is much less dependent on the local testing strategy. In almost all countries, the number of COVID-19 deaths publicly reported corresponds to the cases of COVID-19 positive patients and patients displaying clear diagnostic COVID-19 symptoms who died on a given date. Although this number may not reflect the exact number of deaths due exclusively to COVID-19 infection, because death can result from other underlying health problem exacerbated by the viral infection or because patients may display symptoms that are mistakenly attributed to COVID-19 (ascertainment bias), these biases are similar across countries allowing credible country-wise comparisons. Consequently, mortality rate is a reliable descriptor for the worldwide COVID-19 outbreak. However, when it comes to use the mortality rate as the outbreak descriptor, it implies that the disease is well established, and thus we included other known explanatory factors in our analyses: the proportion of the most susceptible age group in the population, the number of beds in hospitals (reflecting access to health care), and the timing and strength of governmental travel measures for controlling the spread of the disease.
Results
We examined the possible effects of three climatic variables on the worldwide COVID-19 outbreak: air temperature, UV-index and relative humidity. All these variables are averaged over 30 days, which are the 15 days before and the 15 days after the first registered death in each territory or country. We used a linear mixed model (LMM) with the log transformed mortality rate per region as the response variable. The model considers the country-level as a random intercept and implements an exponential geographical correlation structure (see Methods). We included the climatic variables together with other variables that may explain the extent of the epidemic outbreak, namely (i) the proportion of the population older than 64 years old; (ii) the number of beds in hospitals (per 10K inhabitants), and (iii) the timing and strength of the governmental travel measures to control the spread of the disease. As air temperature is highly correlated with UV-index (see Methods), we tested a temperature-based model (excluding UV-index) and a UV-based model (excluding temperature). We ranked candidate models that can be built with combinations of the six variables according to their resulting weighted AIC (ωAIC)23. We then computed a model average of all candidate models included in a confidence set of 95% (Σωi ≥ 0.95) (Table S1, Supporting Information).
Very similar results were obtained with the temperature-based and the UV-based averaged models, yet with slightly higher significance values for the latter (Table 1). Both approaches resulted in sets of best models explaining between 12% and 25% of variation in mortality rate by the fixed variables (R2 GLMM(m), Table S1), and between 49% and 54% when considering both fixed and random variables (R2 GLMM(c), Table S1). In these models, temperature and UV-index influenced negatively the mortality rate, while the opposite was observed for the relative humidity (Fig. 1). All other explanatory variables also contributed to explain the COVID-19 outbreak. The proportion of the population older than 64 years old had a positive effect on mortality rate, while the number of beds in hospitals and the travel restrictions implemented in each country showed a negative effect (Table 1).
The results for the temperature-based averaged model showed that every extra degree Celsius decreases the mortality rate by 3.74% (e-0.04-1, Table 1). The UV-based averaged model showed that every unit increase in UV-index results in a 14.72% decrease in the mortality rate (e-0.16-1, Table 1). In this last model, humidity had a positive effect, in which every 1% increase of relative humidity results in a 1.85% mortality rate increase (e-1.85·0.01-1, Table 1). Regarding the other variables in our models, every 1% increase in the proportion of the population older than 64 years old increased the mortality rate by 11.31% (e0.11-1, Table 1). Every extra bed in hospitals for 10K inhabitants decreased the mortality rate by 2.22% (e-0.02-1, Table 1). Finally, strong and early governmental travel restrictions resulted in a geometric mean of mortality rate 61.8% lower than when restrictions were weak and imposed late (e-0.96-1, Table 1). No effect is observed concerning intermediate timing and strength of governmental travel restrictions (Table 1).
Discussion
We stress that warm air temperatures, intense UV light, or low relative humidity may act by reducing the transmission of the SARS-CoV-2 virus or it may directly or indirectly inactivate this virus. Preliminary results indicate that, under laboratory conditions, the SARS-Cov-2 virus survives less than 4 days at 37° C24 while at 4 °C it may remain stable for more than 14 days25, offering one possible explanation to the negative relation between air temperature and the COVID-19 mortality rate. Some evidence suggests that the transmission efficiency of influenza viruses by respiratory droplet increases when air is cold and dry26. Moreover, in periods of the year or places of the world with cold and dry air and reduced UV light, other human respiratory diseases are more frequent27 and may facilitate inter-individual transmission via coughing or sneezing. When the weather is cold, people also tend to spend more time in closed and populated environments, which also facilitates inter-individual transmission. A causative link between exposure to sunlight (including UV light), a higher synthesis of vitamin D, and better efficiency of the human immune system has been evidenced28, which may explain in part the negative relation between the UV-index and the COVID-19 mortality rate. Yet, these modes of action remain hypotheses to be tested in the case of COVID-19.
Humankind is facing an unprecedented worldwide crisis, forcing decision makers to move forward into the labyrinth of unknowns. Understanding how climatic factors have contributed to the COVID-19 outbreak until now could strengthen the projections of its future spread. Previous analyses were unable to provide an unambiguous view because of limited data or biased descriptors for country-wise comparisons. Using global datasets and a homogeneous descriptor of the COVID-19 outbreak, we show that air temperature and UV-index are related climatic variables that play a substantial role in explaining the COVID-19 mortality rate. A higher value in one or other of the variables is associated with lower mortality rates. Relative humidity is a third climatic variable able to explain a reduction in COVID-19 mortality rate when it is lower. We also show the importance of age class structure, access to healthcare systems and governmental travel restrictions in explaining the COVID-19 outbreak. Although our results may somewhat relieve some of the anxiety of people living in regions entering the warm season or living in warm countries, it is clear that climatic factors alone will not passively stop the outbreak. However, our results do represent a warning for the populations and decision-makers of countries with cold climates or moving towards the cold season. Importantly, the clear contribution of climatic factors on the COVID-19 outbreak, however, does not ensure that this disease will evolve into a winter-seasonal epidemic viral disease like the influenza virus29. More research is urgently needed to better understand the dynamics of the COVID-19 pandemic in order to save lives.
Methods
COVID-19 dataset
We collected the number of deaths per day due to the COVID-19 in each country affected by SARS-CoV-2 from the dataset hosted by the Centre for Systems Science and Engineering at Johns Hopkins University3. This dataset is maintained daily and is verified by registers provided by various health state authorities as well as the World Health Organisation (WHO). The dataset provides information at the country level but also about sub-regions of China, The United States, Canada, and Australia. French and British overseas territories are also included separately in this dataset. Due to the uncertainly regarding the initial stage of the spread of this disease4, we decided to exclude the dataset for mainland China. We considered the date of first registered death in each country as well as the cumulative number of deaths during the first 15 days of deaths registration in each country or territory. We used the cumulative mortality at day 15 in order to include the maximum available information and to compare data extracted from similar epidemic phases across countries. We considered all regions registering at least 15 days of deaths until 6th April 2020. The final dataset includes 208 territories in 88 countries.
Climatic dataset
The historical weather conditions in each country or territory included in the COVID-19 dataset were obtained from the Dark Sky API dataset by using the R package “darksky”30. We considered the geographical coordinates associated with each record in the COVID-19 dataset. We included in the analysis the maximum temperature (°C), relative humidity, and UV-index. These variables were taken by day for the 15 days before and the 15 days after the first registered death in each country or territory recorded. We computed the average across these 30 days for each climatic variable.
Non-climatic dataset
The proportion of the population older than 64 years old per country was calculated on the basis of the population indicators available from the World Bank database31. The indicator of the number of hospital beds per 10K inhabitants was obtained from the Global Health Observatory (GHO) database of the World Health Organization (WHO)32. We used the latest available data per country, which varied from 2005 to 2015, but most data were from the period 2010-2015. Governmental immigration restrictions per country were used as a proxy for the governmental travel measures for controlling the spread of the disease. This last dataset was taken from the COVID-19 Travel Restriction Monitoring database, compiled by the International Organization for Migration (IOM)33. We added some missing data directly from the primary source database, the International Air Transport Association (IATA)34. For each country and date, from March 8 to March 26, we summed the number of other countries for which immigration restrictions were imposed. We considered all the IOM traveling restriction categories with a link to COVID-19 spread control. Based on these daily immigration restrictions, we grouped the countries into three categories. Category 1 (soft restriction) includes countries that imposed virtually no restrictions or otherwise late in time, not before March 18, and only on a subset of countries. Category 2 (middle restriction) contains the countries that started imposing noticeable immigration restrictions from March 11 to 17 with a fast daily-increase in the number of countries with restricted entrance. Category 3 (strong restriction) groups the countries that decided immigration restrictions early in time, before March 11, with a fast daily-increase in the number of countries from which immigration was restricted. The rates of COVID-19 tests performed per country were obtained from the database of STATISTA35.
Statistical analysis
Our response variable (mortality rate) is the number of deaths per inhabitant during the first 15 days of deaths registration in each country or territory included in the COVID-19 dataset. This variable is computed by considering the cumulative number of deaths on day 15 divided by the population size of each country or region. This variable was log-transformed in order to maintain a linear relationship with the predictor variables. The fixed predictor variables included temperature, UV-index, relative humidity, proportion of the population older than 64 years old, number of beds in hospitals for every 10K inhabitants, and the governmental travel measures implemented in each country to contain the spread of this disease. Since temperature and UV-index are highly correlated (r ∼ 0.9, p-value < 0.001), we included those variables in two separated models. We verified the collinearity among predictor variables by computing the variance inflator factor (VIF). All variables included in the separate models with either temperature or UV-index did not show collinearity with other predictor variables (VIFs < 1.6)36. We also estimated the effect of including a spatial correlation structure as well as the effect of the country-level as a random effect, following Zuur, et al. 36. The model selection was based on the lowest Akaike Information Criterion value (AIC)23. The final model structure was a linear mixed model (LMM) that considers an exponential spatial correlation and country-level as a random intercept. We explored all combinations of predictor fixed variables. These models were ranked by using the Akaike weights (ωi), which provide a measure of the probability of each model being the most suitable among all of the candidate models. Since there is no single model that has a high Akaike weight (ωi > 0.95), we averaged all models in a confidence set at 95% (Σωi ≥ 0.95)23. We reported the conditional (R2GLMM(c)) and marginal (R2GLMM(m)) coefficient of determination for each model included in the set of candidate best models included in the model averaging. These values represent the variance explained by the fixed predictors and both the fixed predictors and the random variable, respectively37. In the case of the relationship between the number of COVID-19 tests and the temperature of each country, the model with the lowest AIC was a simple linear relationship without the influence of a spatial correlation structure or the inclusion of a country-level random effect. All statistical analyses were performed using R 3.6.238. We used the package ‘nlme’39 for the linear mixed models and the package ‘MuMIn’40 for the model averaging and the computation of the determination coefficients.
Data Availability
The datasets generated and analysed in the current study are available from the corresponding author upon request
Author contributions
C.S.Q., M.C. and J.I.M.B. conceived the original idea. C.S.Q. performed the statistical analysis. All authors analysed and discussed the results, and contributed critically in the writing of the manuscript.
Data availability
The datasets generated and analysed in the current study are available from the corresponding author upon request.
Supporting Information
Acknowledgements
All authors are supported by grants from the Swiss National Science Foundation: n° 31003A_182577 to MC, n° P400PB_183930 to CSQ, and 310030_185327/1 to JMB.
Footnotes
↵* e-mail: claudio.quilodran{at}zoo.ox.ac.uk