Association of BCG vaccination policy with prevalence and mortality of COVID-19

There is some evidence that tuberculosis vaccine bacillus Calmette-Guérin (BCG) has non-specific beneficial effects against non-related infections. Here, we examined the possible association between BCG vaccination with prevalence and mortality by COVID-19 by using publicly available data of COVID-19 in 199 countries/regions and the BCG World Atlas. By using linear regression modeling, we found that the number of total cases and deaths per one million population were significantly associated with the country's policy concerning BCG vaccine administration. The amount of variance in cases and deaths explained by BCG vaccination policy ranged between 12.5% and 38%. Importantly, this effect remained significant after controlling for the country's life expectancy and the average temperature in February and March 2020, which themselves are significantly correlated with the cases and deaths indices, respectively. By contrast, the ratio between deaths and cases was weakly affected. This latter outcome suggested that BCG vaccination may have hindered the overall spread of the virus or progression of the disease rather than reducing mortality rates (i.e., deaths/cases ratio). Finally, by roughly dividing countries into three categories showing high, middle, or low growth rate of the cases, we found a highly significant difference between the slope categories among the BCG groups, suggesting that the time since the onset of the spread of the virus was not a major confounding factor. While this study potentially suffers from a number of unknown confounding factors, these associations support the idea that BCG vaccination may provide protection against SARS-CoV-2, which, together with its proven safety, encourages consideration of further detailed epidemiological studies, large-scale clinical trials on the efficacy of this vaccine on COVID-19, and/or re-introduction of BCG vaccination practice in the countries which are currently devoid of the practice.


Introduction
The recent outbreak of coronavirus disease 2019  caused by the 2019 novel coronavirus (SARS-CoV-2) is one of the most serious threats to the human race in this millennium. There appears to be a large variability among countries in the risk of infection and mortality by this virus 1 . For example, Japan, which has close physical and social distance from China, is one of the first countries that recorded the first case in the earliest phase of this pandemic. As of March 30th, while many governments have 'locked-down' numerous cities and even the entire country, the Japanese government still keeps applying mild measures to mitigate the spread of the virus. The citizens have simply been 'advised' to stay home.
Nevertheless, most people in Japan, with the notable exception of Tokyo metropolitan, are currently working as usual. Surprisingly, despite such a liberal measure taken and relatively low number of PCR tests conducted, Japan is ranked at 117 th and 69 th among 199 countries and regions 1 , as of March 28th, regarding the number of cases per 1 million population and the number of deaths per 1 million population, respectively. Also, there seem to be visible differences in COVID-19 prevalence and mortality between countries in Western Europe and Eastern Europe. There are many potential geographical, social, and biological factors, such as temperature, humidity, life expectancy, average income, social norms, and ethnical genetic background, that could potentially explain such variabilities among countries, but it remained fully explored. In case they are successfully identified, such information could potentially contribute to the mitigation of the spread of the virus. Some vaccines (e.g., measles and Bacillus Calmette-Guérin (BCG) vaccines) are associated with a lower risk of illness and death from other disorders [2][3][4] . A systematic review of epidemiological studies provided evidence for the non-specific beneficial effects of BCG vaccine on all-cause mortality 5,6 . Induction of cytokines associated with trained immunity is proposed to underly such protection against non-related viral infections 7 . Researchers in four countries will soon start a clinical trial of BCG vaccine on this disease, based on the proposed beneficial effect of BCG 4 . Interestingly, a blogger noticed that there might be a correlation between the administration of BCG vaccine in each country and the extent to which the spread of COVID-19 occurs 8 . In this study, we evaluated the hypothesis that BCG vaccination has some protective effect against COVID-19 by utilizing publicly available datasets through linear regression modeling. Among the potentially confounding factors, we assessed two factors, life expectancy and the average . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint temperature of the countries. We also tried to evaluate the potential effect of BCG vaccination policy on the growth rate of COVID-19, controlled by the onset of its propagation in each country.

Methods and Results
The data on COVID-19 and life expectancy were retrieved on March the 28 th from "COVID-19 CORONAVIRUS PANDEMIC" page 1 and "Life Expectancy of the World Population" page 9 in worldmeter website. The average temperature data were retrieved from "List of cities by average temperature" page 10 in Wikipedia.
Average temperature of each country was obtained by averaging those of the cities in each country. The raw data on which our analyses are based is attached as Supplementary Three sets of linear regressions were run. The dependent variables (DVs) were the rate of total cases per one million population (Model 1), the rate of death per one million population (Model 2), and the ratio between these two variables (Model 3).
The country's BCG policy was the independent variable (IV) of interest. In accordance with the BCG world atlas 12 , this variable had three levels: Group A, the country currently administers the BCG; Group B, the country used to administer the BCG and currently does not; and Group C, the country has never administered the vaccine. The country's life expectancy and mean temperature (°C) in February and March 2020 were added as control variables in all the models in order to rule out potentially confounding effects. In fact, infection rates and mortality rates may have been influenced by the subject's age (i.e., the older, the more vulnerable) and climate conditions (i.e., the higher the temperature, the less contagious the virus).
Finally, the Order Quantile (ORQ) normalization transformation was used to standardize the continuous variables. Beyond meeting the statistical assumption of linear modeling (normalization), this procedure is also useful to mitigate potentially confounding effects of outliers.
This study included 136 countries or regions for which complete data about total cases per one million population, life expectancy, and temperature were available (Model 1). The data regarding total deaths per one million population and deaths/cases ratio were available for 91 countries (Models 2 and 3). All the analyses were run with the R software 13 , and the graphs were produced by using the package ggplot2 14 . The ORQ transformation was performed with the . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint bestNormalize R package 15 . Finally, the rsq R package was employed to calculate partial R 2 statistics (i.e., unique variances) 16 . Descriptive statistics of each variable (e.g., means, medians, and quantiles) are retrievable from the R codes provided online.
Regarding total cases per one million population, the linear model showed a statistically significant effect for both Group B (b = 0.6483, p = .0002) and Group C (b = 0.8666, p = .0025; as compared to Group A). Temperature was barely significant (Figure 1a; b = -0.1232, p = .0424). As anticipated, life expectancy was significant (Figure 1c Again, since life expectancy was a strong predictor in the model, we reran the analyses only with those countries with a life expectancy higher than 78 (n = 40).
The pattern of results was the same. There was a statistically significant effect for . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. While based on a rough classification, this analysis was useful to explore the potential impact of the country's onset of the virus on our findings.
The Group A countries significantly tend to be classified in lower growth group compared to Groups B and C countries (as a single group) with all the 68 countries with known BCG policy (Figure 3b; Chi-squared test: p = 0.0002; B and C groups were combined for this test). As in the linear regression models, given that life expectancy may affect growth rate, we also conducted a Chi-squared test after excluding the countries that had a shorter life expectancy (≤78 years old, n = 40), and found that the difference remained significant after the exclusion (Figure 3c; Chi-squared test: p < 0.0001; B and C groups were combined for this test), where pvalue became even smaller adjusting life expectancy despite the decrease of sample number. Both the analyses thus showed that Group A countries tended to exhibit a . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint lower growth rate compared to Groups B and C countries regardless of their life expectancy.

Discussion
Associations of BCG policy with COVID-19 after controlling two major confounding factors In this study, we have shown that the countries that currently adopt universal BCG vaccination programs (Group A) have, compared to the other countries, a lower number of cases and deaths per one million people. The countries that no longer recommend BCG vaccination for everyone (Group B) and those that have never had universal BCG vaccination programs (Group C) reports more cases and deaths. We have also evaluated the impact of two potentially confounding factors, the country's life expectancy and the average temperature in February and March 2020. Crucially, the effects of BCG groups on the two dependent variables remain significant, even when the country's life expectancy and temperature are controlled for. The amount of variance explained by BCG vaccination is about 12.50% and 20% for cases and deaths per one million population, respectively. The percentage of explained variance is greatly increased when only countries with a life expectancy above 78 years are considered (about 20% and 38% for cases and deaths, respectively). Also, only Group C appears to play a role in the deaths/cases ratio and explains little more than 3% of the observed variance. This latter result may suggest that, unlike cases and deaths per one million population, the death rate is weakly affected by the type of BCG vaccination policy adopted by the country. We thus hypothesize that the protective effect of the vaccine, if any, may consist of a significant reduction of the spread of the virus rather than a reduced mortality rate. It is also possible that the vaccine prevents progression of the disease after infection, since only the persons with severe symptoms tend to be able to receive PCR tests in many of the countries at present. This hypothesis requires additional empirical corroboration. Finally, in line with the results of the linear regression analysis, the Chi-squared analyses highlight that BCG vaccination policy appears to affect how quickly this virus spreads. Taken together, these results suggest that the overall impact of BCG vaccination on the COVID-19related cases and deaths is, at the very least, non-negligible and worth of further empirical investigation.
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint We have reported that a large part of the variance in the two dependent variables is explained by life expectancy, while the effect of temperature seems to be marginal at best. This is not surprising, considering the fact that age is a major risk factor in COVID-19 18,19 . Life expectancy is highly correlated with the income level of individuals 20 and countries 21 , which would also be related to the quality of medical care that people can receive. If life expectancy is controlled, higher income and quality of medical care may decrease the risk and mortality of diseases in general, which is expected to decrease the positive effect of life expectancy on the risks in COVID-19. Considering the highly significant contribution of life expectancy on the risks, we could consider that the income level or the quality of medical care cannot be the confounding factor that explains the association between BCG vaccination and the risks in COVID-19. Regarding the tight relationships between age and mortality in COVID-19 18,19 , BCG protection may wane with time since vaccination 22 , and it could be potentially contributing to the higher mortality in aged populations to some extent. These possibilities need to be examined by further analyses.
As seen, the average temperature in countries exhibits a small and sometimes even non-significant negative correlation with the risks of COVID-19 infection and mortality. That said, we have not assessed the effect of humidity, which we plan to do in the near future, since absolute humidity is considered to be associated with influenza infection 23,24 . However, it is unlikely that absolute humidity significantly affects the results. In fact, absolute humidity does not seem to play any major role in the transmission rates of the COVID-19 25 . Moreover, absolute humidity and temperature are usually highly correlated, and it is thus unlikely that humidity may play a major role in explaining the spread of the COVID-19 if the temperature does not.

BCG vaccination policy is associated with a lower COVID-19 growth rate
We are still in the midst of the pandemic, during which the virus keeps spreading in most of the countries (except for China, where the rise has nearly been stopped).
Also, the role of the observed between-country differences concerning the date of the spread of the COVID-19 is still unclear. In brief, the current state of affairs might change quickly and, therefore, our findings should be interpreted with caution. That being said, we have found a highly significant difference among the BCG groups of the slope of increase of the cases in log scale, that is, adjusted by the . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity. Potential mechanisms BCG, originally developed against tuberculosis, is hypothesized to develop 'frontline' immunity, training it to respond non-specifically to certain viruses with greater intensity 7,3,4 . This idea is supported by clinical and epidemiological studies, which showed that BCG appeared to lower overall mortality in children 5,2,6 . BCG, which can remain alive in the human skin for several months, triggers not only specific memory B and T cells, but also stimulates the innate blood cells for a prolonged period 7 . In a randomized placebo-controlled study, it was shown that  author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint policies, and culture/life style (e.g., shaking hands and hugging, wearing masks, and washing hands). Some of these elements might affect the observed relationships between COVID-19 indices and BCG vaccination policy by affecting both the susceptibility to SARS-CoV-2 and BCG vaccination policy in an indirect manner. These and other factors need to be examined in future studies (or even before formal publication of this manuscript in a journal with peer review; see "production note" below).
As outlined above, we have employed a rough classification for the statistical test for the relationship between BCG groups and the growth curve of COVID-19 in each country with a limited number of observations ( Figure 3a). Obviously, it would be preferable to use continuous data to produce a more accurate estimation of the impact of BCG vaccination policy on how quickly the virus spreads.
Also, examining variables such as BCG strain 26 and the timing and the number of vaccinations (including the ones for other diseases) would be worth some attention.
These variables might be easier to be examined in those countries where mandatory BCG vaccination was terminated by investigating the history of vaccination of COVID-19 patients and their symptoms and condition severity.
In our study, we used the data that are publicly available from several websites (see Method and results section), which have not been validated by any official institutions (to the best of our knowledge). Therefore, further analyses are recommendable once the data's reliability has been certified.
It may also be worth looking into the relationship between the age of BCG vaccination and the risk of infection and severity /mortality in COVID-19 patients 18,19 . An SNP in IL-1b (IL1B; rs16944) was found to affect the trained immunity response induced by BCG 7 , and it is of interest to evaluate the association of this SNP and the protective effect of BCG vaccine against SARS-CoV-2.

Conclusion
The present study reports significant associations of BCG vaccination with prevalence and mortality from COVID-19 in 136 countries, even when the country's life expectancy and average temperature are controlled for. Together with the previous studies showing potential non-specific protection of BCG, we believe that further studies examining this hypothesis are warranted. In countries with no mandatory BCG vaccination at present, policymakers involved in health service . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint may consider, with careful evaluation of its safety, increasing/implementing BCG vaccination during this outbreak of COVID-19, at least for the categories of people who are currently receiving the vaccination in other countries.

Acknowledgement
We thank Yoko Kagami, and Harumi Mitsuya for their assistance in inputting the data and making Figures, Mamoru Tejima for letting us know the existence of potential correlation, and Hideo Hagihara, Tomoyuki Murano, Hironori Funabiki and Shinichi Nakagawa for useful discussions. We also thank Tomoko Tatsumi for helping us drafting the R codes.

Conflict of Interest
The authors declare that there is no conflict of interest regarding the publication of this article.

Production note
We welcome the contributions from readers of any data related to variables of interest (including potentially confounding factors), which might be considered as a collaboration. It is preferable that data contributed be sorted by country name and in excel format (as in our dataset; See Supplementary Table 1). Using the same naming convention for the name of the countries as those in supplementary table 1 is also helpful for us to carry out analyses quickly.
The data we'd like to add to the excel file includes, but not limited to, When you provide us data, specify where you obtained data so that we can confirm the data.
We look forward to collaborating with you in this war against COVID-19.
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .

24.
Shaman  . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint Figure 2: Effect of BCG vaccination policy on COVID-19 a. The boxplot of total cases per one million population sorted by BCG Group in countries with life expectancy higher than 78 years. Groups B and C (no current BCG vaccination) show a significantly higher rate of cases of COVID-19 compared to Group A (countries currently implementing BCG vaccination). Groups B and C (no current BCG vaccination) show a significantly (p = .0024 and p = .0326) higher rate of cases of COVID-19 compared to Group A (countries currently implementing BCG vaccination). b. The boxplot of total deaths per one million population sorted by BCG Group in countries with life expectancy higher than 78 years. Groups B and C (no current BCG vaccination) show a significantly higher rate of cases of COVID-19 compared to Group A (countries currently implementing BCG vaccination). Groups B and C (no current BCG vaccination) show a significantly (p = .0011 and p < .0001) higher rate of deaths from COVID-19 compared to Group A (countries currently implementing BCG vaccination). . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint . https://doi.org/10.1101/2020.03.30.20048165 doi: medRxiv preprint Figure 3: BCG vaccination policy and COVID-19 growth rate a. Categorial classification of growth rate of cases in 70 countries. A, B, or C designates BCG group and l, m, or h designates low, middle or high growth rate. "?" indicates that information is lacking. Countries marked by asterisk have average life expectancy below 78 years. b. Table for 68 countries on BCG vaccination groups and growth rate. c. Table for 40 countries on BCG vaccination groups and growth rate, by excluding the countries whose life expectancy is below 78 years.