Maximum entropy method for estimating the reproduction number: An investigation for COVID-19 in China ===================================================================================================== * Yong Tao ## Abstract The key parameter that characterizes the transmissibility of a disease is the reproduction number *R*. If it exceeds 1, the number of incident cases will inevitably grow over time, and a large epidemic is possible. To prevent the expansion of an epidemic, *R* must be reduced to a level below 1. To estimate the reproduction number, the probability distribution function of the generation interval of an infectious disease is required to be available; however, this distribution is often unknown. In this letter, given the incomplete information for the generation interval, we propose a maximum entropy method to estimate the reproduction number. Based on this method, given the mean value and variance of the generation interval, we first determine its probability distribution function and in turn estimate the real-time values of reproduction number of COVID-19 in China. By applying these estimated reproduction numbers into the susceptible-infectious-removed epidemic model, we simulate the evolutionary track of the epidemic in China, which is well in accordance with that of the real incident cases. The simulation results predict that China’s epidemic will gradually tend to disappear by May 2020 if the quarantine measures can continue to be executed. Keywords * Maximum entropy * Reproduction number * Generation interval * COVID-19 * Incomplete information In December, 2019, a cluster of pneumonia cases in Wuhan, China was caused by a novel coronavirus, the COVID-19 [1-4]. At first the local governments did not take effective measures which leaded to local people not paying enough attention to the risk. However, with the epidemic in Wuhan further expanding, the Chinese Government started to take emergency actions to lock down the Wuhan city on January 23, 2020. Despite this, the epidemic still spread throughout the entire country. By March 12, 2020, controlling the spread of the epidemic has become a global challenge. One of the key parameters in epidemic models is the basic reproduction number *R*, defined as the number of secondary infections that arise from a typical primary case in a completely susceptible population [5]. As an infection is spreading through a population, it is more convenient to work with an effective reproduction number *R**t*, which is defined as the number of secondary infections that arise from a typical primary case [5]. The magnitude of *R**t* is a useful indicator for evaluating the risk of an infectious disease and the validity of controlling the epidemic. If *R**t* exceeds 1, the number of incident cases will inevitably grow over time, and a large epidemic is possible. To prevent the expansion of an epidemic, *R**t* must be reduced to a level below 1. Using the parameter *R**t*, one can establish the susceptible-infectious-removed (SIR) epidemic model as below [6-8]: ![Formula][1] ![Formula][2] ![Formula][3] where *S*(*t*), *I*(*t*) and *R*(*t*) are the number of susceptible, infectious, and removed (including recovered and death) individuals at time *t*; *τ* denotes the generation interval that is the time from infection of an individual to the infection of a secondary case by that individual, namely, the “contagion period” of an infection [5]. The generation interval *τ* should be a random variable. If we denote the number of populations by *N*, we have: ![Formula][4] If *R**t* and *τ* of an epidemic are known, one can employ the SIR model (1)-(4) to simulate the evolutionary track of this epidemic. To estimate the reproduction number *R**t*, the probability distribution function of the generation interval of an infectious disease, *p*(*τ*), is required to be available [5, 9-12]; however, this distribution is often unknown. In the existing literature, many scholars used exponential distribution [5], normal distribution [5], Weibull distribution [9, 10], and Gamma distribution [3, 12] to approximate *p*(*τ*). Theoretically, to use these distributions to approximate *p*(*τ*), one needs to know enough information about symptom onsets of all cases, namely, large sample cases for *τ*. Regarding the incomplete information, one also applied the Monte-Carlo method [4] and Bayesian statistical inference [11] to estimating *p*(*τ*). However, thus far, there is scant literature to discuss the potential application of the maximum entropy method (MaxEnt) [13-15] in estimating the reproduction number. Our letter fills this gap. In the statistical inference, MaxEnt is a powerful tool of predicting probability distributions. The main idea of MaxEnt is to estimate a target probability distribution by finding the probability distribution of maximum entropy, subject to a set of constraints that represent our incomplete information for the target distribution [13]. Due to the advanced predictive capacity, MaxEnt has been widely applied in thermodynamics [13], economics [16-19], artificial intelligence [20-21], and ecology [22-26]. In this letter, we apply the MaxEnt to determining the function shape of *p*(*τ*). Before doing so, we first introduce the relationship between *R**t* and *p*(*τ*). Here, we adopt Wallinga and Lipsitch’s method [5] for deriving the reproduction number. By both authors’ method, the number of infectious individuals at time *t* can be written as [5]: ![Formula][5] where *n*(*τ, t*) denotes the number of cases infected by a *τ*-day infectious individual at time t. Here *τ**max* denotes the maximum symptom duration. Wallinga and Lipsitch assumed [5] *τ**max* = +∞. To make the model more realistic, we assume that *τ**max* is a finite number. Thus, the reproduction number *R**t* can be defined as [5]: ![Formula][6] Let us order ![Formula][7] Substituting equation (7) into equation (6) yields: ![Formula][8] where *p*(*τ*) is the probability distribution function of the generation interval *τ* [5]. Using equation (8), the mean value of the generation interval can be written as: ![Formula][9] If the mean value ![Graphic][10] is known, then by using equation (8) one can obtain the variance of the generation interval: ![Formula][11] Substituting equation (7) into equation (5) we finally obtain: ![Formula][12] Equation (11) is the basic formula for calculating the reproduction number. If *I*(*t*) and *p*(*τ*) are known, one can calculate the reproduction number by using equation (11). Generally speaking, the number of infectious individuals *I*(*t*) is reported for each day, while the function shape of *p*(*τ*) is unknown. Therefore, many scholars used exponential distribution [5], normal distribution [5], Weibull distribution [9, 10] and Gamma distribution [3, 12] to approximate *p*(*τ*). To do so, one needs to collect enough information of *τ*, which requires examining a large number of cases. From a practical point of view, it is easier to collect a sample set of cases (at least 30 samples) to calculate the approximate estimates of the mean value ![Graphic][13] and the variance *σ*2. Given the approximate estimates of ![Graphic][14] and *σ*2 as the prior information, we maximize the information entropy of the generation interval *τ* to infer the function shape of the probability distribution *p*(*τ*). This is the basic idea of the MaxEnt, which agrees with everything that is known, but avoids assuming anything that is not known [15]. The resulting statistical inference gives the least biased predictions of the shapes of probability distributions consistent with prior knowledge [23]. Now we apply the MaxEnt to determining the probability distribution function *p*(*τ*). To this end, we assume that the mean value ![Graphic][15] and the variance *σ*2 of the generation interval *τ* are known. By equation (8), we define the information entropy of the generation interval *τ* as: ![Formula][16] Because we only know the mean value ![Graphic][17] and the variance *σ*2, maximizing the information entropy (12) should yield: ![Formula][18] To solve the optimal problem (13), we construct the Lagrange function: ![Formula][19] where *α*′, *β*, and *γ* are Lagrange multipliers. Plugging equation (14) into the functional derivative δℒ[*p*(*τ*)]/δ*p*(*τ*) = 0 we get the optimal solution: ![Formula][20] where *α* = exp(−*α*′ − 1). Theoretically, substituting equation (15) into equations (8), (9), and (10) one can calculate the values of *α, β*, and *γ*. However, it is difficult to obtain the analytic results of integrals (8), (9) and (10). To do numerical calculation for equations (8), (9), and (10), we assume that *p*(*τ*) quickly tends to 0 as *τ ≫* 1. The validity of this assumption can be justified by checking equation (15); therefore, equations (8), (9) and (10) can be written as: ![Formula][21] ![Formula][22] ![Formula][23] Based on equations (16), (17) and (18), we propose a numerical method to calculate the approximate values of *α, β*, and *γ*. To this end, let us order: ![Formula][24] The partial derivatives of equation(19) with respect to *β* and *γ* yield: ![Formula][25] ![Formula][26] Using equations (19), (20) and (21), equations (8), (9) and (10) can be rewritten in the form: ![Formula][27] where we have used the approximations (16), (17) and (18). By solving equation (22), one can obtain the approximate values of *α, β*, and *γ*. Substituting equation (15) into equation (19) we have ![Formula][28] where ![Graphic][29] exp(−*x*2) *dx* denotes the error function. By equation (23) it is easy to get: ![Formula][30] ![Formula][31] Solving equation (22) is equivalent to minimizing the following function: ![Formula][32] Here we employ the Matlab software to depict *e*[*α, β, γ*] as a 100 × 100 × 100 lattice-point matrix, where the lattice spacing is 0.01. Given the accuracy of 0.01, by inputting the observed values of ![Graphic][33] and *σ*2, we calculate *α, β*, and *γ*. We first apply equation (26) to the SARS epidemic in Singapore in 2003, where ![Graphic][34] (days) and *σ* = 3.8 (days) [9]. Substituting both observed values into equation (26), we seek the lattice-point minimizing equation (26) as below: ![Formula][35] Substituting equation (27) into equation (15) we have: ![Formula][36] The shape of equation (28) is showed in the Figure 1, see the blue curve. It is well in accordance with the sample data of generation interval of the SARS in Singapore in 2003, see the red histogram in Figure 1. This result supports the validity of the MaxEnt. The latest clinical research [1] showed that the generation interval of COVID-19 is similar to that of SARS. Therefore, we assume that the generation interval of COVID-19 shares the same probability function shape as that of SARS. This assumption was also adopted by Wu et al [4]. Based on this assumption, we apply equation (28) to estimating the reproduction number of COVID-19 in China. To this end, substituting equation (28) into equation (11) we have: ![Formula][37] ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/03/20/2020.03.14.20035659/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2020/03/20/2020.03.14.20035659/F1) Figure 1. The function shape of equation (28) is showed by a blue curve. The sample data of generation interval of the SARS in Singapore (2003) is showed by a red histogram, and the data resource refers to [9]. Because the observed data of *I*(*t*) was reported for each day, we denote the unit of time t by “day”. To calculate *R**t* by using equation (29), we need to rewrite the integral (29) as a summation formula. To do so, by Figure 1 we observe *p*(*τ* = 14) *≈* 0. By equations (16)-(18), this means *τ**max* *≈* 14. Therefore, the maximum generation interval of COVID-19 can be approximately denoted by 14. Based on this setting, equation (29) can be rewritten in the form: ![Formula][38] where *a* = 1,2, … denote the ordinal number of the period and *τ**max* = 14. Without loss of generality, in equation (30) we have approximately identified the reproduction number of the last day of a period as the reproduction number of this period. From the perspective of entire time span of an epidemic, this approximation satisfies the spirit of statistical mean-field method. By using equation (30) we can report the reproduction number every 14 days. Before doing so, we first determine the starting point of each contagion period for COVID-19 in China. According to the report of China CDC [2], January 8, 2020 was considered as the last day of a contagion period in Wuhan, China; therefore, we mark January 9, 2020 as the first day of the next contagion period. Based on this setting, the first period we report is from January 9, 2020 to January 22, 2020. In fact, this setting agrees with the date of locking down the Wuhan city, January 23, 2020. Here, we have collected the national-level data of the accumulative infected, recovered, and death cases of China’s epidemic from January 10, 2020 to March 4, 2020, see Figure 2. By using the data in Figure 2, it is easy to calculate the number of real-time infected cases, *I*(*t*), in China for each day. This result has been shown in Figure 3. Using the data in Figure 3, we can report the reproduction number every 14 days by using equation (30). The results have been listed in Table 1. The first period (from January 9, 2020 to January 22, 2020) can be regarded as a free propagation stage of COVID-19 because most Chinese people were aware of the outbreak of COVID-19 after January 21, 2020 and local governments did not take effective measures to control the epidemic during this period. Unfortunately, the data in the first period is very incomplete. By contrast, the reported infected cases on January 10 (41 cases) and January 22 (571 cases) can be roughly used. Consider that the first period is a free propagation stage, we use both data to approximately restore real-time data of this period by the exponential growth formula *57*1 = 41. exp(12. *r*), where *r* denotes the growth rate. Using the restored data, the estimated value of the reproduction number for the first period is calculated to be 3.7069, see Table 1, which implies that the intensity of free transmission of COVID-19 is quite high. The World Health Organization announced [27], up to March 12, 2020, the COVID-19 had spread to 118 countries. The rapid worldwide spread of COVID-19 is an evidence for supporting our calculation. After January 22, 2020, Chinese Government started to take emergency actions to lock down the Wuhan city, and quickly performed different quarantine measures in every provinces. The powerful quarantine measures substantially reduce the contagion probability among individuals. Therefore, the subsequent periods no longer belong to free transmission. For these periods, the results of the reproduction number have been listed in Table 1. Due to the quarantine measures, the reproduction number for the second period (from January 23, 2020 to February 5, 2020) has been reduced to 3.122 with the reduction amplitude being 15.78%. For the third period (from February 6, 2020 to February 19, 2020), the reproduction number has remarkably been reduced to 1.2114, which is close to 1. This implies that the epidemic has been effectively controlled. It should be pointed out that the last day of the third period (February 19) is just the turning point of the epidemic, refer to Figure 3; therefore, the real data supports our calculations for the reproduction number. As the epidemic come to the fourth period (from February 20, 2020 to March 4, 2020), the reproduction number is eventually reduced to 0.6028, a level below 1. In this sense, China’s quarantine measures have obtained a preliminary success. View this table: [Table 1:](http://medrxiv.org/content/early/2020/03/20/2020.03.14.20035659/T1) Table 1: The reproduction number for each period (2020) ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/03/20/2020.03.14.20035659/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2020/03/20/2020.03.14.20035659/F2) Figure 2. National-level data of the accumulative infected, recovered, and death cases of China’s epidemic from January 10, 2020 to March 4, 2020. The data of January 8 and 9 is simply assumed to be same as that of January 10. **Data resource:** [https://voice.baidu.com/act/newpneumonia/newpneumonia/?from=osari\_pc\_3](https://voice.baidu.com/act/newpneumonia/newpneumonia/?from=osari_pc_3) ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/03/20/2020.03.14.20035659/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2020/03/20/2020.03.14.20035659/F3) Figure 3. The number of real-time infected cases for each day = the number of accumulative infected cases for each day − the number of accumulative recovered cases for each day − the number of accumulative death cases for each day. The data comes from Figure 2. To further test the validity of the reproduction numbers in Table 1, we substitute them into the SIR model (1)-(4) for simulating the evolution of China’s epidemic. To this end, let us first check the scope of application of the reproduction number formula (11), which is derived by equation (5). By the mean value theorem of integrals, equation (5) can be rewritten in the form: ![Formula][39] where 0 *≤ τ**c* *≤ τ**max*. By equation (6) and (31) we have: ![Formula][40] On the other hand, if we assume that *R**t* is a constant (step function) for each period, by equations (1)-(4) it is easy to get: ![Formula][41] Let us order: ![Formula][42] By using equation (34), equation (33) can be approximately written as: ![Formula][43] which implies ![Formula][44] where equation (35) is derived by using the approximations |*R**t* − 1|. *R*(*t*) *≪ I*(*t*) and *S*(0) *≈ N*. Comparing equations (32) and (36), we find that the reproduction number formula (11) can be applied to the SIR model (1)-(4) if the approximation (34) holds. Equation (34) implies (*N* − *S*(*t*))*/N ≪* 1. The approximation obviously holds for China, where *N ≈* 1.4 × 109 and *N* − *S*(*t*) *≈* 1 × 105. Therefore, we substitute the reproduction numbers in Table 1 into the SIR model (1)-(4) to simulate the evolution of China’s epidemic. The result is shown by Figure 5, where the evolutionary track (red circles) of the epidemic is well in accordance with that (black circles) of the real incident cases in China. The simulation result requires *τ ≈* 8, which agrees with our previous setting *τ* = 8.4 ± 3.8 [9]. Furthermore, we find that the reproduction numbers of quarantine periods in Table 1 can be fitted by an exponential function with *R*2 = 0.9924, see Figure 4. Therefore, we apply this exponential function to predicting the reproduction numbers for the next seven periods (from March 5, 2020 to June 10, 2020). The results have been listed in Table 2, where we also present the predicted values of the number of real-time infected cases for the last day in each period. These predicted values imply that China’s epidemic will gradually tend to disappear by May 2020, see the blue circles in Figure 5. View this table: [Table 2:](http://medrxiv.org/content/early/2020/03/20/2020.03.14.20035659/T2) Table 2: Predicted values (2020) ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/03/20/2020.03.14.20035659/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2020/03/20/2020.03.14.20035659/F4) Figure 4. The reproduction numbers of quarantine periods in Table 1 are fitted by an exponential function. ![Figure 5.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/03/20/2020.03.14.20035659/F5.medium.gif) [Figure 5.](http://medrxiv.org/content/early/2020/03/20/2020.03.14.20035659/F5) Figure 5. The SIR simulation result by using the reproduction numbers in Table 1 is showed by red circles and the SIR simulation result by using the reproduction numbers in Table 2 is showed by blue circles, where *N ≈* 1.4 × 109 and *τ ≈* 8. The real-time infected cases in Figure 3 are showed by black circles. In conclusion, to estimate the reproduction number, the probability distribution function of the generation interval of an infectious disease is required to be available; however, this distribution is often unknown. In the existing literature, many scholars used exponential distribution, normal distribution, Weibull distribution, and Gamma distribution to approximate the generation interval distribution. To do so, one needs to collect enough information about symptom onsets of all cases, which requires examining a large number of cases. By contrast, the maximum entropy method has more advantage of predicting probability distributions given the incomplete information. In this letter, we argue that, given the mean value and variance of the generation interval, one can determine its probability distribution function by using maximum entropy method. Because the overall data (population) of the generation interval is always absent, the maximum entropy method is a more convenient approach for estimating the probability distribution function of generation interval. By the maximum entropy method we first determine the probability distribution function of generation interval of COVID-19 and further apply it to estimating the real-time values of reproduction numbers of China’s epidemic. Plugging these estimated reproduction numbers into the susceptible-infectious-removed epidemic model, we simulate the evolutionary track of the epidemic in China, which is well in accordance with that of the real incident cases. ## Data Availability I confirm the availability of all data in the paper. * Received March 14, 2020. * Revision received March 14, 2020. * Accepted March 20, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## References 1. [1].Huang, C. et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet (2020): [https://doi.org/10.1016/S0140-6736(20)30183-5](https://doi.org/10.1016/S0140-6736(20)30183-5) 2. [2].Li, Q. et al. Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus– Infected Pneumonia. The New England Journal of Medicine (2020): DOI: 10.1056/NEJMoa2001316 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2001316&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31995857&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) 3. [3].Zhao, S. et al. Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak. International Journal of Infectious Diseases (2020): [https://doi.org/10.1016/j.ijid.2020.01.050](https://doi.org/10.1016/j.ijid.2020.01.050) 4. [4].Wu, J. T. et al. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. The Lancet (2020): [https://doi.org/10.1016/S0140-6736(20)30260-9](https://doi.org/10.1016/S0140-6736(20)30260-9) 5. [5].Wallinga, J. and Lipsitch, M. How generation intervals shape the relationship between growth rates and reproductive numbers. Proceedings of the Royal Society B 274 (2007) 599–604 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1098/rspb.2006.3754&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17476782&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000243354200019&link_type=ISI) 6. [6].Kenah, E. and Robins, J. Network-based analysis of stochastic SIR epidemic models with random and proportionate mixing. Journal of Theoretical Biology 249 (2007) 706–722 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jtbi.2007.09.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17950362&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000252256400006&link_type=ISI) 7. [7].Kenah, E. and Robins, J. Second look at the spread of epidemics on networks. Physical Review E 76 (2007) 036113 8. [8].Cauchemez, S. and Ferguson, N. M. Likelihood-based estimation of continuous-time epidemic models from time-series data: application to measles transmission in London. Journal of the Royal Society Interface 5 (2008) 885–897 9. [9].Lipsitch, M. et al. Transmission Dynamics and Control of Severe Acute Respiratory Syndrome. Science 300 (2003) 1966 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEzOiIzMDAvNTYyNy8xOTY2IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMDMvMjAvMjAyMC4wMy4xNC4yMDAzNTY1OS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 10. [10].Wallinga, J. and Teunis, P. Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures. American Journal of Epidemiology 160 (2004) 509–516 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/aje/kwh255&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15353409&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000223938000001&link_type=ISI) 11. [11].Simon Cauchemez, S. Real-time Estimates in Early Detection of SARS. Emerging Infectious Diseases 12 (2006) 110–113. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16494726&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000234419700020&link_type=ISI) 12. [12].Donnelly, C. A. et al. Epidemiological determinants of spread of causal agent of severe acute respiratory syndrome in Hong Kong. The Lancet 361 (2003) 1761–1766 [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) 13. [13].Jaynes, E. T. Information Theory and Statistical Mechanics. Phys. Rev. 106 (1957) 620–630 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1103/PhysRev.106.620&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1957WB72300004&link_type=ISI) 14. [14].Jaynes, E. T. On the rationale of maximum-entropy methods. Proceedings of the IEEE 70 (1982) 939–952 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/PROC.1982.12425&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1982PJ70200004&link_type=ISI) 15. [15].1. Grandy Jr., W.T., 2. Schick, L.H. Jaynes, E.T. Notes on present status and future prospects. In: Grandy Jr., W.T., Schick, L.H. (Eds.), Maximum Entropy and Bayesian Methods. Kluwer, Dordrecht, The Netherlands, (1990) 1–13. 16. [16].Judge, G. and Miller, D. Maximum Entropy Econometrics: Robust Estimation with Limited Data, John Wiley (1996) 17. [17].Tao, Y. Competitive market for multiple firms and economic crisis. Physical Review E 82 (2010) 036118 18. [18].Tao, Y. (2016), Spontaneous economic order. Journal of Evolutionary Economics 26 (2016) 467–500 19. [19].Tao, Y. et al., Exponential structure of income inequality: evidence from 67 countries. Journal of Economic Interaction and Coordination 14 (2019) 345–376. 20. [20].Wissner-Gross A. D. and Freer, C. E. Causal entropic forces. Physical Review Letters 110 (2013) 168702 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1103/PhysRevLett.110.168702&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23679649&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) 21. [21].Tao, Y. Self-referential Boltzmann machine”. Physica A 123775 (2020): [https://doi.org/10.1016/j.physa.2019.123775](https://doi.org/10.1016/j.physa.2019.123775) 22. [22].Phillips, S. J. Maximum entropy modeling of species geographic distributions. Ecological Modelling 190 (2006) 231–259 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ecolmodel.2005.03.026&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000233859600001&link_type=ISI) 23. [23].Harte, J. et al. Maximum entropy and the state-variable approach to macroecology. Ecology 89 (2008) 2700–2711 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1890/07-1369.1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18959308&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000259995100005&link_type=ISI) 24. [24].Dewar, R. C. and Porte, A. Statistical mechanics unifies different ecological patterns. Journal of Theoretical Biology 251 (2008) 389–403 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jtbi.2007.12.007&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18237750&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000255082000001&link_type=ISI) 25. [25].Frank, S. A. The common patterns of nature. Journal of Evolutionary Biology. 22 (2009) 1563–1585 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1420-9101.2009.01775.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19538344&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F03%2F20%2F2020.03.14.20035659.atom) 26. [26].Harte, J. and Newman, E. A. Maximum information entropy: a foundation for ecological theory. Trends in Ecology & Evolution 29 (2014) 384–389 27. [27].[https://www.who.int/redirect-pages/page/novel-coronavirus-(covid-19)-situation-dashboard](https://www.who.int/redirect-pages/page/novel-coronavirus-(covid-19)-situation-dashboard) [1]: /embed/graphic-1.gif [2]: /embed/graphic-2.gif [3]: /embed/graphic-3.gif [4]: /embed/graphic-4.gif [5]: /embed/graphic-5.gif [6]: /embed/graphic-6.gif [7]: /embed/graphic-7.gif [8]: /embed/graphic-8.gif [9]: /embed/graphic-9.gif [10]: /embed/inline-graphic-1.gif [11]: /embed/graphic-10.gif [12]: /embed/graphic-11.gif [13]: /embed/inline-graphic-2.gif [14]: /embed/inline-graphic-3.gif [15]: /embed/inline-graphic-4.gif [16]: /embed/graphic-12.gif [17]: /embed/inline-graphic-5.gif [18]: /embed/graphic-13.gif [19]: /embed/graphic-14.gif [20]: /embed/graphic-15.gif [21]: /embed/graphic-16.gif [22]: /embed/graphic-17.gif [23]: /embed/graphic-18.gif [24]: /embed/graphic-19.gif [25]: /embed/graphic-20.gif [26]: /embed/graphic-21.gif [27]: /embed/graphic-22.gif [28]: /embed/graphic-23.gif [29]: /embed/inline-graphic-6.gif [30]: /embed/graphic-24.gif [31]: /embed/graphic-25.gif [32]: /embed/graphic-26.gif [33]: /embed/inline-graphic-7.gif [34]: /embed/inline-graphic-8.gif [35]: /embed/graphic-27.gif [36]: /embed/graphic-28.gif [37]: /embed/graphic-29.gif [38]: /embed/graphic-31.gif [39]: /embed/graphic-35.gif [40]: /embed/graphic-36.gif [41]: /embed/graphic-37.gif [42]: /embed/graphic-38.gif [43]: /embed/graphic-39.gif [44]: /embed/graphic-40.gif