Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Epidemiological characteristics of novel coronavirus infection: A statistical analysis of publicly available case data

View ORCID ProfileNatalie M. Linton, View ORCID ProfileTetsuro Kobayashi, View ORCID ProfileYichi Yang, Katsuma Hayashi, View ORCID ProfileAndrei R. Akhmetzhanov, View ORCID ProfileSung-mok Jung, View ORCID ProfileBaoyin Yuan, View ORCID ProfileRyo Kinoshita, View ORCID ProfileHiroshi Nishiura
doi: https://doi.org/10.1101/2020.01.26.20018754
Natalie M. Linton
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Natalie M. Linton
Tetsuro Kobayashi
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tetsuro Kobayashi
Yichi Yang
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yichi Yang
Katsuma Hayashi
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Andrei R. Akhmetzhanov
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Andrei R. Akhmetzhanov
Sung-mok Jung
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sung-mok Jung
Baoyin Yuan
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Baoyin Yuan
Ryo Kinoshita
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ryo Kinoshita
Hiroshi Nishiura
1Graduate School of Medicine, Hokkaido University, Sapporo, Hokkaido, Japan
2CREST, Japan Science and Technology Agency, Honcho 4-1-8, Kawaguchi, Saitama 332-0012, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Hiroshi Nishiura
  • For correspondence: nishiurah@med.hokudai.ac.jp
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

The geographic spread of persons infected with the 2019 novel coronavirus (2019-nCoV) provides an opportunity to study the natural history of the newly emerged virus. Migration events put travelers at risk of infection for the duration of their exposure to an area where transmission is known to occur. Using publicly available data of the ongoing epidemic of 2019-nCoV where event dates for cases have been shared, the present study estimated the incubation period and other time intervals that govern interpretation of the epidemiological dynamics of 2019-nCoV infections. Our results show that the incubation periods falls within the range of two to nine days with 95% confidence, and the median incubation period is 4–5 days when approximated using the Weibull distribution, which was the best fit model. The median time from illness onset to hospitalization was estimated at 3 days. Based on the estimate of the 95th percentile estimate of the incubation period, we recommend that the length of isolation and quarantine should be at least nine days. We also note that the median time delay of 13.8 days from illness onset to death should be considered when estimating the case fatality risk of this novel virus.

1 Introduction

As of 24 January 2020, 1287 cases of novel coronavirus (2019-nCoV) infections were reported in main-land China, causing 41 deaths. While infections in the first case cluster were initially thought to be mostly due to zoonotic (animal-to-human) transmission—possibly due to wild animals sold at a local seafood wholesale market [1, 2] – the growth of case incidence in Wuhan after closure of the market and exportation of cases across China and internationally shows compelling evidence of increasing human-to-human secondary transmission, fueled by human migration. Cases have now been detected in many other parts of the world [3], including other Asian countries, the United States, and France. This geographic expansion beyond the initial epicenter of Wuhan provides an opportunity to study the natural history 2019-nCoV infection, as migration events limit the windows of risk to the time interval during which the person traveled to the area where exposure could occur.

The incubation period is defined as the time from infection to illness onset. Knowledge of the incubation period of a directly transmitted infectious disease is critical to determine the time period required for movement restriction of healthy individuals (i.e. quarantine period) [5, 6]. We therefore undertook the incubation period estimation for the 2019-nCoV to assess how long exposed persons must be monitored. The distribution of the incubation period may also aid in understanding the relative infectiousness of 2019-nCoV over the course of infection.

Another important epidemiologic issue in infectious disease is the inherent time delays governing each event of infection, e.g. hospitalization and death, which inform the temporal dynamics of epidemics. That is, the epidemic curve based on the date of hospitalization for each case is better interpreted and analyzed by understanding the time from symptom onset to hospitalization. A published clinical study has already shown that the average time delay from illness onset to admission is approximately 7 days [7], but variations by patients must be carefully monitored. The time from hospitalization to death is also critical in avoiding the underestimation of case fatality risk [8].

Using publicly available data of the ongoing epidemic of 2019-nCoV with known event dates, the present study aims to estimate the incubation period and other time intervals that govern the interpretation of epidemiological dynamics of 2019-nCoV. We perform the estimation of percentile points using a bootstrapping method.

2 Methods

2.1 Epidemiological data

We retrieved information on cases with confirmed 2019-nCoV infection and diagnosis outside of the epicenter of Hubei Province, China, based on official reports from governmental institutes. We collected the data either directly from governmental websites or from news sites that directly quoted governmental statements. The data were collected in real time, and thus may be updated as more details on cases becomes publicly available. The arranged data are available as the Online Supplementary Material (Table S1). The latest update to the dataset was on 25 January 2020 for cases reported through 24 January.

Specifically, we collected the dates of exposure (entry and/or exit from Wuhan), illness onset, hospitalization, and death. Cases included both residents from other locations who travelled to Wuhan, as well as Wuhan residents who were diagnosed while outside of Wuhan and reported by the governments of the locations where illness was detected. We thus estimated the incubation period by (i) examining visitors to Wuhan and (ii) examining both visitors to and residents from Wuhan who were diagnosed outside of Hubei Province. The former may be more precise in defining the interval of exposure, but the sample size is greater for the latter.

2.2 Statistical model

We used the dates of three critical points of the course of illness (i.e., dates of onset, hospitalization and death) to calculate four time intervals: the time periods (a) from exposure to illness onset (i.e., incubation period), (b) from illness onset to hospitalization, (c) from illness onset to death, and (d) from hospitalization to death. All these intervals were subject to a doubly interval-censored likelihood function to estimate the parameter values (which can be analyzed by using coarseDataTools package of the statistical language R) [9]: Embedded Image

Here, for example in the case of (a), g(.) is the probability density function (p.d.f.) of exposure following a uniform distribution, and f (.) is the p.d.f. of the incubation period independent of g(.). D represents a dataset among all observed cases i. Exposure and symptom onset obey the upper and lower bounds, (ER, EL) and (SR, SL), respectively. For instance, if the date of illness onset is for one day, the respective interval is (SR, SR + 1), where SR is the reported date of illness onset.

We performed a bootstrap method, based on case resampling, to compute the 95% confidence intervals (CI). Likewise, we were able to calculate distributions of (b), (c) and (d). We also assume that the probability density function f (.) follows three different distributions, i.e., lognormal, Weibull and gamma distributions. Akaike Information Criterion (AIC) was used to identify the best fit model for each time interval.

3 Results

Table 1 shows estimated percentiles and AIC values for each combination of time interval and distribution. For the incubation period estimates, the best fit was found with the Weibull distribution for data both excluding and including Wuhan residents. The median incubation period using the Weibull distribution was estimated at 4.6 days (95% CI: 3.3, 5.7) when excluding Wuhan residents (n = 12) and 5.0 days (95% CI: 4.1, 5.8) when including Wuhan residents (n = 31). Figure 1 shows the cumulative distribution function of the incubation period, and the 5th and 95th percentiles are shown in addition to the median. The 95th percentiles were estimated at 7.3 days (95% CI: 5.6, 8.4) days for non-Wuhan residents and at 7.6 days (95% CI: 6.0, 8.8) when including Wuhan residents.

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1.

Bootstrap estimates from 1000 iterations. All cases were diagnosed with laboratory-positive 2019-nCoV outside of Hubei Province. WR: Wuhan residents. AIC = 2 L(θ* ; D) + 2 6 (all estimates had 6 parameters). Ranges for onset, hospitalization, and death calculated as left = reported date; right = reported date + 1 day. Shaded cells indicate the model with the minimal AIC value.

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

Estimated cumulative distribution for the incubation period of 2019-nCoV from outbreak cases reported in January 2020. Data are from public case reports published by governments outside of Hubei Province, China. Left: excludes Wuhan (Hubei Province) residents from the estimates. Right: includes Wuhan residents in the estimates.

The median time from illness onset to hospitalization was estimated at 2.7 days (95% CI: 1.7, 4.2) using the gamma distribution, which yielded the lowest AIC value (Table 1). Figure 2A shows the corresponding p.d.f. Time from symptom onset and hospitalization to death were also computed (Table 1 and Figure 2BC). The best-fit models for each interval were the lognormal and Weibull distributions, respectively. The median time from onset to death was 13.8 days (95% CI: 11.8, 16.0) and the median time from hospitalization to death was 8.3 days (95% CI: 6.4, 10.5).

Figure 2.
  • Download figure
  • Open in new tab
Figure 2.

Probability distributions of time from onset or hospitalization to hospitalization or death for 2019-nCoV outbreak cases reported through 24 January 2020. (A) Probability density of the time from illness onset to hospitalization in days set to the best-fit gamma distribution. (B) Probability density of the time from illness onset to death in days set to the best-fit lognormal distribution. (C) Probability density of the time from hospitalization to death in days set to the best-fit Weibull distribution.

4 Discussion

Our results show that 95% of incubation periods fall within the range of 2 to 9 days, and the median incubation period was 4–5 days when the Weibull distribution was used as the best-fit model. The median time from illness onset to hospitalization was approximately 3 days. The median time from illness onset to death was 13.8 days, the delay of which is key to appropriate estimation of the case fatality risk for 2019-nCoV [10].

The present study advances the public discussion on 2019-nCoV infections as both the incubation period and the time from illness onset to death were explicitly estimated using publicly available data. Our estimated median incubation period of 2019-nCoV is comparable to known median values of the incubation period for severe acute respiratory syndrome (SARS)—estimated at 4.0–6.4 days [8, 11, 12]. In addition to empirically showing the comparability to SARS, the present study has also shown that the 95th percentile of the incubation period is around 7–8 days, indicating that a nine-day quarantine period could mostly ensure the absence of disease among exposed healthy individuals.

The time from illness onset to death is also comparable to SARS [8], and the 13.8-day median delay that we calculated indicates that the crude estimation of the ratio of the cumulative number of deaths to that of cases tends to result in underestimation of the case fatality risk, especially during the early stage of the epidemic. During the SARS epidemic in Hong Kong, 2003, the time from illness onset to hospitalization was shown to have shortened as a function of calendar time, reflecting that contact tracing practice had worked out gradually. Moreover, the study on pandemic influenza H1N1-2009 has demonstrated a negative association between the time from illness onset to hospitalization and the basic reproduction number, i.e., the average number of secondary cases generated by a single primary case in a fully susceptible population [13]. While our estimate was approximately 3 days, consistent with high mortality at hospital settings, this may be thus shortened in the future course of the epidemic. Several limitations of the present study exist. First, the dataset relies on published information, and the defined event date (e.g. the date of illness onset) depends on the decision-making of each governmental authority. Given the novelty of the illness, it is possible that symptom onset and other event data may have been dealt with differently between jurisdictions (e.g., was onset the date of fever or date of dyspnea?). Second, the sample size was limited, and the variance was likely to be biased. Third, we were not able to examine heterogeneity of estimates by different attributes of cases (e.g. age and risk groups).

While several future tasks remain, we believe that the present study has been successful in clarifying the epidemiological characteristics of novel coronavirus infection. The length of quarantine should be at least nine days, and the time delay from illness onset to death of fourteen days must be addressed when estimating the case fatality risk.

Data Availability

Used dataset is available as the Supplementary Material

Supplementary material

Table S1 Event dates for cases included in the analysis.

Author Contributions

N.M.L., T.K., A.R.A., and H.N. conceived the study and participated in the study design. All authors assisted in collecting the data. N.M.L., T.K. and H.N. analyzed the data and T.K., H.N., N.M.L. and Y.Y. drafted the manuscript. All authors edited the manuscript and approved the final version.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. 1.↵
    Peng, W., Xinxin H., Eric, H.Y., Jessica, W.Y., Kathy, L., Joseph, W.T., Benjamin C., Gabriel, L. Real-time tentative assessment of the epidemiological characteristics of novel coronavirus infections in Wuhan, China, as at 22 January 2020. Euro Surveillance 2020, 25(3), pii=2000044 (doi:10.2807/1560-7917.ES.2020.25.3.2000044).
    OpenUrlCrossRefPubMed
  2. 2.↵
    Center for Disease Control and Prevention. 2019 Novel Coronavirus, Wuhan, China. Available online: (reference link) (accessed on 24 January 2020).
  3. 3.↵
    European Centre for Disease Prevention and Control data. Geographical distribution of 2019-nCov cases. Available online: (reference link) (accessed on 24 January 2020).
  4. 4.
    Nishiura, H., Lee, H.W., Cho, S.H., Lee, W.G., In, T.S., Moon, S.U., Chung, G.T., Kim, T.S. Estimates of short- and long-term incubation periods of Plasmodium vivax malaria in the Republic of Korea. Trans R Soc Trop Med Hyg. 2007 Apr;101(4):338–43 (doi:10.1016/j.trstmh.2006.11.002).
    OpenUrlCrossRefPubMed
  5. 5.↵
    Lessler, J., Reich, N.G., Cummings, D.A. New York City Department of Health and Mental Hygiene Swine Influenza Investigation Team, Nair HP, Jordan HT, Thompson N. Outbreak of 2009 pandemic influenza A (H1N1) at a New York City school. N Engl J Med. 2009, 361(27), 2628–36. (doi:10.1056/NEJMoa0906089).
    OpenUrlCrossRefPubMedWeb of Science
  6. 6.↵
    Nishiura, H. Determination of the appropriate quarantine period following smallpox exposure: an objective approach using the incubation period distribution. Int J Hyg Environ Health. 2009, 212(1), 97–104. (doi:10.1016/j.ijheh.2007.10.003).
    OpenUrlCrossRefPubMed
  7. 7.↵
    Huang, C., Wang, Y., Li, X., Ren, L., Zhao, J., Hu, Y., Zhang, L., Fan, G., Xu, J., Gu, X., Cheng, Z., Yu, T., Xia, J., Wei, Y., Wu, W., Xie, X; Yin, W., Li, H., Liu, M., Xiao, Y., Gao, H., Guo, L., Xie. J; Wang, G., Jiang, R., Gao, Z., Jin, Q., Wang, J., Cao, B. Lancet e. in press, (doi:10.1016/S0140-6736(20)30183-5).
    OpenUrlCrossRefPubMed
  8. 8.↵
    Donnelly, C.A., Ghani, A.C., Leung, G.M., Hedley, A.J., Fraser, C., Riley, S., Abu-Raddad, L.J., Ho, L.M., Thach, T.Q., Chau, P., Chan, K.P., Lam, T.H., Tse, L.Y., Tsang, T., Liu, S.H., Kong, J.H., Lau, E.M., Ferguson, N.M., Anderson, R.M. Epidemiological determinants of spread of causal agent of severe acute respiratory syndrome in Hong Kong. Lancet. 2003 May 24;361(9371):1761–6 (doi:10.1016/S0140-6736(03)13410-1).
    OpenUrlCrossRefPubMedWeb of Science
  9. 9.↵
    Reich, N.G., Lessler, J., Cummings, D.A.: Brookmeyer, R. Estimating incubation period distributions with coarse data. Statistics in Medicine 2009, 28(22), 2769–84. (doi:10.1002/sim.3659).
    OpenUrlCrossRefPubMedWeb of Science
  10. 10.↵
    Ghani AC, Donnelly CA, Cox DR, Griffin JT, Fraser C, Lam TH, Ho LM, Chan WS, Anderson RM, Hedley AJ, Leung GM. Methods for estimating the case fatality ratio for a novel, emerging infectious disease. Am J Epidemiol. 2005,162(5), 479–86 (doi:10.1093/aje/kwi230).
    OpenUrlCrossRefPubMedWeb of Science
  11. 11.↵
    Cowling, B.J., Park. M., Fang. V.J., Wu, P., Leung, G.M., Wu, J.T. Preliminary epidemiological assessment of MERS-CoV outbreak in South Korea, May to June 2015. Euro Surveillance 2015, 20(25), pii: 21175 (doi:10.2807/1560-7917.es2015.20.25.21163).
    OpenUrlCrossRef
  12. 12.↵
    Lessler, J., Reich, N.G., Brookmeyer, R., Perl, T.M., Nelson, K.E., Cummings, D.A. Incubation periods of acute respiratory viral infections: a systematic review. Lancet Infect Dis. 2009, 9(5), 291–300 (doi:10.1016/S1473-3099(09)70069-6).
    OpenUrlCrossRefPubMedWeb of Science
  13. 13.↵
    Fraser, C., Donnelly, C.A., Cauchemez, S., Hanage, W.P., Van Kerkhove, M.D., Hollingsworth, T.D., Griffin, J., Baggaley, R.F., Jenkins, H.E., Lyons, E.J., Jombart, T., Hinsley, W.R., Grassly, N.C., Balloux, F., Ghani, A.C., Ferguson, N.M., Rambaut, A., Pybus, O.G., Lopez-Gatell, H., Alpuche-Aranda, C.M., Chapela, I.B., Zavala, E.P., Guevara, D.M., Checchi, F., Garcia, E., Hugonnet, S., Roth, C. WHO Rapid Pandemic Assessment Collaboration. Pandemic potential of a strain of influenza A (H1N1): early findings. Science. e Jun 19, 324(5934), 1557–61 (doi:10.1126/science.1176062)
    OpenUrlAbstract/FREE Full Text
Back to top
PreviousNext
Posted January 28, 2020.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Epidemiological characteristics of novel coronavirus infection: A statistical analysis of publicly available case data
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Epidemiological characteristics of novel coronavirus infection: A statistical analysis of publicly available case data
Natalie M. Linton, Tetsuro Kobayashi, Yichi Yang, Katsuma Hayashi, Andrei R. Akhmetzhanov, Sung-mok Jung, Baoyin Yuan, Ryo Kinoshita, Hiroshi Nishiura
medRxiv 2020.01.26.20018754; doi: https://doi.org/10.1101/2020.01.26.20018754
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Epidemiological characteristics of novel coronavirus infection: A statistical analysis of publicly available case data
Natalie M. Linton, Tetsuro Kobayashi, Yichi Yang, Katsuma Hayashi, Andrei R. Akhmetzhanov, Sung-mok Jung, Baoyin Yuan, Ryo Kinoshita, Hiroshi Nishiura
medRxiv 2020.01.26.20018754; doi: https://doi.org/10.1101/2020.01.26.20018754

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Infectious Diseases (except HIV/AIDS)
Subject Areas
All Articles
  • Addiction Medicine (174)
  • Allergy and Immunology (420)
  • Anesthesia (97)
  • Cardiovascular Medicine (896)
  • Dentistry and Oral Medicine (169)
  • Dermatology (102)
  • Emergency Medicine (257)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (405)
  • Epidemiology (8764)
  • Forensic Medicine (4)
  • Gastroenterology (404)
  • Genetic and Genomic Medicine (1857)
  • Geriatric Medicine (177)
  • Health Economics (387)
  • Health Informatics (1286)
  • Health Policy (642)
  • Health Systems and Quality Improvement (490)
  • Hematology (206)
  • HIV/AIDS (392)
  • Infectious Diseases (except HIV/AIDS) (10543)
  • Intensive Care and Critical Care Medicine (564)
  • Medical Education (193)
  • Medical Ethics (52)
  • Nephrology (218)
  • Neurology (1748)
  • Nursing (102)
  • Nutrition (265)
  • Obstetrics and Gynecology (342)
  • Occupational and Environmental Health (460)
  • Oncology (962)
  • Ophthalmology (281)
  • Orthopedics (107)
  • Otolaryngology (176)
  • Pain Medicine (117)
  • Palliative Medicine (41)
  • Pathology (263)
  • Pediatrics (556)
  • Pharmacology and Therapeutics (264)
  • Primary Care Research (218)
  • Psychiatry and Clinical Psychology (1841)
  • Public and Global Health (3975)
  • Radiology and Imaging (650)
  • Rehabilitation Medicine and Physical Therapy (341)
  • Respiratory Medicine (534)
  • Rheumatology (215)
  • Sexual and Reproductive Health (178)
  • Sports Medicine (166)
  • Surgery (196)
  • Toxicology (37)
  • Transplantation (106)
  • Urology (79)