Abstract
At the end of 2019, a coronavirus disease 2019 (COVID-19) outbroke in Wuhan, China, and spread to Guizhou province on January of 2020. To acquire the epidemiologic characteristics of COVID-19 in Guizhou, China, we collected data on 162 laboratory-confirmed cases related to COVID-19. We described the demographic characteristics of the cases and estimated the incubation period, serial interval and basic reproduction number. With an estimation of 8 days incubation period and 6 days serial interval, our results indicate that there may exist infectiousness during the incubation period for 2019-nCoV. This increases the difficulty of screening or identifying cases related to COVID-19.
Background
At the end of 2019, a Novel Coronavirus Infected Pneumonia (NCIP) outbroke in Wuhan, China. This coronavirus was named as 2019-Novel Coronavirus (2019-nCoV), and then renamed this disease caused by the virus strain from 2019-nCoV acute respiratory disease to “coronavirus disease 2019” (COVID-19) by World Health Organization (WHO) officially0. This incident caused a public health issue within China immediately, and spread to several countries over the world2–5.
Guizhou province is in Southwest of China, close (∼1,000 km) to Wuhan. With the increasing number of COVID-19 patients in mainland China, Guizhou detected its first confirmed case on January 21, and was officially confirmed by Chinese Center for Disease Control and Prevention (China CDC) on January 23. Before the confirmation of the first case, Guizhou CDC identified and screened suspected COVID-19 cases by “pneumonia of unknown etiology” surveillance mechanism6. After the confirmation, Guizhou CDC started to identify and screen every case with pneumonia.
Until February 16, over 140 laboratory-confirmed cases were officially reported by Guizhou Health Committee7. In this study, we analyzed the data on the laboratory-confirmed cases in Guizhou to demonstrate the epidemiologic characteristics of COVID-19.
Methods
Case Definitions
According to standard clinical guidelines, suspected case of COVID-19 was defined as the combination of clinical characteristics and epidemiologic histories. Clinical characteristics of suspected case must fit at least 2 of 3 following criteria:
“fever and/or symptoms in respiratory system; radiographic evidence of pneumonia; low or normal white-cell count or low lymphocyte count.” 8 Epidemiologic histories must fit at least 1 of 4 following criteria: “A history of traveling Hubei Province or other districts that has confirmed cases reported within 14 days of symptom onset; A history of contacting with patient who has fever or symptoms in respiratory system from Hubei Province or other districts that has confirmed cases reported within 14 days of symptom onset; Any person who has had a close contact with confirmed cases; cluster cases.”8 With a stricter definition, Guizhou Health Committee counted all cases that fit clinical characteristics criteria but not any criteria of epidemiologic histories described above as suspected cases. A confirmed case was defined as a case with respiratory specimens that tested positive for the 2019-nCoV by at least one of the following two methods: positive result by real-time reverse-transcription– polymerase-chain-reaction (RT-PCR) assay for 2019-nCoV or a genetic sequence that matches 2019-nCoV. Asymptomatic carrier (i.e. case that displays no signs or symptoms to 2019-nCoV) was not announced by Health Committee officially, but was included in this study.
Sources of Data
Suspected case was screened by local hospital or local Center for Disease Control and Prevention (CDC). Once the patient was identified as confirmed case by laboratory, a joint field epidemiology team comprising members from Guizhou CDC and local CDC would open a detailed field investigation on demography information, epidemiologic histories, timelines of key events, and close contacts.
All data from epidemiological reports were inputted into standardized forms from technical protocols designed by China CDC. At least two team members independently review the full report of each case report to ensure that data is correctly input.
Epidemiological investigation
Once a case was evaluated as suspected case, local CDC would finish an initial investigation report in 24 hours and collect respiratory specimens for centralized laboratory testing. When case was confirmed as positive to 2019-nCoV, province CDC would send a special epidemiology team to investigate with local CDC, in order to acquire more detailed information. Information was collected from infected individuals, family members, medical workers, close contacts, GPS-info, and CCTV camera, etc. Information included basic demography data, detailed life trace and all close contacts, clinical characteristics, exposure history, etc.
Laboratory confirmation
With the guidelines of protocol by World Health Organization (WHO) 9, specimens of suspected cases were taken from whose upper and lower respiratory tract by professional medical worker from CDC or hospital. RNA was extracted and tested by RT-PCR with primers and probes for 2019-nCoV. Cross-reactivity with other known respiratory viruses and bacteria such as Influenza A (H1N1, H3N2, H5N1, or H7N9), Influenza B (Victoria or Yamagata), MERS-CoV, Adenovirus, etc. are also be tested.
Statistical analysis
The onset date was defined as the self-reported first date to have symptoms related to 2019-nCoV or the date of visiting to clinical facilities if it was unavailable to acquire the accurate onset date, and as the confirmation date for confirmed cases without symptoms. As the date of infecting exposure for a given individual falls within a finite interval, an exposure window was clarified to denote this interval. Criteria of infecting exposure was described above in the Case Definitions section. Continuous variables were expressed as the means and standard deviations or medians, the interquartile range (IQR) equals the difference between 75th and 25th percentiles. Categorical variables were summarized as the counts and percentages in each category. The incubation period is defined as the delay from viral infection to the onset of illness10, which was estimated by fitting a parametric accelerated failure time model with log-normal distribution of cases with detailed exposure window data, and standard errors of parameters was computed using a 1,000 times bootstrap routine, this was performed by R package coarseDataTools11. The serial interval distribution (i.e. the duration between symptom onset of a primary/index case and symptom onset of its secondary cases) was fitted by a Weibull distribution with data from cluster events. The basic reproduction number (R0) was defined as the expected number of cases directly generated by one case in a population where all individuals are susceptible to infection, it was estimated by maximum likelihood estimation via R package R012.
All analyses and statistical graphs were conducted with R software version 3.6.2 (R Foundation for Statistical Computing).
Ethics approval
Data collection and analysis of cases and their close contacts were determined by Guizhou Health Commission of the People’s Republic of China. It is part of a continuing public health outbreak investigation and exempted from institutional review board assessment.
Results
There were 759 cases in total. Among all patients, 144 (18.97%) cases were confirmed cases with symptoms to 2019-nCOV, 18 (2.37% of all 759 cases, 11.11% of all 162 confirmed cases) cases were confirmed as cases without symptoms, 597 (78.65%) cases were diagnosed as other diseases after clinical diagnosing or laboratory testing, 34 (4.47%) cases were suspected cases that were pending for laboratory testing. Figure 1 shows the geographic distribution of confirmed cases throughout China and Guizhou. The median age of all confirmed cases was 37 years (ranges from 1 month to 89 years), 82 (50.62%) confirmed cases were male, 84 (51.85%) confirmed cases were exposed to Wuhan or other districts outside Guizhou, the detailed demographic data are shown in Table 1.
The first case was detected on January 21 and was officially confirmed by China CDC on January 23. It was reported by Guiyang (capital city of Guizhou) CDC. This case was a 51-year-old male, infected in Wuhan where he was there to do some business affairs. His onset date was on January 9 when he was still in Wuhan. After returning to Guizhou on January 14, he visited a local hospital for several times until January 16 when he was isolated by the Division of Infection. Figure 2 shows the distribution of the onset date of all cases related to 2019-nCoV over time. It is notable that there was a decline of cases after February 7th, which is most likely attributed to the delay of reporting by the local hospital and laboratory confirmation.
We included 93 confirmed cases with detailed exposure window and onset date information to estimate the incubation period (Fig. 3 A-B). The median incubation period was 8.06 days (95% confidence interval [CI]: 6.89 – 9.36). The 95% percentile was 21.90 days (95% CI: 18.21 – 25.45). Other percentiles are shown on Table 2. We obtained 57 observations from cluster cases and estimated the serial interval distribution (Fig. 3C) with the mean (±SD) of 6.37±4.15 days. The estimated basic reproduction number (R0) was 1.09 (95% CI: 0.94 – 1.26).
Discussions
This study provides an initial analysis among epidemiologic characteristics and two typical transmission phenomena of COVID-19. Most index cases were exposed to Wuhan or confirmed cases related to Wuhan, but it is also necessary to prevent the community transmission of COVID-19. Measures such as community screening, isolation of close contacts, public transportation limitation, extended population of PCR testing, enhanced disinfection, and health education etc. could be taken. Our findings provide more epidemiologic data to this incident of COVID-19, which contribute to the further analysis and the control of this disease.
In addition to the evidence of the human-to-human transmission, it is likely that there exists infectiousness to some extend during the incubation period, and the presence of “super-spreaders”. The median incubation period was 8.2 days (95% CI: 7.9 – 9.5) in our study, which is longer than a recent report of 425 patients (8.2 days vs. 5.2 days), this may be a results of recall bias, during epidemiological investigation, we found that some cases actually have had shown mild symptoms days before the date they reported so as to their ignorance. We estimated the R0 as 1.09 (95% CI: 1.9 – 1.2), which means each patient can infect other 1.09 people on average. In commonly used infection models, when R0 > 1 the infection will be able to start spreading in a population, but not if R0 < 1.
However, it should be treated with caution that the basic reproduction number R0 was calculated under the assumptions that an infected individual but has no symptoms does not yet infect others, which assumes that all index cases should show symptoms before their secondary cases. However, as the investigation went on, Guizhou CDC found that there existed index cases who showed symptoms after that of their secondary cases. This may lead to that the estimated R0 was not robust as we do not hold enough evidence and information to the presence of asymptomatic carrier, those data were not included in the estimation model.
There are several limitations in our study. First, we could not collect all complete exposure and onset data from all confirmed cases, due to the recall bias and the lack of standardization on epidemiological investigation at the beginning of the incident. Second, some key data from asymptomatic carrier, especially the onset date or onset window, and left bound of exposure window from cases of Wuhan residents were unavailable, other statistical model may be used to fit such kind of censored data to estimate serial interval and R0 more accurately.
Data Availability
The data that support the findings of this study are available from the corresponding author upon reasonable request.
Funding
The authors received no specific funding for this research.
Author Bio
Kaike Ping acquires his Master degree of Biostatistics in Southern Medical University on 2017. Now he is a researcher at Center for Disease Control and Prevention of Guizhou, China, where he has been investigating respiratory diseases and their epidemiologic Characteristics. His research interests include disease distributions, time series analysis, survival analysis, and machine learning.