Epidemiology of CoVID-19 and predictors of recovery in the Republic of Korea ============================================================================ * Ashis Kumar Das * Saji Saraswathy Gopalan ## Abstract **Background** The recent CoVID-19 pandemic has emerged as a threat to global health. Though current evidence on the epidemiology of the disease is emerging, very little is known about the predictors of recovery. **Objectives** To describe the epidemiology of confirmed CoVID-19 patients in Republic of Korea and identify predictors of recovery. **Materials and methods** Using publicly available data for confirmed CoVID-19 cases from the Korea Centers for Disease Control and Prevention from January 20, 2020 to April 30, 2020, we undertook descriptive analyses of cases stratified by sex, age group, place of exposure, date of confirmation and province. Correlation was tested among all predictors (sex, age group, place of exposure and province) with the Pearson’s correlation coefficient. Associations between recovery from CoVID-19 and predictors were estimated using a multivariable logistic regression model. **Results** Majority of the confirmed cases were females (56%), from 20-29 age group (24.3%), and primarily from three provinces –– Gyeongsangbuk-do (36.9%), Gyeonggi-do (20.5%) and Seoul (17.1%). Case fatality ratio was 2.1% and 41.6% cases recovered. Older patients, patients from provinces such as Daegu, Gyeonggi-do, Gyeongsangbuk-do, Jeju-do, Jeollabuk-do and Jeollanam-do, and those contracting the disease from healthcare settings had lower recovery. **Conclusions** Our study adds to the very limited evidence base on potential predictors of survival among confirmed CoVID-19 cases. We call additional research to explore the predictors of recovery and support development of policies to protect the vulnerable patient groups. ## Introduction For the first time, a novel coronavirus disease 2019 (CoVID-19) originating from Wuhan in China was reported to the World Health Organization in December of 2019.1 This novel coronavirus has taken the form of a major pandemic and has affected almost all major nations in the world. There have been more than 3.6 million confirmed cases and about 252,000 deaths as of May 05, 2020.2 The very first CoVID-19 case was diagnosed in the Republic of Korea (South Korea) on January 20, 2020.3 During the first two months of this global epidemic, South Korea had the second highest cases globally following China. According to the Korea Centers for Disease Control and Prevention (KCDC), there have been 10,804 confirmed cases and 254 deaths due to CoVID-19 as of May 5, 20204 We present the epidemiology of CoVID-19 in the Republic of Korea using data from Korea Centers for Disease Control and Prevention and identify the predictors of recovery from the disease. ## Materials and methods ### Data source The data were obtained from the Korea Centers for Disease Control and Prevention’s publicly shared sources. The dataset contains information about 3,388 confirmed COVID-19 cases in the Republic of Korea from January 20, 2020 through April 30, 2020. After excluding cases with missing values, 3,299 cases were included in the analysis. ### Variables A confirmed case was defined as a person with laboratory confirmed positive test. The data contained the following patient details – age (in groups), sex, province, date of diagnosis, mode of exposure and outcome. There were 11 age groups – below 10 years, 10-19 years, 20-29 years, 30-39 years, 40-49 years, 50-59 years, 60-69 years, 70-79 years, 80-89 years, 90-99 years and above 100 years. We combined the last two age groups to create 90 years and above, and thus recategorized age to 10 groups. All seventeen provinces of the Republic of Korea were represented – Busan, Chungcheongbuk-do, Chungcheongnam-do, Daegu, Daejeon, Gangwon-do, Gwangju, Gyeonggi-do, Gyeongsangbuk-do, Gyeongsangnam-do, Incheon, Jeju-do, Jeollabuk-do, Jeollanam-do, Sejong, Seoul, and Ulsan. We categorized the dates of diagnosis by weeks, and they were as follows – 20-26 Jan 2020, 27 Jan-02 Feb 2020, 03-09 Feb 2020, 10-16 Feb 2020, 17-23 Feb 2020, 24 Feb-01 Mar 2020, 02-08 Mar 2020, 09-15 Mar 2020, 16-22 Mar 2020, 23-29 Mar 2020, 30 Mar-05 Apr 2020, 06-12 Apr 2020, 13-19 Apr 2020, 20-26 Apr 2020, 27-30 Apr 2020. Patients were exposed to potential CoVID-19 sources in multiple settings. The settings were nursing home, hospital, religious gathering, call center, community center, shelter and apartment, gym facility, overseas inflow, contact with patients and others. There were three outcomes – death, recovery and isolation. The confirmed patients after spending some days in isolation were retested. They were considered as recovered only after receiving a negative COVID-19 test. ### Statistical methods We undertook descriptive analyses for the patient characteristics and presented the results stratified by sub-groups for each characteristic. Correlation was tested among all patient characteristics with the Pearson’s correlation coefficient. Associations between recovery from CoVID-19 and predictors (age group, sex, province and exposure) were estimated using a multivariable logistic regression model. We considered associations statistically significant if the p-value was below 0.05. The statistical analyses were performed using Python programming language Version 3.7 (Python Software Foundation, Wilmington, DE, USA) and Stata Version 15 (StataCorp LLC. College Station, TX). ## Results ### Pattern of the epidemic As shown in figure 1, the first case of CoVID-19 was confirmed on January 20, 2020. There were a few daily cases of new infections for about a month. After a month, the curve suddenly rose starting February 19, 2020 to reach the peak around end of February and early March. It reached it peak on the 29th of February with 813 confirmed cases. Though the curve descended after this date, still there were on an average 200 daily new confined cases until March 11, 2020. The curve continued its downward trend, however, adding at least 100 new daily cases through April 05, 2020. Towards the end of April, daily new confirmed cases were below 10. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/11/2020.05.07.20094094/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2020/05/11/2020.05.07.20094094/F1) Figure 1. Daily new confirmed CoVID-19 cases in the Republic of Korea between January 20, 2020 and April 30, 2020 ### Patient profile Table 1 shows the profile of the patients. Out of 3,299 confirmed patients, a slightly more than half were females (56%). While there were cases from all age groups, the maximum patients were from 20-29 years (24.3%), followed by 50-59 years (18.1%), 40-49 years (13.8%), 30-39 years (13.3%) and 60-69 years (12.2%). Three provinces – Gyeongsangbuk-do (36.9%), Gyeonggi-do (20.5%) and Seoul (17.1%) – together accounted for the maximum patients. With respect to the exposure, it was unknown for the most (44%) followed by direct contact with patients (29%), from overseas (16.8%) and religious gathering (4.9%). According to this available data source, 85% percent of the patients were confirmed of their diagnosis between 24 February and 05 April of 2020. There were 61 deaths accounting for 2.1 percent (case fatality rate) of the patients. More than half were isolated (56.3%) and 41.6% recovered. View this table: [Table 1.](http://medrxiv.org/content/early/2020/05/11/2020.05.07.20094094/T1) Table 1. Sample characteristics (N=3,299) ### Predictors of recovery As shown in figure 2, there were strong no correlations between the predictors. Compared to younger age groups (table 2), older patients had lower recovery – 70-79 years (adjusted odds ratio 0.31; p value 0.01), 80-89 years (aOR 0.22; p value 0.001) and 90 years and above (aOR 0.13; p value <0.001). Provinces such as Daegu, Gyeonggi-do, Gyeongsangbuk-do, Jeju-do, Jeollabuk-do and Jeollanam-do had statistically significant lower recovery rates than Busan. When compared with exposure from nursing homes, patients who were exposed to COVID-19 infection from religious gatherings, community dwellings, and others had higher recovery rates. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/05/11/2020.05.07.20094094/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2020/05/11/2020.05.07.20094094/F2) Figure 2. Correlation among predictors View this table: [Table 2.](http://medrxiv.org/content/early/2020/05/11/2020.05.07.20094094/T2) Table 2. Predictors of recovery ## Discussion Due to multipronged approaches (proactive surveillance, higher testing, isolation, quarantine, use of technology, masks and social distancing campaigns) by the government, incidence of new cases came down sharply in South Korea by mid-March and further to less than 10 new cases by mid-April.5 Our study shows that females constituted the majority of confirmed cases, whereas males accounted for most of the confirmed cases in China and Italy.6–9 Around a fourth of the cases were from 20-29 years age group unlike in most other countries where the infected were older.6,7 The possible reason for higher representation of younger population in our sample could be specific exposure to cluster of cases through participation in religious activities or workplaces.5,10 The case fatality rate was much lower (2.1%) compared to other countries such as Italy (13.3%) and China (4%). Similar to findings from several other countries, we found the elderly to be more vulnerable with lower probabilities of recovery.6,8,11> It is quite possible that presence of pre-existing medical conditions in elderly predispose them to delayed recovery. We also found cases contracting the infection in non-healthcare setting had higher recovery. While there is no such evidence currently, there could be a possibility that the exposure outside non-healthcare setting might have involved relative younger and healthier cases. Considering our study findings, we suggest additional measures to protect the vulnerable cases who are less likely to recover from the infection. Thus, elderly and cases contracting infection from healthcare settings should be given special attention. Our study has two potential limitations. First, we used publicly available data of only a third of confirmed cases in the country. Thus, we are unable to ascertain the representativeness of the data for all confirmed cases in South Korea. So, the findings will have to be interpreted with caution. Secondly, the data lacks information of patients’ symptoms and clinical features. Inclusion of these potential predictors would have enhanced the relevance of this study further. Despite these limitations, our study adds to the very limited evidence base on potential predictors of recovery among confirmed CoVID-19 cases.12 However, we believe the evidence base be strengthened with further relevant research as authorities make more data publicly available or through primary hospital based studies. ## Conclusions The CoVID-19 pandemic has emerged as a great threat to global health challenging health systems across the world to efficiently deal with this situation. Emerging evidence on vulnerability to COVID-19 and predictors of recovery will inform providers and policy makers to effectively triage and prioritize limited resources. Therefore, we call for additional research to explore the predictors of recovery and support development of policies to protect the vulnerable patient groups. ## Data Availability Study uses publicly available data from Korea CDC ## Data Availability The data used to support the findings of this study are available publicly through the Korea Centers for Disease Control and Prevention. ## Conflicts of interest The authors declare that there is no conflict of interest. The views expressed in the paper are that of the authors and do not reflect that of their affiliations. This particular work was conducted outside of the authors’ organizational affiliations. ## Funding statement This study did not receive funding from any source. ## Acknowledgements We are grateful to Korea Centers for Disease Control and Prevention for making this data publicly available. * Received May 7, 2020. * Revision received May 7, 2020. * Accepted May 11, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/) ## REFERENCES 1. 1.WHO. WHO Coronavirus disease (COVID-2019) situation reports 2020. 2. 2.COVID-19 Dashboard by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (JHU). Available at: [https://gisanddata.maps.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6](https://gisanddata.maps.arcgis.com/apps/opsdashboard/index.html#/bda7594740fd40299423467b48e9ecf6) (accessed on May 05, 2020). 3. 3.Ryu, S. & Chun, B. C. Epidemiological characteristics of 2019 novel coronavirus: an interim review. Epidemiol. Health (2020). doi:10.4178/epih.e2020006 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.4178/epih.e2020006&link_type=DOI) 4. 4.KCDC. Korea Centers for Disease Control and Prevention; Seoul, Korea: 2020. 2020 5. 5.Jeong, E. K. et al. Coronavirus disease-19: The first 7,755 cases in the Republic of Korea. Osong Public Heal. Res. Perspect. (2020). doi:10.24171/j.phrp.2020.11.2.05 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.24171/j.phrp.2020.11.2.05&link_type=DOI) 6. 6.Livingston, E. & Bucher, K. Coronavirus Disease 2019 (COVID-19) in Italy. JAMA (2020). doi:10.1001/jama.2020.4344 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.2020.4344&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32181795&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F11%2F2020.05.07.20094094.atom) 7. 7.Guan, W. et al. Clinical Characteristics of Coronavirus Disease 2019 in China. N. Engl. J. Med. (2020). doi:10.1056/nejmoa2002032 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/nejmoa2002032&link_type=DOI) 8. 8.Li, Q. et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. New England Journal of Medicine (2020). doi:10.1056/NEJMoa2001316 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMoa2001316&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=31995857&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F11%2F2020.05.07.20094094.atom) 9. 9.Chen, N. et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet (2020). doi:10.1016/S0140-6736(20)30211-7 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(20)30211-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32007143&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F11%2F2020.05.07.20094094.atom) 10. 10.Park, S. Y. et al. Coronavirus Disease Outbreak in Call Center, South Korea. Emerg. Infect. Dis. (2020). doi:10.3201/eid2608.201274 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3201/eid2608.201274&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32324530&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F05%2F11%2F2020.05.07.20094094.atom) 11. 11.Nikpouraghdam, M. et al. Epidemiological characteristics of coronavirus disease 2019 (COVID-19) patients in IRAN: A single center study. J. Clin. Virol. (2020). doi:10.1016/j.jcv.2020.104378 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jcv.2020.104378&link_type=DOI) 12. 12.Al-Rousan, N. & Al-Najjar, H. Data Analysis of Coronavirus CoVID-19 Epidemic in South Korea Based on Recovered and Death Cases. J. Med. Virol. (2020). doi:10.1002/jmv.25850 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jmv.25850&link_type=DOI)