A Machine Learning Explanation of the Pathogen-Immune Relationship of SARS-CoV-2 (COVID-19), and a Model to Predict Immunity and Therapeutic Opportunity: A Comparative Effectiveness Research Study

JMIRx Med. 2020 Oct 19;1(1):e23582. doi: 10.2196/23582. eCollection 2020 Jan-Dec.

Abstract

Background: Approximately 80% of those infected with COVID-19 are immune. They are asymptomatic unknown carriers who can still infect those with whom they come into contact. Understanding what makes them immune could inform public health policies as to who needs to be protected and why, and possibly lead to a novel treatment for those who cannot, or will not, be vaccinated once a vaccine is available.

Objective: The primary objectives of this study were to learn if machine learning could identify patterns in the pathogen-host immune relationship that differentiate or predict COVID-19 symptom immunity and, if so, which ones and at what levels. The secondary objective was to learn if machine learning could take such differentiators to build a model that could predict COVID-19 immunity with clinical accuracy. The tertiary purpose was to learn about the relevance of other immune factors.

Methods: This was a comparative effectiveness research study on 53 common immunological factors using machine learning on clinical data from 74 similarly grouped Chinese COVID-19-positive patients, 37 of whom were symptomatic and 37 asymptomatic. The setting was a single-center primary care hospital in the Wanzhou District of China. Immunological factors were measured in patients who were diagnosed as SARS-CoV-2 positive by reverse transcriptase-polymerase chain reaction (RT-PCR) in the 14 days before observations were recorded. The median age of the 37 asymptomatic patients was 41 years (range 8-75 years); 22 were female, 15 were male. For comparison, 37 RT-PCR test-positive patients were selected and matched to the asymptomatic group by age, comorbidities, and sex. Machine learning models were trained and compared to understand the pathogen-immune relationship and predict who was immune to COVID-19 and why, using the statistical programming language R.

Results: When stem cell growth factor-beta (SCGF-β) was included in the machine learning analysis, a decision tree and extreme gradient boosting algorithms classified and predicted COVID-19 symptom immunity with 100% accuracy. When SCGF-β was excluded, a random-forest algorithm classified and predicted asymptomatic and symptomatic cases of COVID-19 with 94.8% AUROC (area under the receiver operating characteristic) curve accuracy (95% CI 90.17%-100%). In total, 34 common immune factors have statistically significant associations with COVID-19 symptoms (all c<.05), and 19 immune factors appear to have no statistically significant association.

Conclusions: The primary outcome was that asymptomatic patients with COVID-19 could be identified by three distinct immunological factors and levels: SCGF-β (>127,637), interleukin-16 (IL-16) (>45), and macrophage colony-stimulating factor (M-CSF) (>57). The secondary study outcome was the suggestion that stem-cell therapy with SCGF-β may be a novel treatment for COVID-19. Individuals with an SCGF-β level >127,637, or an IL-16 level >45 and an M-CSF level >57, appear to be predictively immune to COVID-19 100% and 94.8% (AUROC) of the time, respectively. Testing levels of these three immunological factors may be a valuable tool at the point of care for managing and preventing outbreaks. Further, stem-cell therapy via SCGF-β and M-CSF appear to be promising novel therapeutics for patients with COVID-19.

Keywords: COVID-19; SARS-CoV-2; immunity; infectious disease; mass vaccinations; public health; stem-cell growth factor-beta; therapeutics.