ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Brief Report
Revised

Expected immune recognition of COVID-19 virus by memory from earlier infections with common coronaviruses in a large part of the world population

[version 2; peer review: 2 approved]
PUBLISHED 17 Jul 2020
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Emerging Diseases and Outbreaks gateway.

This article is included in the Coronavirus collection.

Abstract

SARS-CoV-2 is the coronavirus agent of the COVID-19 pandemic causing high mortalities. In contrast, the widely spread human coronaviruses OC43, HKU1, 229E, and NL63 tend to cause only mild symptoms. The present study shows, by in silico analysis, that these common human viruses are expected to induce immune memory against SARS-CoV-2 by sharing protein fragments (antigen epitopes) for presentation to the immune system by MHC class I. A list of such epitopes is provided. The number of these epitopes and the prevalence of the common coronaviruses suggest that a large part of the world population has some degree of specific immunity against SARS-CoV-2 already, even without having been infected by that virus. For inducing protection, booster vaccinations enhancing existing immunity are less demanding than primary vaccinations against new antigens. Therefore, for the discussion on vaccination strategies against COVID-19, the available immune memory against related viruses should be part of the consideration.

Keywords

Coronavirus, COVID-19, SARS-CoV-2, OC43, HKU1, 229E, NL63, MHC class I, Immunology, Vaccination

Revised Amendments from Version 1

Based on requests of the reviewers, we now have added more background information and references to the article. In Table 1, we added software predictions for HLA-C binding. In the Notifications section, in line with the comments by the reviewers, we additionally included some new information that appeared after the initial submission of our article.

See the authors' detailed response to the review by Anna Gil and Liisa K. Selin
See the authors' detailed response to the review by Andrea J. Sant

Introduction

SARS-CoV-2 and other human coronaviruses

From the end of 2019, the world experienced the coronavirus disease 2019 (COVID-19) pandemic caused by the emerging severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2; aka 2019 novel coronavirus or 2019-nCoV). SARS-CoV-2 shares ~80% nucleotide identity with SARS-CoV-1 (aka SARS-CoV), the causative agent of the SARS epidemy from 2002, and is even more similar to some coronaviruses in bats (Andersen et al., 2020; Ceraolo & Giorgi, 2020; Wu et al., 2020; Zhou et al., 2020). Coronaviruses are membrane-enveloped positive-strand RNA viruses with, for an RNA virus, a large genome of ~30 kb. That genome encodes several structural components of the virion including the nucleocapsid protein N and the membrane proteins S (spike), M, and E, plus also a number of nonstructural proteins involved in RNA replication and other—partly unknown—functions (Weiss & Navas-Martin, 2005). The coronaviruses infecting humans belong to the serological/phylogenetic clades group I (alphacoronaviruses) and group II (betacoronaviruses); group I includes HCoV-229E (human coronavirus 229E) and HCoV-NL63, while group II includes SARS-CoV-1, SARS-CoV-2, Middle East respiratory syndrome coronavirus (MERS-CoV), HCoV-OC43, and HCoV-HKU1. The viruses SARS-CoV-1 and MERS-CoV, on average, cause the most severe symptoms, and their outbreaks were successfully monitored and halted. At the other end of the spectrum, the viruses HCoV-229E, HCoV-NL-63, HCoV-OC43, and HCoV-HKU1 tend to cause only mild symptoms and are very common.

Prevalence and associated disease of the common human coronaviruses 229E, NL63, OC43, and HKU1

The Centers for Disease Control and Prevention (CDC; https://www.cdc.gov/coronavirus/general-information.html) states: “Common human coronaviruses, including types 229E, NL63, OC43, and HKU1, usually cause mild to moderate upper-respiratory tract illnesses, like the common cold. Most people get infected with one or more of these viruses at some point in their lives.” The same agency lists the common symptoms caused by these viruses as runny nose, sore throat, headache, fever, cough, and general feeling of being unwell, but also explains that they occasionally cause lower-respiratory tract illnesses, such as pneumonia or bronchitis. The viruses 229E and OC43 have been known since the 1960s (reviewed in Kahn & McIntosh, 2005), but NL63 (van der Hoek et al., 2004) and HKU1 (Woo et al., 2005) were only (conclusively) identified following the rise in interest in coronaviruses in the wake of the SARS epidemy. These common coronaviruses are believed to be the second most common cause of the common cold (Mäkelä et al., 1998). In the U.S.A., a 3-year RT-PCR surveillance of respiratory samples of patients revealed that the four viruses 229E, NL63, OC43, and HKU1 were present at levels varying by season and region, with all individual viruses peaking at >3% prevalence in each investigated region (Midwest, Northeast, South, West); co-infection with other coronaviruses was found in only ~2% of infected cases, but co-infection with another respiratory virus was found in a substantial ~30% of infected cases (Killerby et al., 2018). This pattern was reminiscent of findings in the United Kingdom (Gaunt et al., 2010) and Japan (Matoba et al., 2015). Serological investigations in countries as diverse as the U.S.A. (Bradburne & Somerset, 1972; Dijkman et al., 2012), China (Zhou et al., 2013), and Qatar (Al Kahlout et al., 2019), found that most healthy blood donors had antibodies against coronaviruses, supporting that these viruses are widespread indeed.

Since immune memory protection can be induced by related pathogens, as exemplified by the eradication of human smallpox virus (Variola) by immunization with a related “cowpox” virus (Vaccinia) (Plotkin & Plotkin, 2018), it is interesting to consider whether common human coronavirus infections may have induced some level of protection against SARS-CoV-2.

The possibility of matching linear epitopes between SARS-CoV-2 and the common human coronaviruses that may stimulate the immune system through MHC class I presentation

The major arms of immune memory concern antibody secretion by B cells, killing of infected cells by CD8+ T cells, and helper/regulatory immune activities (e.g. cytokine secretion) by CD4+ T cells. For a murine coronavirus infection in mouse, both antibody responses and cell-mediated cytotoxicity were needed to efficiently control the virus (reviewed by Weiss & Navas-Martin, 2005). In SARS-CoV-1-infected patients, B cell as well as T cell responses were observed (Li et al., 2008), and, in animal models of SARS, B cell responses (Bisht et al., 2005) as well as CD4+ and CD8+ T cell responses (Channappanavar et al., 2014; Liu et al., 2017; Zhao et al., 2010; Zhao et al., 2016) were shown to have protective value. Notably, in most individuals who had recovered from SARS, SARS-CoV-1-specific memory CD8+ T cells persisted for up to 6 years after SARS-CoV-1 infection whereas memory B cells and antivirus antibodies generally became undetectable (Tang et al., 2011).

Based on theoretical considerations alone, it is difficult to predict effective B cell memory across different virus species (Qiu et al., 2020), which makes it a poor topic for our present study which is based on sequence comparisons. We just note that a recent study concluded that sera from people that likely had been infected with the common human coronaviruses 229E, NL63, OC43, and/or HKU1, possessed no or negligible cross-reactivity with SARS-CoV-2 virus S protein (Amanat et al., 2020) and thus probably possess no neutralizing antibodies.

There may be some recognition of SARS-CoV-2 epitopes by CD4+ T cell memory derived from previous infections with common human coronaviruses. However, as discussed in the Results and Discussion section, the very limited lengths of identical sequence stretches between the viruses make theoretical predictions of such epitopes difficult, and therefore the current study only concentrates on potential CD8+ T cell memory.

For inducing CD8+ T cell memory, the core requirement is merely that an identical short peptide is presented by major histocompatibility complex (MHC) class I (MHC-I) molecules. MHC-I molecules present peptide fragments from intracellular proteins, thus also from viral proteins, at the cell surface for screening by CD8+ cytotoxic T cells (Neefjes et al., 2011). CD8+ T cells recognize the combination of MHC-I molecule with peptide by T cell receptors (TCR) that are unique per T cell clone, and if stimulated these clones can proliferate, kill the presenting (virus-infected) cell, and produce memory cells. MHC-I molecules are polymorphic in that they are represented by many diverse allelic forms that differ between human populations and individuals (Robinson et al., 2020), and mostly bind peptides of 9 amino acids (aa) length in their binding groove which is closed at either end (Bjorkman et al., 1987; Rammensee et al., 1995; Schellens et al., 2015).

In the present study, we analyzed whether there are linear 9 aa epitopes that are identical between proteins encoded by SARS-CoV-2 and one or more of the common human coronaviruses. We found many of such epitopes indeed, and, by using prediction software, found that some are expected to bind well to certain MHC-I alleles. We therefore expect that common human coronaviruses can induce some level of CD8+ T cell-mediated immune memory recognizing SARS-CoV-2, and consider the possibility of enhancing that immune memory by vaccination.

Methods

Proteins encoded by a reported genomic sequence for SARS-CoV-2 (GenBank MN908947; Wu et al., 2020) were compared with those for HCoV-OC43 (NC_005147; Vijgen et al., 2005), HCoV-HKU1 (NC_006577; Woo et al., 2005), HCoV-229E (NC_002645; Thiel et al., 2001), and HCoV-NL63 (NC_005831; van der Hoek et al., 2004) by performing BLAST homology searches at the NCBI database (https://blast.ncbi.nlm.nih.gov/Blast.cgi) and by making multiple sequence alignments using CLUSTALW software (https://www.genome.jp/tools-bin/clustalw); continuous stretches of 9 aa acids identical between SARS-CoV-2 and one of the other viruses were identified manually. All these shared 9 aa epitopes were screened by ANN 4.0 software at IEDB Analysis Resource (http://tools.immuneepitope.org/mhci/) for prediction of their affinity to a set of representative human MHC-I alleles.

Results and discussion

Table 1 lists the 9 aa epitopes that are identical between proteins encoded by SARS-CoV-2 and one or more of the common human coronaviruses. Many identical >9 aa stretches were found with ORF1ab encoded polyprotein, one such identical stretch (of 12 aa) was found with the N protein of the other two type II coronaviruses HCoV-OC43 and HCoV-HKU1, and no such stretches were found when comparing with any of the other gene products; ORF1ab-derived mature proteins with such stretches, expected from cleavage of the polyprotein precursor (Wu et al., 2020), were the transmembrane protein nonstructural protein 4 (NSP4), 3C-like cysteine protease NSP5, RNA binding protein NSP9, RNA dependent RNA polymerase NSP12, helicase NSP13, 3’-to-5’ exonuclease NSP14, nidoviral endoribonuclease specific for U NSP15, and S-adenosylmethionine-dependent ribose 2’-O-methyltransferase NSP16 (Table 1). Sequence alignment figures of the ORF1ab and N proteins are shown in Extended data (Dijkstra, 2020a) with highlighting of the interesting epitopes. It is of note that the S protein, which is the prime candidate for inducing neutralizing antibodies (Cohen, 2020), is poorly suitable for inducing an MHC-I-restricted immune memory across the investigated viral species as between S protein of SARS-CoV-2 and S proteins of the common human coronaviruses there are no 9 aa matches, and, among the virus isolates compared in this study, only a single 8 aa match (DRLITGRL with HCoV-NL63 and -229E) (not shown).

Table 1. Stretches of 9 consecutive amino acids that are identical between SARS-CoV-2 and at least one of the common human coronaviruses.

Compared sequences were derived from the following GenBank accessions: for ORF1ab SARS-CoV-2, QHD43415; OC43, NP_937947; HKU1, YP_173236; 229E, NP_073549; NL63, YP_003766; and for N SARS-CoV-2, QHD43423; OC43, NP_937954; HKU1, YP_173242; 229E, NP_073556; NL63, YP_003771 Source positions indicate the N-terminal position of the depicted 9 aa sequence in the SARS-CoV-2 ORF1ab protein or N protein (see Supplementary file 1). The ORF1ab protein is only a precursor polyprotein, and the column Mature protein indicates the probable mature protein that possesses the epitope: 3CLpro, 3C-like cysteine protease; RdRp, RNA dependent RNA polymerase; Hel, helicase; ExoN, 3’-to-5’ exonuclease; NendoU, nidoviral endoribonuclease specific for U; O-MT, S-adenosylmethionine-dependent ribose 2’-O-methyltransferase. Yellow blocks indicate the presence of identical sequences in the respective common human coronavirus, and gray blocks indicate the absence of such matches. Orange blocks highlight those peptides with predicted IC50 values of <500 nM for one of the twelf investigated MHC-I alleles.

No predicted IC50 values of <500 nM were found for HLA-C*0401

common human
coronaviruses
IC50 prediction by ANN 4.0 program of IEDB software (only IC50 valuesof <500 nM are shown)
MatureGroup IIGroup IHLA-AHLA-BHLA-C
SARS-CoV-2
source
proteinSequence
(9aa)
OC43HKU1229ENL63*0101*0201*0301*2402*2601*0702*0801*1501*2705*3901*4001*5801*0303*0401
ORF1ab 3004NSP4WVLNNDYYR
ORF1ab 3005NSP4VLNNDYYRS
ORF1ab 3006NSP4LNNDYYRSL
ORF1ab 3007NSP4NNDYYRSLP
ORF1ab 3008NSP4NDYYRSLPG
ORF1ab 3289NSP5 3CLproTLNGLWLDD
ORF1ab 3299NSP5 3CLproVYCPRHVIC
ORF1ab 4208NSP9ELEPPCRFV
ORF1ab 4551NSP12 RdRpKKDWYDFVE
ORF1ab 4552NSP12 RdRpKDWYDFVEN
ORF1ab 4553NSP12 RdRpDWYDFVENP
ORF1ab 4554NSP12 RdRpWYDFVENPD
ORF1ab 4555NSP12 RdRpYDFVENPDI
ORF1ab 4594NSP12 RdRpVGVLTLDNQ
ORF1ab 4595NSP12 RdRpGVLTLDNQD
ORF1ab 4596NSP12 RdRpVLTLDNQDL
ORF1ab 4597NSP12 RdRpLTLDNQDLN
ORF1ab 4598NSP12 RdRpTLDNQDLNG
ORF1ab 4599NSP12 RdRpLDNQDLNGN
ORF1ab 4608NSP12 RdRpWYDFGDFIQ
ORF1ab 4609NSP12 RdRpYDFGDFIQT
ORF1ab 4661NSP12 RdRpDLLKYDFTE
ORF1ab 4725NSP12 RdRpIFVDGVPFV158 nM
ORF1ab 4726NSP12 RdRpFVDGVPFVV10 nM143 nM
ORF1ab 4727NSP12 RdRpVDGVPFVVS
ORF1ab 4799NSP12 RdRpFQTVKPGNF
ORF1ab 4800NSP12 RdRpQTVKPGNFN
ORF1ab 4931NSP12 RdRpTQMNLKYAI278 nM21 nM
ORF1ab 4932NSP12 RdRpQMNLKYAIS
ORF1ab 4933NSP12 RdRpMNLKYAISA
ORF1ab 4934NSP12 RdRpNLKYAISAK198 nM
ORF1ab 4935NSP12 RdRpLKYAISAKN
ORF1ab 4936NSP12 RdRpKYAISAKNR
ORF1ab 4937NSP12 RdRpYAISAKNRA
ORF1ab 4938NSP12 RdRpAISAKNRAR
ORF1ab 4939NSP12 RdRpISAKNRART
ORF1ab 4940NSP12 RdRpSAKNRARTV58 nM
ORF1ab 4941NSP12 RdRpAKNRARTVA
ORF1ab 4942NSP12 RdRpKNRARTVAG
ORF1ab 4943NSP12 RdRpNRARTVAGV468
nM
ORF1ab 4944NSP12 RdRpRARTVAGVS
ORF1ab 4945NSP12 RdRpARTVAGVSI219
nM
ORF1ab 5006NSP12 RdRpLMGWDYPKC
ORF1ab 5007NSP12 RdRpMGWDYPKCD
ORF1ab 5008NSP12 RdRpGWDYPKCDR
ORF1ab 5009NSP12 RdRpWDYPKCDRA
ORF1ab 5010NSP12 RdRpDYPKCDRAM
ORF1ab 5011NSP12 RdRpYPKCDRAMP
ORF1ab 5012NSP12 RdRpPKCDRAMPN
ORF1ab 5043NSP12 RdRpRFYRLANEC
ORF1ab 5044NSP12 RdRpFYRLANECA
ORF1ab 5045NSP12 RdRpYRLANECAQ
ORF1ab 5046NSP12 RdRpRLANECAQV53 nM
ORF1ab 5047NSP12 RdRpLANECAQVL26 nM
ORF1ab 5048NSP12 RdRpANECAQVLS
ORF1ab 5049NSP12 RdRpNECAQVLSE
ORF1ab 5066NSP12 RdRpYVKPGGTSS
ORF1ab 5067NSP12 RdRpVKPGGTSSG
ORF1ab 5068NSP12 RdRpKPGGTSSGD
ORF1ab 5069NSP12 RdRpPGGTSSGDA
ORF1ab 5070NSP12 RdRpGGTSSGDAT
ORF1ab 5071NSP12 RdRpGTSSGDATT
ORF1ab 5072NSP12 RdRpTSSGDATTA
ORF1ab 5074NSP12 RdRpSGDATTAYA
ORF1ab 5075NSP12 RdRpGDATTAYAN
ORF1ab 5076NSP12 RdRpDATTAYANS
ORF1ab 5077NSP12 RdRpATTAYANSV
ORF1ab 5078NSP12 RdRpTTAYANSVF245
nM
29 nM
ORF1ab 5079NSP12 RdRpTAYANSVFN466 nM
ORF1ab 5080NSP12 RdRpAYANSVFNI56 nM
ORF1ab 5082NSP12 RdRpANSVFNICQ
ORF1ab 5083NSP12 RdRpNSVFNICQA
ORF1ab 5084NSP12 RdRpSVFNICQAV85 nM
ORF1ab 5085NSP12 RdRpVFNICQAVT
ORF1ab 5086NSP12 RdRpFNICQAVTA
ORF1ab 5087NSP12 RdRpNICQAVTAN
ORF1ab 5088NSP12 RdRpICQAVTANV
ORF1ab 5140NSP12 RdRpYLRKHFSMM128
nM
134
nM
4 nM47 nM
ORF1ab 5141NSP12 RdRpLRKHFSMMI313
nM
ORF1ab 5142NSP12 RdRpRKHFSMMIL310 nM
ORF1ab 5143NSP12 RdRpKHFSMMILS
ORF1ab 5144NSP12 RdRpHFSMMILSD
ORF1ab 5145NSP12 RdRpFSMMILSDD
ORF1ab 5177NSP12 RdRpVLYYQNNVF25 nM
ORF1ab 5178NSP12 RdRpLYYQNNVFM
ORF1ab 5179NSP12 RdRpYYQNNVFMS
ORF1ab 5180NSP12 RdRpYQNNVFMSE
ORF1ab 5196NSP12 RdRpDLTKGPHEF
ORF1ab 5197NSP12 RdRpLTKGPHEFC
ORF1ab 5198NSP12 RdRpTKGPHEFCS
ORF1ab 5199NSP12 RdRpKGPHEFCSQ
ORF1ab 5200NSP12 RdRpGPHEFCSQH
ORF1ab 5201NSP12 RdRpPHEFCSQHT
ORF1ab 5202NSP12 RdRpHEFCSQHTM54 nM10 nM
ORF1ab 5203NSP12 RdRpEFCSQHTML
ORF1ab 5204NSP12 RdRpFCSQHTMLV
ORF1ab 5205NSP12 RdRpCSQHTMLVK227 nM
ORF1ab 5217NSP12 RdRpDYVYLPYPD
ORF1ab 5218NSP12 RdRpYVYLPYPDP
ORF1ab 5219NSP12 RdRpVYLPYPDPS
ORF1ab 5220NSP12 RdRpYLPYPDPSR
ORF1ab 5221NSP12 RdRpLPYPDPSRI
ORF1ab 5222NSP12 RdRpPYPDPSRIL
ORF1ab 5223NSP12 RdRpYPDPSRILG
ORF1ab 5224NSP12 RdRpPDPSRILGA
ORF1ab 5225NSP12 RdRpDPSRILGAG
ORF1ab 5226NSP12 RdRpPSRILGAGC
ORF1ab 5227NSP12 RdRpSRILGAGCF
ORF1ab 5228NSP12 RdRpRILGAGCFV153 nM
ORF1ab 5229NSP12 RdRpILGAGCFVD
ORF1ab 5230NSP12 RdRpLGAGCFVDD
ORF1ab 5248NSP12 RdRpIERFVSLAI221 nM
ORF1ab 5249NSP12 RdRpERFVSLAID
ORF1ab 5250NSP12 RdRpRFVSLAIDA
ORF1ab 5251NSP12 RdRpFVSLAIDAY477
nM
223 nM
ORF1ab 5252NSP12 RdRpVSLAIDAYP
ORF1ab 5253NSP12 RdRpSLAIDAYPL21 nM487 nM
ORF1ab 5349NSP13 HelLCCKCCYDH
ORF1ab 5350NSP13 HelCCKCCYDHV
ORF1ab 5371NSP13 HelPYVCNAPGC
ORF1ab 5372NSP13 HelYVCNAPGCD
ORF1ab 5373NSP13 HelVCNAPGCDV
ORF1ab 5387NSP13 HelLYLGGMSYY
ORF1ab 5388NSP13 HelYLGGMSYYC89 nM
ORF1ab 5450NSP13 HelCTERLKLFA303
nM
ORF1ab 5451NSP13 HelTERLKLFAA
ORF1ab 5452NSP13 HelERLKLFAAE
ORF1ab 5453NSP13 HelRLKLFAAET
ORF1ab 5559NSP13 HelLSAPTLVPQ
ORF1ab 5560NSP13 HelSAPTLVPQE
ORF1ab 5605NSP13 HelQGPPGTGKS
ORF1ab 5606NSP13 HelGPPGTGKSH
ORF1ab 5634NSP13 HelSHAAVDALC
ORF1ab 5635NSP13 HelHAAVDALCE
ORF1ab 5636NSP13 HelAAVDALCEK
ORF1ab 5637NSP13 HelAVDALCEKA
ORF1ab 5656NSP13 HelRIIPARARV
ORF1ab 5657NSP13 HelIIPARARVE
ORF1ab 5658NSP13 HelIPARARVEC213
nM
ORF1ab 5716NSP13 HelRAKHYVYIG
ORF1ab 5717NSP13 HelAKHYVYIGD
ORF1ab 5718NSP13 HelKHYVYIGDP
ORF1ab 5719NSP13 HelHYVYIGDPA
ORF1ab 5720NSP13 HelYVYIGDPAQ312 nM
ORF1ab 5721NSP13 HelVYIGDPAQL206 nM
ORF1ab 5722NSP13 HelYIGDPAQLP
ORF1ab 5723NSP13 HelIGDPAQLPA
ORF1ab 5724NSP13 HelGDPAQLPAP
ORF1ab 5725NSP13 HelDPAQLPAPR
ORF1ab 5771NSP13 HelEIVDTVSAL16 nM298 nM463 nM
ORF1ab 5772NSP13 HelIVDTVSALV114 nM
ORF1ab 5773NSP13 HelVDTVSALVY
ORF1ab 5775NSP13 HelTVSALVYDN
ORF1ab 5776NSP13 HelVSALVYDNK
ORF1ab 5777NSP13 HelSALVYDNKL
ORF1ab 5778NSP13 HelALVYDNKLK
ORF1ab 5779NSP13 HelLVYDNKLKA
ORF1ab 5832NSP13 HelKAVFISPYN
ORF1ab 5833NSP13 HelAVFISPYNS
ORF1ab 5834NSP13 HelVFISPYNSQ
ORF1ab 5835NSP13 HelFISPYNSQN
ORF1ab 5855NSP13 HelQTVDSSQGS
ORF1ab 5856NSP13 HelTVDSSQGSE
ORF1ab 5857NSP13 HelVDSSQGSEY
ORF1ab 5858NSP13 HelDSSQGSEYD
ORF1ab 5859NSP13 HelSSQGSEYDY
ORF1ab 5860NSP13 HelSQGSEYDYV
ORF1ab 5861NSP13 HelQGSEYDYVI
ORF1ab 5862NSP13 HelGSEYDYVIF
ORF1ab 5880NSP13 HelCNVNRFNVA
ORF1ab 5881NSP13 HelNVNRFNVAI
ORF1ab 5882NSP13 HelVNRFNVAIT
ORF1ab 5883NSP13 HelNRFNVAITR65 nM
ORF1ab 5884NSP13 HelRFNVAITRA
ORF1ab 5885NSP13 HelFNVAITRAK
ORF1ab 6031NSP14 ExoNPLQLGFSTG
ORF1ab 6198NSP14 ExoNDAIMTRCLA
ORF1ab 6199NSP14 ExoNAIMTRCLAV29 nM24 nM
ORF1ab 6307NSP14 ExoNCLFWNCNVD
ORF1ab 6320NSP14 ExoNNSIVCRFDT
ORF1ab 6321NSP14 ExoNSIVCRFDTR
ORF1ab 6322NSP14 ExoNIVCRFDTRV
ORF1ab 6323NSP14 ExoNVCRFDTRVL
ORF1ab 6341NSP14 ExoNGGSLYVNKH
ORF1ab 6342NSP14 ExoNGSLYVNKHA
ORF1ab 6343NSP14 ExoNSLYVNKHAF279 nM154 nM
ORF1ab 6344NSP14 ExoNLYVNKHAFH
ORF1ab 6345NSP14 ExoNYVNKHAFHT
ORF1ab 6346NSP14 ExoNVNKHAFHTP
ORF1ab 6347NSP14 ExoNNKHAFHTPA
ORF1ab 6389NSP14 ExoNDYVPLKSAT
ORF1ab 6390NSP14 ExoNYVPLKSATC
ORF1ab 6391NSP14 ExoNVPLKSATCI
ORF1ab 6392NSP14 ExoNPLKSATCIT
ORF1ab 6393NSP14 ExoNLKSATCITR
ORF1ab 6394NSP14 ExoNKSATCITRC331 nM
ORF1ab 6395NSP14 ExoNSATCITRCN
ORF1ab 6396NSP14 ExoNATCITRCNL
ORF1ab 6397NSP14 ExoNTCITRCNLG
ORF1ab 6398NSP14 ExoNCITRCNLGG
ORF1ab 6399NSP14 ExoNITRCNLGGA
ORF1ab 6400NSP14 ExoNTRCNLGGAV
ORF1ab 6401NSP14 ExoNRCNLGGAVC
ORF1ab 6682NSP15 NendoUYAFEHIVYG177 nM
ORF1ab 6698NSP15 NendoUGGLHLLIGL
ORF1ab 6746NSP15 NendoUVIDLLLDDF
ORF1ab 6747NSP15 NendoUIDLLLDDFV
ORF1ab 6839NSP16 O-MTMMNVAKYTQ
ORF1ab 6840NSP16 O-MTMNVAKYTQL259 nM
ORF1ab 6841NSP16 O-MTNVAKYTQLC
ORF1ab 6842NSP16 O-MTVAKYTQLCQ
ORF1ab 6843NSP16 O-MTAKYTQLCQY
ORF1ab 6844NSP16 O-MTKYTQLCQYL139 nM
ORF1ab 6845NSP16 O-MTYTQLCQYLN
ORF1ab 6846NSP16 O-MTTQLCQYLNT
ORF1ab 6869NSP16 O-MTGAGSDKGVA
ORF1ab 6870NSP16 O-MTAGSDKGVAP
ORF1ab 6871NSP16 O-MTGSDKGVAPG
ORF1ab 6872NSP16 O-MTSDKGVAPGT
ORF1ab 6922NSP16 O-MTWDLIISDMY
ORF1ab 6923NSP16 O-MTDLIISDMYD
ORF1ab 6924NSP16 O-MTLIISDMYDP
ORF1ab 6943NSP16 O-MTSKEGFFTYI
ORF1ab 6958NSP16 O-MTKLALGGSVA
ORF1ab 6959NSP16 O-MTLALGGSVAI11 nM
ORF1ab 6960NSP16 O-MTALGGSVAIK110 nM
ORF1ab 6961NSP16 O-MTLGGSVAIKI
ORF1ab 6962NSP16 O-MTGGSVAIKIT
ORF1ab 6963NSP16 O-MTGSVAIKITE
ORF1ab 6973NSP16 O-MTSWNADLYKL
ORF1ab 6974NSP16 O-MTWNADLYKLM
ORF1ab 6993NSP16 O-MTTNVNASSSE
ORF1ab 6998NSP16 O-MTSSSEAFLIG
ORF1ab 7024NSP16 O-MTHANYIFWRN
N 106NucleocapsidPRWYFYYLG
N 107NucleocapsidRWYFYYLGT
N 108NucleocapsidWYFYYLGTG
N 109NucleocapsidYFYYLGTGP

In Table 1 (for Excel format see Extended data) it is shown that there are >200 linear epitopes of 9 aa that are identical between SARS-CoV-2 and at least one of the common human coronaviruses, most of them with OC43 and HKU1 which, like SARS-CoV-2, belong to the group II coronaviruses. In a simplified model, if people would have been exposed to many of these epitopes through common HCoV infections, this kind of equals immunization by a small intracellular protein under natural viral infection conditions. Whereas live virus is commonly considered the gold standard in regard to inducing strong immunity, unless the virus has some tricks up its sleeve to manipulate the immune system, which for common human coronaviruses is not well investigated, a research grant proposal suggesting this as a vaccination strategy would probably fail. Reviewers of such proposal would righteously point out that the strategy would not induce neutralizing antibodies, which for combating some viral infections can be very important, and that for inducing MHC-I-restricted cell-mediated cytotoxicity memory, ideally, a much larger protein or more proteins should be taken. Those reviewers would conclude that for such small intracellular protein to induce strong immune memory it would be too dependent on the MHC alleles of the immunized person and would need too much luck in regard to immunogenicity. Nevertheless, those reviewers would probably also agree that in most persons thus vaccinated some (small) level of immune memory protection would be established, even with such small non-surface protein (e.g. Polakos et al., 2001; Wasmoen et al., 1995; Zhao et al., 2005). Regardless of that this obviously is not the ideal way to induce a population-wide strong protective immunity (see the spread of COVID-19), together with other factors such as health and the number of encountered viruses (the strength of the viral challenge), the induced immune memory could make a difference for whether a person gets sick; at the population scale, it so may somewhat reduce the virus reproduction number. Importantly, by stimulating this HCoV-derived MHC-I restricted immune memory by vaccination (see below), it may become a more significant helper in fighting COVID-19.

Software predictions of MHC-I-binding epitopes

Based on combinations of experimental results and computer learning, various software has been created that with some degree of reliability can predict how efficiently peptides can bind to the grooves of various MHC-I alleles. In the present study, we used the artificial neural network (ANN) function (Lundegaard et al., 2008) of the IEDB Analysis Resource (http://tools.immuneepitope.org/mhci/) (Dhanda et al., 2019) which may achieve >75% reliability for predicting binding (Lundegaard et al., 2008). The software designers state that IC50 values of <50 nM and <500 nM are considered high and intermediate affinity, respectively, and are found for most epitopes known to stimulate cytotoxic T cells. Therefore, Table 1 only indicates the predicted IC50 values if lower than 500 nM. Table 1 shows these expected affinities for fourteen MHC-I alleles that are rather representative for sets of MHC-I alleles with similar binding properties (supertypes) and so represent a large part of the human MHC-I binding repertoire (Doytchinova et al., 2004; Lund et al., 2004): HLA-A*0101 (supertype A1), HLA-A*0201 (A2), HLA-A*0301 (A3), HLA-A*2402 (A24), HLA-A*2601(A26), HLA-B*0702 (B7), HLA-B*0801 (B8), HLA-B*1501 (B62), HLA-B*2705 (B27), HLA-B*3901 (B39), HLA-B*4001 (B44), HLA-B*5801 (B58), HLA-C*0303 (C1), and HLA-C*0401 (C4). It is of note that Li et al. (2008) found that a SARS-CoV-1 15 aa peptide sequence (their “Replicase 4701-4715” peptide) encompassing the SARS-CoV-2/HCoV-shared ORF1ab4725 and ORF1ab4726 epitopes that are predicted to bind well to the MHC-I alleles HLA-A*0201 and HLA-B*3901 (see our Table 1) was associated with a CD8+ T cell response against SARS-CoV-1 in humans. However, Li et al. (2008) also found such CD8+ T cell response associated with a SARS-CoV-1 15 aa peptide (their “Nucleocapsid 106-120” peptide) encompassing the SARS-CoV-2/HCoV-shared N 106, N 107, N 108, and N 109 epitopes for which our analyses did not predict MHC-I binding (see our Table 1).

The MHC-I binding affinity is considered the most selective in determining which peptides are presented, but also steps in the peptide processing and loading pathways may play selective roles which are difficult to capture in prediction software (Nielsen et al., 2005). We argue that, if such steps would be selective for presentation, in most cases they would probably not differentiate between the 9 aa epitope in the SARS-CoV-2 context versus the respective HCoV context, since most of those epitopes are within stretches that also show many similarities in the neighboring residues (Extended data).

Not all stable complexes of MHC-I with non-self peptides elicit a strong immune response, but “immunogenicity” features are hard to predict with meaningful reliability by in silico analysis (Calis et al., 2013), and in the present study we refrain from such predictions. Table 1 should, foremost, be understood as evidence of principle and a list of promising peptides, whereas only future experiments can prove MHC-I-mediated immune memory involving these or other peptides.

In regard to SARS-CoV-2 recognition, the common human coronaviruses may also induce some MHC-II-mediated immune memory by CD4+ helper T cells (as an example for shared epitope use by different coronaviruses see Zhao et al., 2016). CD4+ helper T cells can help stimulate cells involved in antibody or cell-mediated cytotoxic immune responses (Neefjes et al., 2011). However, for this topic, in the present article, we have refrained from detailed (software) predictions because comparison of MHC-II epitopes across different viruses is harder than for MHC-I epitopes. Namely, although the core of MHC-II bound peptides is also only 9 aa, the surrounding amino acids are also part of the bound peptide that tends to be 12–25 aa (Brown et al., 1993; Rammensee et al., 1995; Stern & Wiley, 1994) and can affect how the peptide interacts with the receptors on the CD4+ helper T cells (Arnold et al., 2002).

Vaccination potential

Immune memory means that a secondary immune response, upon renewed encounter with the same pathogen, is faster and stronger than the primary immune response during the first encounter with the pathogen. This is based on expansion of specific B and T cell clones, which specifically recognize pathogen(-derived) epitopes, with some of those cells becoming memory cells (Paul, 2013). This principle also causes that for a booster vaccination/immunization the requirements for efficiently inducing an immune response are lower than for a first vaccination/immunization (e.g. Du et al., 2008; Goding, 1996; Schulze et al., 2008). Especially in elderly people, who have a decreased ability to mount adaptive immune responses against new antigens, vaccination that stimulates an immune memory response may be beneficial (Kaml et al., 2006; Reber et al., 2012; Wagner & Weinberger, 2020). As discussed above, people’s past infections with common coronaviruses probably did not induce a B cell memory for making antibodies that can neutralize SARS-CoV-2. However, as the current study shows by analysis of linear 9 aa epitopes, these common human coronaviruses are expected to induce CD8+ T cells that may potentially kill SARS-CoV-2-infected cells and so can help eradicate the virus. There are several possible ways to exploit this probable immune memory. For example, if using RNA for immunization (Cohen, 2020), it may be best to also include SARS-CoV-2 genes that encode MHC-I epitopes that match those of the common coronaviruses. Alternatively, delivery of these epitopes to the MHC-I presentation system may be tried by peptide or protein based vaccines (e.g. Kohyama et al., 2009; Slingluff, 2011; van Montfoort et al., 2014; Yadav et al., 2014), possibly in combination with some of the strategies that are currently being explored for non-specific stimulation of the immune system against COVID-19 (Kupferschmidt & Cohen, 2020). Protein (-coding) vaccines, for example encompassing a large part of the SARS-CoV-2 ORF1ab product, would have an advantage over peptide-vaccines by including multiple possible MHC-I and also MHC-II epitopes, and be less dependent on MHC-allele matching and the quality of software predictions. Naturally, as for any new vaccine strategy, it should be carefully assessed whether the benefits of the induced type of immunity outweigh the potential deleterious health effects caused by, for example, an increased inflammation response (Cohen, 2020; Weingartl et al., 2004). Another fundamental concern is the maximum level of protection that can be generated by vaccination against coronavirus infections in humans, considering that infection of volunteers with HCoV-229E live virus gave only partial protection upon infection with the same virus one year later (Callow et al., 1990). Additional questions specifically related to the contents of our study are whether the history of previous—especially recent—infections with common coronaviruses, or people’s MHC alleles, affect people’s resistance to SARS-CoV-2. Most definitely, if discussing possible strategies for vaccination against SARS-CoV-2, pre-existing MHC-I-based immunity derived from previous infections with common coronaviruses should be part of the consideration.

Notifications

Although we were not aware of this at the time of writing, a recent paper appeared with overlapping contents (Nguyen et al., 2020). The Nguyen et al. study was more complete on SARS-CoV-2 MHC epitope predictions and made an association with global MHC allele distributions. The advantage of our study is a more concentrated focus on the MHC-I mediated memory expected from previous coronavirus infections, and the vaccination potential deriving from that memory.

After we had submitted our study, two studies reported in vitro responses of T cells against SARS-CoV-2 peptides, which might represent memory from previous infections with common coronaviruses (Braun et al., 2020; Grifoni et al., 2020). However, both studies only used peptide mixes without identifying the responsible peptide, and at least several of the observed responses necessitated the allowance of peptide ligand sequence mismatches for T cell receptor to MHC/peptide binding (T cell cross-reactivity). Negative control donors, who with certainty had never been infected with common coronaviruses, were not available for the experiments, and conclusions that the observed responses were from T cell memory from previous coronavirus infections, and have in vivo relevance, should be considered only cautiously. Discussion of this topic is important because the two studies concluded a potential of the common coronavirus S proteins to induce CD4+ T cell memory (Braun et al., 2020; Grifoni et al., 2020) and CD8+ T cell memory (Grifoni et al., 2020), whereas these proteins do not share 9 aa identical stretches with SARS-CoV-2 (see our article and Supplementary Fig. 1 in Braun et al., 2020), and would arguably necessitate the allowance of peptide sequence mismatches (T cell cross-reactivity) for inducing an efficient MHC-mediated T cell response. As we pointed out in our article, although SARS-CoV-2 S protein is the prime vaccine component candidate for inducing neutralizing antibodies, for a more realistic chance to efficiently boost existing T cell memory it probably would be better to additionally include other SARS-CoV-2 proteins that do share identical MHC epitopes with common coronaviruses.

Regarding the potential of existing CD8+ T cell memory cells to help fight COVID-19 disease, a recent observation by Liao et al., (2020) might be interesting. Their study suggests that in COVID-19 patients with pneumonia, ZNF683+ CD8+ T cell clonal expansion may protect the patient from more severe disease.

Data availability

Underlying data

Severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1, complete genome, Accession number MN908947: https://www.ncbi.nlm.nih.gov/nuccore/MN908947

Human coronavirus OC43, complete genome, Accession number NC_005147.1: https://www.ncbi.nlm.nih.gov/nuccore/NC_005147.1?report=genbank

Human coronavirus HKU1, complete genome, Accession number NC_006577: https://www.ncbi.nlm.nih.gov/nuccore/NC_006577

Human coronavirus 229E, complete genome, Accession number NC_002645: https://www.ncbi.nlm.nih.gov/nuccore/NC_002645

Human Coronavirus NL63, complete genome, Accession number NC_005831: https://www.ncbi.nlm.nih.gov/nuccore/NC_005831

Extended data

Harvard Dataverse: Extended data. Sequence alignments of SARS-CoV-2 ORF1ab and N proteins with their counterparts in the common human coronaviruses, https://doi.org/10.7910/DVN/CNPUPA (Dijkstra, 2020a).

Harvard Dataverse: Excel format version of Table 1. https://doi.org/10.7910/DVN/LOBKLV (Dijkstra, 2020b).

Data are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).

Comments on this article Comments (3)

Version 2
VERSION 2 PUBLISHED 17 Jul 2020
Revised
Version 1
VERSION 1 PUBLISHED 23 Apr 2020
Discussion is closed on this version, please comment on the latest version above.
  • Author Response 02 Jun 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    02 Jun 2020
    Author Response
    In this comment I would just like to state that we submitted this article to F1000Research on April 15 (Japanese time). F1000Research only lists the publication date (April 23) which ... Continue reading
  • Author Response 28 Apr 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    28 Apr 2020
    Author Response
    Dear Dr. Bercovier,

    Thank you for your comments and interesting links. Especially the data on the protective effects against common coronaviruses by previous infections by homologous virus are relevant.
    ... Continue reading
  • Reader Comment 27 Apr 2020
    Herve Bercovier, Hebrew University of Jerusalem, Faculty of Medicine Ein Kerem, Israel
    27 Apr 2020
    Reader Comment
    Dear Colleagues,
    I am in the middle of writing a similar paper and I would like you to take in consideration the following data that will make my paper unnecessary ... Continue reading
  • Discussion is closed on this version, please comment on the latest version above.
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Dijkstra JM and Hashimoto K. Expected immune recognition of COVID-19 virus by memory from earlier infections with common coronaviruses in a large part of the world population [version 2; peer review: 2 approved] F1000Research 2020, 9:285 (https://doi.org/10.12688/f1000research.23458.2)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 23 Apr 2020
Views
28
Cite
Reviewer Report 22 Jun 2020
Andrea J. Sant, Department of Microbiology and Immunology, University of Rochester Medical Center, Rochester, NY, 14642, USA;  David H. Smith Center for Vaccine Biology and Immunology, Rochester, NY, USA 
Approved
VIEWS 28
These analyses of potential cross reactive CD8 T cell epitopes between the current SARS-CoV-2 and “seasonal” endemic human CoV is useful and timely and the discussion is balanced.
 
There are several modifications that I believe would improve ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Sant AJ. Reviewer Report For: Expected immune recognition of COVID-19 virus by memory from earlier infections with common coronaviruses in a large part of the world population [version 2; peer review: 2 approved]. F1000Research 2020, 9:285 (https://doi.org/10.5256/f1000research.25889.r63246)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 17 Jul 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    17 Jul 2020
    Author Response
    Dear Dr. Sant,
     
    Thank you for reviewing our article. We highly appreciate your comment that our article is timely and well balanced. Especially the latter is important, since we ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 17 Jul 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    17 Jul 2020
    Author Response
    Dear Dr. Sant,
     
    Thank you for reviewing our article. We highly appreciate your comment that our article is timely and well balanced. Especially the latter is important, since we ... Continue reading
Views
36
Cite
Reviewer Report 22 Jun 2020
Anna Gil, University of Massachusetts Medical School, Worcester, MA, USA 
Liisa K. Selin, Department of Pathology, University of Massachusetts Medical School, Worcester, MA, USA 
Approved
VIEWS 36
This manuscript reports a very useful study that extends our knowledge of peptide-MHC recognition by CD8+ T cells during emerging virus infections such as SARS-CoV-2. Detailed in silico analysis showed the presence of potential epitopes shared between new types of  ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Gil A and Selin LK. Reviewer Report For: Expected immune recognition of COVID-19 virus by memory from earlier infections with common coronaviruses in a large part of the world population [version 2; peer review: 2 approved]. F1000Research 2020, 9:285 (https://doi.org/10.5256/f1000research.25889.r63655)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 17 Jul 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    17 Jul 2020
    Author Response
    Dear Dr. Gil and Dr. Selin,
     
    Thank you for your kindness to review our article. We are glad that you find our study very useful, and that you agree ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 17 Jul 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    17 Jul 2020
    Author Response
    Dear Dr. Gil and Dr. Selin,
     
    Thank you for your kindness to review our article. We are glad that you find our study very useful, and that you agree ... Continue reading

Comments on this article Comments (3)

Version 2
VERSION 2 PUBLISHED 17 Jul 2020
Revised
Version 1
VERSION 1 PUBLISHED 23 Apr 2020
Discussion is closed on this version, please comment on the latest version above.
  • Author Response 02 Jun 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    02 Jun 2020
    Author Response
    In this comment I would just like to state that we submitted this article to F1000Research on April 15 (Japanese time). F1000Research only lists the publication date (April 23) which ... Continue reading
  • Author Response 28 Apr 2020
    Johannes M. Dijkstra, Institute for Comprehensive Medical Science, Fujita Health University, Toyoake, 470-1192, Japan
    28 Apr 2020
    Author Response
    Dear Dr. Bercovier,

    Thank you for your comments and interesting links. Especially the data on the protective effects against common coronaviruses by previous infections by homologous virus are relevant.
    ... Continue reading
  • Reader Comment 27 Apr 2020
    Herve Bercovier, Hebrew University of Jerusalem, Faculty of Medicine Ein Kerem, Israel
    27 Apr 2020
    Reader Comment
    Dear Colleagues,
    I am in the middle of writing a similar paper and I would like you to take in consideration the following data that will make my paper unnecessary ... Continue reading
  • Discussion is closed on this version, please comment on the latest version above.
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.