Abstract
A novel severe acute respiratory syndrome coronavirus (SARS-CoV-2) is the source of a current pandemic (COVID-19) with devastating consequences in public health and economic stability. Using a peptide array to map the antibody response of plasma from healing patients, we identified immunodominant linear epitopes corresponding to key proteolytic sites on the spike protein.
Introduction
On December 2019, a novel infectious disease causing pneumonia-like symptoms was identified in the city of Wuhan in the province of Hubei (China) [1]. This new coronavirus infectious disease (COVID-19) caused by the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) is having a devastating impact on public health and economic stability on a global scale [2]. The World Health Organization declared it a pandemic on the 11th March 2020.
Mapping the epitopes corresponding to the immune system’s antibody response against the virus is important for vaccine development [3, 4], diagnostic serological tests [4] as well as for identifying neutralizing antibodies with therapeutic potential [5]. Indeed, epitope mapping of the SARS-CoV-1 revealed immunodominant epitopes and identified neutralizing antibodies [6-13]. However, the observation of antibody-dependent enhancement (ADE) of SARS-CoV-1 in non-human primates is concerning and should be considered for vaccine development [14, 15]. While ADE mechanisms arising from binding-only antibodies (non-neutralizing) are well documented, an ADE mechanism with neutralizing antibodies for the related MERS-CoV was also reported[16]. In this case, it was shown that neutralizing antibodies targeting the receptor-binding domain (RBD) of the virus redirected viral entry to Fc-expressing cells, broadening the host-targeted cells. Thus, antibodies generated by vaccination against SARS-CoV-2 could enhance viral entry instead of offering protection, leading to vaccine-associated enhanced respiratory disease (VARED) [17].
The homology between SARS-CoV-1 and SARS-CoV-2 rapidly led to the hypothesis that neutralizing antibodies identified from patients in the SARS-CoV-1 in the 2003 epidemic could also be neutralizing SARS-CoV-2 [18, 19]. Other antibodies with neutralizing activities have been discovered through different methodologies [20-25]. The rapid propagation of SARS-CoV-2 stimulated several studies predicting the antigenic parts of the viral proteins in silico [26-32], and analyzing SARS-CoV-1 epitopes that were conserved in this new coronavirus [33-36]. More recently, the first reports of experimental epitope mapping of the SARS-CoV-2 were deposited on repositories [37-42].
Herein we report the preparation of a microarray to map the antibody response to linear epitopes of the spike protein of SARS-CoV-2 and the analysis of 12 laboratory confirmed COVID-19 cases and 6 negative controls using the described peptide microarray.
Materials and methods
Plasma specimens from COVID-19 and healthy patients
Anonymized leftovers of whole blood-EDTA were used for this method evaluation, in accordance with our institution’s ethical committee and national regulations. We included 12 real-time RT-PCR confirmed COVID-19 cases hospitalized at the University Hospitals of Geneva, and 6 unmatched negative blood samples from asymptomatic donors, obtained during the same period (April 2020). Analyses (see below) were performed within 72h of blood sampling without any freezing-thawing cycle.
SARS-CoV-2 RT-PCR analyses and SARS-CoV-2 IgG serology
As previously published [43], SARS-CoV-2 RT-PCR was performed according to manufacturers’ instructions on various platforms, including BD SARS-CoV-2 reagent kit for BD Max system (Becton, Dickinson and Co, US) and Cobas 6800 SARS-CoV-2 RT-PCR (Roche, Switzerland).
SARS-CoV-2 IgG serology against the S1-domain of the spike protein of SARS-CoV-2 was assessed using the CE-marked Euroimmun IgG ELISA (Euroimmun AG, Lübeck, Germany # EI 2606-9601 G). EDTA-plasma was diluted at 1:101 and assessed with the IgG ELISA according to the manufacturer’s instructions and has been extensively reported elsewhere [43]. Median time from RT-PCR to serology testing was 3 weeks, reason why sample were considered as healing rather than convalescent plasma. All the 12 COVID-19 samples were considered as reactive against SARS-CoV-2.
Synthesis of the peptide-PNA conjugate library
The library of peptide-PNA conjugate was synthesized by automated synthesis on an Intavis peptide synthesizer as previously described [44, 45]. The synthesis was initiated with the peptide followed by the PNA tag using a capping cycle after each coupling. Hence, truncated peptides cannot hybridize on the microarray since they will not have the necessary tag. A library of 200 linear peptides was constructed based on the sequences of the spike ectodomain protein from SARS-CoV-2 (residues 1-1213-Gene Bank: QHD43416.1), fragmenting the protein into two sets of 100 peptides (12mer) with an overlap of 6 residues. Each peptide-PNA conjugate was positively identified by MALDI analysis. See SI for full synthetic details and characterization data.
Microarray epitope mapping
Microarrays were obtained from Agilent (Custom microarray slides, Agilent ref:0309317100-100002). Each peptide-PNA is complementary to a DNA sequence that is present 23 times at random positions on the array.
The arrays were incubated with plasma (1:150 dilution) for 1 hour at room temperature, washed with PBS-T and dried by centrifugation prior to the next step. The arrays were then incubated with Cy-3 labeled goat anti-human IgG (ab97170 from Abcam, 1:500 dilution) for 30 min, washed with PBS-T and dried by centrifugation for scanning. See SI for detailed procedures.
The fluorescence intensity on the array was measured on a GenePix 4100A microarray scanner using the median value of fluorescent intensity. The data for each peptide (23 spots) was plotted as a heat map of the median value from the 23 spots. The high redundancy in the measurements and the use of a median function insures that artifacts from a microarray experiment do not contribute to the consolidated data.
Validation of epitope 655-672
The peptide was synthesized according to the same protocol as for the library synthesis, replacing the PNA tag with a biotin.
Fluorescent bead assay
0.3 μL of Pierce™ High-Capacity Streptavidin Agarose Beads (catalog n°: 20357 from Thermo Scientific™) were mixed with 50 μL of the biotinylated peptide 10μM in PBS-T. The beads were incubated for 20 minutes and thereafter blocked with 200 μL of Fetal bovine serum for 10 minutes. The beads were then washed once with 100 μL PBS-T and 5 μL of serum from either positive or negative patients was added together with 450 μL of PBS-T and 50 μL of fetal bovine serum in order to block unspecific interactions. The beads were then incubated for 90 minutes and subsequently washed 4 times with 100 μL of PBS-T in order to remove all the non-binders. Finally, 200 μL of a 163nM solution of anti-human IgG-FITC (ab6854 from Abcam) in PBS-T with 0.5% BSA was added and incubated for 1 hour. The excess of secondary antibody was washed away using 3 times 100 μL of PBS-T and finally the beads were imaged with a Leica SP8 inverted confocal microscope.
By Enzyme-Linked Immunosorbent Assay (ELISA)
A solution of Streptavidin (ref: S0677 from Sigma Aldrich), 100 μL of an 80nM, in PBS was added to a Corning® 96-well Clear Flat Bottom Polystyrene High Bind Microplate (catalog nº: 9018 from Corning) and incubated overnight at 4°C. The plate was then washed three times with 300 μL of PBS-T (60 seconds, room temperature) and 200 μL of an 800nM solution of biotinylated peptide in PBS-T was added and incubated for 90 minutes at 36°C. The plate was then blocked with 300 μL of PBS-T with 0.5% non-fat dry milk (60 minutes at 36°C). The plate was washed 3 times with 300 μL of PBS-T (60 seconds, room temperature) and a 1:300 diluted plasma in PBS-T-0.5% non-fat dry milk was added to each well and incubated for 90 minutes at 36°C. After incubation of the plasma, the plate was washed 3 times with 300 μL PBS-T (60 seconds, room temperature), 1 time with PBS-T 0.5% non-fat dry milk (60 minutes, 37°C) and again 3 times with 300 μL PBS-T (60 seconds, room temperature). 100 μL of Goat Anti-Human-IgG HRP conjugated (ab97175 from Abcam) 1:10000 diluted in PBS-T 0.5% BSA were added to each well and incubated for 90 minutes at 37°C. The plate was then washed 3 times with PBS-T (60 seconds, room temperature) and 200 μL of a 0.41mM solution of 3,3⍰,5,5⍰-Tetramethylbenzidine (TMB) (ref: 860336 from Sigma Aldrich) in 50mM Na2HPO4, 25mM citric acid and 0.0024% H2O2, pH 5.5 solution was added to the plate and incubated for 20 minutes at 37 °C. Finally, 50 μL of a 1M sulfuric acid solution were added and the absorbance was measured at 450nm with a plate reader (SpectroMax, Molecular Device). For each sample, triplicates were performed and the fluorescence value are the average of the 3 reads.
Sequence alignment
Sequence alignment was done using Clustal Omega [46].
Results and discussion
SARS-CoV-2 is composed of 4 major structural proteins: S (spike), M (membrane), N (nucleocapsid) and E (envelope) [47-49]. The spike protein is responsible for entry by binding the angiotensin-converting enzyme 2 (ACE 2) on the host cell [50, 51]. Accordingly, antibodies that bind the RBD and inhibit the interaction of the S protein with ACE 2 have been the center of attention. Based on the critical role of the S protein in CoV infection, we focused our work on this protein, dissecting it into two sets of overlapping linear 12mer peptides (two-fold sequence coverage with 6AA overlap between the two sets; i.e 1-12, 7-18, 13-24, …). The peptide array was prepared by hybridization of PNA-tagged peptide library onto a DNA microarray (Figure 1) [52]. This technology insures a high level of homogeneity across different arrays since individual arrays are prepared from the same library hybridized onto commercial DNA microarrays. Furthermore, the arrays are designed to have each sequence present 23 times, thus insuring high accuracy by calculating the median of the observed fluorescence of the 23 spots.
The S protein of SARS-CoV-2 shares 76% homology with the SARS-CoV-1, [48, 53] and this homology has already been harnessed to predict epitopes based on experimental results from SARS-CoV-1. However, the different infection outcome of SARS-CoV-2 relative to SARS-CoV-1 originates in part from differences in the S protein. SARS-CoV-2 has better affinity to ACE 2 than SARS-CoV-1, yielding more efficient cellular entry [54, 55]. Furthermore, the presence of a furin cleavage site [56-58] in the S protein of SARS-CoV-2 (not present in SARS-CoV-1) coupled to an extended loop at the proteolytic site leads to higher cleavage efficacy thus facilitating its activation for membrane fusion [55, 59-61].
Analysis of 12 different plasma samples from SARS-CoV-2 infected patients and comparison to 6 samples from uninfected patients clearly highlighted a strong response to specific epitopes (Figure 2). The three linear epitopes most abundantly detected (SARS-CoV-2 S protein) were: 655-672, 787-822, and 1147-1158. None of these epitopes was singularly detected in all the positive samples tested, but each is detected in >40% of the positive patients. The 655-672 epitope is the most detected in positive samples and corresponds to a peptide that is not part of a secondary structures (Fig 3A-B). The corresponding epitope had been also detected in SARS-CoV-1 [8] (89% homology for the 18mer peptide, Fig 4A-C) and predicted bioinformatically for SARS-CoV-2 [27, 31, 35, 36]; however, it had yet to be observed experimentally.
Interestingly, this epitope is just next to the reported S1/S2 cleavage site (Fig 4A-C, furin/TMPRSS2) [50, 57]. The proteolytic cleavage of the loop 681-685 has been demonstrated to be necessary for the viral entry into the host cell [50]. Moreover, the proteolytic cleavage of the S protein could be a determinant factor for the capacity of the virus to cross species. For example, the S protein of Uganda bats MERS-like CoV is capable of binding human cells, but this is insufficient for entry [62]. However, if a protease (trypsin) is added the protein is cleaved and viral entry occurs. Furthermore, the most closely related virus to SARS-CoV-2 is RaTG-13 from a bat found in Yunnan province in 2013 which does not contain the furin cleavage sequence [49]. Taken together, this evidence suggest that cleavage of the S protein is a barrier to zoonotic coronavirus transmission. Incorporation of the furin cleavage sites could have been acquired by recombination with another virus leading to human infection. In relation to the furin cleavage site, the pathogenic avian H5N1 contains such a furin cleavage site that leads to higher pathogenicity due to the distribution of furins in multiple tissues [63]. We speculate that the binding of an antibody to the epitope 655-672 would sterically block the proteolysis of S1/S2 and should thus be broadly neutralizing, since the proteolysis is required for viral entry, without promoting ADE.
Another epitope abundantly detected only in healing patients was the 787-822, a peptide segment extending at the periphery of the solvent exposed part of the protein (Fig 3A-B). It has also been experimentally observed in the SARS-CoV-1 [9, 13], SARS-CoV-2 [38, 39] and predicted bioinformatically [26, 27, 30, 31, 33, 36]. Interestingly, this epitope includes the S2’ cleavage site of the spike protein (Fig 4D-F), which has been reported to activate the protein for membrane fusion via extensive irreversible conformational changes [53, 64]. This epitope also includes the fusion peptide (816-833, Fig 4D-F) [65] which is highly conserved among coronaviruses [66, 67], suggesting a potential pan-coronavirus epitope at this location. It should be noted that a peptide-based fusion inhibitor was shown to exhibit broad inhibitory activity across multiple human CoVs [68] and that antibodies against that region have shown neutralizing activity in SARS-CoV-1 [69]. Taken together, the data support the fact that antibodies inhibiting this proteolytic cleavage should be neutralizing [61, 65].
Finally, the epitope 1147-1158 is found at the C terminus of the spike protein. The structural data reported thus far did not suggest a defined structure for this portion of the S protein. This epitope extends from the helix bundle 1140-1147 (Fig 3A-B) and had also been experimentally observed in SARS-CoV-1 [9] and predicted bioinformatically for SARS-CoV-2 [27, 31, 35].
One limitation of epitope mapping with a peptide array is that it is restricted to linear epitopes. Antibodies binding to the RBD have been shown to participate in interactions spanning multiple peptide fragments. Indeed, we did not observe a strong response to linear peptides in the RBD. A control experiment with AI334/CR3022 antibody [25, 70] showed only weak binding to 367-378 peptide sequence of the RBD.
To validate the results observed on the microarray, a peptide (655-672) was synthesized as a biotin conjugate for pull-down and ELISA experiments. The sequence corresponding to 655-672-biotin and a scrambled version of the biotinylated peptide were individually immobilized on agarose streptavidin beads. Beads were exposed to serum from patients that were either positive or negative for that epitope based on the microarray data and subsequently treated with anti-Human-IgG-FITC. The fluorescence of the beads was quantified by confocal microscopy (Fig 5A). As can be seen in Fig 5B-E, the beads with 655-672 peptide and positive serum sample showed higher fluorescence than the ones with either negative serum or using the scrambled peptide. To further probe the binding of 655-672 peptide to antibodies of SARS-CoV-2 positive patients, the same 655-672 biotinylated peptide was used in an ELISA assay (Fig 6A). Three SARS-CoV-2 positive samples showing strong 655-672 signal (Samples 7, 8 and 9) and three SARS-CoV-2 negative samples (Samples 14, 15 and 17) were analyzed showing clear binding to the 655-672 peptide and not to the scrambled version (Fig 6B).
Next, an alanine scan was performed to assess the contribution of individual amino acids to the interaction with the antibodies of two of the COVID positive patients containing antibodies for this epitope (Sample number 1 and 6) at two different dilutions (1 to 100 and 1 to 400). For this purpose, 17 different peptide-PNA conjugates were synthesized, replacing one amino acid at the time with Ala (Fig 7A) and measuring the intensity of the observed binding on the microarray. This analysis revealed the key role of 5 residues that, if converted to Ala, lead to dramatic loss of activity (amino acids in blue, Fig 7B). Thus, the key amino acids crucial for binding with the antibodies common for these two plasma samples are H655, Y660, C662, G669 and C671. The binding of the antibody presents in sample 1 also seems to depend on the P665. The remarkable similarities between the two patients is notable considering a polyclonal response.
Conclusion
We have developed a peptide array for the epitope mapping of the spike protein of SARS-CoV-2. Using this array to profile healing plasma of twelve laboratory confirmed COVID-19 patients and six negative controls we have discovered three immunodominant linear regions, each present in >40% of COVID-19 patient. Two of these epitopes correspond to key proteolytic sites on the spike protein (S1/S2 and the S2’) which have been shown to be crucial for viral entry and play an important role in virus evolution and infection. The fact that antibodies binding to the protease cleavage sites were identified from COVID-19 patients raises the possibility that other mechanism than blocking the RBD-ACE2 interaction could be harnessed for neutralization. Furthermore, blocking proteolytic cleavage could be important to reduce antibody-dependent enhancement of viral entry, a key feature for vaccine development. Full characterization of these antibodies is necessary, and efforts on this direction are on their way.
Data Availability
All analytical data is included in the supplementary information file.
Supporting information
Detailed procedures and physical characterization of the synthetic products.
Acknowledgement
The authors gratefully acknowledge funding from the University of Geneva and the département d’instruction public du canton de Genève. The authors also gratefully acknowledge Prof. Cosson from the Geneva Antibody Facility for generous gift of reagents, and Isabelle Arm-Vernez from the virology laboratory of the laboratory medicine division for the assessment of routine SARS-CoV-2 IgG serology.