Pre-vaccination and early B cell signatures predict antibody response to SARS-CoV-2 mRNA vaccine

SARS-CoV-2 mRNA vaccines are highly effective, although weak antibody responses are seen in some individuals with correlates of immunity that remain poorly understood. Here we longitudinally dissected antibody, plasmablast, and memory B cell (MBC) responses to the two-dose Moderna mRNA vaccine in SARS-CoV-2-uninfected adults. Robust, coordinated IgA and IgG antibody responses were preceded by bursts of spike-specific plasmablasts after both doses, but earlier and more intensely after dose two. Distinct antigen-specific MBC populations also emerged post-vaccination with varying kinetics. We identified antigen non-specific pre-vaccination MBC and post-vaccination plasmablasts after dose one and their spike-specific counterparts early after dose two that correlated with subsequent antibody levels. These baseline and response signatures can thus provide early indicators of serological efficacy and explain response variability in the population.


4
The pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-45 2) instigated rapid worldwide COVID-19 vaccine prioritization strategies. Several vaccine 46 candidates were developed, including two vaccines based on novel mRNA platforms (Moderna 47 mRNA-1273 and the Pfizer/BioNTech BNT162b2). Both mRNA vaccines encode a stabilized 48 ectodomain of the spike protein trimer (S-2P) derived from the Wuhan Hu-1 isolate 1 , and are 49 given in two vaccine doses, referred to henceforth as v1 and v2. Both mRNA vaccines have been 50 shown to be highly protective and elicit strong B cell and antibody responses 2,3 , although poorer 51 responses have also been seen in some individuals, such as the elderly 4 and transplant recipients 5-52 7 , raising the question of what determines antibody response levels and whether early correlates 53 of immunity can be defined. Studies on other vaccines have shown that pre-vaccination 54 signatures and early circulating B cell responses involving plasmablasts (PB) and activated 55 memory B cells (MBC) can predict the magnitude and longevity of neutralizing antibodies 56 following vaccination [8][9][10][11] . The Pfizer vaccine has been shown to induce robust PB and MBC 57 responses in blood and draining lymph nodes 4,12 , but the extent by which these responses differ 58 across individuals and whether they are associated with antibody levels have not been assessed. 59 To address gaps in correlates of humoral immunity to mRNA vaccines, we evaluated 60 antibody and B cell responses following vaccination with mRNA-1273 in 21 healthy SARS-61 CoV-2-uninfected adults (Extended Data Table 1). Blood was drawn serially over a period of 62 ~60 days (D) and paired serum and cellular assays were performed at each timepoint ( Fig. 1a and 63 Extended Data Table 1). Given the fragility of PB, cellular assays were performed on freshly 64 isolated cells while sera were cryopreserved for antibody assays. Antibody binding to S-2P, its 65 for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi. org/10.1101org/10. /2021 receptor-binding domain (RBD) and the nucleoprotein (NP), was measured using a multiplex 66 platform 1 . Strong IgG and IgA responses were induced, starting around D10, to both S-2P and 67 RBD (Fig. 1b), although the magnitude was highly variable across vaccinees at v2D28 (c.v. > 68 100%), spanning 2-3 orders of magnitude for both IgA and IgG titers (Fig. 1b, right panels). The 69 IgM response was weak across all vaccinees (Fig. 1b). This is consistent with recent reports for 70 the mRNA vaccines 13,14 , yet in contrast to strong responses in patients who recovered from mild 71 to severe COVID-19 14-17 (Fig. 1c). NP antibodies were also low in vaccinees (Fig. 1c), as 72 expected for SARS-CoV-2-uninfected people. Strong correlations were observed among RBD 73 and S-2P antibodies (Fig. 1d), but the correlation between IgA RBD and IgA S-2P was higher 74 than that between their IgG counterparts. The inhibition of RBD binding to the spike protein 75 receptor ACE2 by serum antibodies, a surrogate for neutralization capacity, also revealed a range 76 of responses (Fig. 1e) that correlated with RBD IgG and IgA binding antibodies (Extended Data 77 Fig. 1a). 78 In general, B cell responses to vaccination are detected in the peripheral blood in two 79 distinct phases; the first consisting of a short burst of PB, typically detected around D7, followed 80 by a slower phase that leads to the establishment of a pool of long-lived MBC 18 . Antigen-specific 81 B cells can be identified by flow cytometry using protein tetramers; this is an approach that has 82 been used to track SARS-CoV-2 spike-specific responses following COVID-19 infection or 83 vaccination 15,19-21 . We used a pair of RBD and spike subunit 1 (S1) tetramers to track spike-84 specific B cell responses of vaccinees, and as expected, dual RBD + S1 + and single S1 + PB 85 became detectable at v1D10 while corresponding MBC became detectable at v1D14 ( Fig. 1f and  86 for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint 6 Extended Data Fig. 1b). S-2P tetramers also clearly detected RBD + within S1 + and S1 + within S-87 2P + B cells, but they did not clearly identify S-2P + PB (Extended Data Fig. 1c). We thus focused 88 on RBD and S1 tetramers to simultaneously measure spike-specific responses among all B cell 89 populations. We validated the approach for PB by showing that 1), frequencies of RBD + and S1 + 90 PB measured by flow cytometry were strongly correlated to those measured by the conventional 91 enzyme-linked immunospot (ELISpot) assay (Extended Data Fig. 1d) This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10. 1101/2021 IgA + (C14), IgG + (C2 and C4) and IgD/M (C19), were prevalent. Both IgG (C9) and IgA (C13) 108 PB were identified, as well as eight nonconventional MBC with lower abundance (Fig 2b;  109 Extended Data Table 3 for details). IgG PB (C9) contained the highest proportion of RBD + S1 + 110 cells, followed by IgA PB (C13), and the nonconventional IgG + MBC C5 (Fig. 2b). Our 111 longitudinal tracking of vaccinees revealed that RBD + S1 + PB were first detected at v1D10, then 112 subsided until v2D5-7, while other RBD + S1 + B cells (namely MBC) became visible at v1D14 113 and intensified significantly following v2 (Fig. 2c, d), consistent with the analyses performed 114 with manual gating ( Fig. 1f and Extended Data Fig. 1b). 115 We next used a linear model to search for cell clusters (both antigen non-specific and 116 specific) whose frequencies varied significantly as a function of time in response to vaccination 117 ( Fig. 3a). Antigen non-specific clusters exhibited distinct patterns of response kinetics (Fig. 3b, 118 c). Most notable were the decreasing nonconventional CD27 -IgG + MBC C3 and C6 after v1 and 119 v2, while the two PB, IgG C9 and IgA C13, and the IgA pre-PB C12 were sharply increasing 120 after each dose. Other temporally changing clusters included several MBC and GC founder B 121 cells (C27) that underwent modest changes after v1 and v2 (Fig. 3b, c), possibly a reflection of 122 trafficking to and from lymphoid tissues. 123 Spike-specific B cell frequencies, measured as a fraction of RBD + S1 + cells among cells 124 within each cluster, also exhibited varying patterns of temporal responses (Fig. 3d). Two-peak 125 responses with stronger increases in v2 than v1 were observed for the two PB clusters C9 and 126 C13, nonconventional MBC C11 (CD27 lo IgA + ) and C6 (CD27 -IgG + ), albeit with differences in 127 timing. For example, C9 (IgG) and C13 (IgA) PB had an initial modest burst beginning at 128 for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint v1D10, followed by a second stronger but shorter burst at v2D5-7 (Fig. 3e). RBD + S1 + cells 129 among two pairs of conventional/nonconventional MBC, namely C2/C5 and C3/C4 respectively, 130 also underwent coordinated changes following v2 (Fig. 3d). It is notable, however, that while 131 RBD + S1 + cells in all four of these MBC clusters showed trends of declines from their peaks by 132 v2D28, those in the two nonconventional MBC (C3 and C5) appeared to drop more precipitously 133 than in the two conventional MBC (C2 and C4; Fig. 3e). These findings are consistent with 134 recent reports that spike-specific memory responses several months after SARS-CoV-2 infection 135 are enriched within CD21 + CD27 + MBC 17,21 , which have a phenotype similar to C2 and C4. 136 Despite the coherent changes observed across subjects, substantial heterogeneity in B cell 137 responses existed among vaccinees. Independent of antigen specificity, the magnitude of PB 138 increases is known to be a correlate of antibody responses for vaccines such as influenza 24,25 . We 139 thus assessed associations between the changes in non-specific cluster frequencies over the 140 course of v1 and v2 relative to the respective baselines with IgA and IgG RBD and/or S-2P titers 141 at v2D28 by using linear models accounting for age and gender (Extended Data Fig. 3a The frequency of spike-specific PB on v2D7 spanned a wide range (Fig. 3e), raising the 147 question of whether those with depressed PB responses also had lower antibody titers following 148 v2. Thus, we next used the same linear model to search for spike-specific correlates (Fig. 4a, b). 149 for use under a CC0 license.
This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint Indeed, at v2D7 and v2D10, IgG PB (C9) were correlated with IgG and IgA RBD antibodies, 150 while IgA PB (C13) at v2D7 were associated with IgA S-2P antibodies (Fig. 4b, c). Thus, in 151 contrast to antigen non-specific PB, spike-specific PB appeared as serological correlates only 152 after v2. However, the frequency of RBD + S1 + cells within IgA PB C13 at v1D10 predicted the 153 fraction of RBD + S1 + cells in the C2 MBC at v2D7 (Extended Data Fig. 3e), suggesting that the 154 magnitude of the spike-specific PB expansion after v1 is associated with the memory response 155 after v2. A recent preprint did not find a correlation between antibody response and 156 transcriptional modules enriched for PB at D7 following the second dose of the Pfizer vaccine 26 , 157 likely due to differences in assessing PB responses using blood transcriptional signatures versus 158 our direct measurement of fresh, antigen-specific PB. In addition to PB, spike-specific C6 was a 159 positive correlate at v2D0 (Fig. 4a, d) when these cells reached a first peak (Fig. 3d). C6 are 160 CD27 -IgG + MBC known to have lower mutational burdens than their CD27-expressing 161 counterparts 27 , and may as such, reflect products of early events of the antigen-driven maturation 162 process after v1. C6 are also CD21 hi , and similar to a stable pool of CD27 -MBC that are 163 generated in response to new influenza variants 28 . 164 Most of the other antigen-specific correlates were positively associated with the titer 165 response and reflect changes after the second dose ( Fig. 4b), including spike-specific MBC. At 166 v2D14 and v2D28, the frequency of RBD + S1 + cells in conventional MBC C2 was positively 167 correlated with RBD/S-2P IgA and IgG antibodies (Fig. 4b, e), consistent with their role in 168 sustaining immunological memory 17,21 . In contrast, it is notable that C5, the nonconventional 169 MBC with features of vaccine-induced activated MBC 24 , and that we found to contain a strong 170 for use under a CC0 license.
This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint RBD/S1 response following v2 (Fig. 3d, e), did not correlate with endpoint antibodies (data not 171 shown). Together, these may reflect a unique response feature of this mRNA vaccine, but timing 172 or insufficient statistical power may have played a role in the lack of detection of correlation 173 between C5 and antibodies. 174 Among the clusters that correlated with antibodies, several were not of the same isotype. 175 While it is possible that IgG B cells could give rise to IgA-secreting cells 29 , the reverse cannot 176 occur. Thus, the most likely explanation for inter-isotype correlations is that most are not causal, 177 but reflect a coordinated immunologic response driven by shared mechanisms, as indicated by 178 the strong correlations between IgG and IgA antibodies (Fig. 1d). Indeed, when we used the 179 correlated component (the first principal component) of the v2D28 IgG and IgA RBD and S-2P 180 antibody titers as an isotype-independent endpoint, we found many of the positive antigen-181 specific correlates highlighted above, including C6 MBC at v2D0, C9 PB at v2D7, and C2 MBC 182 at v2D14 (Extended Data Fig. 3f), indicating that these correlates reflected isotype independent 183 responses that potentially determined the magnitude of antibodies induced by the mRNA 184

vaccine. 185
Intriguingly, aside from confirming the early plasmablast correlates at v1D10 above, the 186 isotype independent analysis revealed additional non-specific correlates from as early as the 187 baseline before the first dose of vaccination (Fig. 4f): C30 MBC was a positive baseline predictor 188 independent of age and gender, suggesting that the frequency of these circulating unswitched 189 CD138 + MBC reflected an antigen non-specific "set point" for humoral responses to naïve 190 antigens 8 . Consistent with this notion, C30 did not appear again as a baseline (v2D0) correlate 191 for use under a CC0 license.
This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint for the second dose (data not shown), probably because v2 elicited a recall response. 192 Interestingly, by v1D7, this population was a negative correlate (Fig. 4f) This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Study design and participants 203
Twenty SARS-CoV-2-uninfected NIH employees and one community member who were This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

SARS-CoV-2-binding antibody assay 224
Serum samples were heat inactivated at 56 o C for 60 minutes. A 4-plex antibody binding 225 assay was performed using an electrochemiluminescence immunoassay analyzer (ECLIA) 226 This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

RBD-ACE2 blocking assay 246
Samples were prepared as for the 4-plex binding assay. 384-well plates precoated with 247 RBD were supplied by the manufacturer (MSD). Plates were blocked at room temperature (RT) 248 for 30 minutes with MSD blocker A solution containing 5% BSA. Plates were washed, test 249 samples were added at dilutions of 1:10, 1:20 and 1:40, and incubated with shaking at RT for 60 250 minutes. Human ACE2 conjugated with SULFO-TAG was added and plates were further 251 incubated to allow binding to RBD. Plates were washed, ECL substrate added, and plates read as 252 in antibody binding assay. Fold reduction in ECL response for each sample was calculated 253 against based on signal emitted in wells in absence of sample (assay diluent). 254 255

Recombinant biotinylated RBD protein 256
A SARS-CoV-2 RBD construct containing a His-tag and Avi-tag was generated, as 257 previously described 30 . The residues 319-541 of the S protein were codon optimized with N-258 terminal of signal peptide (MFVFLVLLPLVSSQ) and C-terminal of 6-His tag and Avi-tag 259 (GLNDIFEAQKIEWHE). The DNA encoding sequence was cloned into the mammalian cell 260

expression vector pCAGGS and confirmed by sequencing, prior to transient transfection in 261
FreeStyle 293-F cells with 293fectin transfection reagent (ThermoFisher). Culture supernatants 262 were harvested at 5 days post transfection, filtered, and purified by in-house packed affinity 263 for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint purification column with Complete His-tag purification resin (Roche). Elutes were buffer 264 exchanged with phosphate-buffered saline (PBS), and concentrated using an Amicon Ultra 10 265 kDa molecular weight cutoff concentrator (Millipore). Biotinylation was performed with a BirA 266 biotin-protein ligase standard reaction kit (Avidity), according to the manufacturer's instructions. 267 Excess biotin was removed by five buffer exchanges with an ultra 10K concentrator (Amicon). This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021.  Table 5). The stained cells were acquired on a FACS Canto II flow cytometer 293 (BD Biosciences) and analyzed using FlowJo software v9.9.6 (BD Biosciences). 294 295

ELISpot assay to enumerate SARS-CoV-2 spike-specific PB 296
Spike protein S1 and RBD-specific antibody-secreting PB were enumerated by 297 modifying the antigen-specific portion of a previously described ELISpot assay 31,32 . Briefly, 298 wells of Immobilon-P polyvinylidene difluoride (PVDF) membrane 96-well plates (Millipore) 299 were coated with 5ug/ml anti-Ig light-chain antibodies (Rockland Immunochemicals) overnight 300 at 4 o C. Plates were washed and wells were blocked at RT for 2 hours with RPMI containing 10% 301 FBS. Duplicate wells were plated with PBMC containing 0.01-0.003 X 10 6 B cells for total 302 IgA/G/M-secreting PB enumeration and 0.1-0.03 X 10 6 B cells for RBD/S1-specific PB. Plates 303 were incubated at 37 o C for 5 hours, followed by overnight incubation at 4 o C with biotinylated 304 antibodies (Jackson Immunoresearch) against IgA (catalogue 109-066-011), IgG (catalogue 709-305 for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint 066-149), IgM (catalogue 709-066-073), or biotinylated proteins S1 (Acrobiosystems, catalogue 306 S1N-C82E8) or RBD (Biolegend, catalogue 790904). Plates were washed, streptavidin-AP 307 conjugate (R&D Systems) was added, followed by incubation at RT for 2 hours. Plates were 308 washed and spots were developed with ELISpot Blue Color Module (R&D Systems). 309 Biotinylated (Biorbyt) or unlabeled (Millipore-Sigma) keyhole limpet hemocyanin (KLH) was 310 used as negative control antigen to enumerate background spots. Spots were counted using an 311 ELISPOT reader (Cellular Technology Ltd). Frequencies of S1 and RBD-specific PB were 312 calculated as the fraction of total Ig-secreting PB after subtraction of background KLH spots. 313 314 Spectral flow cytometry data processing for FlowSOM clustering and UMAP embedding 315 FCS files generated from spectral flow cytometry with the 17-color panel and associated 316 manual gates were read into R using FlowWorkspace (4.2.0). Live, CD19 + single cells were 317 selected for downstream analysis. Data from all samples were merged and transformed with the 318 arcsinh transformation with a scale factor of 1/150. Data were not z-score scaled. The following 319 markers were used to perform clustering CD20, CD138, CD38, CD10, CD11c, CD19, CD27, 320 CD21, IgD, IgM, IgG, IgA, using FlowSOM (1.22.0) 33 , with the number of desired metaclusters 321 (nClus) set to 30. One small cluster, C8, determined to represent granulocytes, was removed 322 from downstream analysis. To visualize the clusters and RBD + S1 + cells in a UMAP 323 embedding 34 , 653,683 cells were subsampled from the roughly 3.2 million CD19 + cells to 324 include 3,667 cells per sample and all RBD + S1 + cells from the 21 longitudinal vaccine 325 for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Analysis of temporally varying FlowSOM clusters 329
Frequencies of cells within each sample were summarized into 1) the fraction of cells 330 within a cluster relative to the total number of CD19 + cells in the given sample; and, 2) the 331 fraction of cells within a cluster determined to be RBD + S1 + by manual gating, relative to the 332 number of cells in that cluster in the given sample. The extent of temporal variation was assessed 333 using a linear mixed effects model with the following formula in lme4 (1.1.26): 334 Timepoint is a factor variable representing the discrete timepoints (v1D0, v1D7, etc.) 336 The significance of the timepoint term was assessed with a type III ANOVA using 337 Satterthwaite's approximation using the lmerTest package This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 using Ward's method, as implemented in the hclust function (method = "ward.D2) in R (4.0.2). 347 After inspecting the respective dendrograms, 4 groups were determined to be appropriate, and 348 the hierarchical clustering trees were cut to produce 4 groups for both antigen non-specific and 349 specific cells. 350 351

Modeling of association between endpoint antibody concentrations and cluster frequencies 352
A linear model accounting for the age and gender of the longitudinal vaccine participants 353 was used to estimate whether cell cluster frequencies in response to vaccination were associated 354 with SARS-COV2 spike protein (S-2P/RBD) antibody concentration at endpoint (v2D28): 355 log2(endpoint concentration) ~ cluster frequency timepoint + age + sex 356 Analyses were carried out on both standardized antigen non-specific and specific 357 frequencies, i.e., cluster cell counts as a fraction of total CD19 + cell counts and RBD + S1 + cells 358 within the clusters, respectively. For the antigen non-specific models, only clusters whose post-359 vaccination frequencies at any timepoint changed significantly from the pre-vaccination baseline 360 (v1D0) were included. For the antigen-specific models, clusters with at least four RBD + S1 + cells 361 in any of the samples were considered. In addition, at each timepoint, a cluster was excluded if 362 there were fewer than five samples with any RBD + S1 + cells. Pre-vaccination baseline 363 frequencies (v1D0) were subtracted from frequencies post vaccine dose 1 (v1) timepoints, 364 including v1D7, v1D10, v1D14, and v2D0. Similarly, dose 2 baseline frequencies (v2D0) were 365 subtracted from frequencies of post-vaccine dose 2 timepoints, including v2D7, v1D10, v1D14, 366 for use under a CC0 license.
This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. ; https://doi.org/10.1101/2021.07.06.21259528 doi: medRxiv preprint and v2D28. P values were adjusted with the Benjamini-Hochberg method within each 367 combination of timepoint and antibody endpoint 36 . R version 3.6.3 was used for this analysis. across subjects. Associations between PC1 and RBD + S1 + cluster frequencies at each timepoint 376 were calculated using the same linear models and inclusion criteria described above. The same 377 analysis was carried out for all antigen non-specific clusters (cell counts as a fraction of total 378 CD19 + B cells) with the baseline pre-vaccination timepoint (v1D0) included. 379 380 for use under a CC0 license. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.  This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Fig. 2. Unsupervised clustering analysis identifies major B cell populations and SARS-CoV-515
2-specific B cells. a, UMAP projection of combined B cells (n = 653,683 cells), subsampled 516 from 3.2 million CD19 + cells to include 3,667 cells per sample and all RBD + S1 + cells from all 517 study participants (n = 21) at all timepoints with annotated major B cell populations identified by 518 FlowSOM clustering. b, MFI-based heatmap of FlowSOM clusters as indicated by cluster 519 number and marker. Rows ordered by hierarchical clustering. Summary of fraction of cells 520 binding both RBD and S1 within each cluster and cell counts per cluster (right). c, UMAP plots 521 with overlays of RBD + S1 + B cells (blue points with white center) at each timepoint. d, RBD + S1 + 522 cells within each cluster expressed as a fraction of total CD19 + B cells across all subjects at each 523 timepoint (n at each timepoint shown in Extended Data Table 1) This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprint this version posted July 7, 2021. for use under a CC0 license.
This article is a US Government work. It is not subject to copyright under 17 USC 105 and is also made available (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.