Emergence of a multidrug resistant and virulent Streptococcus pneumoniae lineage mediates serotype replacement after PCV13

Background Pneumococcal Conjugate Vaccine (PCV) which targets up to 13 serotypes of Streptococcus pneumoniae is very effective at reducing disease in young children; however, rapid increases in replacement with non-PCV serotypes remains a concern. Serotype 24F is one of the major invasive serotypes that mediates serotype replacement in France and multiple other countries. We aimed to identify the major pneumococcal lineage that has driven the increase of serotype 24F in France, and provide context for the findings by investigating the global diversity of serotype 24F pneumococci and characterise the driver lineage from a global perspective and elucidate its spatiotemporal transmission in France and across the world. Methods We whole-genome sequenced a collection of 419 serotype 24F S. pneumoniae from asymptomatic carriers and invasive disease cases among individuals <18 years old in France between 2003 and 2018. Genomes were clustered into Global Pneumococcal Sequence Clusters (GPSCs) and clonal complexes (CCs) so as to identify the lineages that drove the increase in serotype 24F in France. For each serotype 24F lineage, we evaluated the invasive disease potential and propensity to cause meningitis by comparing the proportion of invasive disease cases with that of carriers. To provide a global context of serotype 24F and the driver lineage, we extracted relevant genomes and metadata from the Global Pneumococcal Sequencing (GPS) project database (n=25,590) and additionally sequenced a collection of 91 pneumococcal isolates belonging to the lineage that were responsible for the serotype 24F increase in Spain during the PCV introduction for comparison. Phylogenetic, evolutionary, and spatiotemporal analysis were conducted to understand the mechanism underlying the global spread of serotype 24F, evolutionary history and long-range transmissions of the driver lineage. Findings A multidrug-resistant pneumococcal lineage GPSC10 (CC230) drove the serotype 24F increase in both carriage and invasive disease in France after PCV13 introduction. When compared with other serotype 24F lineages, it exhibited a 1.4-fold higher invasive disease potential and 1.6-fold higher propensity to cause meningitis. Globally, serotype 24F was widespread, largely due to clonal dissemination of GPSC10, GPSC16 (CC66) and GPSC206 (CC7701) rather than recent capsular switching. Among these lineages, only GPSC10 was multidrug-resistant. It expressed 17 serotypes, with only 6 included in PCV13 and none of the expected PCVs covered all serotypes expressed by this lineage. Global phylogeny of GPSC10 showed that all serotype 24F isolates except for one were clustered together, regardless of its country of origin. Long-range transmissions of GPSC10-24F from Europe to Israel, Morocco and India were detected. Spatiotemporal analysis revealed that it took ~5 years for GPSC10-24F to spread across French provinces. In Spain, we detected that the serotype 24F driver lineage GPSC10 underwent a rapid change of serotype composition from serotype 19A to 24F during the introduction of PCV13 (targets 19A but not 24F), indicating that pre-existence of serotype variants enabled GPSC10 to survive and expand under vaccine-selective pressure. Interpretation Our work further shows the utility of bacterial genome sequencing to better understand the pneumococcal lineages behind the serotype changes and reveals that GPSC10 alone is a challenge for serotype-based vaccine strategy. More systematic investigation to identify lineages like GPSC10 will better inform and improve next-generation preventive strategies against pneumococcal diseases. Funding Bill & Melinda Gates Foundation, Wellcome Sanger Institute, and the US Centers for Disease Control.


Summary Background
Pneumococcal Conjugate Vaccine (PCV) which targets up to 13 serotypes of Streptococcus pneumoniae is very effective at reducing disease in young children; however, rapid increases in replacement with non-PCV serotypes remains a concern. Serotype 24F is one of the major invasive serotypes that mediates serotype replacement in France and multiple other countries.
We aimed to identify the major pneumococcal lineage that has driven the increase of serotype 24F in France, and provide context for the findings by investigating the global diversity of serotype 24F pneumococci and characterise the driver lineage from a global perspective and elucidate its spatiotemporal transmission in France and across the world.

Methods
We whole-genome sequenced a collection of 419 serotype 24F S. pneumoniae from asymptomatic carriers and invasive disease cases among individuals <18 years old in France between 2003 and 2018. Genomes were clustered into Global Pneumococcal Sequence Clusters (GPSCs) and clonal complexes (CCs) so as to identify the lineages that drove the increase in serotype 24F in France. For each serotype 24F lineage, we evaluated the invasive disease potential and propensity to cause meningitis by comparing the proportion of invasive disease cases with that of carriers. To provide a global context of serotype 24F and the driver lineage, we extracted relevant genomes and metadata from the Global Pneumococcal Sequencing (GPS) project database (n=25,590) and additionally sequenced a collection of 91 pneumococcal isolates belonging to the lineage that were responsible for the serotype 24F increase in Spain during the PCV introduction for comparison. Phylogenetic, evolutionary, and spatiotemporal analysis were conducted to understand the mechanism underlying the global spread of serotype 24F, evolutionary history and long-range transmissions of the driver lineage.

Findings
A multidrug-resistant pneumococcal lineage GPSC10 (CC230) drove the serotype 24F increase in both carriage and invasive disease in France after PCV13 introduction. When compared with other serotype 24F lineages, it exhibited a 1.4-fold higher invasive disease potential and 1.6-fold higher propensity to cause meningitis. Globally, serotype 24F was widespread, largely due to clonal dissemination of GPSC10, GPSC16 (CC66) and GPSC206 (CC7701) rather than recent capsular switching. Among these lineages, only GPSC10 was multidrug-resistant. It expressed 17 serotypes, with only 6 included in PCV13 and none of the expected PCVs covered all serotypes expressed by this lineage. Global phylogeny of GPSC10 showed that all serotype 24F isolates except for one were clustered together, regardless of its . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint country of origin. Long-range transmissions of GPSC10-24F from Europe to Israel, Morocco and India were detected. Spatiotemporal analysis revealed that it took ~5 years for GPSC10-24F to spread across French provinces. In Spain, we detected that the serotype 24F driver lineage GPSC10 underwent a rapid change of serotype composition from serotype 19A to 24F during the introduction of PCV13 (targets 19A but not 24F), indicating that pre-existence of serotype variants enabled GPSC10 to survive and expand under vaccine-selective pressure.

Interpretation
Our work further shows the utility of bacterial genome sequencing to better understand the pneumococcal lineages behind the serotype changes and reveals that GPSC10 alone is a challenge for serotype-based vaccine strategy. More systematic investigation to identify lineages like GPSC10 will better inform and improve next-generation preventive strategies against pneumococcal diseases.

Funding
Bill & Melinda Gates Foundation, Wellcome Sanger Institute, and the US Centers for Disease Control.
. CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint

Introduction
Pneumococcal conjugate vaccines (PCV), which targets up to 13 capsule serotypes of Streptococcus pneumoniae that account for most of the diseases in infants, have been very effective at reducing disease worldwide. 1 However, increases in replacement with non-PCV serotypes remain a concern, 2-6 such as serotype 19A after PCV7 in the USA. 7 In France, a sharp increase in pneumococcal meningitis cases occurred five years after the roll out of PCV13, mainly driven by a non-PCV13 serotype 24F. 8,9 This serotype also mediated serotype replacement in multiple other countries such as Argentina 10 , Canada, 11 Denmark, 12 Germany, 13 Israel, 14 Italy, 15 Japan, 16 Lebanon, 17 Norway, 18 Spain, 19,20 and UK 21 , and was reported to be the predominant serotype causing invasive pneumococcal disease (IPD) in Portugal 22 after PCV13 introduction.
Compared with most of the non-PCV13 serotypes, the serotype 24F capsule has a high invasive disease potential 23 and propensity to cause meningitis. 24 In France, the fatality rate for meningitis due to serotype 24F pneumococci was 13%, similar to that (11%) caused by pneumococci expressing other serotypes. 8 In some countries, the serotype 24F increase was concomitant with increasing prevalence of penicillin resistance in IPD overall 8,15 and IPDs due to non-vaccine serotypes. 10 Despite serotype 24F being an important emerging serotype with high invasiveness and potential for antimicrobial resistance, it is not included, to our knowledge, in any expected future PCV formulations (PCV15, 20, 24).
By delineating pneumococcal lineages into Global Pneumococcal Sequence Clusters (GPSCs, aka lineages) using variations across the entire genome 25 and/or clonal complexes (CCs) defined by nucleotide sequences of seven housekeeping genes, 26 we observed that pneumococcal lineages driving the increase in serotype 24F varied between countries. For example, the increase was mainly driven by GPSC10 (CC230) in Argentina, 10 Lebanon 17 and Spain 27 , by GPSC6 (CC156) in Denmark 12 , by GPSC106 (CC2572) in Japan. 16 Here we applied whole genome sequencing to investigate the pneumococcal lineages driving the increase in serotype 24F after PCV13 introduction in France. Merging with >25,000 pneumococcal genomes from 56 countries in the Global Pneumococcal Sequencing (GPS) project database, we provide context for the findings from France by investigating the global diversity of serotype 24F pneumococci. We characterise the driver lineage from a global perspective and elucidate its spatiotemporal transmission in France and across the world.

Study design
. CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; https://doi.org/10.1101/2021.11.24.21266813 doi: medRxiv preprint A representative set of French serotype 24F pneumococcal isolates collected through a nationwide hospital-based active surveillance for IPD 8,9 and carriage survey 28 across the country were whole-genome sequenced. The collection included isolates from invasive disease cases (n=190) and asymptomatic colonisation (n=229) among individuals <18 years old between 2003 and 2018 ( Figure S1). The study period spanned across different phases of PCV introduction: from PCV7 use in target groups of children (e.g. children in a daycare) to generalised use of PCV13 for all children and the subsequent increase of 24F in 2015. To identify the pneumococcal lineage(s) driving the increase in 24F, we grouped isolates into GPSCs and CCs, and evaluated each lineage's invasive disease potential and propensity to cause meningitis by calculating odds ratio by reference to carriage. 29 We then contextualised the serotype 24F and the driver lineage (GPSC10) from a global perspective by including additional genomes from the GPS project database (n=25,590 from 55 countries, last accessed on 2 nd October 2021). We additionally sequenced a collection of 91 pneumococcal isolates that were responsible for the serotype 24F increase in Spain for comparison. These isolates were collected from children aged <5 years old in the Catalan support laboratory for non-mandatory molecular surveillance of IPD located at Hospital Sant Joan de Déu, Barcelona between 2009 and 2018. 27 To understand the genetic diversity, phylogenetic analysis was carried out on 642 serotype 24F isolates from 29 countries across six continents (Africa, Asia, Australia, Europe, North America, and Latin America). An international collection of 888 GPSC10 isolates from 33 countries, regardless of serotype, were included for phylogenetic, evolutionary, and spatiotemporal analysis.

Sequencing and genomic characterisation
The pneumococcal isolates in this study were whole-genome sequenced at the Wellcome Sanger Institute (Hinxton, UK) on an Illumina HiSeq sequencer. The sequence reads were subjected to quality control based on the criteria as previously described 25 . We characterised each genome by assigning GPSC using PopPUNK 25,30 , sequence type (ST) by MLSTcheck 31 and then grouped STs into CC using Eburst 32 , predicted serotypes using SeroBA 33 , and resistance profile of 17 antibiotics, including penicillin, chloramphenicol, erythromycin, cotrimoxazole and tetracycline, using a pipeline developed by the Streptococcus Laboratory at the Centers for Disease Control and Prevention Atlanta, USA. 33,34 Multidrug resistance (MDR) was defined as an isolate resistant to ≥3 antibiotic classes. In our previous large-scale analysis, we showed a high concordance between GPSC and CC, therefore sequence type is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; extraction, genome quality control criteria, de novo assembly, annotation and in silico serotyping within serogroup 24 were described in supplementary file 2.

Phylogenetic analysis
We performed phylogenetic analysis on all serotype 24F isolates by constructing a maximum likelihood tree using FastTree version 2.1.10 with the general time reversible substitution model. 34  Bayesian phylogenetic analysis on a subset of serotype 24F isolates within GPSC10 was conducted to generate a time-resolved phylogeny and provide estimates of the median effective population size over time with a 95% highest posterior density to detect any exponential increase in population using BEAST Bayesian skyline model version 2.6.3 40 . The time-resolved phylogeny can be visualised at https://microreact.org/project/GPSC10-24F.
To calculate time taken for GPSC10-24F to spread across France, we inferred the evolutionary time between all pairs of genomes from the time-resolved phylogeny. We then utilized an risk ratio framework to calculate the odds that a pair of genomes from within the same province (0-50km) compared with pairs isolated from locations between 250-350km apart (the mean distance between different French provinces is 330km) had a specified timeto-most-recent-common-ancestor (tMRCA) across rolling 2 year divergence time windows from 2 to 20 years. 41 We restricted the analysis to pairs isolated within the same year to mitigate variable geographic sampling across years. A risk ratio close to one indicated an equal chance that a pair of isolates diverged from the same tMRCA within and between . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; https://doi.org/10.1101/2021.11.24.21266813 doi: medRxiv preprint provinces. To determine uncertainty, we ran this with 100 bootstrapped iterations sampling with replacement. We determined the significance of the relationship between the tMRCA and risk ratio using a generalized linear mixed model. The spatiotemporal analysis was run using R version 3.6.0.

Statistical analysis
To determine if differences in proportions between two groups were significant, two-sided Fisher's exact test was used. Two-sided p values of less than 0.05 were considered significant.
Multiple testing correction was carried out using the Benjamini-Hochberg false discovery rate of 5%. 42 We grouped the French isolates into four vaccine periods as previously described by

Role of funding source
The funders of the study had no role in the study design, data collection, data analysis, data interpretation, or writing of the report. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.
The global collection of 642 serotype 24F isolates revealed that this serotype was widely Among these most common lineages, only GPSC10 was MDR with a majority of isolates exhibiting resistance to penicillin, cotrimoxazole, erythromycin and tetracycline ( Figure S4).
Phylogenetic analysis of the serotype 24F cps revealed a strong clonal but not geographical structure ( Figure S5). The cps belonging to the same GPSC were clustered together regardless of the isolates' country of origin. This finding indicated that after a single capsular switching to 24F, the serotype variants clonally disseminated across different geographical regions majorly at the genetic background of GPSC10, 16 and 206. Only a few capsular switching events were observed based on the high similarity in cps between lineages. For example, three GPSC18 isolates from France share highly similar cps with GPSC10, suggesting a capsular switching between GPSC10 and GPSC18. GPSC18 expressing serotype 24F was not detected elsewhere but in France according to two largest pneumococcal isolate databases, the GPS project and pubMLST database (last accessed on 12 th August 2021). GPSC18-24F was first detected in 2011 in this collection of isolates, a year after the introduction of PCV13 in France, without significant expansion ( Figure 1). GPSC18 was a MDR lineage exhibiting resistance to penicillin, cotrimoxazole and erythromycin and found to cause both invasive disease and asymptomatic colonisation. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; Of all serotype 24F lineages, GPSC10 was the only one detected with high invasive disease potential and multidrug resistance. It was responsible for the increase of serotype 24F in France, and one of the major lineages mediating the global spread of serotype 24F. We further investigated this lineage from a global perspective using an international collection of 888 GPSC10 isolates. This lineage was detected in 33 different countries across Africa, Asia, Europe, North and South America ( Figure S6 Concerningly, GPSC10 is among the top 5 lineages in India, 43 Pakistan and Nepal (Table S4), where pneumococcal disease burden is highest. 1 In these three countries, 15 serotype variants were detected in GPSC10 and only 6 were included in PCV13, underlining its potential to cause serotype replacement in the future. Internationally, GPSC10 was consistently found to express multiple serotypes and be multidrug-resistant.
The global GPSC10 phylogeny showed that all serotype 24F isolates but one were clustered together, regardless of their country of origin (Figure 2A). This finding is consistent with the 24F cps phylogeny, which indicated the global dissemination of GPSC10-24F was largely mediated by clonal spread rather than capsular switching. We analysed a collection of 91 GPSC10 isolates from Spain and detected a rapid change in serotype composition from serotype 19A to 24F after the implementation of PCV13 in the target group of children ( Figure   S7). The 19A and 24F serotype variants from Spain were separated in long branches on the global phylogeny, further indicating that the capsular switch predates the vaccine introduction and coexistence of both GPSC10 serotype variants in Spain. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; https://doi.org/10.1101/2021.11.24.21266813 doi: medRxiv preprint multiple transmissions from Europe to Morocco were detected and a single transmission from Europe to India in the early 2000s was followed by a clonal expansion.
We reconstructed the population dynamics over time on EU-clade-I and EU-clade-II using the Bayesian skyline model. The EU-clade-I was predicted to have three exponential increases in effective population size in around 1995, 2004 and 2013, while EU-clade-II had one in around 2010 ( Figure 3A and 3B). The most recent increase in EU-clade-I coincided with the observed prevalence increase of GPSC10-24F in both overall disease and carriage isolates from France ( Figure 3A, 3C-D). Spatiotemporal analysis of the 174 GPSC10-24F isolates from France indicated that a pair of pneumococcal isolates was more likely to be recovered within the same province if they were <5 years diverged from their MRCA (p=0.0083). Pairs which had diverged 5 or more years ago had a risk ratio stably surrounding 1 and no downward trend (p=0.4449). This suggests that the GPSC10-24F population was homogenous across French provinces after 4 years ( Figure 4 and Table S3).

Discussion
Our results showed the emergence of a virulent and MDR pneumococcal lineage GPSC10 that was responsible for the increase of invasive pneumococcal disease in France and the global spread of the invasive serotype 24F, which is not included in the current or expected PCVs. Due to its recombinogenic nature, 44 this lineage is capable of simultaneously expressing a wide range of serotypes to facilitate its adaptation under the vaccine-selective pressure. Together with its transmissibility, GPSC10 should therefore be regarded as a highrisk lineage that could diminish the benefits of the vaccination programme worldwide over time.
Over the two decades since the advent of PCV, GPSC10 has mediated serotype replacement in multiple countries. After the introduction of PCV7/10, 19A became the predominant serotype causing invasive disease in Europe, such increase was partially driven by GPSC10, together with GPSC1 (CC320). 45,46,47,48,49,50 In France and Spain, the major contributor of serotype 19A was GPSC10 in the post-PCV7 period. 50,51 The use of PCV13 (which targets 19A) effectively reduced serotype 19A but with a concurrent increase in serotype 24F in Europe. 8,15,22,27,52 Using the Spanish collection, we observed a rapid change in serotype composition within GPSC10 from 19A to 24F after PCV13. A similar serotype change within GPSC10 was also observed in Argentina 10 and Israel 53 after PCV13, demonstrating that the pre-existence of serotype variants enables GPSC10 to survive and expand under the vaccine-selective pressure. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; https://doi.org/10.1101/2021.11.24.21266813 doi: medRxiv preprint Serotype 24F pneumococci were infrequent causes of invasive disease in Europe in the 1980s 54 and the earliest GPSC10 lineage was detected in Denmark 1996 expressing serotype 14. 55 GPSC10-24F was first reported from three adult patients in Naples, Italy between 1997-1998 56 ; these findings coincide with our model prediction of the emergence of this clone in Europe. Since then, GPSC10-24F started to be more frequently detected in children 8,45 and adults 57,58 from Southern Europe. In spite of the geographical proximity between France and Spain, the increase in serotype 24F was mainly driven by two different clones, GPSC10 EUclade-I and EU-clade-II, respectively. This finding may suggest trans-border transmission is present, but may not be as frequent as transmission within a country, resulting in evolution of two clones in parallel.
Although the global spread of serotype 24F is largely due to the clonal spread of three pneumococcal lineages (GPSC10, 16 and 206), lineage that drove the serotype 24F increase differs between countries. This variation could be partially explained by the differences in antibiotic-selective pressure. For example, the 24F driver lineage in Denmark was GPSC6. 12 Unlike GPSC10, GPSC6 was susceptible to penicillin and erythromycin. This observation coincided with the lower consumption of penicillin and macrolide (a class of antibiotic includes erythromycin) in Denmark, as compared with other countries such as France 8,9 ,Lebanon 17 and Spain 19,20 where serotype 24F increase was mediated by the multidrug-resistant GPSC10 (Table S5). The latter three countries consumed 1.8-2.2 times more penicillin and 1.2-2.0 times more macrolide than that in Denmark. GPSC10 also mediated the serotype 24F increase in Argentina 10 where macrolide consumption was 1.2 times higher than Denmark, though penicillin consumption was almost similar. The high consumption of penicillin and/or erythromycin in Argentina, France, Lebanon and Spain potentially selected for GPSC10 while serotype replacement in low antibiotic consumption settings was mainly observed to be mediated by susceptible lineages. 12,59 Among countries with serotype 24F increase, Japan has the highest consumption of erythromycin and the least of penicillin, the serotype 24F driver lineage GPSC106 (CC2572) only exhibits erythromycin resistance. 16 These observations suggested that vaccine-and antibiotic-selective pressure are shaping the post-vaccine population structure, and gradually the reduction in antibiotic resistance achieved by PCVs may diminish. 53 However, resistance alone was not necessary for clonal success, as GPSC10-24F was detected in all six countries but was only expanded in high antibiotic use settings. This finding echoed the observation in other bacterial species such as Escherichia coli. 60   is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; https://doi.org/10.1101/2021.11.24.21266813 doi: medRxiv preprint The strength of this study lies in a large global collection of pneumococcal genomes, along with a comprehensive epidemiological metadata. Although the sampling strategy and collection time frame was not consistent between countries, the GPS project has been formulated to create pre-and post-PCV dataset in each participating country to evaluate the impact of PCV on pneumococcal population. This overarching study is complemented with the knowledge we gained from a number of country-specific analyses to provide an international perspective of GPSC10 and to highlight this lineage as a future risk in pneumococcal disease prevention. This study also underlines the need for policymakers to evaluate the overall impact of PCVs including changes in IPD incidence and detection of emerging non-vaccine serotypes.
Focusing only on vaccine serotypes in estimating impact could be misleading as it would identify a rise in serotype 19A, observed in some settings with PCV10 use, but would not detect the increase in serotype 24F, associated with PCV13 use, as noted in the current study.
Knowledge gained through implementation of an effective and sustainable surveillance system for S. pneumoniae could guide timely policy making including choice of PCV to stabilise the incidence of IPD cases at a low level or even further reduce incidence.
Our work further shows the usefulness of bacterial genome sequencing to better understand the pneumococcal lineages behind the serotype changes and reveals that GPSC10 alone is a challenge for a serotype-based vaccine strategy. More systematic investigation to identify lineages like GPSC10 will better inform and improve next-generation preventive strategies against pneumococcal disease.

Acknowledgement
The study was co-funded by the Bill and Melinda Gates Foundation is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; https://doi.org/10.1101/2021.11.24.21266813 doi: medRxiv preprint increase in 24F post PCV7 was reported amongst IPD cases from Spain, Italy and France, carriage and IPD cases in Norway and carriage in Portugal.

Added value of this study
We have an enhanced understanding of the multidrug resistant lineage of S. pneumoniae which has driven the increase in serotype 24F in France post PCV13. Additionally, we utilised a global collection of S. pneumoniae isolates from 56 countries to contextualise the isolates from France, identifying the predominant lineage (GPSC10) associated with the increase of multidrug resistance in serotype 24F globally. This study is complemented with the knowledge we gained from a number of country-specific analyses to demonstrate GPSC10 may pose a global threat after PCV13 due to a high risk of vaccine evasion.

Implications of all the available evidence
The increase in serotype 24F post PCV13 in France was attributed to the multidrug resistant lineage GPSC10. Concerningly, GPSC10 has a relatively high disease potential and propensity to cause meningitis independent of serotype. GPSC10 appears to be highly capable of acquiring DNA that may result in antimicrobial resistance and serotype switches.  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; https://doi.org/10.1101/2021.11.24.21266813 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; Figure 4 Spatiotemporal analysis of GPSC10-24F sub-lineage from France. A pairwise odds ratio was calculated for two samples being diverged from time-to-most-recent-commonancestor (tMRCA) and recovered from the same French province. Odds ratio higher than one indicates that a pair of isolates is more likely to be recovered within the same French province. Pairs diverged 5 or more years ago had an odd ~ 1 without up-or downward trend, indicating an equal chance to recover the sample pair within and between French provinces.
. CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; Figure S3 (A)The geographical distribution of serotype 24F Streptococcus pneumoniae (n=642) in the Global Pneumococcal Sequencing (GPS) project database, including 419 isolates from France. (B) Proportion of pneumococcal lineages or Global Pneumococcal Sequence Clusters (GPSCs) in an overall collection of serotype 24F pneumococci (n=642) and a sub-collection (n=123) includes isolates randomly selected from disease surveillance systems and carriage surveys. Pneumococcal lineages less than 3% in prevalence are grouped as others in the pie charts. The geographical distribution can be interactively visualised at https://microreact.org/project/global_24F/7d36573f . is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; Figure S7. Rapid changes in serotype composition within Global Pneumococcal Sequence Cluster (GPSC)10 during PCV introductions among 91 isolates from Spain. Serotype 19A is targeted by PCV13 but not PCV10. Serotype 24F is not targeted by either. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; Figure S8. The proportion of Global Pneumococcal Sequencing Cluster (GPSC)10 clades and other GPSCs in serotype 24F Streptococcus pneumoniae isolates causing invasive pneumococcal disease from France (A) and Spain (B). is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; Table S3 A pairwise odds ratio for two samples being diverged from time-to-most-recentcommon-ancestor (tMRCA) and recovered from the same French province. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ; Table S4 The prevalence, serotypes and resistance profile of GPSC10 by country in the Global Pneumococcal Sequencing (GPS) database is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted November 26, 2021. ;