Genomic epidemiology and longitudinal sampling of ward wastewater environments and patients reveals complexity of the transmission dynamics of blaKPC-carbapenemase-producing Enterobacterales in a hospital setting

Background Healthcare-associated wastewater reservoirs and asymptomatic gastrointestinal patient colonisation by carbapenemase-producing Enterobacterales (CPE) can be important in nosocomial CPE dissemination and infection. We characterised these niches and within-niche diversity in a blaKPC-associated CPE (KPC-E) endemic healthcare setting, to better understand transmission potential. Methods We systematically sampled wastewater sites and patients across three units (six wards) over 6-12 months in 2016 in a KPC-E endemic hospital. We used Illumina sequencing to characterise up to five isolates per sample. Recombination-adjusted phylogenies were used to define genetically related strains; assembly and mapping-based typing approaches were used to characterise antimicrobial resistance gene and insertion sequence profiles, and Tn4401 types/target site sequences. The wider accessory genome was evaluated in a subset of the largest clusters, and those crossing niches. Findings Wastewater site KPC-E-positivity was substantial (101/349 sites [28.9%] positive, 319/4,488 [7.1%] sampling events positive); 183/4,425 (4.1%) of patients were CPE culture-positive over the same timeframe. 13 species and 109 KPC-E strains were observed across niches, and 24% of wastewater and 26% of patient KPC-E-positive samples harboured [≥]1 strain. Most diversity was explained by the individual niche, suggesting highly localised factors are important in selection and spread. Tn4401+target site sequence (TSS) diversity was greater in wastewater sites (p<0.001), suggesting these might favour Tn4401-associated transposition/evolution and dissemination. Shower/bath and sluice/mop-associated sites were more likely to be KPC-E-positive (Adjusted Odds Ratio [95% CI]: 2.69 [1.44-5.01], p=0.0019 and 2.60 [1.04-6.52], p=0.0410, respectively). Different strains had different transmission and blaKPC dissemination dynamics. Interpretation There may be substantial KPC-E colonisation of wastewater sites and patients in KPC-E-endemic healthcare settings. Niche-specific factors, and different strains with different transmission dynamics influence carbapenemase gene dissemination. New transmission models incorporating complex, multi-level dynamics are needed to better quantify CPE dissemination to inform interventions and reduce transmission. Funding This study was supported by the National Institute for Health Research, UK.

is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 accessory genome was evaluated in a subset of the largest clusters, and those crossing 48 niches. 49 is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Implications of all the available evidence 96 Through a detailed and resource-intensive study we captured the dynamics of seeding 97 and dissemination of important carbapenemase genes amongst patients and 98 environmental reservoirs within a hospital. We found differential colonisation and 99 dissemination dynamics for different species and lineages within and between niches. 100 Improved approaches incorporating variable within-niche diversity, accessory 101 distances and horizontal gene transfer in transmission evaluations are required to 102 better understand CPE dissemination and persistence, in order to direct interventions 103 limiting transmission. 104 . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021

INTRODUCTION 105
Carbapenemase-producing Enterobacterales (CPE) are a global health threat 1 , and 106 commonly associated with multi-drug resistance, rendering treatment of CPE 107 infections difficult. Major transmissible global carbapenemases include the metallo-108 beta-lactamases (bla NDM , bla VIM , bla IMP ), some oxacillinases (blaOXA -48/48-like 109 variants), and the Klebsiella pneumoniae carbapenemase (KPC, encoded by bla KPC ) 2 . 110 Intra-and inter-species horizontal transfer of these genes is facilitated by mobile 111 genetic elements such as transposons and plasmids, resulting in rapid dissemination of 112 carbapenem resistance 3,4 . 113 114 CPE have been isolated from human and animal gastrointestinal tracts, and from 115 sewage, rivers and sink drains [5][6][7] . Recent studies have highlighted that hospital 116 wastewater sites may act as a CPE reservoir 5 , but the diversity within these niches, 117 genetic overlap with patient isolates and likely directionality and rates of transmission 118 remain to be elucidated. Studies evaluating the contribution of patient-to-patient 119 transmission as the sole explanatory factor explaining transmission have generally 120 been unable to robustly explain the majority of transmission events, suggesting that 121 patient, staff and/or environmental reservoirs remain insufficiently sampled to 122 account for the diversity observed 8 . Few studies have considered within-niche 123 diversity of strains by sampling multiple isolates per individual and/or site [9][10][11]  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Infirmary (MRI; part of MFT), and subsequent ward closure and plumbing 130 infrastructure replacement 12 , we undertook a prospective study systematically 131 sampling all wastewater sites on six wards for 6-12 months alongside patient rectal 132 screening/clinical sampling, and used an anonymised electronic database to 133 characterise patient admission, ward movement, sampling profiles and culture results. 134 Multiple colonies were isolated from KPC-E-positive samples to define within-niche 135 diversity, and whole genome sequencing (Illumina) of individual isolates was used as 136 to characterise this diversity and consider modes of evolution and transmission. 137

METHODS 139
Study setting and epidemiological data 140 MFT is a large, regional referral centre in northwest England, UK, managing >10,000 141 patients/year. In response to the regional emergence of   is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ; https://doi.org/10. 1101 Analyses of within-niche diversity 180 To evaluate within-niche diversity in the environment and patients, up to five 181 different Enterobacterales colonies were individually sub-cultured; bla  in sub-cultures was re-confirmed by PCR and species identification performed by 183 MALDI-ToF prior to storage in nutrient broth at -80°C. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ; https://doi.org/10.1101/2021.11.26.21266267 doi: medRxiv preprint genome clusters characterised using Panaroo (default parameters including --clean-204 mode strict 22 ) and clustered using the R heatmap package. 205 206 Isolates were assembled using SPAdes (v3.6) 23 with default parameters. Multi-locus 207 sequence types were derived using BLASTn to relevant species-specific databases 208 hosted at https://pubmlst.org (Achtman 7-gene MLST for E. coli). AMR genes were 209 identified using ARIBA 24 with the CARD database 25 . bla KPC

Statistical analyses and data visualisation 223
Descriptive statistics were calculated in R v3.6.2 or STATA v16.1. To conduct 224 permutational analyses of variance, we used the adonis function from the vegan R 225 package (v2.5-7) 28 on a matrix of pairwise Gower distances based on genetic features 226 (i.e. composite of species-strain type, AMR gene profile, IS profile, and 227 Tn4401+flanking sequence profiles), which was calculated using the daisy function in 228 . CC-BY 4.0 International license It is made available under a perpetuity.
is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ; https://doi.org/10.1101/2021.11.26.21266267 doi: medRxiv preprint 13 identified as KPC-E-positive (Fig.S3); 7/349 (2.0%) sites appeared more likely to be 254 persistently KPC-E positive (≥10 consecutive KPC-E-positive sampling events) and 255 interestingly five of these were medicines/treatment room handwash basin drains 256 likely only accessible to healthcare staff (A59, F45, F113, T43, T44; Fig.S4). The 257 proportion of positive sites did not vary significantly over time (Fig.2, bottom panel). 258 Two utility room sink drains on the cardiac unit (across W3, W4) were positive for 259 KPC-E a day after W3 reopened to patient admissions (11 th January 2016) following 260 closure and complete plumbing replacement over three months; thereafter the unit 261 was rapidly re-colonised ( Fig.2)  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ; https://doi.org/10. 1101 15 level because of small numbers, see Methods). Of the 109 strains, 61 (56%) were 304 found only in the environment, 27 (24%) only in patients and 21 (19%) in both 305 patients and the environment (Fig.4); the median strain cluster size was 5 isolates 306 (range: 1-122), and the median number of unique niches (i.e. unique patients or 307 wastewater sites) affected by a strain was 2 (range: 1-23) (Fig.4A, Fig.4B is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ; https://doi.org/10.1101/2021.11.26.21266267 doi: medRxiv preprint

Environmental and patient niches harbour diverse KPC-E within single samples and 328 over time 329
Environmental sites showed evidence of major strain-level diversity (Fig.5) KPC-E strains, 6/97 (6%) with three strains and 2/97 (2%) with four strains (Fig.5)  is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 additional Tn4401 deletions occurring in multiple isolates at four distinct timepoints 377 in the three sites (Fig.6C). 378 379 Rapid patient-patient transmission -K. pneumoniae strain 9 380 K. pneumoniae strain 9 (kpne9, ST252) was highly related (Fig.S7; maximum 6 SNVs 381 across strain, 68 isolates within 0 SNVs of each other), and spread amongst 21 382 patients across three wards on two units (acute medicine AM1 and AM2 and 383 geratology W46), with a dense outbreak in Aug 2016, most consistent, at least 384 initially, with direct/indirect patient-patient transmission (Fig.7). bla KPC was 385 consistently nested in Tn4401-ATTGA-ATTGA, except for one isolate which 386 acquired a Tn4401-associated mutation (isolate: 2216698_14, C4620T), and another 387 in which the right target site sequence (TSS) could not be identified (2035791_11). 388 Amongst the 106 isolates there was evidence of four transient plasmid replicon 389 acquisitions on a background of the stable presence of a set of IncFIB/IncFII replicons 390 (Fig.S7). Transmission probabilities inferred by SCOTTI were weak (see Appendix; 391 Fig.S8), as previously 8 , but using accessory cluster profiling to attempt further 392 discrimination amongst strains suggested that there were several major transmission 393 clusters (Fig.S9), with some individuals involved in several of these (Fig.7). A single 394 shower drain site was involved (A13, serving a particular bay with four beds), which 395 was likely a seeded bystander from affected patients, but may have contributed to 396 transmission of sub-strain 3g (Fig.7). 397 is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 Given that Tn4401 transposition is associated with bla KPC dissemination 21 , we also 401 evaluated whether the number of different Tn4401-TSS types was different across 402 strains, noting that the observed frequency of different Tn4401-TSS combinations 403 was higher in K. pneumoniae strain 11 (kpne11, ST11) than in other strains (Fig.S10). 404 Kpne11 represented two sub-clusters, separated by ~40SNVs (Fig.7, designated 405 "cluster 1" and "cluster 2"), both of which were isolated from patients and wastewater 406 sites across ward settings, including the cardiac unit after plumbing replacement. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ;https://doi.org/10.1101https://doi.org/10. /2021 1.
Munoz is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ; https://doi.org/10.1101/2021.11.26.21266267 doi: medRxiv preprint is the author/funder, who has granted medRxiv a license to display the preprint in (which was not certified by peer review) preprint The copyright holder for this this version posted December 1, 2021. ; https://doi.org/10. 1101