Microbial context predicts SARS-CoV-2 prevalence in patients and the hospital built environment

Synergistic effects of bacteria on viral stability and transmission are widely documented but remain unclear in the context of SARS-CoV-2. We collected 972 samples from hospitalized patients with coronavirus disease 2019 (COVID-19), their health care providers, and hospital surfaces before, during, and after admission. We screened for SARS-CoV-2 using RT-qPCR, characterized microbial communities using 16S rRNA gene amplicon sequencing, and contextualized the massive microbial diversity in this dataset through meta-analysis of over 20,000 samples. Sixteen percent of surfaces from COVID-19 patient rooms were positive, with the highest prevalence in floor samples next to patient beds (39%) and directly outside their rooms (29%). Although bed rail samples increasingly resembled the patient microbiome over time, SARS-CoV-2 was detected less there (11%). Despite viral surface contamination in almost all patient rooms, no health care workers contracted the disease, suggesting that personal protective equipment was effective in preventing transmissions. SARS-CoV-2 positive samples had higher bacterial phylogenetic diversity across human and surface samples, and higher biomass in floor samples. 16S microbial community profiles allowed for high SARS-CoV-2 classifier accuracy in not only nares, but also forehead, stool, and floor samples. Across distinct microbial profiles, a single amplicon sequence variant from the genus Rothia was highly predictive of SARS-CoV-2 across sample types and had higher prevalence in positive surface and human samples, even compared to samples from patients in another intensive care unit prior to the COVID-19 pandemic. These results suggest that bacterial communities may contribute to viral prevalence both in the host and hospital environment.


Introduction 1
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of 2 a novel infectious disease, COVID-19, that has reached pandemic proportions. COVID-19 was 3 first detected in Wuhan, China, in patients with pneumonia in December 2019. This pandemic has 4 been characterized by sustained human to human transmission and it has caused more than 44 5 million cases and over 1.2 million deaths worldwide (as of 1 November 2020, WHO report). The 6 United States now has the largest number of cases worldwide at over 11 million as of November 7 20th, 2020 (1). COVID-19 is primarily transmitted via either respiratory droplets or aerosols 8 produced by an infected person and inhaled by another individual. Other routes of transmission 9 have also been proposed including fecal oral transmission (2, 3) and fomite transmission (4) 10 although the relative importance of various transmission routes is uncertain (5-8). The potential 11 role of fomite transmission is especially concerning as SARS-CoV-2 has been detected on a variety 12 of surfaces including plastic, stainless steel, cardboard, and copper, and in aerosols (9). A more 13 comprehensive understanding of what influences SARS-CoV-2 stability, transmission, and 14 infectivity is crucial to implementing effective public health measures. 15 Viruses exist in a complex microbial environment, and virus-bacterial interaction has been 16 increasingly documented in humans. In the animal microbiome, the gastrointestinal tract contains CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint 2 been demonstrated to alter the human glycocalyx thereby modulating the ability of SARS-CoV-2 24 to bind host cells (16). Given the nature of known virus-bacterium interactions, we hypothesized 25 that virus-bacterium interactions may also exist in indoor spaces (the 'built environment'). 26 The risk of contracting SARS-CoV-2 is higher indoors than outdoors particularly in poorly 27 ventilated areas (17), and the built environment has a distinct microbiome (18). The built 28 environment microbiome is usually dominated by human-associated microbes (19), and it is 29 estimated that humans shed approximately 37 million bacterial genomes per hour into their built 30 environments (20). In a study following the building of a new hospital, we discovered that the 31 indoor spaces were colonized with microbes from patients and health care workers, and 32 metagenomic analysis was used to infer transmission between occupants via surface transmission 33 (21). To test whether specific bacterial taxa in the host or built environment influence SARS-CoV-34 2 persistence, we collected samples from hospital surfaces, patients, and health care workers in the 35 intensive care unit (ICU) and medical-surgical floor during the onset of the COVID-19 outbreak 36 and screened for viral presence and microbial context. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint assigned to these patients (113 samples), and 734 hospital surfaces either inside or immediately 47 outside of the patients' rooms over the span of two months (Fig. 1A). 48 The 16 patients enrolled in this study ranged from age 20 to 84, with a median age of 49.5 49 (Fig S1). 31% were female and 69% were male, consistent with reports that men tend to experience 50 more severe COVID-19 symptoms (26). Of the patients for whom antibiotic treatment information 51 was collected, 77% were on antibiotics, of which 80% were taking more than one antibiotic. The 52 number of days spent in the hospital ranged from 1 to 25, with a median stay of 9 days. 53 Each sample was screened for the presence of SARS-CoV-2 using three distinct 54 primer/probe sets: the U.S. Center for Disease Control N1 and N2 targets, and the World Health for SARS-CoV-2, including those touched primarily by health care workers (keyboard, ventilator 62 buttons, door handles inside, and outside the rooms) and those directly in contact with the patient 63 (toilet seats, bed rails). Of the patients enrolled in the study, we collected at least one positive 64 sample from 15/16 patients (nares, forehead, or stool) and from 14/15 associated hospital rooms.

67
Floor samples had the highest positivity rates (36% of samples collected from the floor 68 near the patients' bed, i.e. "Inside Floor", and 26% of samples collected from the floor immediately 69 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint 4 outside of the patient room, i.e. "Outside Floor") ( Fig. 1B, Fig. S2). In some cases, SARS-CoV-2 70 was detected on the floors of rooms with patients who tested negative for COVID-19 and in rooms 71 that had been cleaned following COVID-19 patient occupancy (Fig. 1B, Fig. S3B). Most of the 72 positive surface samples amplified only one or two out of the three SARS-CoV-2 targets (Fig. 1C) 73 and had significantly lower viral load over time compared to patient nares and stool samples 74 (p<0.003, non-parametric test from sparse functional principal components analysis) (28), but 75 similar viral load to patient forehead samples (Fig. 1D).

76
SARS-CoV-2 viral load tended to decrease in patients over time ( Fig. 1E)

E) F)
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint   on the meta-analysis we found that floor samples, which cluster separately from the rest of this 116 dataset (Fig. 2C), are similar to built environment samples from previous studies (Fig. S4).

117
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint Beta-diversity estimated using unweighted UniFrac distances (32) in this study showed 118 that floor samples, stool samples, and nares/forehead samples formed three distinct clusters with 119 other surfaces falling between the human skin and floor samples ( Fig. 2B-C). SARS-CoV-2 viral 120 load was weakly correlated with unweighted UniFrac beta-diversity (PERMANOVA R 2 <0.01, p-121 value = 0.043, Fig. S5). 122 We compared beta-diversity between human samples and paired built environment  (Fig. 2D). Notably, the percent of SARS-CoV-2 positive bed rail samples was lower than 128 floor (11% vs 39%) despite the high similarity of bed rail microbiomes to the corresponding patient 129 microbiomes.

130
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

D)
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint Longitudinal beta-diversity analysis reveals patient-surface microbial convergence 142 To account for the longitudinal nature of this dataset, we applied a compositional tensor 143 factorization method implemented through the Gemelli QIIME2 plugin (33, 34) (Fig. 3A). 144 Actinomycetales and Bacteroidales were the most highly ranked taxa driving the separation of  is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint 11 rail vs Nares (j), Bed rail vs Stool (k), Bed rail vs Forehead (l), Nares vs Stool (m), Nares vs 164 Forehead (n), Stool vs Forehead (o). Full statistics in Data File S1.

166
Positive association of microbial diversity and biomass with SARS-CoV-2 167 Next, we evaluated potential alpha diversity differences associated with SARS-CoV-2 168 detection. Overall, Faith's phylogenetic alpha-diversity was significantly higher among surface 169 samples than patient or health care worker samples (Fig. 4A). Across all sample types, Faith's 170 phylogenetic diversity tended to be higher in SARS-CoV-2 positive samples, and was significantly 171 higher in forehead, inside floor, and outside floor samples ( Fig 4B). CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint The high alpha-diversity of floor samples and significant association with SARS-CoV-2 180 detection led us to examine potential differences in biomass across floor samples. Two 181 independent metrics were used to assess biomass; 16S rRNA gene amplicon sequencing read 182 count, which because of our equal volume sequencing library pooling approach correlates with

188
The abundance of human RNAse P was also significantly higher in floor samples with SARS- CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint 13 had a small, yet significant effect size (0.8%, p-value=0.04). Importantly, SARS-CoV-2 detection 203 status also significantly contributed to microbial variation (3.4%, p-value = 0.0004).  CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint normally found in anterior nares samples (44-46), but are not commonly described in forehead 238 microbiome samples. Interestingly, from Figure 2C, we observed that the unweighted UniFrac 239 distance between samples from the same individual's nares and forehead were more similar in 240 COVID-positive room surfaces, suggesting that patients who shed virus into their environment 241 could be cross-contaminating bacteria between nares and forehead (Fig. S8).

242
One ASV with an exact match to Rothia dentocariosa (GenBank ID CP054018.1) was 243 highly ranked across all four disparate sample types: nares, forehead, stool, and inside floor.  CoV-2 in hospital settings, ranging from 25% to over 50% (52, [54][55][56]. The lower SARS-CoV-2 295 prevalence rates in this study could be due to differences in sampling strategy (e.g. area sampled, 296 storage and extraction methods), more careful environmental cleaning of high touch areas around 297 the patient, or due to physiological differences since different surface types differentially influence 298 viral persistence (57). Furthermore, contamination of hospital room surfaces with SARS-CoV-2 299 tends to be highest during the first 5 days after symptom onset (Chia et al., 2020). All patients 300 enrolled in our study had symptoms for at least 6 days before admission to the hospital and 301 enrollment in this study.

302
While SARS-CoV-2 was identified via RT-qPCR for both patient and hospital room 303 samples, it cannot be determined whether the detected virus was viable. Infectivity is both a 304 function of viral viability and abundance. One study assaying infectivity and RT-qPCR in parallel 305 showed that samples with Ct values >30 were not infectious (56). In our study, only 2 out of 79 306 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint positive surface samples amplified at least one SARS-CoV-2 target under 30 cycles, suggesting a 307 relative low viral abundance. Interestingly, both of these samples were from the floor directly next 308 to the patient bed in rooms that hosted patients who were mechanically ventilated during their stay.

309
One of these potentially infectious samples was collected after the patient was transferred to the  It should be acknowledged that transportation of samples in ethanol (to ensure the safety 317 of those handling samples, as well as to enable microbiome analysis) instead of using viral 318 transport media may have resulted in overall lower viral RNA yield. Despite these potential 319 sources of variation, we found that bed rail and patient samples were highly similar in microbiomes 320 to one another before cleaning, but this similarity disappeared after cleaning. Microbial community 321 composition was also more similar between humans and the surfaces they touched (including 322 between health care workers and keyboards, as well as patients and bed rails), supporting the 323 robustness of our microbial sample collection and processing protocols.  Table S1) which is representative of hospital 328 environmental practices worldwide. SARS-CoV-2 was amplified from floor samples, albeit at a 329 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.  Using Random Forest models to classify microbes associated with SARS-CoV-2 detection, 346 we found 16S microbial profiles had high predictive accuracy of SARS-CoV-2 presence in nares, 347 stool, forehead, and inside floor samples. Despite these sample types having distinct microbiomes 348 covering a broad range of microbial diversity (Fig. 2), we identified a single Rothia ASV that was CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review) preprint
The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint 21 study from an intensive care unit (30), we found that this signal is specific to SARS-CoV-2 positive 353 samples, and not other factors associated with an ICU admission such as antibiotic use. This 354 finding supports previous work reporting Rothia to be enriched in SARS-CoV-2 positive stool (59) 355 and bronchoalveolar lavage fluid (60), and further suggests a role in nares, forehead, and surfaces.

356
While the mechanism remains unclear, the consistent Rothia ASV prevalence trend across 357 both patient and surface sample types suggest an association of this bacteria with SARS-COV-2.  Interestingly, we also found that patients with cardiovascular disease comorbidities tended to have 363 higher prevalence of the Rothia ASV associated with SARS-CoV-2, compared to patients with 364 pre-existing cardiovascular disease (45% versus 26%, respectively). Rothia dentocariosa can    (Fig. 1A). Actual sample 412 collection timing varied by patient availability and duration in the hospital (Fig. S3). CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint was present, the intake was swabbed. For a subset of samples, patient care equipment such as 422 portable ultrasound and ventilator screen were also swabbed, as well as the toilet seat. After sample 423 collection, dual-tipped swabs were returned to the swab container. Surface samples were collected 424 at the same time as patient sample collection, as well as prior to patient admission and following 425 patient discharge and room cleaning, when possible.          CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint doi:10.1016/S1473-3099(20)30678-2.

608
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint G. Streicker, Using noninvasive metagenomics to characterize viral communities from wildlife,   CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint antimicrobial resistance and HAI-associated antimicrobial consumption and costs: a multicenter 699 study, Infect. Drug Resist. 12, 501-510 (2019).

700
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint Tigchelaar, L. Eeckhaudt, J. Fu, L. Henckaerts, A. Zhernakova, C. Wijmenga, J. Raes, 723 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

746
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint 769 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted November 22, 2020. ; https://doi.org/10.1101/2020.11.19.20234229 doi: medRxiv preprint