Viral genetic sequencing identifies staff transmission of COVID-19 is important in a community hospital outbreak

Background Whole-genome sequencing has been used successfully to provide additional information for transmission pathways in infectious spread. We report and interpret genomic sequencing results in clinical context from a large outbreak of COVID-19 with 46 cases across staff and patients in a community hospital in the UK. Methods Following multiple symptomatic cases within a two-week period, all staff and patients were screened by RT-PCR and staff subsequently had serology tests. Findings Thirty staff (25%) and 16 patients (62%) tested positive for COVID-19. Genomic sequencing data showed significant overlap of viral haplotypes in staff who had overlapping shift patterns. Patient haplotypes were more distinct from each other but had overlap with staff haplotypes. Interpretation This study includes clinical and genomic epidemiological detail that demonstrates the value of a combined approach. Viral genetic sequencing has identified that staff transmission of COVID-19 was important in this community hospital outbreak.


Introduction
Whole-genome sequencing and phylogenetic analysis of pathogen isolates can be combined with traditional epidemiology to define transmission pathways and track outbreaks. 1 Applied to bacterial pathogens such as methicillin-resistant Staphylococcus aureus (MRSA), 2 sequencing has distinguished between closely related genotypes that would be otherwise indistinguishable, thereby improving the resolution of analyses of pathogen transmission.
Viruses have high rates of mutation and replication, resulting in rapid genetic variation. 3 It has been proposed that viral genome sequencing and phylogenetic analyses could be key in viral epidemics by adding molecular precision 4 and this approach was pioneered in near real time during the West African ebola epidemic of 2014 to 2016 5 . The SARS-CoV-2 pandemic has precipitated unprecedented levels of application of viral genome sequencing, which has been used to inform clinical decision-making. 5 Clinically, spread of COVID-19 can occur in the community or within hospitals and can be acquired from multiple sources. Outbreak analysis best practice combines detailed clinical epidemiology with supporting microbiological testing. 6 However, it is extremely difficult to ascertain the source of a positive case and standard testing by reverse transcriptase-polymerase chain reaction (RT-PCR) 7 does not aid understanding of the transmission pathway. Phylogenetic analysis of genome sequences from outbreaks can add essential detail. This is exemplified in New Zealand where genomic surveillance has been successfully deployed to understand the origins of SARS-CoV-2 in circulation and patterns of their transmission. 8 The COVID-19 Genomics UK (COG-UK) Consortium 9 was established to support rapid whole-genome viral sequencing for confirmed COVID-19 cases 10 . As of June 2020, the epidemic in the UK comprised at least 1356 independently introduced lineages. Comparison of the genome sequences of viral isolates from a cluster of cases can be used to test . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint hypothetical epidemiological links between those cases and provide supporting or contradictory evidence.
We report the analysis of a clinical outbreak of COVID-19 at a community hospital in Devon, United Kingdom, and show whole-genome viral sequencing combined with clinical context to hypothesise likely transmission routes in this outbreak.

Study design and participants
We investigated a COVID-19 outbreak that occurred in a 32-bed community hospital with 118 staff. The community hospital focuses on rehabilitation, with the majority of patients transferred following inpatient stays at the acute hospital.

COVID-19 testing
The full inpatient cohort of 26 and 118 staff were tested for COVID-19 by reverse transcriptase-polymerase chain reaction (RT-PCR) on nasopharyngeal and throat swabs within the outbreak period.
Serological testing of COVID-19 via antibody detection was introduced in the UK in May 2020. Antibody testing on serology was performed in 115 staff when first available in the UK NHS, seven weeks after the initial RT-PCR screening. Total antibody count has reported to yield up to 100% sensitivity from two weeks after symptom onset 7,11 . Serology was performed to ascertain whether initial screening at outbreak accounted for the majority of cases in the staff.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)

Clinical analysis
Detailed information was obtained for clinical epidemiological evaluation from outbreak surveillance and supported by additional detail for participants recruited to the Exeter COVID-19 Sequencing project (ExCoSe). The sources used for clinical analysis consisted of inpatient medical records, staff symptom questionnaires, and staff rotas. We ascertained symptom timelines, exposures, co-working, COVID-19 severity and outcomes.  CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint subsequent filtering, assembly and variant calling.

Ethical Approval
The genomic sequencing was performed as part of the COG-UK study. COG-UK genomic Approval for the ExCoSe study (STB63/CTB58) was obtained from the RDETB Steering Committee, who are delegated to approve study specific collections. The RDETB is managed within the NIHR Exeter Clinical Research Facility.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Description of outbreak
Multiple cases were detected, following the onset of symptoms in both staff and patient groups, therefore screening was conducted on all inpatients (n=26) and staff (n=118).

Staff cases
Six of the staff cases were asymptomatic at the point of testing, with two remaining asymptomatic. The majority developed mild to moderate symptoms at worst but one case had severe COVID-19 requiring hospitalisation. The first cases of symptomatic staff members preceded any positive results in inpatients. All symptomatic staff remained away from work appropriately. This outbreak occurred before routine testing was available for healthcare workers. Two positive staff members had been symptomatic, followed Public Health England guidance on absence and return to work and had returned to work following their period of absence prior to their positive COVID-19 swab.

Patients
The mean age of the 16 COVID-19 positive patients was 82 years (SD 8.7) and 63% were female. All of the COVID-19 positive patients had recently been inpatients at the acute hospital trust prior to transfer to community hospital, with a mean time from transfer to confirmed COVID-19 of 18 days (SD 9.5). Any patients with a clinical suspicion of COVID-19 were tested and confirmed COVID-19 negative prior to transfer to the community . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint hospital, as per the policy at that point; guidance was subsequently updated to test all prior to transfer. The patients had significant co-morbidity (mean seven co-morbidities) and polypharmacy (mean eight co-prescribed regular medications). Seven of the patients were asymptomatic for COVID-19 at the point of positive swab. Thirteen of the patients were transferred back to the acute trust, although this did not correlate with severity of symptoms as it was early in the pandemic and these patients were anticipated to potentially require further medical input. Four patients died within 30 days of COVID-19 positive test.

Serology
Subsequent serological testing was performed in 115 of the 118 staff who were tested by RT-PCR. Twenty-nine staff were PCR positive and serology positive. Ten additional staff members had positive serology. There was one case who was PCR positive and had successful viral sequencing for whom the serology was negative on two repeated tests. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint the outbreak samples. The relationships between the viral haplotypes are illustrated in Figure   1A, placing the local cases into regional context of viral genetic diversity, and  4 , while we cannot confidently propose links between cases whose haplotypes are differentiated by one or two SNPs, it is plausible that they are connected.
The sequencing-based networks in Figure 1 reveal that the most-frequent haplotype (A) in the outbreak cases is frequent across samples sequenced regionally (n= 37 (haplotype A) out of 141 sequenced at that point). This suggests that this haplotype is common and therefore its co-incidence among 16 of 31 cases could be explained by either spread from a single initial case or multiple independent acquisitions from the community, or a combination of both of these. However, the second-most frequent haplotype (B) among outbreak cases is not seen in other samples external to the outbreak (Figure 1), nor in other samples sequenced to date nationally, strongly suggesting that those cases are epidemiologically linked. Two further cases were separated by a single mutation from this haplotype (haplotypes E and F) suggesting they could have arisen from haplotype B, and so were included in analysis of this cluster. We therefore undertook a more detailed analysis on the cluster of cases with . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint haplotype B plus haplotypes E and F to try to identify the likely spread of infection to guide policy to prevent future outbreaks (Figure 3).

Cluster analysis
The patients with haplotypes E and F and three of the four patients with haplotype B were asymptomatic and screened for COVID-19 by PCR test as part of the outbreak analysis, with one complaining of a mild cough following positive result. Patient 4 was negative at initial screening and subsequently positive. Detailed case-note review revealed the timeline of exposure and symptom onset (Figure 3). Three staff members within the cluster were symptomatic and off work before any patient cases were confirmed positive and prior to the transfer of patients 1 and 2 from the acute hospital to the community hospital. It is noticeable that the cluster of staff who share haplotype B sequencing had overlapping shift patterns over the two-week period preceding symptom onset, with regular points of contact at shift handover. Additionally, the staff cases within this cluster all developed symptoms within a one-week period.

Discussion
Genomic sequencing has shown that the majority of staff cases in this outbreak had a limited lineage, while patient cases were more likely to be distinct. The most common haplotype in this outbreak maps to the lineage sequenced most frequently both regionally and nationally.
Distinct mutations have arisen in this viral lineage to result in the cluster of cases described with haplotypes B, E and F, which have only been identified in this outbreak.
The cluster of cases within this distinct haplotype suggests a likely transmission pathway within the community hospital. COVID-19 positive patients more frequently had a unique haplotype than did staff, suggesting that patient-to-staff transmission was rare. The staff's . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint viral genomes predominantly fell within two haplotypes, one of which was unique to this outbreak. At the time of this reported outbreak, the South West of England had the lowest COVID-19 incidence in the country. 18 Such a large cluster of cases, within two (closely related) haplotypes in staff, indicates it is highly likely that there was significant staff-to-staff transmission in this outbreak. Patients had a more diverse range of haplotypes, but it is also probable that there was some staff-to-patient transmission, given overlapping haplotypes between staff and inpatients who had been in the community hospital for an average of 18 days prior to screening.
Whilst the route of transmission between staff cannot be definitely assigned, it is noted that the staff with shared haplotypes had overlapping shift patterns. Healthcare workers have higher COVID-19 exposure than the general population due to the nature of their work 19 and have been commonly affected worldwide. 20 There have been prior reported outbreaks in which healthcare workers have represented the majority of cases affected, 21 and nosocomial infection rates likely transmitted by healthcare workers have risen, leading to regular healthcare worker testing. 22 In the UK COVID-19 was found to be RT-PCR positive in 14% of healthcare workers reporting symptoms. 23 Rivett et al. described large-scale screening of 1,032 asymptomatic healthcare workers, with 3% positive. 24 Those with symptoms can follow public health measures on isolation, but asymptomatic carriage is problematic.
In this outbreak viral sequencing provided key information on which cases are likely to be linked versus those from different sources. This illustrates the utility of sequencing and supports the use of rapid viral sequencing in the evaluation of COVID-19 outbreaks. We have shown that in future COVID-19 surge episodes or outbreaks whole-genome sequencing could be used effectively to support clinical evaluation. This method could be employed alongside contact tracing to track COVID-19 spread in real-time.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint unavailable samples or insufficient sequence, probably reflecting low viral load at testing. This outbreak was prior to widespread testing among healthcare workers or asymptomatic inpatients outside outbreak scenarios and if events repeated now it is likely that testing would have occurred earlier in individual symptomatic cases.
A key strength of this study is that it combined viral genomic sequencing with epidemiological clinical analysis, RT-PCR and subsequent serological testing to facilitate full outbreak analysis. The screening of all staff members first by RT-PCR and then by serology testing has given a complete picture in this outbreak. The subsequent antibody testing shows that the majority of cases occurred in the staff during this outbreak . The COG-UK genetic sequencing provides excellent coverage for genomic sequencing in the UK and better clinical corroboration could aid understanding in outbreaks and more generally, such as whether new COVID-19 cases over time in the same setting (e.g. a specific care home) are the same haplotype or if distinct viral haplotypes are being introduced by staff or visitors. This could significantly affect policy, but requires centres having access to both clinical detail and viral genomic sequencing, a particular strength of this study, in which the NHS trust and University teams work in close collaboration. It is particularly challenging to combine sequencing and clinical information from non-hospital settings such as care homes due to centralisation of testing practices.

Conclusion
This case demonstrates the added value of genomic epidemiology in outbreak analysis. The likely staff-to-staff transmission and potential staff-to-patient transmission should be heeded.
It reinforces the need for implementing infection control strategies and social distancing within health and social care environments between healthcare workers 19,25 , even in the . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

(which was not certified by peer review)
The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint absence of symptoms. Ward handovers and other meeting events, plus staff rest spaces and office areas may be particular risks and need to be urgently re-evaluated and adapted during the pandemic of this highly infectious virus. The lessons learnt from this are transferrable to hospitals and care home settings, in which staff groups provide care and assistance to older people. Viral genetic sequencing has identified that staff transmission of COVID-19 was important in this community hospital outbreak.

Acknowledgements
We would like to acknowledge the COVID-19 Genomics UK (COG-UK) consortium, which provides the infrastructure for COVID-19 sequencing across the UK.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)   Table 1). Each node represents a single haplotype and its area is proportional to the numbers of samples within that haplotype; the proportions of the circle shaded in red and yellow indicate the proportions of samples in that haplotype from outbreak . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint and non-outbreak samples respectively. The mutations separating the nodes are indicated by the cross hatches on the connecting edges.  test is represented by a red cross.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) The copyright holder for this preprint this version posted February 19, 2021. ; https://doi.org/10.1101/2021.02.18.21250737 doi: medRxiv preprint