%0 Journal Article %A Andrew J. Page %A Alison E. Mather %A Thanh Le-Viet %A Emma J. Meader %A Nabil-Fareed Alikhan %A Gemma L. Kay %A Leonardo de Oliveira Martins %A Alp Aydin %A Dave J. Baker %A Alexander J. Trotter %A Steven Rudder %A Ana P. Tedim %A Anastasia Kolyva %A Rachael Stanley %A Maria Diaz %A Will Potter %A Claire Stuart %A Lizzie Meadows %A Andrew Bell %A Ana Victoria Gutierrez %A Nicholas M. Thomson %A Evelien M. Adriaenssens %A Tracey Swingler %A Rachel A. J. Gilroy %A Luke Griffith %A Dheeraj K. Sethi %A Rose K. Davidson %A Robert A. Kingsley %A Luke Bedford %A Lindsay J. Coupland %A Ian G. Charles %A Ngozi Elumogo %A John Wain %A Reenesh Prakash %A Mark A. Webber %A S. J. Louise Smith %A Meera Chand %A Samir Dervisevic %A Justin O’Grady %A The COVID-19 Genomics UK (COG-UK) consortium %T Large scale sequencing of SARS-CoV-2 genomes from one region allows detailed epidemiology and enables local outbreak management %D 2020 %R 10.1101/2020.09.28.20201475 %J medRxiv %P 2020.09.28.20201475 %X The COVID-19 pandemic has spread rapidly throughout the world. In the UK, the initial peak was in April 2020; in the county of Norfolk (UK) and surrounding areas, which has a stable, low-density population, over 3,200 cases were reported between March and August 2020. As part of the activities of the national COVID-19 Genomics Consortium (COG-UK) we undertook whole genome sequencing of the SARS-CoV-2 genomes present in positive clinical samples from the Norfolk region. These samples were collected by four major hospitals, multiple minor hospitals, care facilities and community organisations within Norfolk and surrounding areas. We combined clinical metadata with the sequencing data from regional SARS-CoV-2 genomes to understand the origins, genetic variation, transmission and expansion (spread) of the virus within the region and provide context nationally. Data were fed back into the national effort for pandemic management, whilst simultaneously being used to assist local outbreak analyses. Overall, 1,565 positive samples (172 per 100,000 population) from 1,376 cases were evaluated; for 140 cases between two and six samples were available providing longitudinal data. This represented 42.6% of all positive samples identified by hospital testing in the region and encompassed those with clinical need, and health and care workers and their families. 1,035 cases had genome sequences of sufficient quality to provide phylogenetic lineages. These genomes belonged to 26 distinct global lineages, indicating that there were multiple separate introductions into the region.Furthermore, 100 genetically-distinct UK lineages were detected demonstrating local evolution, at a rate of ∼2 SNPs per month, and multiple co-occurring lineages as the pandemic progressed. Our analysis: identified a sublineage associated with 6 care facilities; found no evidence of reinfection in longitudinal samples; ruled out a nosocomial outbreak; identified 16 lineages in key workers which were not in patients indicating infection control measures were effective; found the D614G spike protein mutation which is linked to increased transmissibility dominates the samples and rapidly confirmed relatedness of cases in an outbreak at a food processing facility. The large-scale genome sequencing of SARS-CoV-2-positive samples has provided valuable additional data for public health epidemiology in the Norfolk region, and will continue to help identify and untangle hidden transmission chains as the pandemic evolves.Major points In Norfolk and surrounding regions100 distinct UK lineages were identified.16 UK lineages found in key workers were not observed in patients or in community care.172 genomes from SARS-CoV-2 positive samples sequenced per 100,000 population representing 42.6% of all positive cases.SARS-CoV-2 genomes from 1035 cases sequenced to a high quality.Only 5 countries, out of 103, have sequenced more SARS-CoV-2 genomes than have been sequenced in Norfolk for this paper.Samples covered the entire first wave, March to August 2020.Stable evolutionary rate of 2 SNPs per month.D614G mutation is the dominant genotype and associated with increased transmission.No evidence of reinfection in 42 cases with longitudinal samples.WGS identified a sublineage associated with care facilities.WGS ruled out nosocomial outbreaks.Rapid WGS confirmed the relatedness of cases from an outbreak at a food processing facility.Competing Interest StatementLG received a partial support for his PhD from Roche.Funding StatementThe authors gratefully acknowledge the support of the Biotechnology and Biological Sciences Research Council (BBSRC); this research was funded by the BBSRC Institute Strategic Programme Microbes in the Food Chain BB/R012504/1 and its constituent projects BBS/E/F/000PR10348, BBS/E/F/000PR10349, BBS/E/F/000PR10351, and BBS/E/F/000PR10352. DJB, NFA, TLV and AJP were supported by the Quadram Institute Bioscience BBSRC funded Core Capability Grant (project number BB/CCG1860/1). EMA was funded by the BBSRC Institute Strategic Programme Gut Microbes and Health BB/R012490/1 and its constituent project(s) BBS/E/F/000PR10353 and BBS/E/F/000PR10356. The sequencing costs were funded by the COVID-19 Genomics UK (COG-UK) Consortium which is supported by funding from the Medical Research Council (MRC) part of UK Research & Innovation (UKRI), the National Institute of Health Research (NIHR) and Genome Research Limited, operating as the Wellcome Sanger Institute. The author(s) gratefully acknowledge the UKRI Biotechnology and Biological Sciences Research Council's (BBSRC) support of The Norwich Research Park Biorepository. LG was supported by a DART MRC iCASE and Roche Diagnostics. APT was funded by Sara Borrell Research Grant CD018/0123 from ISCIII and co-financed by the European Development Regional Fund (A Way to Achieve Europe program) and APT QIB internship additionally funded by "Ayuda de la SEIMC". The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The COVID-19 Genomics UK Consortium has been given approval by Public Health England's Research Ethics and Governance Group (PHE R and D Ref: NR0195).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesConsensus sequences were deposited in GISAID if they met its minimum QC threshold. Raw sequence data were deposited in the European Nucleotide Archive (ENA). Accession numbers detailed in Supplementary Tables 1-3. %U https://www.medrxiv.org/content/medrxiv/early/2020/09/30/2020.09.28.20201475.full.pdf