TY - JOUR T1 - Genomic Diversity of SARS-CoV-2 During Early Introduction into the United States National Capital Region JF - medRxiv DO - 10.1101/2020.08.13.20174136 SP - 2020.08.13.20174136 AU - Peter M. Thielen AU - Shirlee Wohl AU - Thomas Mehoke AU - Srividya Ramakrishnan AU - Melanie Kirsche AU - Oluwaseun Falade-Nwulia AU - Nídia S. Trovão AU - Amanda Ernlund AU - Craig Howser AU - Norah Sadowski AU - Paul Morris AU - Mark Hopkins AU - Matthew Schwartz AU - Yunfan Fan AU - Victoria Gniazdowski AU - Justin Lessler AU - Lauren Sauer AU - Michael C. Schatz AU - Jared D. Evans AU - Stuart C. Ray AU - Winston Timp AU - Heba H. Mostafa Y1 - 2020/01/01 UR - http://medrxiv.org/content/early/2020/08/15/2020.08.13.20174136.abstract N2 - Background The early COVID-19 pandemic has been characterized by rapid global spread. In the United States National Capital Region, over 2,000 cases were reported within three weeks of its first detection in March 2020. We aimed to use genomic sequencing to understand the initial spread of SARS-CoV-2, the virus that causes COVID-19, in the region. By correlating genetic information to disease phenotype, we also aimed to gain insight into any correlation between viral genotype and case severity or transmissibility.Methods We performed whole genome sequencing of clinical SARS-CoV-2 samples collected in March 2020 by the Johns Hopkins Health System. We analyzed these regional SARS-CoV-2 genomes alongside detailed clinical metadata and the global phylogeny to understand early establishment of the virus within the region.Results We analyzed 620 samples from the Johns Hopkins Health System collected between March 11–31, 2020, comprising 37.3% of the total cases in Maryland during this period. We selected 143 of these samples for sequencing, generating 114 complete viral genomes. These genomes belong to all five major Nextstrain-defined clades, suggesting multiple introductions into the region and underscoring the diversity of the regional epidemic. We also found that clinically severe cases had genomes belonging to all of these clades.Conclusions We established a pipeline for SARS-CoV-2 sequencing within the Johns Hopkins Health system, which enabled us to capture the significant viral diversity present in the region as early as March 2020. Efforts to control local spread of the virus were likely confounded by the number of introductions into the region early in the epidemic and interconnectedness of the region as a whole.Competing Interest StatementWT has two patents (8,748,091 and 8,394,584) licensed to Oxford Nanopore Technologies. Funding StatementThis work was supported by the Johns Hopkins University President's Fund Research Response (W.T., S.C.R., P.M.T), the Johns Hopkins University Applied Physics Laboratory Internal Research and Development (P.M.T), The Johns Hopkins Hospital (S.W. and J.L.), the Johns Hopkins Department of Pathology, NIH awards U41HG006620 (M.C.S.) and R01AI134384 (M.C.S.), and NSF award DBI-1627442 (M.C.S.).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:This study was approved by the Johns Hopkins University School of Medicine Institutional Review Board.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesRaw nanopore and Illumina data are deposited at SRA (Bioproject PRJNA629390). Consensus sequences are deposited at GISAID and Genbank (MT509452-MT509493, MT646048-MT646120) ER -