RT Journal Article SR Electronic T1 Variant abundance estimation for SARS-CoV-2 in wastewater using RNA-Seq quantification JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2021.08.31.21262938 DO 10.1101/2021.08.31.21262938 A1 Jasmijn A. Baaijens A1 Alessandro Zulli A1 Isabel M. Ott A1 Mary E. Petrone A1 Tara Alpert A1 Joseph R. Fauver A1 Chaney C. Kalinich A1 Chantal B.F. Vogels A1 Mallery I. Breban A1 Claire Duvallet A1 Kyle McElroy A1 Newsha Ghaeli A1 Maxim Imakaev A1 Malaika Mckenzie-Bennett A1 Keith Robison A1 Alex Plocik A1 Rebecca Schilling A1 Martha Pierson A1 Rebecca Littlefield A1 Michelle Spencer A1 Birgitte B. Simen A1 Yale SARS-CoV-2 Genomic Surveillance Initiative A1 William P. Hanage A1 Nathan D. Grubaugh A1 Jordan Peccia A1 Michael Baym YR 2021 UL http://medrxiv.org/content/early/2021/09/02/2021.08.31.21262938.abstract AB Effectively monitoring the spread of SARS-CoV-2 variants is essential to efforts to counter the ongoing pandemic. Wastewater monitoring of SARS-CoV-2 RNA has proven an effective and efficient technique to approximate COVID-19 case rates in the population. Predicting variant abundances from wastewater, however, is technically challenging. Here we show that by sequencing SARS-CoV-2 RNA in wastewater and applying computational techniques initially used for RNA-Seq quantification, we can estimate the abundance of variants in wastewater samples. We show by sequencing samples from wastewater and clinical isolates in Connecticut U.S.A. between January and April 2021 that the temporal dynamics of variant strains broadly correspond. We further show that this technique can be used with other wastewater sequencing techniques by expanding to samples taken across the United States in a similar timeframe. We find high variability in signal among individual samples, and limited ability to detect the presence of variants with clinical frequencies <10%; nevertheless, the overall trends match what we observed from sequencing clinical samples. Thus, while clinical sequencing remains a more sensitive technique for population surveillance, wastewater sequencing can be used to monitor trends in variant prevalence in situations where clinical sequencing is unavailable or impractical.Competing Interest StatementN.D.G. is an infectious diseases consultant for Tempus Labs. W.P.H. is a scientific advisory board member to Biobot Analytics and has received compensation for expert witness testimony on the expected course of the pandemic. N.G. is co-founder of Biobot Analytics; C.D., K.A.M., and M.I. are employees of Biobot Analytics.Funding StatementThis work was supported in part by the Pew Charitable Trusts, the David and Lucile Packard Foundation, NIH NIGMS award R35GM133700, and the Alfred P. Sloan Foundation (J.A.B. and M.B); CTSA Grant Number TL1 TR001864 (M.E.P. and T.A.); Fast Grant from Emergent Ventures at the Mercatus Center at George Mason University (N.D.G.); CDC Contract #75D30120C09570 (N.D.G.); Yale CoReCT pilot award (J.P. and N.D.G.); and NIH NIGMS award U54GM088558 (W.P.H.).Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:The Institutional Review Board from the Yale University Human Research Protection Program determined that the RT-qPCR testing and sequencing of de-identified remnant COVID-19 clinical samples obtained from clinical partners conducted in this study is not research involving human subjects (IRB Protocol ID: 2000028599).All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesThe raw SARS-CoV-2 sequencing data from New Haven wastewater (.fastq files) are available on NCBI SRA under Bioproject PRJNA741211. The clinical sequencing data can be accessed via covidtrackerct.com. The raw SARS-CoV-2 sequencing data from across the U.S. (.fastq files) are available on NCBI SRA under Bioproject PRJNA759260. The simulated wastewater sequencing data (.fastq files) for benchmarking are available on Zenodo (DOI: 10.5281/zenodo.5307070). https://www.ncbi.nlm.nih.gov/bioproject/PRJNA741211 https://www.ncbi.nlm.nih.gov/bioproject/PRJNA759260 https://zenodo.org/record/5307070#.YS6eZNNKjUI