Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Variant abundance estimation for SARS-CoV-2 in wastewater using RNA-Seq quantification

View ORCID ProfileJasmijn A. Baaijens, View ORCID ProfileAlessandro Zulli, View ORCID ProfileIsabel M. Ott, View ORCID ProfileMary E. Petrone, View ORCID ProfileTara Alpert, View ORCID ProfileJoseph R. Fauver, View ORCID ProfileChaney C. Kalinich, View ORCID ProfileChantal B.F. Vogels, View ORCID ProfileMallery I. Breban, View ORCID ProfileClaire Duvallet, Kyle McElroy, Newsha Ghaeli, View ORCID ProfileMaxim Imakaev, Malaika Mckenzie-Bennett, View ORCID ProfileKeith Robison, View ORCID ProfileAlex Plocik, Rebecca Schilling, Martha Pierson, Rebecca Littlefield, Michelle Spencer, View ORCID ProfileBirgitte B. Simen, Yale SARS-CoV-2 Genomic Surveillance Initiative, View ORCID ProfileWilliam P. Hanage, View ORCID ProfileNathan D. Grubaugh, View ORCID ProfileJordan Peccia, View ORCID ProfileMichael Baym
doi: https://doi.org/10.1101/2021.08.31.21262938
Jasmijn A. Baaijens
1Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jasmijn A. Baaijens
  • For correspondence: j.a.baaijens@tudelft.nl
Alessandro Zulli
2Department of Chemical and Environmental Engineering, Yale University, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alessandro Zulli
Isabel M. Ott
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Isabel M. Ott
Mary E. Petrone
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mary E. Petrone
Tara Alpert
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tara Alpert
Joseph R. Fauver
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Joseph R. Fauver
Chaney C. Kalinich
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Chaney C. Kalinich
Chantal B.F. Vogels
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Chantal B.F. Vogels
Mallery I. Breban
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Mallery I. Breban
Claire Duvallet
4Biobot Analytics, Inc., Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Claire Duvallet
Kyle McElroy
4Biobot Analytics, Inc., Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Newsha Ghaeli
4Biobot Analytics, Inc., Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maxim Imakaev
4Biobot Analytics, Inc., Cambridge, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Maxim Imakaev
Malaika Mckenzie-Bennett
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Keith Robison
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Keith Robison
Alex Plocik
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Alex Plocik
Rebecca Schilling
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Martha Pierson
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rebecca Littlefield
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michelle Spencer
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Birgitte B. Simen
5Ginkgo Bioworks, Inc., Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Birgitte B. Simen
William P. Hanage
6Center for Communicable Disease Dynamics and Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for William P. Hanage
Nathan D. Grubaugh
3Department of Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, CT, USA
7Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nathan D. Grubaugh
Jordan Peccia
2Department of Chemical and Environmental Engineering, Yale University, New Haven, CT, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jordan Peccia
Michael Baym
1Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michael Baym
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Effectively monitoring the spread of SARS-CoV-2 variants is essential to efforts to counter the ongoing pandemic. Wastewater monitoring of SARS-CoV-2 RNA has proven an effective and efficient technique to approximate COVID-19 case rates in the population. Predicting variant abundances from wastewater, however, is technically challenging. Here we show that by sequencing SARS-CoV-2 RNA in wastewater and applying computational techniques initially used for RNA-Seq quantification, we can estimate the abundance of variants in wastewater samples. We show by sequencing samples from wastewater and clinical isolates in Connecticut U.S.A. between January and April 2021 that the temporal dynamics of variant strains broadly correspond. We further show that this technique can be used with other wastewater sequencing techniques by expanding to samples taken across the United States in a similar timeframe. We find high variability in signal among individual samples, and limited ability to detect the presence of variants with clinical frequencies <10%; nevertheless, the overall trends match what we observed from sequencing clinical samples. Thus, while clinical sequencing remains a more sensitive technique for population surveillance, wastewater sequencing can be used to monitor trends in variant prevalence in situations where clinical sequencing is unavailable or impractical.

Competing Interest Statement

N.D.G. is an infectious diseases consultant for Tempus Labs. W.P.H. is a scientific advisory board member to Biobot Analytics and has received compensation for expert witness testimony on the expected course of the pandemic. N.G. is co-founder of Biobot Analytics; C.D., K.A.M., and M.I. are employees of Biobot Analytics.

Funding Statement

This work was supported in part by the Pew Charitable Trusts, the David and Lucile Packard Foundation, NIH NIGMS award R35GM133700, and the Alfred P. Sloan Foundation (J.A.B. and M.B); CTSA Grant Number TL1 TR001864 (M.E.P. and T.A.); Fast Grant from Emergent Ventures at the Mercatus Center at George Mason University (N.D.G.); CDC Contract #75D30120C09570 (N.D.G.); Yale CoReCT pilot award (J.P. and N.D.G.); and NIH NIGMS award U54GM088558 (W.P.H.).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The Institutional Review Board from the Yale University Human Research Protection Program determined that the RT-qPCR testing and sequencing of de-identified remnant COVID-19 clinical samples obtained from clinical partners conducted in this study is not research involving human subjects (IRB Protocol ID: 2000028599).

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • ↵† Denotes co-senior authorship

Data Availability

The raw SARS-CoV-2 sequencing data from New Haven wastewater (.fastq files) are available on NCBI SRA under Bioproject PRJNA741211. The clinical sequencing data can be accessed via covidtrackerct.com. The raw SARS-CoV-2 sequencing data from across the U.S. (.fastq files) are available on NCBI SRA under Bioproject PRJNA759260. The simulated wastewater sequencing data (.fastq files) for benchmarking are available on Zenodo (DOI: 10.5281/zenodo.5307070).

https://www.ncbi.nlm.nih.gov/bioproject/PRJNA741211

https://www.ncbi.nlm.nih.gov/bioproject/PRJNA759260

https://zenodo.org/record/5307070#.YS6eZNNKjUI

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted September 02, 2021.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Variant abundance estimation for SARS-CoV-2 in wastewater using RNA-Seq quantification
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Variant abundance estimation for SARS-CoV-2 in wastewater using RNA-Seq quantification
Jasmijn A. Baaijens, Alessandro Zulli, Isabel M. Ott, Mary E. Petrone, Tara Alpert, Joseph R. Fauver, Chaney C. Kalinich, Chantal B.F. Vogels, Mallery I. Breban, Claire Duvallet, Kyle McElroy, Newsha Ghaeli, Maxim Imakaev, Malaika Mckenzie-Bennett, Keith Robison, Alex Plocik, Rebecca Schilling, Martha Pierson, Rebecca Littlefield, Michelle Spencer, Birgitte B. Simen, Yale SARS-CoV-2 Genomic Surveillance Initiative, William P. Hanage, Nathan D. Grubaugh, Jordan Peccia, Michael Baym
medRxiv 2021.08.31.21262938; doi: https://doi.org/10.1101/2021.08.31.21262938
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Variant abundance estimation for SARS-CoV-2 in wastewater using RNA-Seq quantification
Jasmijn A. Baaijens, Alessandro Zulli, Isabel M. Ott, Mary E. Petrone, Tara Alpert, Joseph R. Fauver, Chaney C. Kalinich, Chantal B.F. Vogels, Mallery I. Breban, Claire Duvallet, Kyle McElroy, Newsha Ghaeli, Maxim Imakaev, Malaika Mckenzie-Bennett, Keith Robison, Alex Plocik, Rebecca Schilling, Martha Pierson, Rebecca Littlefield, Michelle Spencer, Birgitte B. Simen, Yale SARS-CoV-2 Genomic Surveillance Initiative, William P. Hanage, Nathan D. Grubaugh, Jordan Peccia, Michael Baym
medRxiv 2021.08.31.21262938; doi: https://doi.org/10.1101/2021.08.31.21262938

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (271)
  • Allergy and Immunology (559)
  • Anesthesia (135)
  • Cardiovascular Medicine (1778)
  • Dentistry and Oral Medicine (240)
  • Dermatology (173)
  • Emergency Medicine (317)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (662)
  • Epidemiology (10834)
  • Forensic Medicine (8)
  • Gastroenterology (595)
  • Genetic and Genomic Medicine (2968)
  • Geriatric Medicine (289)
  • Health Economics (534)
  • Health Informatics (1938)
  • Health Policy (837)
  • Health Systems and Quality Improvement (747)
  • Hematology (295)
  • HIV/AIDS (634)
  • Infectious Diseases (except HIV/AIDS) (12535)
  • Intensive Care and Critical Care Medicine (697)
  • Medical Education (300)
  • Medical Ethics (89)
  • Nephrology (325)
  • Neurology (2819)
  • Nursing (152)
  • Nutrition (435)
  • Obstetrics and Gynecology (560)
  • Occupational and Environmental Health (600)
  • Oncology (1475)
  • Ophthalmology (444)
  • Orthopedics (172)
  • Otolaryngology (259)
  • Pain Medicine (190)
  • Palliative Medicine (56)
  • Pathology (382)
  • Pediatrics (870)
  • Pharmacology and Therapeutics (368)
  • Primary Care Research (341)
  • Psychiatry and Clinical Psychology (2653)
  • Public and Global Health (5393)
  • Radiology and Imaging (1019)
  • Rehabilitation Medicine and Physical Therapy (599)
  • Respiratory Medicine (729)
  • Rheumatology (330)
  • Sexual and Reproductive Health (294)
  • Sports Medicine (280)
  • Surgery (329)
  • Toxicology (48)
  • Transplantation (151)
  • Urology (127)