Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Using viral genomics to estimate undetected infections and extent of superspreading events for COVID-19

View ORCID ProfileLucy M. Li, Patrick Ayscue
doi: https://doi.org/10.1101/2020.05.05.20092098
Lucy M. Li
1Chan Zuckerberg Biohub, San Francisco, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lucy M. Li
  • For correspondence: lucy.li@czbiohub.org
Patrick Ayscue
1Chan Zuckerberg Biohub, San Francisco, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Asymptomatic infections and limited testing capacity have led to under-reporting of SARS-CoV-2 cases. This has hampered the ability to ascertain true infection numbers, evaluate the effectiveness of surveillance strategies, determine transmission dynamics, and estimate reproductive numbers. Leveraging both viral genomic and time series case data offers methods to estimate these parameters.

Using a Bayesian inference framework to fit a branching process model to viral phylogeny and time series case data, we estimated time-varying reproductive numbers and their variance, the total numbers of infected individuals, the probability of case detection over time, and the estimated time to detection of an outbreak for 12 locations in Europe, China, and the United States.

The median percentage of undetected infections ranged from 13% in New York to 92% in Shanghai, China, with the length of local transmission prior to two cases being detected ranging from 11 days (95% CI: 4-21) in California to 37 days (9-100) in Minnesota. The probability of detection was as low as 1% at the start of local epidemics, increasing as the number of reported cases increased exponentially. The precision of estimates increased with the number of full-length viral genomes in a location. The viral phylogeny was informative of the variance in the reproductive number with the 32% most infectious individuals contributing 80% of total transmission events.

This is the first study that incorporates both the viral genomes and time series case data in the estimation of undetected COVID-19 infections. Our findings suggest the presence of undetected infections broadly and that superspreading events are contributing less to observed dynamics than during the SARS epidemic in 2003. This genomics-informed modeling approach could estimate in near real-time critical surveillance metrics to inform ongoing COVID-19 response efforts.

Funding AWS provided computational credit via the Diagnostic Development Initiative.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

AWS provided computational credit via the Diagnostic Development Initiative.

Author Declarations

All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.

Yes

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Data are provided as supplementary files.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted May 09, 2020.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Using viral genomics to estimate undetected infections and extent of superspreading events for COVID-19
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Using viral genomics to estimate undetected infections and extent of superspreading events for COVID-19
Lucy M. Li, Patrick Ayscue
medRxiv 2020.05.05.20092098; doi: https://doi.org/10.1101/2020.05.05.20092098
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Using viral genomics to estimate undetected infections and extent of superspreading events for COVID-19
Lucy M. Li, Patrick Ayscue
medRxiv 2020.05.05.20092098; doi: https://doi.org/10.1101/2020.05.05.20092098

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (214)
  • Allergy and Immunology (495)
  • Anesthesia (106)
  • Cardiovascular Medicine (1091)
  • Dentistry and Oral Medicine (194)
  • Dermatology (141)
  • Emergency Medicine (274)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (497)
  • Epidemiology (9747)
  • Forensic Medicine (5)
  • Gastroenterology (480)
  • Genetic and Genomic Medicine (2298)
  • Geriatric Medicine (221)
  • Health Economics (461)
  • Health Informatics (1548)
  • Health Policy (729)
  • Health Systems and Quality Improvement (600)
  • Hematology (236)
  • HIV/AIDS (500)
  • Infectious Diseases (except HIV/AIDS) (11620)
  • Intensive Care and Critical Care Medicine (615)
  • Medical Education (236)
  • Medical Ethics (67)
  • Nephrology (256)
  • Neurology (2136)
  • Nursing (133)
  • Nutrition (332)
  • Obstetrics and Gynecology (424)
  • Occupational and Environmental Health (516)
  • Oncology (1171)
  • Ophthalmology (363)
  • Orthopedics (128)
  • Otolaryngology (220)
  • Pain Medicine (145)
  • Palliative Medicine (50)
  • Pathology (308)
  • Pediatrics (692)
  • Pharmacology and Therapeutics (298)
  • Primary Care Research (265)
  • Psychiatry and Clinical Psychology (2168)
  • Public and Global Health (4640)
  • Radiology and Imaging (775)
  • Rehabilitation Medicine and Physical Therapy (450)
  • Respiratory Medicine (622)
  • Rheumatology (273)
  • Sexual and Reproductive Health (224)
  • Sports Medicine (208)
  • Surgery (250)
  • Toxicology (42)
  • Transplantation (120)
  • Urology (94)