Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Associations Between Google Search Trends for Symptoms and COVID-19 Confirmed and Death Cases in the United States

Mostafa Abbas, Thomas B. Morland, Eric S. Hall, View ORCID ProfileYasser EL-Manzalawy
doi: https://doi.org/10.1101/2021.02.22.21252254
Mostafa Abbas
1Department of Translational Data Science and Informatics, Geisinger
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Thomas B. Morland
2Department of General Internal Medicine, Geisinger
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eric S. Hall
1Department of Translational Data Science and Informatics, Geisinger
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yasser EL-Manzalawy
1Department of Translational Data Science and Informatics, Geisinger
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yasser EL-Manzalawy
  • For correspondence: yelmanzalawi@geisinger.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

We utilize functional data analysis techniques to investigate patterns of COVID-19 positivity and mortality in the US and their associations with Google search trends for COVID-19 related symptoms. Specifically, we represent state-level time series data for COVID-19 and Google search trends for symptoms as smoothed functional curves. Given these functional data, we explore the modes of variation in the data using functional principal component analysis (FPCA). We also apply functional clustering analysis to identify patterns of COVID-19 confirmed case and death trajectories across the US. Moreover, we quantify the associations between Google COVID-19 search trends for symptoms and COVID-19 confirmed case and death trajectories using dynamic correlation. Finally, we examine the dynamics of correlations for the top nine Google search trends of symptoms commonly associated with COVID-19 confirmed case and death trajectories. Our results reveal and characterize distinct patterns for COVID-19 spread and mortality across the US. The dynamics of these correlations suggest the feasibility of using Google queries to forecast COVID-19 cases and mortality for up to three weeks in advance. Our results and analysis framework set the stage for the development of predictive models for forecasting COVID-19 confirmed cases and deaths using historical data and Google search trends for nine symptoms associated with both outcomes.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

YE is supported by startup funding from Geisinger Health System. The funder had no role in the design of the study, collection, analysis, or interpretation of data or the writing of the manuscript.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

N/A

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The numbers of daily COVID-19 confirmed cases and deaths were obtained from the Centers for Disease Control and Prevention (CDC) at https://data.cdc.gov/Case-Surveillance/United-States-COVID-19-Cases-and-Deaths-by-State-o/9mfq-cb36. The Google COVID-19 Search Trends Symptoms dataset is publicly available at https://github.com/google-research/open-covid-19-data/. The 2019 US Census data are available at https://www.census.gov/data.html.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted February 24, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Associations Between Google Search Trends for Symptoms and COVID-19 Confirmed and Death Cases in the United States
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Associations Between Google Search Trends for Symptoms and COVID-19 Confirmed and Death Cases in the United States
Mostafa Abbas, Thomas B. Morland, Eric S. Hall, Yasser EL-Manzalawy
medRxiv 2021.02.22.21252254; doi: https://doi.org/10.1101/2021.02.22.21252254
Digg logo Reddit logo Twitter logo Facebook logo Google logo LinkedIn logo Mendeley logo
Citation Tools
Associations Between Google Search Trends for Symptoms and COVID-19 Confirmed and Death Cases in the United States
Mostafa Abbas, Thomas B. Morland, Eric S. Hall, Yasser EL-Manzalawy
medRxiv 2021.02.22.21252254; doi: https://doi.org/10.1101/2021.02.22.21252254

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (164)
  • Allergy and Immunology (417)
  • Anesthesia (93)
  • Cardiovascular Medicine (867)
  • Dentistry and Oral Medicine (159)
  • Dermatology (98)
  • Emergency Medicine (251)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (398)
  • Epidemiology (8597)
  • Forensic Medicine (4)
  • Gastroenterology (391)
  • Genetic and Genomic Medicine (1775)
  • Geriatric Medicine (170)
  • Health Economics (376)
  • Health Informatics (1252)
  • Health Policy (625)
  • Health Systems and Quality Improvement (472)
  • Hematology (198)
  • HIV/AIDS (380)
  • Infectious Diseases (except HIV/AIDS) (10354)
  • Intensive Care and Critical Care Medicine (554)
  • Medical Education (193)
  • Medical Ethics (51)
  • Nephrology (214)
  • Neurology (1692)
  • Nursing (97)
  • Nutrition (252)
  • Obstetrics and Gynecology (330)
  • Occupational and Environmental Health (451)
  • Oncology (934)
  • Ophthalmology (265)
  • Orthopedics (104)
  • Otolaryngology (172)
  • Pain Medicine (115)
  • Palliative Medicine (40)
  • Pathology (256)
  • Pediatrics (541)
  • Pharmacology and Therapeutics (257)
  • Primary Care Research (210)
  • Psychiatry and Clinical Psychology (1788)
  • Public and Global Health (3877)
  • Radiology and Imaging (629)
  • Rehabilitation Medicine and Physical Therapy (324)
  • Respiratory Medicine (525)
  • Rheumatology (208)
  • Sexual and Reproductive Health (171)
  • Sports Medicine (159)
  • Surgery (191)
  • Toxicology (36)
  • Transplantation (101)
  • Urology (76)