Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Unmasking the conversation on masks: Natural language processing for topical sentiment analysis of COVID-19 Twitter discourse

Abraham C. Sanders, Rachael C. White, Lauren S. Severson, Rufeng Ma, Richard McQueen, Haniel C. Alcântara Paulo, Yucheng Zhang, View ORCID ProfileJohn S. Erickson, View ORCID ProfileKristin P. Bennett
doi: https://doi.org/10.1101/2020.08.28.20183863
Abraham C. Sanders
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: sandea5@rpi.edu
Rachael C. White
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lauren S. Severson
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rufeng Ma
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Richard McQueen
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Haniel C. Alcântara Paulo
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yucheng Zhang
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John S. Erickson
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for John S. Erickson
Kristin P. Bennett
Rensselaer Polytechnic Institute, Troy, New York
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Kristin P. Bennett
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

In this exploratory study, we scrutinize a database of over one million tweets collected from March to July 2020 to illustrate public attitudes towards mask usage during the COVID-19 pandemic. We employ natural language processing, clustering and sentiment analysis techniques to organize tweets relating to mask-wearing into high-level themes, then relay narratives for each theme using automatic text summarization. In recent months, a body of literature has highlighted the robustness of trends in online activity as proxies for the sociological impact of COVID-19. We find that topic clustering based on mask-related Twitter data offers revealing insights into societal perceptions of COVID-19 and techniques for its prevention. We observe that the volume and polarity of mask-related tweets has greatly increased. Importantly, the analysis pipeline presented may be leveraged by the health community for qualitative assessment of public response to health intervention techniques in real time.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was partially funded by a grant from the United Health Foundation

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

No IRB review required

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • Final revisions incorporating reviewer feedback.

Data Availability

All Tweet IDs associated with this work have been made available via a publicly-accessible repository

https://github.com/TheRensselaerIDEA/COVID-masks-nlp

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted March 20, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Unmasking the conversation on masks: Natural language processing for topical sentiment analysis of COVID-19 Twitter discourse
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Unmasking the conversation on masks: Natural language processing for topical sentiment analysis of COVID-19 Twitter discourse
Abraham C. Sanders, Rachael C. White, Lauren S. Severson, Rufeng Ma, Richard McQueen, Haniel C. Alcântara Paulo, Yucheng Zhang, John S. Erickson, Kristin P. Bennett
medRxiv 2020.08.28.20183863; doi: https://doi.org/10.1101/2020.08.28.20183863
Reddit logo Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Unmasking the conversation on masks: Natural language processing for topical sentiment analysis of COVID-19 Twitter discourse
Abraham C. Sanders, Rachael C. White, Lauren S. Severson, Rufeng Ma, Richard McQueen, Haniel C. Alcântara Paulo, Yucheng Zhang, John S. Erickson, Kristin P. Bennett
medRxiv 2020.08.28.20183863; doi: https://doi.org/10.1101/2020.08.28.20183863

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (228)
  • Allergy and Immunology (504)
  • Anesthesia (110)
  • Cardiovascular Medicine (1238)
  • Dentistry and Oral Medicine (206)
  • Dermatology (147)
  • Emergency Medicine (282)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (531)
  • Epidemiology (10021)
  • Forensic Medicine (5)
  • Gastroenterology (499)
  • Genetic and Genomic Medicine (2453)
  • Geriatric Medicine (238)
  • Health Economics (479)
  • Health Informatics (1643)
  • Health Policy (752)
  • Health Systems and Quality Improvement (636)
  • Hematology (248)
  • HIV/AIDS (533)
  • Infectious Diseases (except HIV/AIDS) (11864)
  • Intensive Care and Critical Care Medicine (626)
  • Medical Education (252)
  • Medical Ethics (75)
  • Nephrology (268)
  • Neurology (2280)
  • Nursing (139)
  • Nutrition (352)
  • Obstetrics and Gynecology (454)
  • Occupational and Environmental Health (536)
  • Oncology (1245)
  • Ophthalmology (377)
  • Orthopedics (134)
  • Otolaryngology (226)
  • Pain Medicine (157)
  • Palliative Medicine (50)
  • Pathology (324)
  • Pediatrics (730)
  • Pharmacology and Therapeutics (313)
  • Primary Care Research (282)
  • Psychiatry and Clinical Psychology (2280)
  • Public and Global Health (4833)
  • Radiology and Imaging (837)
  • Rehabilitation Medicine and Physical Therapy (491)
  • Respiratory Medicine (651)
  • Rheumatology (285)
  • Sexual and Reproductive Health (238)
  • Sports Medicine (227)
  • Surgery (267)
  • Toxicology (44)
  • Transplantation (125)
  • Urology (99)