RT Journal Article SR Electronic T1 Unmasking the conversation on masks: Natural language processing for topical sentiment analysis of COVID-19 Twitter discourse JF medRxiv FD Cold Spring Harbor Laboratory Press SP 2020.08.28.20183863 DO 10.1101/2020.08.28.20183863 A1 Abraham Sanders A1 Rachael White A1 Lauren S. Severson A1 Rufeng Ma A1 Richard McQueen A1 Haniel C. Alcânatara Paulo A1 Yucheng Zhang A1 John S. Erickson A1 Kristin P. Bennett YR 2020 UL http://medrxiv.org/content/early/2020/09/01/2020.08.28.20183863.abstract AB In this exploratory study, we scrutinize a database of over 1 million tweets collected across the first five months of 2020 to draw conclusions about public attitudes towards the preventative measure of mask usage during the COVID-19 pandemic. In recent months, a body of literature has emerged to suggest the robustness of trends in online activity as proxies for the epidemiological and sociological impact of COVID-19. We employ natural language processing, clustering and sentiment analysis techniques to organize tweets relating to mask-wearing into high-level themes, then relay narratives for individual clusters through automatic text summarization. We find that topic clustering and visualization based on mask-related Twitter data offers revealing insights into societal perceptions of COVID-19 and techniques for its prevention. We observe that the volume and polarity of mask related tweets has greatly increased. Importantly, the analysis pipeline presented can be leveraged by the health community for the assessment of public response to health interventions in the ongoing global health crisis.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis work was partially funded by a grant from the United Health FoundationAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:No IRB review requiredAll necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesAll Tweet IDs associated with this work have been made available via a publicly-accessible repository https://github.com/TheRensselaerIDEA/COVID-masks-nlp