A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration
Top Cited Papers
Open Access
- 5 August 2021
- journal article
- research article
- Published by MDPI AG in Epidemiologia
- Vol. 2 (3) , 315-324
- https://doi.org/10.3390/epidemiologia2030024
Abstract
As the COVID-19 pandemic continues to spread worldwide, an unprecedented amount of open data is being generated for medical, genetics, and epidemiological research. The unparalleled rate at which many research groups around the world are releasing data and publications on the ongoing pandemic is allowing other scientists to learn from local experiences and data generated on the front lines of the COVID-19 pandemic. However, there is a need to integrate additional data sources that map and measure the role of social dynamics of such a unique worldwide event in biomedical, biological, and epidemiological analyses. For this purpose, we present a large-scale curated dataset of over 1.12 billion tweets, growing daily, related to COVID-19 chatter generated from 1 January 2020 to 27 June 2021 at the time of writing. This data source provides a freely available additional data source for researchers worldwide to conduct a wide and diverse number of research projects, such as epidemiological analyses, emotional and mental responses to social distancing measures, the identification of sources of misinformation, stratified measurement of sentiment towards the pandemic in near real time, among many others.Keywords
This publication has 33 references indexed in Scilit:
- Pathological findings of COVID-19 associated with acute respiratory distress syndromeThe Lancet Respiratory Medicine, 2020
- Breakthrough: Chloroquine phosphate has shown apparent efficacy in treatment of COVID-19 associated pneumonia in clinical studiesBioScience Trends, 2020
- Mining Twitter Data for Improved Understanding of Disaster ResilienceAnnals of the American Association of Geographers, 2018
- Celebrating parasitesNature Genetics, 2017
- Against Dataism and for Data Sharing of Big Biomedical and Clinical Data with Research ParasitesFrontiers in Genetics, 2016
- Strengthening Research through Data SharingNew England Journal of Medicine, 2016
- Crowdsourcing biomedical research: leveraging communities as innovation enginesNature Reviews Genetics, 2016
- The FAIR Guiding Principles for scientific data management and stewardshipScientific Data, 2016
- Tools and methods for capturing Twitter data during natural disastersFirst Monday, 2012
- Earthquake TwitterNature Geoscience, 2010