FakeCovid -- A Multilingual Cross-domain Fact Check News Dataset for COVID-19
June 19, 2020 Β· Declared Dead Β· π ICWSM Workshops
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Gautam Kishore Shahi, Durgesh Nandini
arXiv ID
2006.11343
Category
cs.CY: Computers & Society
Cross-listed
cs.SI
Citations
228
Venue
ICWSM Workshops
Last Checked
4 months ago
Abstract
In this paper, we present a first multilingual cross-domain dataset of 5182 fact-checked news articles for COVID-19, collected from 04/01/2020 to 15/05/2020. We have collected the fact-checked articles from 92 different fact-checking websites after obtaining references from Poynter and Snopes. We have manually annotated articles into 11 different categories of the fact-checked news according to their content. The dataset is in 40 languages from 105 countries. We have built a classifier to detect fake news and present results for the automatic fake news detection and its class. Our model achieves an F1 score of 0.76 to detect the false class and other fact check articles. The FakeCovid dataset is available at Github.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Computers & Society
π
π
The Cartographer
R.I.P.
π»
Ghosted
Artificial Intelligence: the global landscape of ethics guidelines
R.I.P.
π»
Ghosted
The role of artificial intelligence in achieving the Sustainable Development Goals
R.I.P.
π»
Ghosted
Green AI
R.I.P.
π»
Ghosted
Principles alone cannot guarantee ethical AI
R.I.P.
π»
Ghosted
Tackling Climate Change with Machine Learning
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted