Sentiment analysis is not solved! Assessing and probing sentiment classification

June 13, 2019 · Entered Twilight · 🏛 BlackboxNLP@ACL

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: README.md, analyze_predictions.py, annotated.txt, example_pred.txt, test.txt

Authors Jeremy Barnes, Lilja Øvrelid, Erik Velldal arXiv ID 1906.05887 Category cs.CL: Computation & Language Citations 33 Venue BlackboxNLP@ACL Repository https://github.com/ltgoslo/assessing_and_probing_sentiment ⭐ 4 Last Checked 1 month ago

Abstract

Neural methods for SA have led to quantitative improvements over previous approaches, but these advances are not always accompanied with a thorough analysis of the qualitative differences. Therefore, it is not clear what outstanding conceptual challenges for sentiment analysis remain. In this work, we attempt to discover what challenges still prove a problem for sentiment classifiers for English and to provide a challenging dataset. We collect the subset of sentences that an (oracle) ensemble of state-of-the-art sentiment classifiers misclassify and then annotate them for 18 linguistic and paralinguistic phenomena, such as negation, sarcasm, modality, etc. The dataset is available at https://github.com/ltgoslo/assessing_and_probing_sentiment. Finally, we provide a case study that demonstrates the usefulness of the dataset to probe the performance of a given sentiment classifier with respect to linguistic phenomena.