SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis

May 12, 2020 · Entered Twilight · 🏛 Annual Meeting of the Association for Computational Linguistics

"Last commit was 5.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: AUTHORS, LICENSE, README.en.md, README.md, config, data, env.sh, infer.py, lanch.py, model_files, pretraining.py, requirements.txt, script, senta, setup.cfg, setup.py, train.py

Authors Hao Tian, Can Gao, Xinyan Xiao, Hao Liu, Bolei He, Hua Wu, Haifeng Wang, Feng Wu arXiv ID 2005.05635 Category cs.CL: Computation & Language Citations 272 Venue Annual Meeting of the Association for Computational Linguistics Repository https://github.com/baidu/Senta ⭐ 1999 Last Checked 1 month ago

Abstract

Recently, sentiment analysis has seen remarkable advance with the help of pre-training approaches. However, sentiment knowledge, such as sentiment words and aspect-sentiment pairs, is ignored in the process of pre-training, despite the fact that they are widely used in traditional sentiment analysis approaches. In this paper, we introduce Sentiment Knowledge Enhanced Pre-training (SKEP) in order to learn a unified sentiment representation for multiple sentiment analysis tasks. With the help of automatically-mined knowledge, SKEP conducts sentiment masking and constructs three sentiment knowledge prediction objectives, so as to embed sentiment information at the word, polarity and aspect level into pre-trained sentiment representation. In particular, the prediction of aspect-sentiment pairs is converted into multi-label classification, aiming to capture the dependency between words in a pair. Experiments on three kinds of sentiment tasks show that SKEP significantly outperforms strong pre-training baseline, and achieves new state-of-the-art results on most of the test datasets. We release our code at https://github.com/baidu/Senta.