CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning

November 24, 2019 · Entered Twilight · 🏛 AAAI Conference on Artificial Intelligence

"Last commit was 5.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, README.md, config.json, const.py, data, data_prepare.py, evaluation.py, main.py, model.py, saved_model, tune.sh

Authors Daojian Zeng, Ranran Haoran Zhang, Qianying Liu arXiv ID 1911.10438 Category cs.CL: Computation & Language Cross-listed cs.LG Citations 200 Venue AAAI Conference on Artificial Intelligence Repository https://github.com/WindChimeRan/CopyMTL ⭐ 128 Last Checked 1 month ago

Abstract

Joint extraction of entities and relations has received significant attention due to its potential of providing higher performance for both tasks. Among existing methods, CopyRE is effective and novel, which uses a sequence-to-sequence framework and copy mechanism to directly generate the relation triplets. However, it suffers from two fatal problems. The model is extremely weak at differing the head and tail entity, resulting in inaccurate entity extraction. It also cannot predict multi-token entities (e.g. \textit{Steven Jobs}). To address these problems, we give a detailed analysis of the reasons behind the inaccurate entity extraction problem, and then propose a simple but extremely effective model structure to solve this problem. In addition, we propose a multi-task learning framework equipped with copy mechanism, called CopyMTL, to allow the model to predict multi-token entities. Experiments reveal the problems of CopyRE and show that our model achieves significant improvement over the current state-of-the-art method by 9% in NYT and 16% in WebNLG (F1 score). Our code is available at https://github.com/WindChimeRan/CopyMTL