An improved neural network model for joint POS tagging and dependency parsing

July 11, 2018 · Entered Twilight · 🏛 Conference on Computational Natural Language Learning

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: License.txt, README.md, decoder.py, jPTDP.py, learner.py, mnnl.py, sample, utils.py, utils

Authors Dat Quoc Nguyen, Karin Verspoor arXiv ID 1807.03955 Category cs.CL: Computation & Language Citations 66 Venue Conference on Computational Natural Language Learning Repository https://github.com/datquocnguyen/jPTDP ⭐ 156 Last Checked 1 month ago

Abstract

We propose a novel neural network model for joint part-of-speech (POS) tagging and dependency parsing. Our model extends the well-known BIST graph-based dependency parser (Kiperwasser and Goldberg, 2016) by incorporating a BiLSTM-based tagging component to produce automatically predicted POS tags for the parser. On the benchmark English Penn treebank, our model obtains strong UAS and LAS scores at 94.51% and 92.87%, respectively, producing 1.5+% absolute improvements to the BIST graph-based parser, and also obtaining a state-of-the-art POS tagging accuracy at 97.97%. Furthermore, experimental results on parsing 61 "big" Universal Dependencies treebanks from raw texts show that our model outperforms the baseline UDPipe (Straka and Straková, 2017) with 0.8% higher average POS tagging score and 3.6% higher average LAS score. In addition, with our model, we also obtain state-of-the-art downstream task scores for biomedical event extraction and opinion analysis applications. Our code is available together with all pre-trained models at: https://github.com/datquocnguyen/jPTDP