Improving Open Information Extraction via Iterative Rank-Aware Learning

May 31, 2019 · Entered Twilight · 🏛 Annual Meeting of the Association for Computational Linguistics

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: README.md, data, init_iter.sh, iter.sh, openie_extract.py, pretrain, rerank.py, rerank, rerank_to_oie.sh, training_config

Authors Zhengbao Jiang, Pengcheng Yin, Graham Neubig arXiv ID 1905.13413 Category cs.CL: Computation & Language Citations 11 Venue Annual Meeting of the Association for Computational Linguistics Repository https://github.com/jzbjyb/oie_rank ⭐ 30 Last Checked 1 month ago

Abstract

Open information extraction (IE) is the task of extracting open-domain assertions from natural language sentences. A key step in open IE is confidence modeling, ranking the extractions based on their estimated quality to adjust precision and recall of extracted assertions. We found that the extraction likelihood, a confidence measure used by current supervised open IE systems, is not well calibrated when comparing the quality of assertions extracted from different sentences. We propose an additional binary classification loss to calibrate the likelihood to make it more globally comparable, and an iterative learning process, where extractions generated by the open IE model are incrementally included as training samples to help the model learn from trial and error. Experiments on OIE2016 demonstrate the effectiveness of our method. Code and data are available at https://github.com/jzbjyb/oie_rank.