Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints

September 04, 2018 · Entered Twilight · 🏛 Conference on Empirical Methods in Natural Language Processing

"Last commit was 7.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, CONTRIBUTING.md, LICENSE.md, MTurk Evaluation, README.md, README_OpenNMT.md, data, docs, onmt, opts.py, preprocess.py, requirements.txt, test, tools, train.py, translate.py

Authors Ashutosh Baheti, Alan Ritter, Jiwei Li, Bill Dolan arXiv ID 1809.01215 Category cs.CL: Computation & Language Citations 94 Venue Conference on Empirical Methods in Natural Language Processing Repository https://github.com/abaheti95/DC-NeuralConversation ⭐ 28 Last Checked 1 month ago

Abstract

Neural conversation models tend to generate safe, generic responses for most inputs. This is due to the limitations of likelihood-based decoding objectives in generation tasks with diverse outputs, such as conversation. To address this challenge, we propose a simple yet effective approach for incorporating side information in the form of distributional constraints over the generated responses. We propose two constraints that help generate more content rich responses that are based on a model of syntax and topics (Griffiths et al., 2005) and semantic similarity (Arora et al., 2016). We evaluate our approach against a variety of competitive baselines, using both automatic metrics and human judgments, showing that our proposed approach generates responses that are much less generic without sacrificing plausibility. A working demo of our code can be found at https://github.com/abaheti95/DC-NeuralConversation.