TripleNet: Triple Attention Network for Multi-Turn Response Selection in Retrieval-based Chatbots

September 24, 2019 · Entered Twilight · 🏛 Conference on Computational Natural Language Learning

"Last commit was 6.0 years ago (≥5 year threshold)"

Evidence collected by the PWNC Scanner

Repo contents: .gitignore, LICENSE, README.md, callback.py, evaluate.py, main.py, preprocess.py, shell, triplenet_model.py

Authors Wentao Ma, Yiming Cui, Nan Shao, Su He, Wei-Nan Zhang, Ting Liu, Shijin Wang, Guoping Hu arXiv ID 1909.10666 Category cs.CL: Computation & Language Cross-listed cs.IR Citations 20 Venue Conference on Computational Natural Language Learning Repository https://github.com/wtma/TripleNet ⭐ 26 Last Checked 1 month ago

Abstract

We consider the importance of different utterances in the context for selecting the response usually depends on the current query. In this paper, we propose the model TripleNet to fully model the task with the triple <context, query, response> instead of <context, response> in previous works. The heart of TripleNet is a novel attention mechanism named triple attention to model the relationships within the triple at four levels. The new mechanism updates the representation for each element based on the attention with the other two concurrently and symmetrically. We match the triple <C, Q, R> centered on the response from char to context level for prediction. Experimental results on two large-scale multi-turn response selection datasets show that the proposed model can significantly outperform the state-of-the-art methods. TripleNet source code is available at https://github.com/wtma/TripleNet