Compositional Sentence Representation from Character within Large Context Text

May 02, 2016 · Declared Dead · 🏛 International Conference on Neural Information Processing

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Geonmin Kim, Hwaran Lee, Jisu Choi, Soo-young Lee arXiv ID 1605.00482 Category cs.CL: Computation & Language Citations 1 Venue International Conference on Neural Information Processing Last Checked 3 months ago

Abstract

This paper describes a Hierarchical Composition Recurrent Network (HCRN) consisting of a 3-level hierarchy of compositional models: character, word and sentence. This model is designed to overcome two problems of representing a sentence on the basis of a constituent word sequence. The first is a data-sparsity problem in word embedding, and the other is a no usage of inter-sentence dependency. In the HCRN, word representations are built from characters, thus resolving the data-sparsity problem, and inter-sentence dependency is embedded into sentence representation at the level of sentence composition. We adopt a hierarchy-wise learning scheme in order to alleviate the optimization difficulties of learning deep hierarchical recurrent network in end-to-end fashion. The HCRN was quantitatively and qualitatively evaluated on a dialogue act classification task. Especially, sentence representations with an inter-sentence dependency are able to capture both implicit and explicit semantics of sentence, significantly improving performance. In the end, the HCRN achieved state-of-the-art performance with a test error rate of 22.7% for dialogue act classification on the SWBD-DAMSL database.