This is a brief summary of paper for me to study and organize it, Enhanced LSTM for Natural Language Inference (Chen et al., ACL 2017) I read and studied.

This paper is a research related to Natural Language inference task.

Entailment task is to predict whether the two sentences are entailment, contradiction, and neutral.

Their model is called ESIM (Enhanced Sequential Inference Model) as in the left below figure.

Chen et al., ACL 2017

In here, they used tree-LSTM as follows:

Chen et al., ACL 2017

Chen et al., 2017 ACL

They domonstrates using syntactic parsing information contribute to their best result with tree-LSTM.

They use bidirectional LSTM to ecode a word itself and its context, and then they extraced the local inference information with attention mechanism.

Before classifying, in order to enhance local inference information, they compute the difference and the elemnt-wise product for the tuple $<a^{-},a^{~}>$ as well as for $<b^{-},b^{~}>$, where $a^{-} and b^{-}$ is the output of bidirectional LSTMs on a premise and a hypothesis respectively.

Also they used Bidirectional LSTM to compose local inferences into a fixed-lenght vector:

Chen et al., ACL 2017

Also this paper showed the a variety of sentence embedding for Natural languae Inference task, suggesting their model outputperform the previous models:

Chen et al., 2017 ACL

Note(Abstract): Reasoning and inference are central to human and artificial intelligence. Modeling inference in human language is very challenging. With the availability of large annotated data (i.e. Stanford Natural Language Inference corpus), it has recently become feasible to train neural network based inference models, which have shown to be very effective. In Their paper, Unlike the previous top models that use very complicated network architectures, they first demonstrate that carefully designing sequential inference models based on chain LSTMs can outperform all previous models. Based on this, they further show that by explicitly considering recursive architectures in both local inference modeling and inference composition, they achieve additional improvement. Particularly, incorporating syntactic parsing information contributes to our best result—it further improves the performance even when added to the already very strong model.

Download URL:
The paper: Enhanced LSTM for Natural Language Inference (Chen et al., ACL 2017)

Reference

Paper
- arXiv Version: Enhanced LSTM for Natural Language Inference (Chen et al., arXiv 2017)
- ACL Version: Enhanced LSTM for Natural Language Inference (Chen et al., ACL 2017)
How to use html for alert
- how to use icon

Enhanced LSTM for Natural Language Inference

Title of paper - Enhanced LSTM for Natural Language Inference

Enhanced LSTM for Natural Language Inference

Title of paper - Enhanced LSTM for Natural Language Inference

Reference