This is a brief summary of paper for me to study and organize it, Siamese Recurrent Architectures for Learning Sentence Similarity (Mueller and Thyagarajan., AAAI 2016) I read and studied.

Thes propose a siamese neural network based on LSTM to compare a pair of sentences.

First, they encode two sentence of different length into fixed-size vectors using an LSTM as follows:

Mueller and Thyagarajan., AAAI 2016

As you can see, they used Manhattan distance as objective function.

In their paper, they use data augmentation for NLP(natural language processing) called synonym augmentation.

synonym augmentation replace random words with one of their synonyms, for example, using Wordnet.

If you want to know the property of simamese network, refer to the lecture below:

Siamese network on face recognition

</div>

Reference