This paper, End-to-end Sequence Labeling via Bi directional LSTM-CNNs-CRF (Ma and Hovy., ACL 2016), proposed a neural network architectural for sequence labeling, which is NER and POS.

Their models is end-to-end models relying on no task-specific resources, feature engineering or data pre-procsessing.

They use convolutiona neural network to extract morphogical information from characters of words and encode it inot neural representations as follows.

Ma and Hovy., ACL 2016

The whole model architecture they argued is the following

Ma and Hovy., ACL 2016

As you can see their model above, they used character-level representation by concating word vector as input of Bidirectional LSTM.

on decoding label, they used conditional random field(CRF) which is beneficail to consider the correlations between labels in neighborhoods and jointly decode the best chain of labels for a given input sequence.

They used a variety of deeplearnig technique like dropout and pertrained word embedding.

Also they argued that their model can be further improved by exploring multi-task learnig approaches to combine more useful and correlated information as the future of works.

For example, they said their model can be jointly trained with both the POS and NER tags to improve the intermediate representation learned from their network.

Tip: CNN is an effective approach to extract morphological information (like the prefix or suffix of a word) from characters of words and encode it into neural representations

Note: State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of handcrafted features and data pre-processing. In this paper, we introduce a novel neutral network architecture that benefits from both word- and character-level representations automatically, by using combination of bidirectional LSTM, CNN and CRF. their model is truely end-to-end, requiring no feature engineering or data pre-processing.

Download URL:
The paper: End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF on arXive version</br> The paper: End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF on ACL version

GloVe Youtube

Youtube
DeepLearning ai

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

Title of paper - End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

Title of paper - End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

Reference