This is a brief summary of paper for me to study and arrange for Boosting Named Entity Recognition with Neural Character Embeddings (Santos and Guimarães., NEWS-WS 2015) I read and studied.

This paper is a research ralted to NER tagging and focus on not using the handcrafted fetaures and the output of other NLP tasks such as part-of-speech tagging and text chuncking.

Ther model jointly trained word and character embedding to boost the performance of NER task in Portuguese and Spanish.

representing character embedding in a word, they use covoutional neural network with max operation to generate a fix-size vector.

And then they have an assumption that in sequential classification tag of word mainly depends on its neghboring words.

So they joinlty concatenate word and character embedding corresponding to each word in a window which is hyper-parameter as follows:

Santos and Guimarães., NEWS-WS 2015

They use a tag style called IOB2 where: O, means that the word is not a NE; B-X is used for the leftmost word of a NE type X; and I-X means that the word is inside of a NE type X. The IOB2 tagging style is illustrated in the following example.

Santos and Guimarães., NEWS-WS 2015

Note(Abstract): Most state-of-the-art named entity recognition (NER) systems rely on handcrafted features and on the output of other NLP tasks such as part-of-speech (POS) tagging and text chunking. In this work they propose a language-independent NER system that uses automatically learned features only. Their approach is based on the CharWNN deep neural network, which uses word-level and character-level representations (embeddings) to perform sequential classification.

Download URL:
The paper: Boosting Named Entity Recognition with Neural Character Embeddings (Santos and Guimarães., NEWS workshop 2015)

Boosting Named Entity Recognition with Neural Character Embeddings

Title of paper - Boosting Named Entity Recognition with Neural Character Embeddings

Boosting Named Entity Recognition with Neural Character Embeddings

Title of paper - Boosting Named Entity Recognition with Neural Character Embeddings

Reference