This is a brief summary of paper for me to study and organize it, Learned in Translation: Contextualized Word Vectors (McCann et al., NIPS 2017) I read and studied.

They have a focus on transfer-learning from Machine translation task to downstream tasks of NLP.

Reference