Word2vec, GloVe, FastText Efficient Estimation of Word Representations in Vector Space (2013), T. Mikolov et al. [pdf] Distributed Representations of Words and Phrases and their Compositionality (2013), T. Mikolov et al. [pdf] word2vec Parameter Learning Explained (2014), Xin Rong [pdf] word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method (2014), Yoav Goldberg, Ome