Statistical Language Models Based on Neural Networks Tomáš Mikolov Speech@FIT, Brno University of Technology, Czech Republic Google, Mountain View, 2nd April 2012 1 / 59 Overview Motivation Neural Network Based Language Models Training Algorithm Recurrent Neural Network Classes Maximum Entropy Language Model Empirical Results: Penn Treebank Corpus Wall Street Journal Speech Recognition NIST RT04