CS 224D: Deep Learning for NLP1 1 Course Instructor: Richard Socher Lecture Notes: Part III2 2 Author: Rohit Mundra, Richard Socher Spring 2015 Keyphrases: Neural networks. Forward computation. Backward propagation. Neuron Units. Max-margin Loss. Gradient checks. Xavier parameter initialization. Learning rates. Adagrad. This set of notes introduces single and multilayer neural networks, and how th