These are my study notes on Recurrent Batch Normalization as preparation for the Deep Learning Study Group (SF) session on April 26, 2016. These notes also contain some info from Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Recurrent Neural Networks, or RNNs, work great for many tasks. A big downside is that training these deep networks takes a sign