I've found that Imagenet and other large CNN makes use of local response normalization layers. However, I cannot find that much information about them. How important are they and when should they be used? From http://caffe.berkeleyvision.org/tutorial/layers.html#data-layers: "The local response normalization layer performs a kind of “lateral inhibition” by normalizing over local input regions. In