serihiroのブックマーク / 2020年7月7日

Understanding the Disharmony between Dropout and Batch Normalization by Variance Shift
- 4 users
- arxiv.org
- 学び
This paper first answers the question "why do the two most powerful techniques Dropout and Batch Normalization (BN) often lead to a worse performance when they are combined together?" in both theoretical and statistical aspects. Theoretically, we find that Dropout would shift the variance of a specific neural unit when we transfer the state of that network from train to test. However, BN would mai
serihiro 2020/07/07
paper

batch_normalization

dropout
リンク
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates and careful parameter initialization, and makes it notoriously hard to train models with saturating nonlinearities. We refer to this phenomenon as internal covar
serihiro 2020/07/07
paper

batch_normalization
リンク
続・GANによる二次元美少女画像生成 – Lento – Medium
serihiro 2020/07/07
GAN
リンク
AutoAugment: Learning Augmentation Policies from Data
Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment to automatically search for improved data augmentation policies. In our implementation, we have designed a search space where a policy consists of many sub-polic
serihiro 2020/07/07
Data Augmentation
リンク
- 2020年7月8日
- 2020年7月7日
- 2020年7月6日