How Does Batch Normalization Help Optimization? (No, It Is Not About Internal Covariate Shift

태그
작성일자
1 more property
Jul 11, 2018 deep-learning, deep-learning

WHY?

While the effect of batch normalization was widely proven empirically, the exact mechanism of it is yet been understood. Commonly known explanation for this was internal covariance shift(ICS) meaning the change in the distribution of layer inputs caused by updates to the preceeding layers.

WHAT?

Critic

So?