Why not updating batch norm parameters? #50

emrahyigit · 2020-08-04T09:24:03Z

According to the following lines, your code doesn't update batch norm parameters. What is the real reason for that?

params_without_bn = [params for name, params in model.named_parameters() if not ('_bn' in name or '.bn' in name)]

The text was updated successfully, but these errors were encountered:

JiyueWang · 2020-12-01T03:06:41Z

This line means BN layer is excluded for weight decay, not the whole BP. However, I tried the regular way and got similar results on CIFAR100. I think the link below will explain part of it. Looking forward to the official explanation.
https://blog.janestreet.com/l2-regularization-and-batch-norm/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why not updating batch norm parameters? #50

Why not updating batch norm parameters? #50

emrahyigit commented Aug 4, 2020

JiyueWang commented Dec 1, 2020 •

edited

Loading

Why not updating batch norm parameters? #50

Why not updating batch norm parameters? #50

Comments

emrahyigit commented Aug 4, 2020

JiyueWang commented Dec 1, 2020 • edited Loading

JiyueWang commented Dec 1, 2020 •

edited

Loading