Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trouble replicating results for ResNet18 on ImageNet 1k #50

Open
dyelax opened this issue Jun 13, 2018 · 1 comment
Open

Trouble replicating results for ResNet18 on ImageNet 1k #50

dyelax opened this issue Jun 13, 2018 · 1 comment

Comments

@dyelax
Copy link

dyelax commented Jun 13, 2018

I'm trying to replicate your results for ResNet18 on ImageNet 1k. I'm using the default hyperparameters in your train_resnet.py and all of the same augmenters (for aug-level=2), but can't achieve the accuracy that you report. The most obvious issue is that if I use your default learning rate of 0.1, the net fails to learn anything, even though I'm using the same batch size (256), momentum (0.9), weight decay (0.0001), etc. I have to set a learning rate to 0.001 for it to start learning well, but even then, the accuracy tops out at about 80% for Top 5 and 55% for Top 1.

  1. I see in the code that these are the default parameters for ResNet50. Did you change any of them for ResNet18?

  2. Is the data that you are using standard ImageNet, or did you modify it before training (by normalizing, etc.)?

@MichaelSolotky
Copy link

Hi, I've recently had a problem with the replication of results for a pre-trained DeiT on the ImageNet validation part and the thing is I've had a slightly different version of ImageNet. Maybe your problems with training are due to a different dataset. So make sure can at least replicate the results for a pre-trained net.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants