Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training MixNet models for Cifar 10 #4

Open
meteozay opened this issue Nov 6, 2019 · 6 comments
Open

Training MixNet models for Cifar 10 #4

meteozay opened this issue Nov 6, 2019 · 6 comments

Comments

@meteozay
Copy link

meteozay commented Nov 6, 2019

Hello,

Have you trained MixNet models for Cifar-10 from scratch (not transfer learning from ImageNet models)?

With default configuration, I obtain about 70% accuracy for MixNet small, and the accuracy decreases as the model size increases. Do I need to change some particular parameters in the configuration while training Cifar-10?

Thank you.

@leaderj1001
Copy link
Owner

leaderj1001 commented Nov 7, 2019 via email

@meteozay
Copy link
Author

meteozay commented Nov 7, 2019 via email

@leaderj1001
Copy link
Owner

leaderj1001 commented Nov 7, 2019 via email

@meteozay
Copy link
Author

meteozay commented Nov 7, 2019 via email

@sporthand
Copy link

sporthand commented Jun 1, 2020

Hi, I also have the same problem which my Mixnet was originally installed from pip install Keras_Mixnet. I wonder if your code is similar to Keras_Mixnet.

So how many epoch did you set to get the over 90% of validation accuracy? I initially set the number of epoch as 10,000 and the number of patient as 50. However, it stopped between from 110th to 120 epoch. So, the result is 73% of validation accuracy. But, I am planning to change the number of patient from 50 to 100 in order to get the same validation accuracy. I am looking forward to have your instruction.
Thank you!

@leaderj1001
Copy link
Owner

The code currently uploaded to github is model architecture in the ImageNet dataset. ImageNet and CIFAR have different input image sizes, so you need to train by modifying the stem layer and model architecture.
Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants