Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Code/script to reproduce val loss using the shared models #475

Open
Alexey234432 opened this issue Jan 25, 2024 · 3 comments
Open

Code/script to reproduce val loss using the shared models #475

Alexey234432 opened this issue Jan 25, 2024 · 3 comments

Comments

@Alexey234432
Copy link

Hi,

does anyone know if there is a script/code to reproduce val loss using provided "*.bin" models? I've tried myself and can't get the numbers shared.

Thank you.

@DavidHerel
Copy link

Same issue here.

@Alexey234432
Copy link
Author

in my case loss values are slightly higher - is it the same for you? ie 1.072 for 15M model is my case is 1.0833 and 0.760 for 110M model jumps to 0.8725 @DavidHerel

Thank you

@DavidHerel
Copy link

DavidHerel commented Feb 6, 2024

Yeah, I think it was something similar to you.

I did not play with lr, warmup and dropout, so maybe more extensive hyperparams search will get us the results?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants