-
Notifications
You must be signed in to change notification settings - Fork 540
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mixtral-8x7b: Reference Implementation Accuracy Failure on H200 #2018
Comments
@pgmpablo157321 @nvzhihanj @arjunsuresh : Any comments? |
Hi @mrmhodak we are running the full accuracy run for this. But it won't be finishing until Thursday. |
We did the dataset update for Mixtral this round (for the EOS issue). Were you running on the latest dataset and latest settings (i.e. min_output_len=2)? |
@nvzhihanj : Yes, all latest, freshly downloaded according to latest instructions using rclone. |
@arjunsuresh @nvzhihanj @pgmpablo157321: Any update on this? |
I am able to re-run the standalone script and double-check the accuracy of the model
The bug must be in the reference implementation FYI @pgmpablo157321 , I will check in the standalone script to the repo later. |
I added the reference standalone scripts in #2029 and formalize the docker workflow. For the reference implementation, @pgmpablo157321 can you help the discrepancy between the standalone and the existing code? |
@nvzhihanj Working on this |
When running reference implementation on H200, I see an accuracy failure:
The text was updated successfully, but these errors were encountered: