Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The ckpt of Quantized OPT model is not be found #53

Open
liuxy1103 opened this issue Jan 12, 2024 · 6 comments
Open

The ckpt of Quantized OPT model is not be found #53

liuxy1103 opened this issue Jan 12, 2024 · 6 comments

Comments

@liuxy1103
Copy link

We have tried to reproduce the quantization of OPT-30B, but it is still difficult for us. Would you mind releasing the details of the procedure?

@ChenMnZ
Copy link
Collaborator

ChenMnZ commented Jan 12, 2024

We have released the ckpt of OPT. You can find them at https://huggingface.co/ChenMnZ/OmniQuant/tree/main.
image

@liuxy1103
Copy link
Author

The results by loading the ckpt is not consistent with the reported result in the paper.

@ChenMnZ
Copy link
Collaborator

ChenMnZ commented Jan 12, 2024

Can you reproduce the results of other models except OPT-30B?

@linloong
Copy link

linloong commented Jul 2, 2024

When reproducing evaluation results for OPT-13b-w4a4, I got nan for wikitext2 dataset.
image
image

It seems that the checkpoint has also broken now.

And when I try to train by myself, the ppl is very high.
image

@ChenMnZ
Copy link
Collaborator

ChenMnZ commented Jul 2, 2024

@linloong Can you provide the training script?

@linloong
Copy link

linloong commented Jul 2, 2024

Sure,
Actually, I just use the script you provided in the scripts/opt/opt-13b/w4a4.sh.

CUDA_VISIBLE_DEVICES=0 python main.py \
--model facebook/opt-13b  \
--epochs 20 --output_dir ./log/opt-13b-w4a4 \
--wbits 4 --abits 4 --lwc --let --alpha 0.75

@ChenMnZ

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants