-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vision_tower is not updated as expected #130
Comments
Have you followed the instructions here? |
@Isaachhh |
You may try to print all the parameters needed to be optimized here? |
@zycoldness @Isaachhh BTW, here is what I got, both the
|
That's pretty weird. As shown by you, the weights of the vision encoder before training and bunny_phi3 are different, which means the vision encoder was tuned during visual instruction tuning. So, the current code works when I trained Bunny? It may be related to the version of the packages. |
It might be, here is my package setting that you may refer to: Docker:
Python:
pip:
ds_report:
|
i am trying to continues fine-tune the model. But I found that the vision_tower is not updated.
So I try to use the "Recipe-2" in Bunny-v1.1-4B.md to fine-tune Bunny with your pretrained mm_projector. I use a large lr and 10 images from "bunny_695k.json":
I added some codes in "train.py" to make save the parameters before and after train:
I also extract the Bunny_v1.1_4B parameters from your weights:
Then, I used the following codes to compare the parameters:
The results shows that seems the vision_tower is not updated even the param.require_grad = True:
I am confused the vision_tower is not updated even I set
--unfreeze_vision_tower True
. Is there anything I missed?The text was updated successfully, but these errors were encountered: