Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is the number of visual tokens? #28

Open
mu-cai opened this issue Nov 18, 2024 · 2 comments
Open

What is the number of visual tokens? #28

mu-cai opened this issue Nov 18, 2024 · 2 comments

Comments

@mu-cai
Copy link

mu-cai commented Nov 18, 2024

Hi Authors,

Thanks for bringing this work! I think from your description, the number of tokens is 64 × (N + 1), which should be 64*7=448 visual tokens?

Thanks

@mu-cai
Copy link
Author

mu-cai commented Nov 30, 2024

I just realize that authors use 144 tokens to represent one sub image, according to the code.

@guozonghao96
Copy link
Collaborator

In our implementation, we use 144 tokens to represent a image slice. We also use 64 tokens for one image, but there saw a drop compared with the 144 tokens (the efficiency will improve).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants