You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In our implementation, we use 144 tokens to represent a image slice. We also use 64 tokens for one image, but there saw a drop compared with the 144 tokens (the efficiency will improve).
Hi Authors,
Thanks for bringing this work! I think from your description, the number of tokens is 64 × (N + 1), which should be 64*7=448 visual tokens?
Thanks
The text was updated successfully, but these errors were encountered: