Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JPEG ENCODING #68

Open
edmondja opened this issue Nov 3, 2024 · 0 comments
Open

JPEG ENCODING #68

edmondja opened this issue Nov 3, 2024 · 0 comments

Comments

@edmondja
Copy link

edmondja commented Nov 3, 2024

https://arxiv.org/abs/2408.08459 shows you can avoid using a trained encoder. For trying JPEG codes as input of my VLM I can tell you it works wonderfully well. Besides, unlike in your paper we do not have unstability during training due to different type of modalities.
Please Meta, talk to each other and retrain Chameleon with JPEG codes, because it seems your teams dont know what other teams do.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant