Any plans to use Long-CLIP to extend text input token limit? #53

lennartmoritz · 2024-05-14T12:50:19Z

If i read your paper right, you have frozen the CLIP text encoder and only aligned the other modalities.
Do you think a pretrained Long-CLIP model could be used as a drop in replacement for LanguageBind to extend the token limit?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any plans to use Long-CLIP to extend text input token limit? #53

Any plans to use Long-CLIP to extend text input token limit? #53

lennartmoritz commented May 14, 2024

Any plans to use Long-CLIP to extend text input token limit? #53

Any plans to use Long-CLIP to extend text input token limit? #53

Comments

lennartmoritz commented May 14, 2024