Clarification questions about the framework #50

felmoreno1726 · 2024-04-19T18:43:42Z

I'm trying to understand this in context of other works in the ecosystem. For example, I'm interested in video. For the video encoder, there is the LoRa tuned and the Fully-finetuned, can I use the embeddings from these models with an already trained LLM or model? Can I use these embeddings with Video-Llava? Can I use the LanguageBind encoder as a replacement for Video-Llava encoder (video tower)?

Also the demos shown in gradio, only show modality comparisons. I'm also trying to understand how do you do zero shot classifications. Thank you -- someone who is confused but excited and thankful for for the work done.

lennartmoritz · 2024-04-22T09:10:22Z

You can likely not just use the embeddings with an arbitrarily trained LLM model. The Idea of LanguageBind is to create a custom set of embeddings that is aligned to a specific set of text embeddings (the CLIP encoder, I think).
I don't really understand what you want to do with video-llava embeddings here, but there is another issue in this repo with a similar question, that you can find.
Zero-shot classification refers to when you validate the performance of LanguageBind on a new dataset, which was not used in any way for training the model. So no examples of a particular dataset have been seen by the model before.

XuecWu · 2024-05-14T04:28:49Z

@lennartmoritz Hi, I wonder that during the pre-training process, dothe authors only use video-language or audio-language for training, or do they train with audio-video-depth-infrae-language？
Thank you

lennartmoritz · 2024-05-14T14:59:03Z

They use x-language training pairs where x denotes any of the supported modalities. So e.g. video-language, audio-language and depth-language, etc. are all used during training.

XuecWu · 2024-05-15T01:33:12Z

They use x-language training pairs where x denotes any of the supported modalities. So e.g. video-language, audio-language and depth-language, etc. are all used during training.

Got it. Thank you for your reply.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification questions about the framework #50

Clarification questions about the framework #50

felmoreno1726 commented Apr 19, 2024

lennartmoritz commented Apr 22, 2024

XuecWu commented May 14, 2024

lennartmoritz commented May 14, 2024

XuecWu commented May 15, 2024

Clarification questions about the framework #50

Clarification questions about the framework #50

Comments

felmoreno1726 commented Apr 19, 2024

lennartmoritz commented Apr 22, 2024

XuecWu commented May 14, 2024

lennartmoritz commented May 14, 2024

XuecWu commented May 15, 2024