Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Phi 3.5 vision (4B model) #637

Open
2 tasks done
CheeseAndMeat opened this issue Oct 8, 2024 · 2 comments
Open
2 tasks done

Phi 3.5 vision (4B model) #637

CheeseAndMeat opened this issue Oct 8, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@CheeseAndMeat
Copy link

Model description

Lorax's official supported models does not list any vision model. This is a big gap for a very successful product.
Having lorax a critical component in our tech stack without clear option of image-based language models is a big risk on our end. Can the Lorax team please prioritize on-boarding Phi3.5 vision, state of the art SML with vision? Appreciated.

https://huggingface.co/microsoft/Phi-3.5-vision-instruct

Open source status

  • The model implementation is available
  • The model weights are available

Provide useful links for the implementation

No response

@tgaddair
Copy link
Contributor

tgaddair commented Oct 8, 2024

Hi @CheeseAndMeat, thanks for raising this issue. There are two things here for us to do:

  1. Add support for Phi 3.5 Vision, which we can certainly do
  2. Update our docs for VLMs, as we do now support both Llava Next and Llama 3.2 Vision models

@tgaddair tgaddair added the enhancement New feature or request label Oct 8, 2024
@CheeseAndMeat
Copy link
Author

@tgaddair I really appreciate the prompt follow-up :)
1- Phi3.5 Vision outperformed LLMama3.2 Vision in our testing... We are really impressed with it!
2- Same for Phi3.5 MOE, it is much better than both Mixtral & llama3.2, would be great to have it in the roadmap as well.
Thanks again!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants