minigpt4

Here are 12 public repositories matching this topic...

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

llama large-language-models video-language-pretraining vision-language-pretraining cross-modal-pretraining blip2 minigpt4 multi-modal-chatgpt

Updated Jun 4, 2024
Python

chenking2020 / FindTheChatGPTer

Star

ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利

Updated Aug 14, 2023

Maknee / minigpt4.cpp

Star

Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)

c machine-learning deep-learning cpp quantization multimodal ggml minigpt4

Updated Aug 8, 2023
C++

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

Updated Nov 4, 2024
Python

FuxiaoLiu / MMC

Star

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

chart benchmark resource stock dataset arxiv gpt otter multimodal instruction-tuning llava minigpt4 mplug-owl

Updated Sep 23, 2024
Python

BUAADreamer / Chinese-LLaVA-Med

Star

中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine

ai transformers medical chinese multimodal huggingface-datasets mllm llava minigpt4 gpt4v qwen1-5 llama-factory

Updated May 22, 2024
Python

rbbrdckybk / MiniGPT-4

Sponsor

Star

Simplified local Windows OS setup of MiniGPT-4 running in an Anaconda environment; includes example local server and client.

machine-learning computer-vision artificial-intelligence vicuna chatgpt minigpt4

Updated Dec 27, 2023
Python

bobo0810 / MiniGPT-4-DeepSpeed

Star

MiniGPT-4基于DeepSpeed加速➕ 扩充模型规模 ➕ 实验分析

deepspeed llm minigpt4

Updated Oct 11, 2023

neemiasbsilva / MiniGPT4-image-caption-generation

Star

Streamline the creation of supervised datasets to facilitate data augmentation for deep learning architectures focused on image captioning. The core framework leverages MiniGPT-4, complemented by the pre-trained Vicuna model, which boasts 13 billion parameters.

caption image-caption-generator minigpt4