A project showcasing image captioning using two approaches: a custom CNN-based model for feature extraction and BLIP for state-of-the-art multimodal caption generation. Includes model comparisons, results, and code for training, inference, and evaluation on custom datasets.
-
Notifications
You must be signed in to change notification settings - Fork 0
A project showcasing image captioning using two approaches: a custom CNN-based model for feature extraction and BLIP for state-of-the-art multimodal caption generation. Includes model comparisons, results, and code for training, inference, and evaluation on custom datasets.
pratikkmane/ImageCaptioning-CNN-BLIP
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A project showcasing image captioning using two approaches: a custom CNN-based model for feature extraction and BLIP for state-of-the-art multimodal caption generation. Includes model comparisons, results, and code for training, inference, and evaluation on custom datasets.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published