Image Captioning Project

This project aims to develop image captioning models using recurrent neural networks (RNNs) and attention mechanisms used in LSTM. The models are trained on the COCO dataset, a large-scale dataset of images with corresponding captions.

Project Description

The project consists of the following components:

Data downloading and preprocessing
Minibatch visualization
Overfitting test
RNN model training
Attention LSTM model training
Result visualization

How to Run the Project

To run the project, follow these steps:

Install the required dependencies.
Clone the repository.
Download coco.py and save in datasets directory
Use the ipynb notebook to train the models and show the result.

Results

We can clearly see that the results on the train data are much better then on the val data.

Badges

Conclusion

This project provided a comprehensive overview of image captioning models and their implementation. The user gained hands-on experience with data preprocessing, model training, and result visualization. The project also highlighted the importance of attention mechanisms in improving the performance of image captioning models.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
__pycache__		__pycache__
README.md		README.md
Result.ipynb		Result.ipynb
a5_helper.py		a5_helper.py
caption_creator.py		caption_creator.py
rnn_lstm_captioning.py		rnn_lstm_captioning.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning Project

Table of Contents

Project Description

How to Run the Project

Results

Badges

Conclusion

About

Releases

Packages

Languages

ReuveniDor/Caption_RNN_Study

Folders and files

Latest commit

History

Repository files navigation

Image Captioning Project

Table of Contents

Project Description

How to Run the Project

Results

Badges

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages