Udacity Computer Vision Nanodegree

Image Captioning

Combine convolutional and recurrent neural networks to build an automatic image captioning application.

Clone the project repository

git clone http://github.com/sdonatti/nd891-project-image-captioning

Install required Python packages

cd nd891-project-image-captioning
conda install --file requirements.txt -c pytorch

git clone http://github.com/cocodataset/cocoapi.git
cd cocoapi/PythonAPI
python setup.py install

Under Annotations, download:
- 2014 Train/Val annotations [241MB] (extract captions_train2014.json and captions_val2014.json at cocoapi/annotations/)
- 2014 Testing Image info [1MB] (extract image_info_test2014.json at cocoapi/annotations/)
Under Images, download:
- 2014 Train images [83K/13GB] (extract train2014 folder at cocoapi/images/)
- 2014 Val images [41K/6GB] (extract val2014 folder at cocoapi/images/)
- 2014 Test images [41K/6GB] (extract test2014 folder at cocoapi/images/)

Launch the project Jupyter Notebooks

cd ../../
jupyter notebook

This project is licensed under the MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
fonts		fonts
images		images
.gitignore		.gitignore
0_Dataset.ipynb		0_Dataset.ipynb
1_Preliminaries.ipynb		1_Preliminaries.ipynb
2_Training.ipynb		2_Training.ipynb
3_Inference.ipynb		3_Inference.ipynb
LICENSE		LICENSE
README.md		README.md
data_loader.py		data_loader.py
model.py		model.py
renovate.json		renovate.json
requirements.txt		requirements.txt
vocabulary.py		vocabulary.py