This repository contains Docker File Image source for the use of image and video object identification and detection and image and video captioning using Tensorflow and the Apache Tika toolkit. Combined with our Image Captioning Project by USC Data Science we provide a full implementation of the Show and Tell: A Neural Image Caption Generator paper. To our knowledge there is no other implementation of this paper available in the public domain.
docker build -f InceptionRestDockerfile -t uscdatascience/inception-rest-tika .
- Builds the Inception v4 model Image detection capability.docker build -f Im2txtRestDockerfile -t uscdatascience/im2txt-rest-tika .
- Builds the Show and Tell model Image Text Captioning capability.docker build -f InceptionVideoRestDockerfile -t uscdatascience/inception-video-rest-tika .
- Builds a Docker with OpenCV and Tensorflow that can be used to idenitfy objects in Videos.
docker run -it -p 8764:8764 uscdatascience/inception-rest-tika
- then run the tests in ObjectRecognitionParserTest classdocker run -it -p 8764:8764 uscdatascience/im2txt-rest-tika
- then run the tests in ObjectRecognitionParserTest classdocker run -it -p 8764:8764 uscdatascience/inception-video-rest-tika
- then run the tests in TensorflowVideoRecParserTest class
If you use this work, please cite first USC Data Science and then:
@article{DBLP:journals/corr/VinyalsTBE14,
author = {Oriol Vinyals and
Alexander Toshev and
Samy Bengio and
Dumitru Erhan},
title = {Show and Tell: {A} Neural Image Caption Generator},
journal = {CoRR},
volume = {abs/1411.4555},
year = {2014},
url = {http://arxiv.org/abs/1411.4555},
archivePrefix = {arXiv},
eprint = {1411.4555},
timestamp = {Wed, 07 Jun 2017 14:41:10 +0200},
biburl = {http://dblp.org/rec/bib/journals/corr/VinyalsTBE14},
bibsource = {dblp computer science bibliography, http://dblp.org}
}
The Tika Advanced Object Recognition Page for more advanced tutorials on how to use the Dockers with Tika.
Send them to Chris A. Mattmann or Thejan Wijesinghe.
- Chris A. Mattmann, JPL & USC
- Thejan Wijesinghe, University of Moratuwa
This project is licensed under the Apache License, version 2.0.