arXiv Analytics

Sign in

arXiv:1805.09137 [cs.CV]AbstractReferencesReviewsResources

Image Captioning

Vikram Mullachery, Vishal Motwani

Published 2018-05-13Version 1

This paper discusses and demonstrates the outcomes from our experimentation on Image Captioning. Image captioning is a much more involved task than image recognition or classification, because of the additional challenge of recognizing the interdependence between the objects/concepts in the image and the creation of a succinct sentential narration. Experiments on several labeled datasets show the accuracy of the model and the fluency of the language it learns solely from image descriptions. As a toy application, we apply image captioning to create video captions, and we advance a few hypotheses on the challenges we encountered.

Comments: arXiv admin note: text overlap with arXiv:1609.06647 by other authors
Categories: cs.CV, cs.AI
Related articles: Most relevant | Search more
arXiv:2202.10492 [cs.CV] (Published 2022-02-21)
CaMEL: Mean Teacher Learning for Image Captioning
arXiv:2203.15350 [cs.CV] (Published 2022-03-29)
End-to-End Transformer Based Model for Image Captioning
arXiv:1707.07998 [cs.CV] (Published 2017-07-25)
Bottom-Up and Top-Down Attention for Image Captioning and VQA