arXiv:1805.09137 Abstract | arXiv Analytics

arXiv:1805.09137 [cs.CV]Abstract References Reviews Resources

Image Captioning

Published 2018-05-13Version 1

This paper discusses and demonstrates the outcomes from our experimentation on Image Captioning. Image captioning is a much more involved task than image recognition or classification, because of the additional challenge of recognizing the interdependence between the objects/concepts in the image and the creation of a succinct sentential narration. Experiments on several labeled datasets show the accuracy of the model and the fluency of the language it learns solely from image descriptions. As a toy application, we apply image captioning to create video captions, and we advance a few hypotheses on the challenges we encountered.

Comments: arXiv admin note: text overlap with arXiv:1609.06647 by other authors

Categories: cs.CV, cs.AI

Keywords: image captioning, create video captions, succinct sentential narration, toy application, image descriptions

Related articles: Most relevant | Search more

arXiv:2202.10492 [cs.CV] (Published 2022-02-21)

CaMEL: Mean Teacher Learning for Image Captioning

Manuele Barraco, Matteo Stefanini, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

arXiv:2203.15350 [cs.CV] (Published 2022-03-29)

End-to-End Transformer Based Model for Image Captioning

Yiyu Wang, Jungang Xu, Yingfei Sun

arXiv:1707.07998 [cs.CV] (Published 2017-07-25)

Bottom-Up and Top-Down Attention for Image Captioning and VQA