arXiv:1805.09137 [cs.CV]AbstractReferencesReviewsResources
Image Captioning
Vikram Mullachery, Vishal Motwani
Published 2018-05-13Version 1
This paper discusses and demonstrates the outcomes from our experimentation on Image Captioning. Image captioning is a much more involved task than image recognition or classification, because of the additional challenge of recognizing the interdependence between the objects/concepts in the image and the creation of a succinct sentential narration. Experiments on several labeled datasets show the accuracy of the model and the fluency of the language it learns solely from image descriptions. As a toy application, we apply image captioning to create video captions, and we advance a few hypotheses on the challenges we encountered.
Comments: arXiv admin note: text overlap with arXiv:1609.06647 by other authors
Related articles: Most relevant | Search more
arXiv:2202.10492 [cs.CV] (Published 2022-02-21)
CaMEL: Mean Teacher Learning for Image Captioning
Manuele Barraco, Matteo Stefanini, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara
arXiv:2203.15350 [cs.CV] (Published 2022-03-29)
End-to-End Transformer Based Model for Image Captioning
arXiv:1707.07998 [cs.CV] (Published 2017-07-25)
Bottom-Up and Top-Down Attention for Image Captioning and VQA