arXiv Analytics

Sign in

arXiv:1905.08110 [cs.CV]AbstractReferencesReviewsResources

Image Captioning based on Deep Learning Methods: A Survey

Yiyu Wang, Jungang Xu, Yingfei Sun, Ben He

Published 2019-05-20Version 1

Image captioning is a challenging task and attracting more and more attention in the field of Artificial Intelligence, and which can be applied to efficient image retrieval, intelligent blind guidance and human-computer interaction, etc. In this paper, we present a survey on advances in image captioning based on Deep Learning methods, including Encoder-Decoder structure, improved methods in Encoder, improved methods in Decoder, and other improvements. Furthermore, we discussed future research directions.

Related articles: Most relevant | Search more
arXiv:1708.05271 [cs.CV] (Published 2017-08-17)
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects
arXiv:1604.00790 [cs.CV] (Published 2016-04-04)
Image Captioning with Deep Bidirectional LSTMs
arXiv:2010.06034 [cs.CV] (Published 2020-10-12)
A translational pathway of deep learning methods in GastroIntestinal Endoscopy
Sharib Ali et al.