arXiv:1905.08110 Abstract | arXiv Analytics

arXiv:1905.08110 [cs.CV]Abstract References Reviews Resources

Image Captioning based on Deep Learning Methods: A Survey

Yiyu Wang, Jungang Xu, Yingfei Sun, Ben He

Published 2019-05-20Version 1

Image captioning is a challenging task and attracting more and more attention in the field of Artificial Intelligence, and which can be applied to efficient image retrieval, intelligent blind guidance and human-computer interaction, etc. In this paper, we present a survey on advances in image captioning based on Deep Learning methods, including Encoder-Decoder structure, improved methods in Encoder, improved methods in Decoder, and other improvements. Furthermore, we discussed future research directions.

Categories: cs.CV, cs.CL, cs.LG

Keywords: deep learning methods, image captioning, efficient image retrieval, intelligent blind guidance, research directions

Related articles: Most relevant | Search more

arXiv:1708.05271 [cs.CV] (Published 2017-08-17)

Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects

Ting Yao, Yingwei Pan, Yehao Li, Tao Mei

arXiv:1604.00790 [cs.CV] (Published 2016-04-04)

Image Captioning with Deep Bidirectional LSTMs

Cheng Wang, Haojin Yang, Christian Bartz, Christoph Meinel

arXiv:2010.06034 [cs.CV] (Published 2020-10-12)

A translational pathway of deep learning methods in GastroIntestinal Endoscopy

Sharib Ali et al.

arXiv Analytics

arXiv:1905.08110 [cs.CV]Abstract References Reviews Resources

Image Captioning based on Deep Learning Methods: A Survey

Links

Toolbox

arXiv:1905.08110 [cs.CV]AbstractReferencesReviewsResources

Image Captioning based on Deep Learning Methods: A Survey

Links

Toolbox

arXiv:1905.08110 [cs.CV]Abstract References Reviews Resources