English

Image Captioning

Computer Vision and Pattern Recognition 2018-05-24 v1 Artificial Intelligence

Abstract

This paper discusses and demonstrates the outcomes from our experimentation on Image Captioning. Image captioning is a much more involved task than image recognition or classification, because of the additional challenge of recognizing the interdependence between the objects/concepts in the image and the creation of a succinct sentential narration. Experiments on several labeled datasets show the accuracy of the model and the fluency of the language it learns solely from image descriptions. As a toy application, we apply image captioning to create video captions, and we advance a few hypotheses on the challenges we encountered.

Keywords

Cite

@article{arxiv.1805.09137,
  title  = {Image Captioning},
  author = {Vikram Mullachery and Vishal Motwani},
  journal= {arXiv preprint arXiv:1805.09137},
  year   = {2018}
}

Comments

arXiv admin note: text overlap with arXiv:1609.06647 by other authors

R2 v1 2026-06-23T02:05:40.433Z