Image Captioning Using Deep Learning

Mr. P Rajasekhar Reddy; Sainath Omdas; Venkatesh Nangi; Neelam Chavada

Image Captioning Using Deep Learning

Mr. P Rajasekhar Reddy, Sainath Omdas, Venkatesh Nangi, Neelam Chavada

Abstract

In Artificial Intelligence, Caption generation is a challenging task where a textual description must be generated for a given image. It requires both methods from computer vision to understand the objects and actions involved in the image and a natural language processing model to generate a caption. Image captioning has various applications such as usage in virtual assistants, recommendations in editing applications, for image indexing, for social media, for visually impaired persons, and many other natural language processing applications. We propose a hybrid system using multilayer Convolutional Neural Network (CNN) to generate vocabulary describing the images and a Long Short Term Memory (LSTM) to accurately structure meaningful sentences using the generated keywords. The CNN compares the given image to a large dataset of training images, then generates an accurate description using the trained captions.

Full Text:

PDF

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

All published Articles are Open Access at https://journals.pen2print.org/index.php/ijr/

Paper submission: ijr@pen2print.org

Username
Password
Remember me

International Journal of Research

Image Captioning Using Deep Learning

Abstract

Full Text: