IJNRD Research Journal

WhatsApp
Click Here

WhatsApp editor@ijnrd.org
IJNRD
INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT
International Peer Reviewed & Refereed Journals, Open Access Journal
ISSN Approved Journal No: 2456-4184 | Impact factor: 8.76 | ESTD Year: 2016
Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.76 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

Call For Paper

For Authors

Forms / Download

Published Issue Details

Editorial Board

Other IMP Links

Facts & Figure

Impact Factor : 8.76

Issue per Year : 12

Volume Published : 9

Issue Published : 96

Article Submitted :

Article Published :

Total Authors :

Total Reviewer :

Total Countries :

Indexing Partner

Join RMS/Earn 300

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Published Paper Details
Paper Title: IMAGE CAPTION GENERATION USING LSTMS AND CONVOLUTIONAL NEURAL NETWORKS
Authors Name: D Kanishya Gayathri , L Prinslin
Download E-Certificate: Download
Author Reg. ID:
IJNRD_205981
Published Paper Id: IJNRD2309391
Published In: Volume 8 Issue 9, September-2023
DOI: http://doi.one/10.1729/Journal.36336
Abstract: Image caption generation is a captivating intersection of computer vision and natural language processing, with applications spanning assistive technology, content retrieval, and human-computer interaction. In this project, we delve into the fusion of Convolutional Neural Networks (CNNs) and Long Short-Term Memory networks (LSTMs) to address this intriguing challenge. Our project centres on the holistic development of an image captioning system. We kick-started the journey with meticulous data collection and pre-processing, leveraging established datasets like MSCOCO. The images underwent resizing, normalization, and feature extraction using a pre-trained CNN, serving as our image encoder. The nucleus of our innovation resides in the model architecture. We meticulously designed a two-tiered structure, comprising a CNN as the image encoder and an LSTM as the text decoder. The CNN's role is to extract salient image features, while the LSTM excels at generating coherent and contextually relevant captions, guided by a cross-entropy loss function during training. Teacher forcing further stabilizes our model's convergence. Our system underwent rigorous evaluation employing a battery of metrics, including BLEU, METEOR, CIDEr, and ROUGE, benchmarking the generated captions against human-annotated references. This quantitative analysis provides an in-depth perspective on the system's performance and underscores areas for enhancement. As we conclude this report, we reflect on the challenges encountered throughout our project's lifecycle and propose avenues for future research in the captivating realm of image caption generation. Our aspiration is to refine and enhance our system, enabling it to generate not just accurate but also contextually rich captions—an endeavor that advances the frontiers of multimodal AI applications. In summary, this project showcases the symbiotic relationship between computer vision and natural language processing, underscoring the potential of CNN-LSTM architectures in the captivating domain of image captioning. Our work contributes to the burgeoning field of multimodal AI, fostering innovative applications across diverse domains.
Keywords: Image Captioning, Convolutional Neural Networks, LSTMs, Deep Learning, Computer Vision, Natural Language Processing, MSCOCO Dataset, Data Preprocessing, Model Architecture, Training, Evaluation Metrics, Encoder-Decoder, Teacher Forcing, Cross-Entropy Loss, Metric-Based Evaluation, BLEU, METEOR, CIDEr, ROUGE, Multimodal AI, Visual Understanding, Text Generation, Image Description, Machine Learning, Deep Neural Networks.
Cite Article: "IMAGE CAPTION GENERATION USING LSTMS AND CONVOLUTIONAL NEURAL NETWORKS", International Journal of Novel Research and Development (www.ijnrd.org), ISSN:2456-4184, Vol.8, Issue 9, page no.d775-d781, September-2023, Available :http://www.ijnrd.org/papers/IJNRD2309391.pdf
Downloads: 000118772
ISSN: 2456-4184 | IMPACT FACTOR: 8.76 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.76 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator
Publication Details: Published Paper ID:IJNRD2309391
Registration ID: 205981
Published In: Volume 8 Issue 9, September-2023
DOI (Digital Object Identifier): http://doi.one/10.1729/Journal.36336
Page No: d775-d781
Country: Chennai, Tamilnadu, India
Research Area: Engineering
Publisher : IJ Publication
Published Paper URL : https://www.ijnrd.org/viewpaperforall?paper=IJNRD2309391
Published Paper PDF: https://www.ijnrd.org/papers/IJNRD2309391
Share Article:
Share

Click Here to Download This Article

Article Preview
Click Here to Download This Article

Major Indexing from www.ijnrd.org
Semantic Scholar Microsaoft Academic ORCID Zenodo
Google Scholar ResearcherID Thomson Reuters Mendeley : reference manager Academia.edu
arXiv.org : cornell university library Research Gate CiteSeerX PUBLON
DRJI SSRN Scribd DocStoc

ISSN Details

ISSN: 2456-4184
Impact Factor: 8.76 and ISSN APPROVED
Journal Starting Year (ESTD) : 2016

DOI (A digital object identifier)


Providing A digital object identifier by DOI
How to Get DOI? DOI

Conference

Open Access License Policy

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Creative Commons License This material is Open Knowledge This material is Open Data This material is Open Content

Important Details

Social Media

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Join RMS/Earn 300

IJNRD