Open Access
Research Paper
Peer Reviewed

Paper Title

"VISION TO TEXT: ADVANCED IMAGE CAPTIONING WITH TRANSFORMER MODELS"

Article Identifiers

Registration ID: IJNRD_300461

Published ID: IJNRD2409170

: http://doi.one/10.1729/Journal.41517

Keywords

Image captioning, InceptionV3 Transformer architecture, Attention Mechanisms, Neural network architectures

Abstract

This project introduces a novel approach to image captioning, leveraging a sophisticated Transformer-based architecture trained on the COCO 2017 dataset. The goal is to smoothly combine natural language processing with computer vision so that a variety of visual content can have evocative captions created for it. Initial steps involve meticulous dataset preprocessing, focusing on a curated subset of 70,000 image-caption pairs. The architecture comprises an InceptionV3-based CNN encoder and Transformer encoder-decoder layers, creating a robust model for image captioning. The training process incorporates a custom loss function and early stopping, resulting in a well-performing model after five epochs. Experimental results demonstrate the model and its ability to generate consistent and contextual captions for different images. Beyond dataset images, the model showcases its versatility by captioning external images provided through URLs. This feature emphasizes the potential real-world applications of the model, beyond the confines of the training dataset.

How To Cite (APA)

Chinthaparthi Sridhar & Pavani Kotha (September-2024). "VISION TO TEXT: ADVANCED IMAGE CAPTIONING WITH TRANSFORMER MODELS". INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT, 9(9), b593-b602. http://doi.one/10.1729/Journal.41517

Citation

Issue

Other Publication Details

Paper Reg. ID: IJNRD_300461

Published Paper Id: IJNRD2409170

Downloads: 000122042

Research Area: Science and Technology

Author Type: Indian Author

Country: Tirupathi, Andhra Pradhesh, India

Published Paper PDF: https://ijnrd.org/papers/IJNRD2409170.pdf

Published Paper URL: https://ijnrd.org/viewpaperforall?paper=IJNRD2409170

Crossref DOI: http://doi.one/10.1729/Journal.41517

About Publisher

Journal Name: INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT(IJNRD)

UGC CARE JOURNAL PUBLICATION | ISSN: 2456-4184 | IMPACT FACTOR: 8.76 Calculated By Google Scholar | ESTD YEAR: 2016

An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.76 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator

Publisher: IJNRD (IJ Publication) Janvi Wave | IJNRD.ORG | IJNRD.COM | IJPUB.ORG

Copyright & License

© 2025 — Authors hold the copyright of this article. This work is licensed under a Creative Commons Attribution 4.0 International License. and The Open Definition.

You are free to share, adapt, and redistribute the material, provided proper credit is given to the original author. 🛡️ Disclaimer: The content, data, and findings in this article are based on the authors’ research and have been peer-reviewed for academic purposes only. Readers are advised to verify all information before practical or commercial use. The journal and its editorial board are not liable for any errors, losses, or consequences arising from its use.
CC OpenContant

Publication Timeline

Peer Review
Through Scholar9.com Platform

Article Preview: View Full Paper

Call For Paper

Call For Paper - Volume 10 | Issue 12 | December 2025

IJNRD is a Scholarly Open Access, Peer-Reviewed, Refereed, and UGC CARE Journal Publication with a High Impact Factor of 8.76 (calculated by Google Scholar & Semantic Scholar | AI-Powered Research Tool). It is a Multidisciplinary, Monthly, Low-Cost, and Transparent Peer Review Journal Publication that adheres to the UGC CARE 2025 Peer-Reviewed Journal Policy and aligns with Scopus Journal Publication standards to ensure the highest level of research quality and credibility.

IJNRD offers comprehensive Journal Publication Services including indexing in all major databases and metadata repositories, Digital Object Identifier (Crossref DOI) assignment for each published article with additional fees, citation generation tools, and full Open Access visibility to enhance global research reach and citation impact.

The INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT (IJNRD) aims to advance applied, theoretical, and experimental research across diverse academic and professional fields. The journal promotes global knowledge exchange among researchers, developers, academicians, engineers, and practitioners, serving as a trusted platform for innovative, peer-reviewed journal publication and scientific collaboration.

Indexing Coverage: Google Scholar, SSRN, ResearcherID-Publons, Semantic Scholar (AI-Powered Research Tool), Microsoft Academic, Academia.edu, arXiv.org, ResearchGate, CiteSeerX, ResearcherID (Thomson Reuters), Mendeley, DocStoc, ISSUU, Scribd, and many other recognized academic repositories.

How to submit the paper?

You can now publish your research in IJNRD. IJNRD is a Transparent Peer-Reviewed Open Access Journal Publication (Refereed Journal), aligning with New UGC and UGC CARE recommendations.


For more details, refer to the official notice: UGC Public Notice


Submit Paper Online

Important Dates for Current issue

Paper Submission Open For: December 2025

Current Issue: Volume 10 | Issue 12 | December 2025

Impact Factor: 8.76

Last Date for Paper Submission: Till 31-Dec-2025

Notification of Review Result: Within 1-2 Days after Submitting paper.

Publication of Paper: Within 01-02 Days after Submititng documents.

Frequency: Monthly (12 issue Annually).

Journal Type: IJNRD is an International Peer-reviewed, Refereed, and Open Access Journal with Transparent Peer Review as per the new UGC CARE 2025 guidelines, offering low-cost multidisciplinary publication with Crossref DOI and global indexing.

Subject Category: Research Area

Call for Paper: More Details

Approval, Licenses and Indexing: More Details