IJNRD Research Journal

WhatsApp
Click Here

WhatsApp editor@ijnrd.org
IJNRD
INTERNATIONAL JOURNAL OF NOVEL RESEARCH AND DEVELOPMENT
International Peer Reviewed & Refereed Journals, Open Access Journal
ISSN Approved Journal No: 2456-4184 | Impact factor: 8.76 | ESTD Year: 2016
Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.76 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)

Call For Paper

For Authors

Forms / Download

Published Issue Details

Editorial Board

Other IMP Links

Facts & Figure

Impact Factor : 8.76

Issue per Year : 12

Volume Published : 9

Issue Published : 96

Article Submitted :

Article Published :

Total Authors :

Total Reviewer :

Total Countries :

Indexing Partner

Join RMS/Earn 300

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License
Published Paper Details
Paper Title: SPEECH TO IMAGE TRANSLATION
Authors Name: VIKAS YADAV , TUSHAR SRIVASTAVA
Download E-Certificate: Download
Author Reg. ID:
IJNRD_208085
Published Paper Id: IJNRD2312023
Published In: Volume 8 Issue 11, November-2023
DOI: http://doi.one/10.1729/Journal.37051
Abstract: Today we often see people speak and the computer/mobile types the words exactly as spoken by the operator. This direct speech- to- text conversion has led to a curiosity to develop a method or program through which we could directly convert our speech to an image. If we could convert any speech to image directly then it would pave a path due to the vast potential applications it would have in man- machine interaction, art creation, computer- aided design, etc. The generation of realistic images from text automatically is an intriguing and valuable concept, but current AI systems have not yet achieved this objective. Nevertheless, for learning discriminatory text feature representations, recent years have seen the development of general and strong recurrent neural network designs. Deep GANs lead to generate the images related to certain kinds, like faces, folder covers, etc. The goal of this work is that these developments in text and image modeling must be associated by converting graphic notions from characters to pixels using an original deep structure and GAN formulation. The potential of our model is proved by creating realistic pictures of birds and flowers from thorough text descriptions.
Keywords: Speech-to-image translation, cross-modal generation, generative adversarial network, teacher-student learning.
Cite Article: "SPEECH TO IMAGE TRANSLATION", International Journal of Novel Research and Development (www.ijnrd.org), ISSN:2456-4184, Vol.8, Issue 11, page no.a157-a170, November-2023, Available :http://www.ijnrd.org/papers/IJNRD2312023.pdf
Downloads: 000118767
ISSN: 2456-4184 | IMPACT FACTOR: 8.76 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.76 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator
Publication Details: Published Paper ID:IJNRD2312023
Registration ID: 208085
Published In: Volume 8 Issue 11, November-2023
DOI (Digital Object Identifier): http://doi.one/10.1729/Journal.37051
Page No: a157-a170
Country: RAEBARELI, UTTAR PRADESH, India
Research Area: Electronics & Communication Engg. 
Publisher : IJ Publication
Published Paper URL : https://www.ijnrd.org/viewpaperforall?paper=IJNRD2312023
Published Paper PDF: https://www.ijnrd.org/papers/IJNRD2312023
Share Article:
Share

Click Here to Download This Article

Article Preview
Click Here to Download This Article

Major Indexing from www.ijnrd.org
Semantic Scholar Microsaoft Academic ORCID Zenodo
Google Scholar ResearcherID Thomson Reuters Mendeley : reference manager Academia.edu
arXiv.org : cornell university library Research Gate CiteSeerX PUBLON
DRJI SSRN Scribd DocStoc

ISSN Details

ISSN: 2456-4184
Impact Factor: 8.76 and ISSN APPROVED
Journal Starting Year (ESTD) : 2016

DOI (A digital object identifier)


Providing A digital object identifier by DOI
How to Get DOI? DOI

Conference

Open Access License Policy

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Creative Commons License This material is Open Knowledge This material is Open Data This material is Open Content

Important Details

Social Media

Licence

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Join RMS/Earn 300

IJNRD