A804113010524 - Indian Journal of Image Processing and Recognition (IJIPR)

An Overview of Text to Visual Generation Using GAN
Sibi Mathew
Sibi Mathew, MTech Scholar, Department of CSE TKM College of Engineering Kollam, Kerala, India.

Abstract: Text-to-visual generation was once a cumbersome task until the advent of deep learning networks. With the introduction of deep learning, both images and videos can now be generated from textual descriptions. Deep learning networks have revolutionized various fields, including computer vision and natural language processing, with the emergence of Generative Adversarial Networks (GANs). GANs have played a significant role in advancing these domains. A GAN typically comprises multiple deep networks combined with other machine learning techniques. In the context of text-to-visual generation, GANs have enabled the synthesis of images and videos based on textual input. This work aims to explore different variations of GANs for image and video synthesis and propose a general architecture for text-to-visual generation using GANs. Additionally, this study delves into the challenges associated with this task and discusses ongoing research and future prospects. By leveraging the power of deep learning networks and GANs, the process of generating visual content from text has become more accessible and efficient. This work will contribute to the understanding and advancement of text-to-visual generation, paving the way for numerous applica-tions across various industries.

Keywords: Computer Vision, Image Synthesis, Natural Lan- Guage Processing, Video Synthesis, Filler Images.
Scope of the Article: Communication of Visual Data

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US