Related papers: Comixify: Transform video into a comics

ComicGAN: Text-to-Comic Generative Adversarial Network

Drawing and annotating comic illustrations is a complex and difficult process. No existing machine learning algorithms have been developed to create comic illustrations based on descriptions of illustrations, or the dialogue in comics.…

Computer Vision and Pattern Recognition · Computer Science 2021-09-21 Ben Proven-Bessel , Zilong Zhao , Lydia Chen

Neural Comic Style Transfer: Case Study

The work by Gatys et al. [1] recently showed a neural style algorithm that can produce an image in the style of another image. Some further works introduced various improvements regarding generalization, quality and efficiency, but each of…

Computer Vision and Pattern Recognition · Computer Science 2018-09-12 Maciej Pęśko , Tomasz Trzciński

Video-to-Video Translation for Visual Speech Synthesis

Despite remarkable success in image-to-image translation that celebrates the advancements of generative adversarial networks (GANs), very limited attempts are known for video domain translation. We study the task of video-to-video…

Computer Vision and Pattern Recognition · Computer Science 2019-05-30 Michail C. Doukas , Viktoriia Sharmanska , Stefanos Zafeiriou

Cartoon-to-real: An Approach to Translate Cartoon to Realistic Images using GAN

We propose a method to translate cartoon images to real world images using Generative Aderserial Network (GAN). Existing GAN-based image-to-image translation methods which are trained on paired datasets are impractical as the data is…

Computer Vision and Pattern Recognition · Computer Science 2019-03-25 K M Arefeen Sultan , Labiba Kanij Rupty , Nahidul Islam Pranto , Sayed Khan Shuvo , Mohammad Imrul Jubair

CariGANs: Unpaired Photo-to-Caricature Translation

Facial caricature is an art form of drawing faces in an exaggerated way to convey humor or sarcasm. In this paper, we propose the first Generative Adversarial Network (GAN) for unpaired photo-to-caricature translation, which we call…

Computer Vision and Pattern Recognition · Computer Science 2018-11-05 Kaidi Cao , Jing Liao , Lu Yuan

cGANs for Cartoon to Real-life Images

The image-to-image translation is a learning task to establish a visual mapping between an input and output image. The task has several variations differentiated based on the purpose of the translation, such as synthetic to real…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Pranjal Singh Rajput , Kanya Satis , Sonnya Dellarosa , Wenxuan Huang , Obinna Agba

WAIT: Feature Warping for Animation to Illustration video Translation using GANs

In this paper, we explore a new domain for video-to-video translation. Motivated by the availability of animation movies that are adopted from illustrated books for children, we aim to stylize these videos with the style of the original…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Samet Hicsonmez , Nermin Samet , Fidan Samet , Oguz Bakir , Emre Akbas , Pinar Duygulu

Anime-to-Real Clothing: Cosplay Costume Generation via Image-to-Image Translation

Cosplay has grown from its origins at fan conventions into a billion-dollar global dress phenomenon. To facilitate imagination and reinterpretation from animated images to real garments, this paper presents an automatic costume image…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Koya Tango , Marie Katsurai , Hayato Maki , Ryosuke Goto

Generative Adversarial Networks for Video-to-Video Domain Adaptation

Endoscopic videos from multicentres often have different imaging conditions, e.g., color and illumination, which make the models trained on one domain usually fail to generalize well to another. Domain adaptation is one of the potential…

Computer Vision and Pattern Recognition · Computer Science 2020-04-20 Jiawei Chen , Yuexiang Li , Kai Ma , Yefeng Zheng

AI-based System for Transforming text and sound to Educational Videos

Technological developments have produced methods that can generate educational videos from input text or sound. Recently, the use of deep learning techniques for image and video generation has been widely explored, particularly in…

Multimedia · Computer Science 2026-01-27 M. E. ElAlami , S. M. Khater , M. El. R. Rehan

Emotion-Aware Speech Generation with Character-Specific Voices for Comics

This paper presents an end-to-end pipeline for generating character-specific, emotion-aware speech from comics. The proposed system takes full comic volumes as input and produces speech aligned with each character's dialogue and emotional…

Sound · Computer Science 2025-09-22 Zhiwen Qian , Jinhua Liang , Huan Zhang

Collaborative Comic Generation: Integrating Visual Narrative Theories with AI Models for Enhanced Creativity

This study presents a theory-inspired visual narrative generative system that integrates conceptual principles-comic authoring idioms-with generative and language models to enhance the comic creation process. Our system combines human…

Artificial Intelligence · Computer Science 2024-09-27 Yi-Chun Chen , Arnav Jhala

Automatic Comic Generation with Stylistic Multi-page Layouts and Emotion-driven Text Balloon Generation

In this paper, we propose a fully automatic system for generating comic books from videos without any human intervention. Given an input video along with its subtitles, our approach first extracts informative keyframes by analyzing the…

Computer Vision and Pattern Recognition · Computer Science 2021-01-28 Xin Yang , Zongliang Ma , Letian Yu , Ying Cao , Baocai Yin , Xiaopeng Wei , Qiang Zhang , Rynson W. H. Lau

GANime: Generating Anime and Manga Character Drawings from Sketches with Deep Learning

The process of generating fully colorized drawings from sketches is a large, usually costly bottleneck in the manga and anime industry. In this study, we examine multiple models for image-to-image translation between anime characters and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Tai Vu , Robert Yang

Style Transfer for Anime Sketches with Enhanced Residual U-net and Auxiliary Classifier GAN

Recently, with the revolutionary neural style transferring methods, creditable paintings can be synthesized automatically from content images and style images. However, when it comes to the task of applying a painting's style to an anime…

Computer Vision and Pattern Recognition · Computer Science 2017-06-14 Lvmin Zhang , Yi Ji , Xin Lin

CariGAN: Caricature Generation through Weakly Paired Adversarial Learning

Caricature generation is an interesting yet challenging task. The primary goal is to generate plausible caricatures with reasonable exaggerations given face images. Conventional caricature generation approaches mainly use low-level…

Computer Vision and Pattern Recognition · Computer Science 2018-11-22 Wenbin Li , Wei Xiong , Haofu Liao , Jing Huo , Yang Gao , Jiebo Luo

Artistic style transfer for videos and spherical images

Manually re-drawing an image in a certain artistic style takes a professional artist a long time. Doing this for a video sequence single-handedly is beyond imagination. We present two computational approaches that transfer the style from…

Computer Vision and Pattern Recognition · Computer Science 2018-08-07 Manuel Ruder , Alexey Dosovitskiy , Thomas Brox

AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation

In this paper, we propose a novel framework to translate a portrait photo-face into an anime appearance. Our aim is to synthesize anime-faces which are style-consistent with a given reference anime-face. However, unlike typical translation…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Bing Li , Yuanlue Zhu , Yitong Wang , Chia-Wen Lin , Bernard Ghanem , Linlin Shen

Semantic Draw Engineering for Text-to-Image Creation

Text-to-image generation is conducted through Generative Adversarial Networks (GANs) or transformer models. However, the current challenge lies in accurately generating images based on textual descriptions, especially in scenarios where the…

Human-Computer Interaction · Computer Science 2024-01-10 Yang Li , Huaqiang Jiang , Yangkai Wu

Video Content Swapping Using GAN

Video generation is an interesting problem in computer vision. It is quite popular for data augmentation, special effect in move, AR/VR and so on. With the advances of deep learning, many deep generative models have been proposed to solve…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Tingfung Lau , Sailun Xu , Xinze Wang