English
Related papers

Related papers: Comixify: Transform video into a comics

200 papers

Drawing and annotating comic illustrations is a complex and difficult process. No existing machine learning algorithms have been developed to create comic illustrations based on descriptions of illustrations, or the dialogue in comics.…

Computer Vision and Pattern Recognition · Computer Science 2021-09-21 Ben Proven-Bessel , Zilong Zhao , Lydia Chen

The work by Gatys et al. [1] recently showed a neural style algorithm that can produce an image in the style of another image. Some further works introduced various improvements regarding generalization, quality and efficiency, but each of…

Computer Vision and Pattern Recognition · Computer Science 2018-09-12 Maciej Pęśko , Tomasz Trzciński

Despite remarkable success in image-to-image translation that celebrates the advancements of generative adversarial networks (GANs), very limited attempts are known for video domain translation. We study the task of video-to-video…

Computer Vision and Pattern Recognition · Computer Science 2019-05-30 Michail C. Doukas , Viktoriia Sharmanska , Stefanos Zafeiriou

We propose a method to translate cartoon images to real world images using Generative Aderserial Network (GAN). Existing GAN-based image-to-image translation methods which are trained on paired datasets are impractical as the data is…

Computer Vision and Pattern Recognition · Computer Science 2019-03-25 K M Arefeen Sultan , Labiba Kanij Rupty , Nahidul Islam Pranto , Sayed Khan Shuvo , Mohammad Imrul Jubair

Facial caricature is an art form of drawing faces in an exaggerated way to convey humor or sarcasm. In this paper, we propose the first Generative Adversarial Network (GAN) for unpaired photo-to-caricature translation, which we call…

Computer Vision and Pattern Recognition · Computer Science 2018-11-05 Kaidi Cao , Jing Liao , Lu Yuan

The image-to-image translation is a learning task to establish a visual mapping between an input and output image. The task has several variations differentiated based on the purpose of the translation, such as synthetic to real…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Pranjal Singh Rajput , Kanya Satis , Sonnya Dellarosa , Wenxuan Huang , Obinna Agba

In this paper, we explore a new domain for video-to-video translation. Motivated by the availability of animation movies that are adopted from illustrated books for children, we aim to stylize these videos with the style of the original…

Computer Vision and Pattern Recognition · Computer Science 2025-03-24 Samet Hicsonmez , Nermin Samet , Fidan Samet , Oguz Bakir , Emre Akbas , Pinar Duygulu

Cosplay has grown from its origins at fan conventions into a billion-dollar global dress phenomenon. To facilitate imagination and reinterpretation from animated images to real garments, this paper presents an automatic costume image…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Koya Tango , Marie Katsurai , Hayato Maki , Ryosuke Goto

Endoscopic videos from multicentres often have different imaging conditions, e.g., color and illumination, which make the models trained on one domain usually fail to generalize well to another. Domain adaptation is one of the potential…

Computer Vision and Pattern Recognition · Computer Science 2020-04-20 Jiawei Chen , Yuexiang Li , Kai Ma , Yefeng Zheng

Technological developments have produced methods that can generate educational videos from input text or sound. Recently, the use of deep learning techniques for image and video generation has been widely explored, particularly in…

Multimedia · Computer Science 2026-01-27 M. E. ElAlami , S. M. Khater , M. El. R. Rehan

This paper presents an end-to-end pipeline for generating character-specific, emotion-aware speech from comics. The proposed system takes full comic volumes as input and produces speech aligned with each character's dialogue and emotional…

Sound · Computer Science 2025-09-22 Zhiwen Qian , Jinhua Liang , Huan Zhang

This study presents a theory-inspired visual narrative generative system that integrates conceptual principles-comic authoring idioms-with generative and language models to enhance the comic creation process. Our system combines human…

Artificial Intelligence · Computer Science 2024-09-27 Yi-Chun Chen , Arnav Jhala

In this paper, we propose a fully automatic system for generating comic books from videos without any human intervention. Given an input video along with its subtitles, our approach first extracts informative keyframes by analyzing the…

Computer Vision and Pattern Recognition · Computer Science 2021-01-28 Xin Yang , Zongliang Ma , Letian Yu , Ying Cao , Baocai Yin , Xiaopeng Wei , Qiang Zhang , Rynson W. H. Lau

The process of generating fully colorized drawings from sketches is a large, usually costly bottleneck in the manga and anime industry. In this study, we examine multiple models for image-to-image translation between anime characters and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Tai Vu , Robert Yang

Recently, with the revolutionary neural style transferring methods, creditable paintings can be synthesized automatically from content images and style images. However, when it comes to the task of applying a painting's style to an anime…

Computer Vision and Pattern Recognition · Computer Science 2017-06-14 Lvmin Zhang , Yi Ji , Xin Lin

Caricature generation is an interesting yet challenging task. The primary goal is to generate plausible caricatures with reasonable exaggerations given face images. Conventional caricature generation approaches mainly use low-level…

Computer Vision and Pattern Recognition · Computer Science 2018-11-22 Wenbin Li , Wei Xiong , Haofu Liao , Jing Huo , Yang Gao , Jiebo Luo

Manually re-drawing an image in a certain artistic style takes a professional artist a long time. Doing this for a video sequence single-handedly is beyond imagination. We present two computational approaches that transfer the style from…

Computer Vision and Pattern Recognition · Computer Science 2018-08-07 Manuel Ruder , Alexey Dosovitskiy , Thomas Brox

In this paper, we propose a novel framework to translate a portrait photo-face into an anime appearance. Our aim is to synthesize anime-faces which are style-consistent with a given reference anime-face. However, unlike typical translation…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Bing Li , Yuanlue Zhu , Yitong Wang , Chia-Wen Lin , Bernard Ghanem , Linlin Shen

Text-to-image generation is conducted through Generative Adversarial Networks (GANs) or transformer models. However, the current challenge lies in accurately generating images based on textual descriptions, especially in scenarios where the…

Human-Computer Interaction · Computer Science 2024-01-10 Yang Li , Huaqiang Jiang , Yangkai Wu

Video generation is an interesting problem in computer vision. It is quite popular for data augmentation, special effect in move, AR/VR and so on. With the advances of deep learning, many deep generative models have been proposed to solve…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Tingfung Lau , Sailun Xu , Xinze Wang
‹ Prev 1 2 3 10 Next ›