English
Related papers

Related papers: LoopAnimate: Loopable Salient Object Animation

200 papers

Recent diffusion-based human image animation techniques have demonstrated impressive success in synthesizing videos that faithfully follow a given reference identity and a sequence of desired movement poses. Despite this, there are still…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Xiang Wang , Shiwei Zhang , Changxin Gao , Jiayu Wang , Xiaoqiang Zhou , Yingya Zhang , Luxin Yan , Nong Sang

This paper studies the human image animation task, which aims to generate a video of a certain reference identity following a particular motion sequence. Existing animation works typically employ the frame-warping technique to animate the…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Zhongcong Xu , Jianfeng Zhang , Jun Hao Liew , Hanshu Yan , Jia-Wei Liu , Chenxu Zhang , Jiashi Feng , Mike Zheng Shou

We propose EverAnimate, an efficient post-training method for long-horizon animated video generation that preserves visual quality and character identity. Long-form animation remains challenging because highly dynamic human motion must be…

Computer Vision and Pattern Recognition · Computer Science 2026-05-15 Wuyang Li , Yang Gao , Mariam Hassan , Lan Feng , Wentao Pan , Po-Chien Luan , Alexandre Alahi

With the impressive progress in diffusion-based text-to-image generation, extending such powerful generative ability to text-to-video raises enormous attention. Existing methods either require large-scale text-video pairs and a large number…

Computer Vision and Pattern Recognition · Computer Science 2023-10-18 Ruiqi Wu , Liangyu Chen , Tong Yang , Chunle Guo , Chongyi Li , Xiangyu Zhang

We present Mobius, a novel method to generate seamlessly looping videos from text descriptions directly without any user annotations, thereby creating new visual materials for the multi-media presentation. Our method repurposes the…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Xiuli Bi , Jianfei Yuan , Bo Liu , Yong Zhang , Xiaodong Cun , Chi-Man Pun , Bin Xiao

Traditional animation production decomposes visual elements into discrete layers to enable independent processing for sketching, refining, coloring, and in-betweening. Existing anime generation video methods typically treat animation as a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Yuxue Yang , Lue Fan , Zuzeng Lin , Feng Wang , Zhaoxiang Zhang

We introduce Lumiere -- a text-to-video diffusion model designed for synthesizing videos that portray realistic, diverse and coherent motion -- a pivotal challenge in video synthesis. To this end, we introduce a Space-Time U-Net…

Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding excessive compute demands by training a diffusion model in a compressed lower-dimensional latent space. Here, we apply the LDM paradigm to high-resolution…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Andreas Blattmann , Robin Rombach , Huan Ling , Tim Dockhorn , Seung Wook Kim , Sanja Fidler , Karsten Kreis

While recent years have witnessed great progress on using diffusion models for video generation, most of them are simple extensions of image generation frameworks, which fail to explicitly consider one of the key differences between videos…

Computer Vision and Pattern Recognition · Computer Science 2024-07-31 Jingyun Liang , Yuchen Fan , Kai Zhang , Radu Timofte , Luc Van Gool , Rakesh Ranjan

Generating temporally coherent high fidelity video is an important milestone in generative modeling research. We make progress towards this milestone by proposing a diffusion model for video generation that shows very promising initial…

Computer Vision and Pattern Recognition · Computer Science 2022-06-24 Jonathan Ho , Tim Salimans , Alexey Gritsenko , William Chan , Mohammad Norouzi , David J. Fleet

The video generation field has witnessed rapid improvements with the introduction of recent diffusion models. While these models have successfully enhanced appearance quality, they still face challenges in generating coherent and natural…

Computer Vision and Pattern Recognition · Computer Science 2025-04-21 Yaosi Hu , Zhenzhong Chen , Chong Luo

Generating realistic animated videos from static images is an important area of research in computer vision. Methods based on physical simulation and motion prediction have achieved notable advances, but they are often limited to specific…

Computer Vision and Pattern Recognition · Computer Science 2024-05-29 Qiang Wang , Minghua Liu , Junjun Hu , Fan Jiang , Mu Xu

We present a video generation model that accurately reproduces object motion, changes in camera viewpoint, and new content that arises over time. Existing video generation methods often fail to produce new content as a function of time…

Computer Vision and Pattern Recognition · Computer Science 2022-06-10 Tim Brooks , Janne Hellsten , Miika Aittala , Ting-Chun Wang , Timo Aila , Jaakko Lehtinen , Ming-Yu Liu , Alexei A. Efros , Tero Karras

This paper introduces EasyAnimate, an efficient and high quality video generation framework that leverages diffusion transformers to achieve high-quality video production, encompassing data processing, model training, and end-to-end…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Jiaqi Xu , Kunzhe Huang , Xinyi Zou , Yunkuo Chen , Bo Liu , MengLi Cheng , Jun Huang , Xing Shi

Long video generation remains a challenging and compelling topic in computer vision. Diffusion based models, among the various approaches to video generation, have achieved state of the art quality with their iterative denoising procedures.…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Siyang Zhang , Harry Yang , Ser-Nam Lim

Conditional human animation traditionally animates static reference images using pose-based motion cues extracted from video data. However, these video-derived cues often suffer from low temporal resolution, motion blur, and unreliable…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Qiang Qu , Ming Li , Xiaoming Chen , Tongliang Liu

We propose a zero-shot approach for generating consistent videos of animated characters based on Text-to-Image (T2I) diffusion models. Existing Text-to-Video (T2V) methods are expensive to train and require large-scale video datasets to…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Abdelrahman Eldesokey , Peter Wonka

With the introduction of diffusion-based video generation techniques, audio-conditioned human video generation has recently achieved significant breakthroughs in both the naturalness of motion and the synthesis of portrait details. Due to…

Computer Vision and Pattern Recognition · Computer Science 2025-04-07 Jianwen Jiang , Chao Liang , Jiaqi Yang , Gaojie Lin , Tianyun Zhong , Yanbo Zheng

Contemporary models for generating images show remarkable quality and versatility. Swayed by these advantages, the research community repurposes them to generate videos. Since video content is highly redundant, we argue that naively…

Computer Vision and Pattern Recognition · Computer Science 2024-02-23 Willi Menapace , Aliaksandr Siarohin , Ivan Skorokhodov , Ekaterina Deyneka , Tsai-Shien Chen , Anil Kag , Yuwei Fang , Aleksei Stoliar , Elisa Ricci , Jian Ren , Sergey Tulyakov

Text-to-video diffusion models are notoriously limited in their ability to model temporal aspects such as motion, physics, and dynamic interactions. Existing approaches address this limitation by retraining the model or introducing external…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Ariel Shaulov , Itay Hazan , Lior Wolf , Hila Chefer
‹ Prev 1 2 3 10 Next ›