Related papers: LayerAnimate: Layer-level Control for Animation

Workflow-Aware Structured Layer Decomposition for Illustration Production

Recent generative image editing methods adopt layered representations to mitigate the entangled nature of raster images and improve controllability, typically relying on object-based segmentation. However, such strategies may fail to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Tianyu Zhang , Dongchi Li , Keiichi Sawada , Haoran Xie

LoopAnimate: Loopable Salient Object Animation

Research on diffusion model-based video generation has advanced rapidly. However, limitations in object fidelity and generation length hinder its practical applications. Additionally, specific domains like animated wallpapers require…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Fanyi Wang , Peng Liu , Haotian Hu , Dan Meng , Jingwen Su , Jinjin Xu , Yanhao Zhang , Xiaoming Ren , Zhiwang Zhang

PhysLayer: Language-Guided Layered Animation with Depth-Aware Physics

Existing image-to-video generation methods often produce physically implausible motions and lack precise control over object dynamics. While prior approaches have incorporated physics simulators, they remain confined to 2D planar motions…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Tianyidan Xie , Zhentao Huang , Mingjie Wang , Xin Huang , Jun Zhou , Minglun Gong , Zili Yi

TransAnimate: Taming Layer Diffusion to Generate RGBA Video

Text-to-video generative models have made remarkable advancements in recent years. However, generating RGBA videos with alpha channels for transparency and visual effects remains a significant challenge due to the scarcity of suitable…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Xuewei Chen , Zhimin Chen , Yiren Song

See-through: Single-image Layer Decomposition for Anime Characters

We introduce a framework that automates the transformation of static anime illustrations into manipulatable 2.5D models. Current professional workflows require tedious manual segmentation and the artistic ``hallucination'' of occluded…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Jian Lin , Chengze Li , Haoyun Qin , Kwun Wang Chan , Yanghua Jin , Hanyuan Liu , Stephen Chun Wang Choy , Xueting Liu

LayerFlow: A Unified Model for Layer-aware Video Generation

We present LayerFlow, a unified solution for layer-aware video generation. Given per-layer prompts, LayerFlow generates videos for the transparent foreground, clean background, and blended scene. It also supports versatile variants like…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Sihui Ji , Hao Luo , Xi Chen , Yuanpeng Tu , Yiyang Wang , Hengshuang Zhao

InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

Recent video generation research has focused heavily on isolated actions, leaving interactive motions-such as hand-face interactions-largely unexamined. These interactions are essential for emerging biometric authentication systems, which…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Yukang Lin , Yan Hong , Zunnan Xu , Xindi Li , Chao Xu , Chuanbiao Song , Ronghui Li , Haoxing Chen , Jun Lan , Huijia Zhu , Weiqiang Wang , Jianfu Zhang , Xiu Li

AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance

Image animation is a key task in computer vision which aims to generate dynamic visual content from static image. Recent image animation methods employ neural based rendering technique to generate realistic animations. Despite these…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Zuozhuo Dai , Zhenghao Zhang , Yao Yao , Bingxue Qiu , Siyu Zhu , Long Qin , Weizhi Wang

Layered Controllable Video Generation

We introduce layered controllable video generation, where we, without any supervision, decompose the initial frame of a video into foreground and background layers, with which the user can control the video generation process by simply…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Jiahui Huang , Yuhe Jin , Kwang Moo Yi , Leonid Sigal

Towards Multi-Layered 3D Garments Animation

Mimicking realistic dynamics in 3D garment animations is a challenging task due to the complex nature of multi-layered garments and the variety of outer forces involved. Existing approaches mostly focus on single-layered garments driven by…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Yidi Shao , Chen Change Loy , Bo Dai

AniME: Adaptive Multi-Agent Planning for Long Animation Generation

We present AniME, a director-oriented multi-agent system for automated long-form anime production, covering the full workflow from a story to the final video. The director agent keeps a global memory for the whole workflow, and coordinates…

Artificial Intelligence · Computer Science 2025-10-13 Lisai Zhang , Baohan Xu , Siqian Yang , Mingyu Yin , Jing Liu , Chao Xu , Siqi Wang , Yidi Wu , Yuxin Hong , Zihao Zhang , Yanzhang Liang , Yudong Jiang

AnimAgents: Coordinating Multi-Stage Animation Pre-Production with Human-Multi-Agent Collaboration

Animation pre-production lays the foundation of an animated film by transforming initial concepts into a coherent blueprint across interdependent stages such as ideation, scripting, design, and storyboarding. While generative AI tools are…

Human-Computer Interaction · Computer Science 2025-11-25 Wen-Fan Wang , Chien-Ting Lu , Jin Ping Ng , Yi-Ting Chiu , Ting-Ying Lee , Miaosen Wang , Bing-Yu Chen , Xiang 'Anthony' Chen

UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation

Recent diffusion-based human image animation techniques have demonstrated impressive success in synthesizing videos that faithfully follow a given reference identity and a sequence of desired movement poses. Despite this, there are still…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Xiang Wang , Shiwei Zhang , Changxin Gao , Jiayu Wang , Xiaoqiang Zhou , Yingya Zhang , Luxin Yan , Nong Sang

AnimateAnything: Consistent and Controllable Animation for Video Generation

We present a unified controllable video generation approach AnimateAnything that facilitates precise and consistent video manipulation across various conditions, including camera trajectories, text prompts, and user motion annotations.…

Computer Vision and Pattern Recognition · Computer Science 2024-11-19 Guojun Lei , Chi Wang , Hong Li , Rong Zhang , Yikai Wang , Weiwei Xu

AnimeColor: Reference-based Animation Colorization with Diffusion Transformers

Animation colorization plays a vital role in animation production, yet existing methods struggle to achieve color accuracy and temporal consistency. To address these challenges, we propose \textbf{AnimeColor}, a novel reference-based…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Yuhong Zhang , Liyao Wang , Han Wang , Danni Wu , Zuzeng Lin , Feng Wang , Li Song

Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation

Traditional animation generation methods depend on training generative models with human-labelled data, entailing a sophisticated multi-stage pipeline that demands substantial human effort and incurs high training costs. Due to limited…

Computation and Language · Computer Science 2024-08-20 Yunxin Li , Haoyuan Shi , Baotian Hu , Longyue Wang , Jiashun Zhu , Jinyi Xu , Zhen Zhao , Min Zhang

MVAnimate: Enhancing Character Animation with Multi-View Optimization

The demand for realistic and versatile character animation has surged, driven by its wide-ranging applications in various domains. However, the animation generation algorithms modeling human pose with 2D or 3D structures all face various…

Computer Vision and Pattern Recognition · Computer Science 2026-02-10 Tianyu Sun , Zhoujie Fu , Bang Zhang , Guosheng Lin

LayerBuilder: Layer Decomposition for Interactive Image and Video Color Editing

Exploring and editing colors in images is a common task in graphic design and photography. However, allowing for interactive recoloring while preserving smooth color blends in the image remains a challenging problem. We present…

Graphics · Computer Science 2017-01-18 Sharon Lin , Matthew Fisher , Angela Dai , Pat Hanrahan

Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion

Recent diffusion-based talking face generation models have demonstrated impressive potential in synthesizing videos that accurately match a speech audio clip with a given reference identity. However, existing approaches still encounter…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Xingpei Ma , Jiaran Cai , Yuansheng Guan , Shenneng Huang , Qiang Zhang , Shunsi Zhang

Generative Image Layer Decomposition with Visual Effects

Recent advancements in large generative models, particularly diffusion-based methods, have significantly enhanced the capabilities of image editing. However, achieving precise control over image composition tasks remains a challenge.…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Jinrui Yang , Qing Liu , Yijun Li , Soo Ye Kim , Daniil Pakhomov , Mengwei Ren , Jianming Zhang , Zhe Lin , Cihang Xie , Yuyin Zhou