Related papers: Animating Arbitrary Objects via Deep Motion Transf…

Image Animation with Keypoint Mask

Motion transfer is the task of synthesizing future video frames of a single source image according to the motion from a given driving video. In order to solve it, we face the challenging complexity of motion representation and the unknown…

Computer Vision and Pattern Recognition · Computer Science 2021-12-23 Or Toledano , Yanir Marmor , Dov Gertz

First Order Motion Model for Image Animation

Image animation consists of generating a video sequence so that an object in a source image is animated according to the motion of a driving video. Our framework addresses this problem without using any annotation or prior information about…

Computer Vision and Pattern Recognition · Computer Science 2020-10-02 Aliaksandr Siarohin , Stéphane Lathuilière , Sergey Tulyakov , Elisa Ricci , Nicu Sebe

Implicit Warping for Animation with Image Sets

We present a new implicit warping framework for image animation using sets of source images through the transfer of the motion of a driving video. A single cross- modal attention layer is used to find correspondences between the source…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Arun Mallya , Ting-Chun Wang , Ming-Yu Liu

Cross-Identity Motion Transfer for Arbitrary Objects through Pose-Attentive Video Reassembling

We propose an attention-based networks for transferring motions between arbitrary objects. Given a source image(s) and a driving video, our networks animate the subject in the source images according to the motion in the driving video. In…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Subin Jeon , Seonghyeon Nam , Seoung Wug Oh , Seon Joo Kim

Sparse to Dense Motion Transfer for Face Image Animation

Face image animation from a single image has achieved remarkable progress. However, it remains challenging when only sparse landmarks are available as the driving signal. Given a source face image and a sequence of sparse face landmarks,…

Computer Vision and Pattern Recognition · Computer Science 2021-09-06 Ruiqi Zhao , Tianyi Wu , Guodong Guo

Understanding image motion with group representations

Motion is an important signal for agents in dynamic environments, but learning to represent motion from unlabeled video is a difficult and underconstrained problem. We propose a model of motion based on elementary group properties of…

Computer Vision and Pattern Recognition · Computer Science 2018-02-27 Andrew Jaegle , Stephen Phillips , Daphne Ippolito , Kostas Daniilidis

Motion Representations for Articulated Animation

We propose novel motion representations for animating articulated objects consisting of distinct parts. In a completely unsupervised manner, our method identifies object parts, tracks them in a driving video, and infers their motions by…

Computer Vision and Pattern Recognition · Computer Science 2021-04-26 Aliaksandr Siarohin , Oliver J. Woodford , Jian Ren , Menglei Chai , Sergey Tulyakov

Convolutional Humanoid Animation via Deformation

In this paper we present a new deep learning-driven approach to image-based synthesis of animations involving humanoid characters. Unlike previous deep approaches to image-based animation our method makes no assumptions on the type of…

Graphics · Computer Science 2019-08-14 John Kanji , David I. W. Levin

Learning a perceptual manifold with deep features for animation video resequencing

We propose a novel deep learning framework for animation video resequencing. Our system produces new video sequences by minimizing a perceptual distance of images from an existing animation video clip. To measure perceptual distance, we…

Graphics · Computer Science 2021-11-03 Charles C. Morace , Thi-Ngoc-Hanh Le , Sheng-Yi Yao , Shang-Wei Zhang , Tong-Yee Lee

Differential Motion Evolution for Fine-Grained Motion Deformation in Unsupervised Image Animation

Image animation is the task of transferring the motion of a driving video to a given object in a source image. While great progress has recently been made in unsupervised motion transfer, requiring no labeled data or domain priors, many…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Peirong Liu , Rui Wang , Xuefei Cao , Yipin Zhou , Ashish Shah , Ser-Nam Lim

MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation

Existing text-to-video methods struggle to transfer motion smoothly from a reference object to a target object with significant differences in appearance or structure between them. To address this challenge, we introduce MotionShot, a…

Computer Vision and Pattern Recognition · Computer Science 2025-07-23 Yanchen Liu , Yanan Sun , Zhening Xing , Junyao Gao , Kai Chen , Wenjie Pei

Motion Transformer for Unsupervised Image Animation

Image animation aims to animate a source image by using motion learned from a driving video. Current state-of-the-art methods typically use convolutional neural networks (CNNs) to predict motion information, such as motion keypoints and…

Computer Vision and Pattern Recognition · Computer Science 2022-09-29 Jiale Tao , Biao Wang , Tiezheng Ge , Yuning Jiang , Wen Li , Lixin Duan

Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation

Pose-guided person image generation and animation aim to transform a source person image to target poses. These tasks require spatial manipulation of source data. However, Convolutional Neural Networks are limited by the lack of ability to…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Yurui Ren , Ge Li , Shan Liu , Thomas H. Li

MotionAdapter: Video Motion Transfer via Content-Aware Attention Customization

Recent advances in diffusion-based text-to-video models, particularly those built on the diffusion transformer architecture, have achieved remarkable progress in generating high-quality and temporally coherent videos. However, transferring…

Computer Vision and Pattern Recognition · Computer Science 2026-04-08 Zhexin Zhang , Yangyang Xu , Yifeng Zhu , Long Chen , Yong Du , Shengfeng He , Jun Yu

Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction

We propose a deep video prediction model conditioned on a single image and an action class. To generate future frames, we first detect keypoints of a moving object and predict future motion as a sequence of keypoints. The input image is…

Computer Vision and Pattern Recognition · Computer Science 2019-10-07 Yunji Kim , Seonghyeon Nam , In Cho , Seon Joo Kim

Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Given two consecutive RGB-D images, we propose a model that estimates a dense 3D motion field, also known as scene flow. We take advantage of the fact that in robot manipulation scenarios, scenes often consist of a set of rigidly moving…

Robotics · Computer Science 2018-07-25 Lin Shao , Parth Shah , Vikranth Dwaracherla , Jeannette Bohg

Towards Efficient Real-Time Video Motion Transfer via Generative Time Series Modeling

Motion Transfer is a technique that synthesizes videos by transferring motion dynamics from a driving video to a source image. In this work we propose a deep learning-based framework to enable real-time video motion transfer which is…

Computer Vision and Pattern Recognition · Computer Science 2025-12-12 Tasmiah Haque , Md. Asif Bin Syed , Byungheon Jeong , Xue Bai , Sumit Mohan , Somdyuti Paul , Imtiaz Ahmed , Srinjoy Das

DwNet: Dense warp-based network for pose-guided human video generation

Generation of realistic high-resolution videos of human subjects is a challenging and important task in computer vision. In this paper, we focus on human motion transfer - generation of a video depicting a particular subject, observed in a…

Computer Vision and Pattern Recognition · Computer Science 2019-10-22 Polina Zablotskaia , Aliaksandr Siarohin , Bo Zhao , Leonid Sigal

Learning to Segment Moving Objects

We study the problem of segmenting moving objects in unconstrained videos. Given a video, the task is to segment all the objects that exhibit independent motion in at least one frame. We formulate this as a learning problem and design our…

Computer Vision and Pattern Recognition · Computer Science 2017-12-05 Pavel Tokmakov , Cordelia Schmid , Karteek Alahari

AniFormer: Data-driven 3D Animation with Transformer

We present a novel task, i.e., animating a target 3D object through the motion of a raw driving sequence. In previous works, extra auxiliary correlations between source and target meshes or intermedia factors are inevitable to capture the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Haoyu Chen , Hao Tang , Nicu Sebe , Guoying Zhao