Related papers: Generative Image Dynamics

Tokenizing Motion: A Generative Approach for Scene Dynamics Compression

This paper proposes a novel generative video compression framework that leverages motion pattern priors, derived from subtle dynamics in common scenes (e.g., swaying flowers or a boat drifting on water), rather than relying on video content…

Computer Vision and Pattern Recognition · Computer Science 2025-12-11 Shanzhi Yin , Zihan Zhang , Bolin Chen , Shiqi Wang , Yan Ye

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Animating a still image offers an engaging visual experience. Traditional image animation techniques mainly focus on animating natural scenes with stochastic dynamics (e.g. clouds and fluid) or domain-specific motions (e.g. human hair or…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Jinbo Xing , Menghan Xia , Yong Zhang , Haoxin Chen , Wangbo Yu , Hanyuan Liu , Xintao Wang , Tien-Tsin Wong , Ying Shan

Spectral Progressive Diffusion for Efficient Image and Video Generation

Diffusion models have been shown to implicitly generate visual content autoregressively in the frequency domain, where low-frequency components are generated earlier in the denoising process while high-frequency details emerge only in later…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Howard Xiao , Brian Chao , Lior Yariv , Gordon Wetzstein

Generating Videos with Scene Dynamics

We capitalize on large amounts of unlabeled video in order to learn a model of scene dynamics for both video recognition tasks (e.g. action classification) and video generation tasks (e.g. future prediction). We propose a generative…

Computer Vision and Pattern Recognition · Computer Science 2016-10-27 Carl Vondrick , Hamed Pirsiavash , Antonio Torralba

First Order Motion Model for Image Animation

Image animation consists of generating a video sequence so that an object in a source image is animated according to the motion of a driving video. Our framework addresses this problem without using any annotation or prior information about…

Computer Vision and Pattern Recognition · Computer Science 2020-10-02 Aliaksandr Siarohin , Stéphane Lathuilière , Sergey Tulyakov , Elisa Ricci , Nicu Sebe

Motion Modes: What Could Happen Next?

Predicting diverse object motions from a single static image remains challenging, as current video generation models often entangle object movement with camera motion and other scene changes. While recent methods can predict specific…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Karran Pandey , Matheus Gadelha , Yannick Hold-Geoffroy , Karan Singh , Niloy J. Mitra , Paul Guerrero

What Happens Next? Anticipating Future Motion by Generating Point Trajectories

We consider the problem of forecasting motion from a single image, i.e., predicting how objects in the world are likely to move, without the ability to observe other parameters such as the object velocities or the forces applied to them. We…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Gabrijel Boduljak , Laurynas Karazija , Iro Laina , Christian Rupprecht , Andrea Vedaldi

Learning a Generative Motion Model from Image Sequences based on a Latent Motion Matrix

We propose to learn a probabilistic motion model from a sequence of images for spatio-temporal registration. Our model encodes motion in a low-dimensional probabilistic space - the motion matrix - which enables various motion analysis tasks…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Julian Krebs , Hervé Delingette , Nicholas Ayache , Tommaso Mansi

Transforming Static Images Using Generative Models for Video Salient Object Detection

In many video processing tasks, leveraging large-scale image datasets is a common strategy, as image data is more abundant and facilitates comprehensive knowledge transfer. A typical approach for simulating video from static images involves…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Suhwan Cho , Minhyeok Lee , Jungho Lee , Sangyoun Lee

Image inpainting using frequency domain priors

In this paper, we present a novel image inpainting technique using frequency domain information. Prior works on image inpainting predict the missing pixels by training neural networks using only the spatial domain information. However,…

Computer Vision and Pattern Recognition · Computer Science 2021-07-14 Hiya Roy , Subhajit Chaudhury , Toshihiko Yamasaki , Tatsuaki Hashimoto

E-Motion: Future Motion Simulation via Event Sequence Diffusion

Forecasting a typical object's future motion is a critical task for interpreting and interacting with dynamic environments in computer vision. Event-based sensors, which could capture changes in the scene with exceptional temporal…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Song Wu , Zhiyu Zhu , Junhui Hou , Guangming Shi , Jinjian Wu

AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance

Image animation is a key task in computer vision which aims to generate dynamic visual content from static image. Recent image animation methods employ neural based rendering technique to generate realistic animations. Despite these…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Zuozhuo Dai , Zhenghao Zhang , Yao Yao , Bingxue Qiu , Siyu Zhu , Long Qin , Weizhi Wang

Fourier-based Video Prediction through Relational Object Motion

The ability to predict future outcomes conditioned on observed video frames is crucial for intelligent decision-making in autonomous systems. Recently, deep recurrent architectures have been applied to the task of video prediction. However,…

Computer Vision and Pattern Recognition · Computer Science 2021-10-13 Malte Mosbach , Sven Behnke

MoVideo: Motion-Aware Video Generation with Diffusion Models

While recent years have witnessed great progress on using diffusion models for video generation, most of them are simple extensions of image generation frameworks, which fail to explicitly consider one of the key differences between videos…

Computer Vision and Pattern Recognition · Computer Science 2024-07-31 Jingyun Liang , Yuchen Fan , Kai Zhang , Radu Timofte , Luc Van Gool , Rakesh Ranjan

MotionFlow:Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation

Generating videos guided by camera trajectories poses significant challenges in achieving consistency and generalizability, particularly when both camera and object motions are present. Existing approaches often attempt to learn these…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Guojun Lei , Chi Wang , Yikai Wang , Hong Li , Ying Song , Weiwei Xu

Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later

How do diffusion generative models convert pure noise into meaningful images? In a variety of pretrained diffusion models (including conditional latent space models like Stable Diffusion), we observe that the reverse diffusion process that…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Binxu Wang , John J. Vastola

Controllable Longer Image Animation with Diffusion Models

Generating realistic animated videos from static images is an important area of research in computer vision. Methods based on physical simulation and motion prediction have achieved notable advances, but they are often limited to specific…

Computer Vision and Pattern Recognition · Computer Science 2024-05-29 Qiang Wang , Minghua Liu , Junjun Hu , Fan Jiang , Mu Xu

Future Video Synthesis with Object Motion Prediction

We present an approach to predict future video frames given a sequence of continuous video frames in the past. Instead of synthesizing images directly, our approach is designed to understand the complex scene dynamics by decoupling the…

Computer Vision and Pattern Recognition · Computer Science 2020-04-16 Yue Wu , Rongrong Gao , Jaesik Park , Qifeng Chen

Antagonising explanation and revealing bias directly through sequencing and multimodal inference

Deep generative models produce data according to a learned representation, e.g. diffusion models, through a process of approximation computing possible samples. Approximation can be understood as reconstruction and the large datasets used…

Human-Computer Interaction · Computer Science 2023-09-25 Luís Arandas , Mick Grierson , Miguel Carvalhais

Spectral Dictionary Learning for Generative Image Modeling

We propose a novel spectral generative model for image synthesis that departs radically from the common variational, adversarial, and diffusion paradigms. In our approach, images, after being flattened into one-dimensional signals, are…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Andrew Kiruluta