Related papers: Compositional Video Prediction

Stochastic Variational Video Prediction

Predicting the future in real-world settings, particularly from raw sensory observations such as images, is exceptionally challenging. Real-world events can be stochastic and unpredictable, and the high dimensionality and complexity of…

Computer Vision and Pattern Recognition · Computer Science 2018-03-07 Mohammad Babaeizadeh , Chelsea Finn , Dumitru Erhan , Roy H. Campbell , Sergey Levine

Future Video Synthesis with Object Motion Prediction

We present an approach to predict future video frames given a sequence of continuous video frames in the past. Instead of synthesizing images directly, our approach is designed to understand the complex scene dynamics by decoupling the…

Computer Vision and Pattern Recognition · Computer Science 2020-04-16 Yue Wu , Rongrong Gao , Jaesik Park , Qifeng Chen

Probabilistic Future Prediction for Video Scene Understanding

We present a novel deep learning architecture for probabilistic future prediction from video. We predict the future semantics, geometry and motion of complex real-world urban scenes and use this representation to control an autonomous…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Anthony Hu , Fergal Cotter , Nikhil Mohan , Corina Gurau , Alex Kendall

Video Prediction via Example Guidance

In video prediction tasks, one major challenge is to capture the multi-modal nature of future contents and dynamics. In this work, we propose a simple yet effective framework that can efficiently predict plausible future states. The key…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Jingwei Xu , Huazhe Xu , Bingbing Ni , Xiaokang Yang , Trevor Darrell

A unified model for continuous conditional video prediction

Different conditional video prediction tasks, like video future frame prediction and video frame interpolation, are normally solved by task-related models even though they share many common underlying characteristics. Furthermore, almost…

Computer Vision and Pattern Recognition · Computer Science 2023-04-10 Xi Ye , Guillaume-Alexandre Bilodeau

Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction

Learning to predict the long-term future of video frames is notoriously challenging due to inherent ambiguities in the distant future and dramatic amplifications of prediction error through time. Despite the recent advances in the…

Computer Vision and Pattern Recognition · Computer Science 2021-04-15 Wonkwang Lee , Whie Jung , Han Zhang , Ting Chen , Jing Yu Koh , Thomas Huang , Hyungsuk Yoon , Honglak Lee , Seunghoon Hong

An Uncertain Future: Forecasting from Static Images using Variational Autoencoders

In a given scene, humans can often easily predict a set of immediate future events that might happen. However, generalized pixel-level anticipation in computer vision systems is difficult because machine learning struggles with the…

Computer Vision and Pattern Recognition · Computer Science 2016-06-28 Jacob Walker , Carl Doersch , Abhinav Gupta , Martial Hebert

Motion and Context-Aware Audio-Visual Conditioned Video Prediction

The existing state-of-the-art method for audio-visual conditioned video prediction uses the latent codes of the audio-visual frames from a multimodal stochastic network and a frame encoder to predict the next visual frame. However, a direct…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Yating Xu , Conghui Hu , Gim Hee Lee

Stochastic Adversarial Video Prediction

Being able to predict what may happen in the future requires an in-depth understanding of the physical and causal rules that govern the world. A model that is able to do so has a number of appealing applications, from robotic planning to…

Computer Vision and Pattern Recognition · Computer Science 2018-04-05 Alex X. Lee , Richard Zhang , Frederik Ebert , Pieter Abbeel , Chelsea Finn , Sergey Levine

Stochastic Video Prediction with Structure and Motion

While stochastic video prediction models enable future prediction under uncertainty, they mostly fail to model the complex dynamics of real-world scenes. For example, they cannot provide reliable predictions for scenes with a moving camera…

Computer Vision and Pattern Recognition · Computer Science 2022-05-02 Adil Kaan Akan , Sadra Safadoust , Fatma Güney

State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend

Stochastic video prediction enables the consideration of uncertainty in future motion, thereby providing a better reflection of the dynamic nature of the environment. Stochastic video prediction methods based on image auto-regressive…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Fei Cui , Jiaojiao Fang , Xiaojiang Wu , Zelong Lai , Mengke Yang , Menghan Jia , Guizhong Liu

Future Frame Prediction of a Video Sequence

Predicting future frames of a video sequence has been a problem of high interest in the field of Computer Vision as it caters to a multitude of applications. The ability to predict, anticipate and reason about future events is the essence…

Computer Vision and Pattern Recognition · Computer Science 2020-09-04 Jasmeen Kaur , Sukhendu Das

Contextually Plausible and Diverse 3D Human Motion Prediction

We tackle the task of diverse 3D human motion prediction, that is, forecasting multiple plausible future 3D poses given a sequence of observed 3D poses. In this context, a popular approach consists of using a Conditional Variational…

Machine Learning · Computer Science 2020-12-08 Sadegh Aliakbarian , Fatemeh Sadat Saleh , Lars Petersson , Stephen Gould , Mathieu Salzmann

Long-Term Human Video Generation of Multiple Futures Using Poses

Predicting future human behavior from an input human video is a useful task for applications such as autonomous driving and robotics. While most previous works predict a single future, multiple futures with different behavior can…

Computer Vision and Pattern Recognition · Computer Science 2021-06-02 Naoya Fushishita , Antonio Tejero-de-Pablos , Yusuke Mukuta , Tatsuya Harada

Deep Sequence Learning for Video Anticipation: From Discrete and Deterministic to Continuous and Stochastic

Video anticipation is the task of predicting one/multiple future representation(s) given limited, partial observation. This is a challenging task due to the fact that given limited observation, the future representation can be highly…

Computer Vision and Pattern Recognition · Computer Science 2020-10-12 Sadegh Aliakbarian

Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks

We study the problem of synthesizing a number of likely future frames from a single input image. In contrast to traditional methods that have tackled this problem in a deterministic or non-parametric way, we propose to model future frames…

Computer Vision and Pattern Recognition · Computer Science 2019-08-13 Tianfan Xue , Jiajun Wu , Katherine L. Bouman , William T. Freeman

What Happens Next? Anticipating Future Motion by Generating Point Trajectories

We consider the problem of forecasting motion from a single image, i.e., predicting how objects in the world are likely to move, without the ability to observe other parameters such as the object velocities or the forces applied to them. We…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Gabrijel Boduljak , Laurynas Karazija , Iro Laina , Christian Rupprecht , Andrea Vedaldi

Motion Prediction Under Multimodality with Conditional Stochastic Networks

Given a visual history, multiple future outcomes for a video scene are equally probable, in other words, the distribution of future outcomes has multiple modes. Multimodality is notoriously hard to handle by standard regressors or…

Computer Vision and Pattern Recognition · Computer Science 2017-05-08 Katerina Fragkiadaki , Jonathan Huang , Alex Alemi , Sudheendra Vijayanarasimhan , Susanna Ricco , Rahul Sukthankar

VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation

Generative models that can model and predict sequences of future events can, in principle, learn to capture complex real-world phenomena, such as physical interactions. However, a central challenge in video prediction is that the future is…

Computer Vision and Pattern Recognition · Computer Science 2020-02-13 Manoj Kumar , Mohammad Babaeizadeh , Dumitru Erhan , Chelsea Finn , Sergey Levine , Laurent Dinh , Durk Kingma

STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction

Predicting future frames of a video is challenging because it is difficult to learn the uncertainty of the underlying factors influencing their contents. In this paper, we propose a novel video prediction model, which has…

Computer Vision and Pattern Recognition · Computer Science 2024-02-20 Xi Ye , Guillaume-Alexandre Bilodeau