Related papers: Learning Energy-based Spatial-Temporal Generative …

Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet

Video sequences contain rich dynamic patterns, such as dynamic texture patterns that exhibit stationarity in the temporal domain, and action patterns that are non-stationary in either spatial or temporal domain. We show that a…

Machine Learning · Statistics 2017-05-31 Jianwen Xie , Song-Chun Zhu , Ying Nian Wu

Learning Energy-Based Models as Generative ConvNets via Multi-grid Modeling and Sampling

This paper proposes a multi-grid method for learning energy-based generative ConvNet models of images. For each grid, we learn an energy-based probabilistic model where the energy function is defined by a bottom-up convolutional neural…

Machine Learning · Statistics 2020-10-16 Ruiqi Gao , Yang Lu , Junpei Zhou , Song-Chun Zhu , Ying Nian Wu

Learning a Generative Motion Model from Image Sequences based on a Latent Motion Matrix

We propose to learn a probabilistic motion model from a sequence of images for spatio-temporal registration. Our model encodes motion in a low-dimensional probabilistic space - the motion matrix - which enables various motion analysis tasks…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Julian Krebs , Hervé Delingette , Nicholas Ayache , Tommaso Mansi

Learning Dynamic Generator Model by Alternating Back-Propagation Through Time

This paper studies the dynamic generator model for spatial-temporal processes such as dynamic textures and action sequences in video data. In this model, each time frame of the video sequence is generated by a generator model, which is a…

Machine Learning · Statistics 2018-12-31 Jianwen Xie , Ruiqi Gao , Zilong Zheng , Song-Chun Zhu , Ying Nian Wu

Cooperative Training of Descriptor and Generator Networks

This paper studies the cooperative training of two generative models for image modeling and synthesis. Both models are parametrized by convolutional neural networks (ConvNets). The first model is a deep energy-based model, whose energy…

Machine Learning · Statistics 2018-10-31 Jianwen Xie , Yang Lu , Ruiqi Gao , Song-Chun Zhu , Ying Nian Wu

StNet: Local and Global Spatial-Temporal Modeling for Action Recognition

Despite the success of deep learning for static image understanding, it remains unclear what are the most effective network architectures for the spatial-temporal modeling in videos. In this paper, in contrast to the existing CNN+RNN or…

Computer Vision and Pattern Recognition · Computer Science 2018-12-12 Dongliang He , Zhichao Zhou , Chuang Gan , Fu Li , Xiao Liu , Yandong Li , Limin Wang , Shilei Wen

StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN

Generative adversarial models (GANs) continue to produce advances in terms of the visual quality of still images, as well as the learning of temporal correlations. However, few works manage to combine these two interesting capabilities for…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Gereon Fox , Ayush Tewari , Mohamed Elgharib , Christian Theobalt

Consistent Generative Query Networks

Stochastic video prediction models take in a sequence of image frames, and generate a sequence of consecutive future image frames. These models typically generate future frames in an autoregressive fashion, which is slow and requires the…

Computer Vision and Pattern Recognition · Computer Science 2019-04-23 Ananya Kumar , S. M. Ali Eslami , Danilo J. Rezende , Marta Garnelo , Fabio Viola , Edward Lockhart , Murray Shanahan

Generating Videos with Scene Dynamics

We capitalize on large amounts of unlabeled video in order to learn a model of scene dynamics for both video recognition tasks (e.g. action classification) and video generation tasks (e.g. future prediction). We propose a generative…

Computer Vision and Pattern Recognition · Computer Science 2016-10-27 Carl Vondrick , Hamed Pirsiavash , Antonio Torralba

Comparison of Spatiotemporal Networks for Learning Video Related Tasks

Many methods for learning from video sequences involve temporally processing 2D CNN features from the individual frames or directly utilizing 3D convolutions within high-performing 2D CNN architectures. The focus typically remains on how to…

Computer Vision and Pattern Recognition · Computer Science 2020-09-17 Logan Courtney , Ramavarapu Sreenivas

Spatiotemporal Residual Networks for Video Action Recognition

Two-stream Convolutional Networks (ConvNets) have shown strong performance for human action recognition in videos. Recently, Residual Networks (ResNets) have arisen as a new technique to train extremely deep architectures. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2016-11-08 Christoph Feichtenhofer , Axel Pinz , Richard P. Wildes

Learning Sequential Latent Variable Models from Multimodal Time Series Data

Sequential modelling of high-dimensional data is an important problem that appears in many domains including model-based reinforcement learning and dynamics identification for control. Latent variable models applied to sequential data…

Machine Learning · Computer Science 2023-01-23 Oliver Limoyo , Trevor Ablett , Jonathan Kelly

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Deep convolutional networks have achieved great success for visual recognition in still images. However, for action recognition in videos, the advantage over traditional methods is not so evident. This paper aims to discover the principles…

Computer Vision and Pattern Recognition · Computer Science 2016-08-03 Limin Wang , Yuanjun Xiong , Zhe Wang , Yu Qiao , Dahua Lin , Xiaoou Tang , Luc Van Gool

Learning Localized Spatio-Temporal Models From Streaming Data

We address the problem of predicting spatio-temporal processes with temporal patterns that vary across spatial regions, when data is obtained as a stream. That is, when the training dataset is augmented sequentially. Specifically, we…

Machine Learning · Statistics 2018-06-25 Muhammad Osama , Dave Zachariah , Thomas B. Schön

Learning Long-term Motion Embeddings for Efficient Kinematics Generation

Understanding and predicting motion is a fundamental component of visual intelligence. Although modern video models exhibit strong comprehension of scene dynamics, exploring multiple possible futures through full video synthesis remains…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Nick Stracke , Kolja Bauer , Stefan Andreas Baumann , Miguel Angel Bautista , Josh Susskind , Björn Ommer

Generative VoxelNet: Learning Energy-Based Models for 3D Shape Synthesis and Analysis

3D data that contains rich geometry information of objects and scenes is valuable for understanding 3D physical world. With the recent emergence of large-scale 3D datasets, it becomes increasingly crucial to have a powerful 3D generative…

Computer Vision and Pattern Recognition · Computer Science 2020-12-29 Jianwen Xie , Zilong Zheng , Ruiqi Gao , Wenguan Wang , Song-Chun Zhu , Ying Nian Wu

Learning to encode motion using spatio-temporal synchrony

We consider the task of learning to extract motion from videos. To this end, we show that the detection of spatial transformations can be viewed as the detection of synchrony between the image sequence and a sequence of features undergoing…

Computer Vision and Pattern Recognition · Computer Science 2014-02-11 Kishore Reddy Konda , Roland Memisevic , Vincent Michalski

Deep Video-Based Performance Cloning

We present a new video-based performance cloning technique. After training a deep generative network using a reference video capturing the appearance and dynamics of a target actor, we are able to generate videos where this actor reenacts…

Computer Vision and Pattern Recognition · Computer Science 2018-08-22 Kfir Aberman , Mingyi Shi , Jing Liao , Dani Lischinski , Baoquan Chen , Daniel Cohen-Or

Demand Forecasting from Spatiotemporal Data with Graph Networks and Temporal-Guided Embedding

Short-term demand forecasting models commonly combine convolutional and recurrent layers to extract complex spatiotemporal patterns in data. Long-term histories are also used to consider periodicity and seasonality patterns as time series…

Machine Learning · Computer Science 2019-10-15 Doyup Lee , Suehun Jung , Yeongjae Cheon , Dongil Kim , Seungil You

Temporal Embedding in Convolutional Neural Networks for Robust Learning of Abstract Snippets

The prediction of periodical time-series remains challenging due to various types of data distortions and misalignments. Here, we propose a novel model called Temporal embedding-enhanced convolutional neural Network (TeNet) to learn…

Machine Learning · Computer Science 2022-02-09 Jiajun Liu , Kun Zhao , Brano Kusy , Ji-rong Wen , Raja Jurdak