Related papers: Predictive Coding Based Multiscale Network with En…

Pyramidal Predictive Network: A Model for Visual-frame Prediction Based on Predictive Coding Theory

Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames. Lacking of appearance details, low prediction accuracy and high computational overhead are still major problems with current models or…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Chaofan Ling , Junpei Zhong , Weihua Li

Anti-aliasing Predictive Coding Network for Future Video Frame Prediction

We introduce here a predictive coding based model that aims to generate accurate and sharp future frames. Inspired by the predictive coding hypothesis and related works, the total model is updated through a combination of bottom-up and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-12 Chaofan Ling , Weihua Li , Junpei Zhong

Learning to Generate Long-term Future via Hierarchical Prediction

We propose a hierarchical approach for making long-term predictions of future frames. To avoid inherent compounding errors in recursive pixel-level prediction, we propose to first estimate high-level structure in the input frames, then…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Ruben Villegas , Jimei Yang , Yuliang Zou , Sungryull Sohn , Xunyu Lin , Honglak Lee

Multi Resolution LSTM For Long Term Prediction In Neural Activity Video

Epileptic seizures are caused by abnormal, overly syn- chronized, electrical activity in the brain. The abnor- mal electrical activity manifests as waves, propagating across the brain. Accurate prediction of the propagation velocity and…

Computer Vision and Pattern Recognition · Computer Science 2018-07-04 Yilin Song , Jonathan Viventi , Yao Wang

Sequence-to-Sequence Prediction of Vehicle Trajectory via LSTM Encoder-Decoder Architecture

In this paper, we propose a deep learning based vehicle trajectory prediction technique which can generate the future trajectory sequence of surrounding vehicles in real time. We employ the encoder-decoder architecture which analyzes the…

Machine Learning · Computer Science 2018-10-23 Seong Hyeon Park , ByeongDo Kim , Chang Mook Kang , Chung Choo Chung , Jun Won Choi

Unsupervised Learning of Video Representations using LSTMs

We use multilayer Long Short Term Memory (LSTM) networks to learn representations of video sequences. Our model uses an encoder LSTM to map an input sequence into a fixed length representation. This representation is decoded using single or…

Machine Learning · Computer Science 2016-01-05 Nitish Srivastava , Elman Mansimov , Ruslan Salakhutdinov

Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning

While great strides have been made in using deep learning algorithms to solve supervised learning tasks, the problem of unsupervised learning - leveraging unlabeled examples to learn about the structure of a domain - remains a difficult…

Machine Learning · Computer Science 2017-03-02 William Lotter , Gabriel Kreiman , David Cox

Hierarchical Boundary-Aware Neural Encoder for Video Captioning

The use of Recurrent Neural Networks for video captioning has recently gained a lot of attention, since they can be used both to encode the input video and to generate the corresponding description. In this paper, we present a recurrent…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Lorenzo Baraldi , Costantino Grana , Rita Cucchiara

STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction

Although many video prediction methods have obtained good performance in low-resolution (64$\sim$128) videos, predictive models for high-resolution (512$\sim$4K) videos have not been fully explored yet, which are more meaningful due to the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-31 Zheng Chang , Xinfeng Zhang , Shanshe Wang , Siwei Ma , Wen Gao

Predicting Long-horizon Futures by Conditioning on Geometry and Time

Our work explores the task of generating future sensor observations conditioned on the past. We are motivated by `predictive coding' concepts from neuroscience as well as robotic applications such as self-driving vehicles. Predictive video…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Tarasha Khurana , Deva Ramanan

Local Frequency Domain Transformer Networks for Video Prediction

Video prediction is commonly referred to as forecasting future frames of a video sequence provided several past frames thereof. It remains a challenging domain as visual scenes evolve according to complex underlying dynamics, such as the…

Computer Vision and Pattern Recognition · Computer Science 2021-05-12 Hafez Farazi , Jan Nogga , Sven Behnke

Sequential Learning of Movement Prediction in Dynamic Environments using LSTM Autoencoder

Predicting movement of objects while the action of learning agent interacts with the dynamics of the scene still remains a key challenge in robotics. We propose a multi-layer Long Short Term Memory (LSTM) autoendocer network that predicts…

Machine Learning · Computer Science 2018-10-15 Meenakshi Sarkar , Debasish Ghose

PreCNet: Next-Frame Video Prediction Based on Predictive Coding

Predictive coding, currently a highly influential theory in neuroscience, has not been widely adopted in machine learning yet. In this work, we transform the seminal model of Rao and Ballard (1999) into a modern deep learning framework…

Computer Vision and Pattern Recognition · Computer Science 2023-02-09 Zdenek Straka , Tomas Svoboda , Matej Hoffmann

E-LSTM-D: A Deep Learning Framework for Dynamic Network Link Prediction

Predicting the potential relations between nodes in networks, known as link prediction, has long been a challenge in network science. However, most studies just focused on link prediction of static network, while real-world networks always…

Social and Information Networks · Computer Science 2019-02-25 Jinyin Chen , Jian Zhang , Xuanheng Xu , Chengbo Fu , Dan Zhang , Qingpeng Zhang , Qi Xuan

Predictive Encoding of Contextual Relationships for Perceptual Inference, Interpolation and Prediction

We propose a new neurally-inspired model that can learn to encode the global relationship context of visual events across time and space and to use the contextual information to modulate the analysis by synthesis process in a predictive…

Machine Learning · Computer Science 2015-04-17 Mingmin Zhao , Chengxu Zhuang , Yizhou Wang , Tai Sing Lee

Convolutional Tensor-Train LSTM for Spatio-temporal Learning

Learning from spatio-temporal data has numerous applications such as human-behavior analysis, object tracking, video compression, and physics simulation.However, existing methods still perform poorly on challenging video tasks such as…

Machine Learning · Computer Science 2020-10-06 Jiahao Su , Wonmin Byeon , Jean Kossaifi , Furong Huang , Jan Kautz , Animashree Anandkumar

Encoder-Decoder Based Long Short-Term Memory (LSTM) Model for Video Captioning

This work demonstrates the implementation and use of an encoder-decoder model to perform a many-to-many mapping of video data to text captions. The many-to-many mapping occurs via an input temporal sequence of video frames to an output…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Sikiru Adewale , Tosin Ige , Bolanle Hafiz Matti

Predictive Coding for Dynamic Visual Processing: Development of Functional Hierarchy in a Multiple Spatio-Temporal Scales RNN Model

The current paper proposes a novel predictive coding type neural network model, the predictive multiple spatio-temporal scales recurrent neural network (P-MSTRNN). The P-MSTRNN learns to predict visually perceived human whole-body cyclic…

Computer Vision and Pattern Recognition · Computer Science 2017-09-07 Minkyu Choi , Jun Tani

Decomposing Motion and Content for Natural Video Sequence Prediction

We propose a deep neural network for the prediction of future frames in natural video sequences. To effectively handle complex evolution of pixels in videos, we propose to decompose the motion and content, two key components generating…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Ruben Villegas , Jimei Yang , Seunghoon Hong , Xunyu Lin , Honglak Lee

Inception-inspired LSTM for Next-frame Video Prediction

The problem of video frame prediction has received much interest due to its relevance to many computer vision applications such as autonomous vehicles or robotics. Supervised methods for video frame prediction rely on labeled data, which…

Computer Vision and Pattern Recognition · Computer Science 2020-04-27 Matin Hosseini , Anthony S. Maida , Majid Hosseini , Gottumukkala Raju