English
Related papers

Related papers: Predictive Coding Based Multiscale Network with En…

200 papers

Visual-frame prediction is a pixel-dense prediction task that infers future frames from past frames. Lacking of appearance details, low prediction accuracy and high computational overhead are still major problems with current models or…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Chaofan Ling , Junpei Zhong , Weihua Li

We introduce here a predictive coding based model that aims to generate accurate and sharp future frames. Inspired by the predictive coding hypothesis and related works, the total model is updated through a combination of bottom-up and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-12 Chaofan Ling , Weihua Li , Junpei Zhong

We propose a hierarchical approach for making long-term predictions of future frames. To avoid inherent compounding errors in recursive pixel-level prediction, we propose to first estimate high-level structure in the input frames, then…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Ruben Villegas , Jimei Yang , Yuliang Zou , Sungryull Sohn , Xunyu Lin , Honglak Lee

Epileptic seizures are caused by abnormal, overly syn- chronized, electrical activity in the brain. The abnor- mal electrical activity manifests as waves, propagating across the brain. Accurate prediction of the propagation velocity and…

Computer Vision and Pattern Recognition · Computer Science 2018-07-04 Yilin Song , Jonathan Viventi , Yao Wang

In this paper, we propose a deep learning based vehicle trajectory prediction technique which can generate the future trajectory sequence of surrounding vehicles in real time. We employ the encoder-decoder architecture which analyzes the…

Machine Learning · Computer Science 2018-10-23 Seong Hyeon Park , ByeongDo Kim , Chang Mook Kang , Chung Choo Chung , Jun Won Choi

We use multilayer Long Short Term Memory (LSTM) networks to learn representations of video sequences. Our model uses an encoder LSTM to map an input sequence into a fixed length representation. This representation is decoded using single or…

Machine Learning · Computer Science 2016-01-05 Nitish Srivastava , Elman Mansimov , Ruslan Salakhutdinov

While great strides have been made in using deep learning algorithms to solve supervised learning tasks, the problem of unsupervised learning - leveraging unlabeled examples to learn about the structure of a domain - remains a difficult…

Machine Learning · Computer Science 2017-03-02 William Lotter , Gabriel Kreiman , David Cox

The use of Recurrent Neural Networks for video captioning has recently gained a lot of attention, since they can be used both to encode the input video and to generate the corresponding description. In this paper, we present a recurrent…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Lorenzo Baraldi , Costantino Grana , Rita Cucchiara

Although many video prediction methods have obtained good performance in low-resolution (64$\sim$128) videos, predictive models for high-resolution (512$\sim$4K) videos have not been fully explored yet, which are more meaningful due to the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-31 Zheng Chang , Xinfeng Zhang , Shanshe Wang , Siwei Ma , Wen Gao

Our work explores the task of generating future sensor observations conditioned on the past. We are motivated by `predictive coding' concepts from neuroscience as well as robotic applications such as self-driving vehicles. Predictive video…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Tarasha Khurana , Deva Ramanan

Video prediction is commonly referred to as forecasting future frames of a video sequence provided several past frames thereof. It remains a challenging domain as visual scenes evolve according to complex underlying dynamics, such as the…

Computer Vision and Pattern Recognition · Computer Science 2021-05-12 Hafez Farazi , Jan Nogga , Sven Behnke

Predicting movement of objects while the action of learning agent interacts with the dynamics of the scene still remains a key challenge in robotics. We propose a multi-layer Long Short Term Memory (LSTM) autoendocer network that predicts…

Machine Learning · Computer Science 2018-10-15 Meenakshi Sarkar , Debasish Ghose

Predictive coding, currently a highly influential theory in neuroscience, has not been widely adopted in machine learning yet. In this work, we transform the seminal model of Rao and Ballard (1999) into a modern deep learning framework…

Computer Vision and Pattern Recognition · Computer Science 2023-02-09 Zdenek Straka , Tomas Svoboda , Matej Hoffmann

Predicting the potential relations between nodes in networks, known as link prediction, has long been a challenge in network science. However, most studies just focused on link prediction of static network, while real-world networks always…

Social and Information Networks · Computer Science 2019-02-25 Jinyin Chen , Jian Zhang , Xuanheng Xu , Chengbo Fu , Dan Zhang , Qingpeng Zhang , Qi Xuan

We propose a new neurally-inspired model that can learn to encode the global relationship context of visual events across time and space and to use the contextual information to modulate the analysis by synthesis process in a predictive…

Machine Learning · Computer Science 2015-04-17 Mingmin Zhao , Chengxu Zhuang , Yizhou Wang , Tai Sing Lee

Learning from spatio-temporal data has numerous applications such as human-behavior analysis, object tracking, video compression, and physics simulation.However, existing methods still perform poorly on challenging video tasks such as…

Machine Learning · Computer Science 2020-10-06 Jiahao Su , Wonmin Byeon , Jean Kossaifi , Furong Huang , Jan Kautz , Animashree Anandkumar

This work demonstrates the implementation and use of an encoder-decoder model to perform a many-to-many mapping of video data to text captions. The many-to-many mapping occurs via an input temporal sequence of video frames to an output…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Sikiru Adewale , Tosin Ige , Bolanle Hafiz Matti

The current paper proposes a novel predictive coding type neural network model, the predictive multiple spatio-temporal scales recurrent neural network (P-MSTRNN). The P-MSTRNN learns to predict visually perceived human whole-body cyclic…

Computer Vision and Pattern Recognition · Computer Science 2017-09-07 Minkyu Choi , Jun Tani

We propose a deep neural network for the prediction of future frames in natural video sequences. To effectively handle complex evolution of pixels in videos, we propose to decompose the motion and content, two key components generating…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Ruben Villegas , Jimei Yang , Seunghoon Hong , Xunyu Lin , Honglak Lee

The problem of video frame prediction has received much interest due to its relevance to many computer vision applications such as autonomous vehicles or robotics. Supervised methods for video frame prediction rely on labeled data, which…

Computer Vision and Pattern Recognition · Computer Science 2020-04-27 Matin Hosseini , Anthony S. Maida , Majid Hosseini , Gottumukkala Raju
‹ Prev 1 2 3 10 Next ›