Related papers: Deep Learning Driven Visual Path Prediction from a…

A Review on Deep Learning Techniques for Video Prediction

The ability to predict, anticipate and reason about future outcomes is a key component of intelligent decision-making systems. In light of the success of deep learning in computer vision, deep-learning-based video prediction emerged as a…

Computer Vision and Pattern Recognition · Computer Science 2021-04-22 Sergiu Oprea , Pablo Martinez-Gonzalez , Alberto Garcia-Garcia , John Alejandro Castro-Vargas , Sergio Orts-Escolano , Jose Garcia-Rodriguez , Antonis Argyros

Deep learning for scene recognition from visual data: a survey

The use of deep learning techniques has exploded during the last few years, resulting in a direct contribution to the field of artificial intelligence. This work aims to be a review of the state-of-the-art in scene recognition with deep…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Alina Matei , Andreea Glavan , Estefania Talavera

Video Scene Parsing with Predictive Feature Learning

In this work, we address the challenging video scene parsing problem by developing effective representation learning methods given limited parsing annotations. In particular, we contribute two novel methods that constitute a unified parsing…

Computer Vision and Pattern Recognition · Computer Science 2016-12-14 Xiaojie Jin , Xin Li , Huaxin Xiao , Xiaohui Shen , Zhe Lin , Jimei Yang , Yunpeng Chen , Jian Dong , Luoqi Liu , Zequn Jie , Jiashi Feng , Shuicheng Yan

Visuomotor Understanding for Representation Learning of Driving Scenes

Dashboard cameras capture a tremendous amount of driving scene video each day. These videos are purposefully coupled with vehicle sensing data, such as from the speedometer and inertial sensors, providing an additional sensing modality for…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Seokju Lee , Junsik Kim , Tae-Hyun Oh , Yongseop Jeong , Donggeun Yoo , Stephen Lin , In So Kweon

From Recognition to Prediction: Analysis of Human Action and Trajectory Prediction in Video

With the advancement in computer vision deep learning, systems now are able to analyze an unprecedented amount of rich visual information from videos to enable applications such as autonomous driving, socially-aware robot assistant and…

Computer Vision and Pattern Recognition · Computer Science 2021-07-19 Junwei Liang

Automatic Robot Path Planning for Visual Inspection from Object Shape

Visual inspection is a crucial yet time-consuming task across various industries. Numerous established methods employ machine learning in inspection tasks, necessitating specific training data that includes predefined inspection poses and…

Robotics · Computer Science 2023-12-06 O. Tasneem , R. Pieters

Deep Learning for Scene Classification: A Survey

Scene classification, aiming at classifying a scene image to one of the predefined scene categories by comprehending the entire image, is a longstanding, fundamental and challenging problem in computer vision. The rise of large-scale…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Delu Zeng , Minyu Liao , Mohammad Tavakolian , Yulan Guo , Bolei Zhou , Dewen Hu , Matti Pietikäinen , Li Liu

Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning

While great strides have been made in using deep learning algorithms to solve supervised learning tasks, the problem of unsupervised learning - leveraging unlabeled examples to learn about the structure of a domain - remains a difficult…

Machine Learning · Computer Science 2017-03-02 William Lotter , Gabriel Kreiman , David Cox

Survey on Vision-based Path Prediction

Path prediction is a fundamental task for estimating how pedestrians or vehicles are going to move in a scene. Because path prediction as a task of computer vision uses video as input, various information used for prediction, such as the…

Computer Vision and Pattern Recognition · Computer Science 2018-11-02 Tsubasa Hirakawa , Takayoshi Yamashita , Toru Tamaki , Hironobu Fujiyoshi

On the difficulty of learning and predicting the long-term dynamics of bouncing objects

The ability to accurately predict the surrounding environment is a foundational principle of intelligence in biological and artificial agents. In recent years, a variety of approaches have been proposed for learning to predict the physical…

Computer Vision and Pattern Recognition · Computer Science 2019-08-01 Alberto Cenzato , Alberto Testolin , Marco Zorzi

One-Shot Learning of Visual Path Navigation for Autonomous Vehicles

Autonomous driving presents many challenges due to the large number of scenarios the autonomous vehicle (AV) may encounter. End-to-end deep learning models are comparatively simplistic models that can handle a broad set of scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-16 Zhongying CuiZhu , Francois Charette , Amin Ghafourian , Debo Shi , Matthew Cui , Anjali Krishnamachar , Iman Soltani

Real-Time Forecasting of Driver-Vehicle Dynamics on 3D Roads: a Deep-Learning Framework Leveraging Bayesian Optimisation

Most state-of-the-art works in trajectory forecasting for automotive target predicting the pose and orientation of the agents in the scene. This represents a particularly useful problem, for instance in autonomous driving, but it does not…

Robotics · Computer Science 2024-10-28 Luca Paparusso , Stefano Melzi , Francesco Braghin

Visual Semantic Planning using Deep Successor Representations

A crucial capability of real-world intelligent agents is their ability to plan a sequence of actions to achieve their goals in the visual world. In this work, we address the problem of visual semantic planning: the task of predicting a…

Computer Vision and Pattern Recognition · Computer Science 2017-08-17 Yuke Zhu , Daniel Gordon , Eric Kolve , Dieter Fox , Li Fei-Fei , Abhinav Gupta , Roozbeh Mottaghi , Ali Farhadi

Wide and Narrow: Video Prediction from Context and Motion

Video prediction, forecasting the future frames from a sequence of input frames, is a challenging task since the view changes are influenced by various factors, such as the global context surrounding the scene and local motion dynamics. In…

Computer Vision and Pattern Recognition · Computer Science 2021-10-25 Jaehoon Cho , Jiyoung Lee , Changjae Oh , Wonil Song , Kwanghoon Sohn

Future Frame Prediction of a Video Sequence

Predicting future frames of a video sequence has been a problem of high interest in the field of Computer Vision as it caters to a multitude of applications. The ability to predict, anticipate and reason about future events is the essence…

Computer Vision and Pattern Recognition · Computer Science 2020-09-04 Jasmeen Kaur , Sukhendu Das

Unsupervised learning from video to detect foreground objects in single images

Unsupervised learning from visual data is one of the most difficult challenges in computer vision, being a fundamental task for understanding how visual recognition works. From a practical point of view, learning from unsupervised visual…

Computer Vision and Pattern Recognition · Computer Science 2017-04-03 Ioana Croitoru , Simion-Vlad Bogolin , Marius Leordeanu

Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition

With the explosive growth of video data in real-world applications, a comprehensive representation of videos becomes increasingly important. In this paper, we address the problem of video scene recognition, whose goal is to learn a…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xuzheng Yu , Chen Jiang , Wei Zhang , Tian Gan , Linlin Chao , Jianan Zhao , Yuan Cheng , Qingpei Guo , Wei Chu

Guiding Video Prediction with Explicit Procedural Knowledge

We propose a general way to integrate procedural knowledge of a domain into deep learning models. We apply it to the case of video prediction, building on top of object-centric deep models and show that this leads to a better performance…

Computer Vision and Pattern Recognition · Computer Science 2024-06-27 Patrick Takenaka , Johannes Maucher , Marco F. Huber

To Learn or Not to Learn: Visual Localization from Essential Matrices

Visual localization is the problem of estimating a camera within a scene and a key component in computer vision applications such as self-driving cars and Mixed Reality. State-of-the-art approaches for accurate visual localization use…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Qunjie Zhou , Torsten Sattler , Marc Pollefeys , Laura Leal-Taixe

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

While deep neural networks have led to human-level performance on computer vision tasks, they have yet to demonstrate similar gains for holistic scene understanding. In particular, 3D context has been shown to be an extremely important cue…

Computer Vision and Pattern Recognition · Computer Science 2017-08-17 Yinda Zhang , Mingru Bai , Pushmeet Kohli , Shahram Izadi , Jianxiong Xiao