Related papers: Convolutional Tensor-Train LSTM for Spatio-tempora…

Comparison of Spatiotemporal Networks for Learning Video Related Tasks

Many methods for learning from video sequences involve temporally processing 2D CNN features from the individual frames or directly utilizing 3D convolutions within high-performing 2D CNN architectures. The focus typically remains on how to…

Computer Vision and Pattern Recognition · Computer Science 2020-09-17 Logan Courtney , Ramavarapu Sreenivas

MS-LSTM: Exploring Spatiotemporal Multiscale Representations in Video Prediction Domain

The drastic variation of motion in spatial and temporal dimensions makes the video prediction task extremely challenging. Existing RNN models obtain higher performance by deepening or widening the model. They obtain the multi-scale features…

Computer Vision and Pattern Recognition · Computer Science 2024-02-19 Zhifeng Ma , Hao Zhang , Jie Liu

Unsupervised Learning of Video Representations using LSTMs

We use multilayer Long Short Term Memory (LSTM) networks to learn representations of video sequences. Our model uses an encoder LSTM to map an input sequence into a fixed length representation. This representation is decoded using single or…

Machine Learning · Computer Science 2016-01-05 Nitish Srivastava , Elman Mansimov , Ruslan Salakhutdinov

Exploring Transformer-Augmented LSTM for Temporal and Spatial Feature Learning in Trajectory Prediction

Accurate vehicle trajectory prediction is crucial for ensuring safe and efficient autonomous driving. This work explores the integration of Transformer based model with Long Short-Term Memory (LSTM) based technique to enhance spatial and…

Robotics · Computer Science 2024-12-19 Chandra Raskoti , Weizi Li

STSM: Spatio-Temporal Shift Module for Efficient Action Recognition

The modeling, computational cost, and accuracy of traditional Spatio-temporal networks are the three most concentrated research topics in video action recognition. The traditional 2D convolution has a low computational cost, but it cannot…

Computer Vision and Pattern Recognition · Computer Science 2021-12-07 Zhaoqilin Yang , Gaoyun An

Deep Learning Based Motion Planning For Autonomous Vehicle Using Spatiotemporal LSTM Network

Motion Planning, as a fundamental technology of automatic navigation for the autonomous vehicle, is still an open challenging issue in the real-life traffic situation and is mostly applied by the model-based approaches. However, due to the…

Computer Vision and Pattern Recognition · Computer Science 2019-03-06 Zhengwei Bai , Baigen Cai , Wei Shangguan , Linguo Chai

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

In this paper, we develop a new approach of spatially supervised recurrent convolutional neural networks for visual object tracking. Our recurrent convolutional network exploits the history of locations as well as the distinctive visual…

Computer Vision and Pattern Recognition · Computer Science 2016-07-21 Guanghan Ning , Zhi Zhang , Chen Huang , Zhihai He , Xiaobo Ren , Haohong Wang

Future Semantic Segmentation with Convolutional LSTM

We consider the problem of predicting semantic segmentation of future frames in a video. Given several observed frames in a video, our goal is to predict the semantic segmentation map of future frames that are not yet observed. A reliable…

Computer Vision and Pattern Recognition · Computer Science 2018-07-23 Seyed shahabeddin Nabavi , Mrigank Rochan , Yang , Wang

Compressing Recurrent Neural Networks with Tensor Ring for Action Recognition

Recurrent Neural Networks (RNNs) and their variants, such as Long-Short Term Memory (LSTM) networks, and Gated Recurrent Unit (GRU) networks, have achieved promising performance in sequential data modeling. The hidden layers in RNNs can be…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Yu Pan , Jing Xu , Maolin Wang , Jinmian Ye , Fei Wang , Kun Bai , Zenglin Xu

Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

Semantic video segmentation is a key challenge for various applications. This paper presents a new model named Noisy-LSTM, which is trainable in an end-to-end manner, with convolutional LSTMs (ConvLSTMs) to leverage the temporal coherency…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Bowen Wang , Liangzhi Li , Yuta Nakashima , Ryo Kawasaki , Hajime Nagahara , Yasushi Yagi

Relational Long Short-Term Memory for Video Action Recognition

Spatial and temporal relationships, both short-range and long-range, between objects in videos, are key cues for recognizing actions. It is a challenging problem to model them jointly. In this paper, we first present a new variant of Long…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Zexi Chen , Bharathkumar Ramachandra , Tianfu Wu , Ranga Raju Vatsavai

Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos

Action recognition greatly benefits motion understanding in video analysis. Recurrent networks such as long short-term memory (LSTM) networks are a popular choice for motion-aware sequence learning tasks. Recently, a convolutional extension…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Sebastian Agethen , Winston H. Hsu

Sparse and Low-Rank High-Order Tensor Regression via Parallel Proximal Method

Recently, tensor data (or multidimensional array) have been generated in many modern applications, such as functional magnetic resonance imaging (fMRI) in neuroscience and videos in video analysis. Many efforts are made in recent years to…

Machine Learning · Computer Science 2023-08-10 Jiaqi Zhang , Yinghao Cai , Zhaoyang Wang , Beilun Wang

Low-Latency Human Action Recognition with Weighted Multi-Region Convolutional Neural Network

Spatio-temporal contexts are crucial in understanding human actions in videos. Recent state-of-the-art Convolutional Neural Network (ConvNet) based action recognition systems frequently involve 3D spatio-temporal ConvNet filters, chunking…

Computer Vision and Pattern Recognition · Computer Science 2018-05-09 Yunfeng Wang , Wengang Zhou , Qilin Zhang , Xiaotian Zhu , Houqiang Li

Temporal Convolutional Networks for Action Segmentation and Detection

The ability to identify and temporally segment fine-grained human actions throughout a video is crucial for robotics, surveillance, education, and beyond. Typical approaches decouple this problem by first extracting local spatiotemporal…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 Colin Lea , Michael D. Flynn , Rene Vidal , Austin Reiter , Gregory D. Hager

Temporal-Spatial Mapping for Action Recognition

Deep learning models have enjoyed great success for image related computer vision tasks like image classification and object detection. For video related tasks like human action recognition, however, the advancements are not as significant…

Computer Vision and Pattern Recognition · Computer Science 2018-09-12 Xiaolin Song , Cuiling Lan , Wenjun Zeng , Junliang Xing , Jingyu Yang , Xiaoyan Sun

Lattice Long Short-Term Memory for Human Action Recognition

Human actions captured in video sequences are three-dimensional signals characterizing visual appearance and motion dynamics. To learn action patterns, existing methods adopt Convolutional and/or Recurrent Neural Networks (CNNs and RNNs).…

Computer Vision and Pattern Recognition · Computer Science 2017-08-15 Lin Sun , Kui Jia , Kevin Chen , Dit Yan Yeung , Bertram E. Shi , Silvio Savarese

Deep Learning with Long Short-Term Memory for Time Series Prediction

Time series prediction can be generalized as a process that extracts useful information from historical records and then determines future values. Learning long-range dependencies that are embedded in time series is often an obstacle for…

Neural and Evolutionary Computing · Computer Science 2018-10-25 Yuxiu Hua , Zhifeng Zhao , Rongpeng Li , Xianfu Chen , Zhiming Liu , Honggang Zhang

Wider and Deeper, Cheaper and Faster: Tensorized LSTMs for Sequence Learning

Long Short-Term Memory (LSTM) is a popular approach to boosting the ability of Recurrent Neural Networks to store longer term temporal information. The capacity of an LSTM network can be increased by widening and adding layers. However,…

Machine Learning · Statistics 2017-12-14 Zhen He , Shaobing Gao , Liang Xiao , Daxue Liu , Hangen He , David Barber

Learning from Videos with Deep Convolutional LSTM Networks

This paper explores the use of convolution LSTMs to simultaneously learn spatial- and temporal-information in videos. A deep network of convolutional LSTMs allows the model to access the entire range of temporal information at all spatial…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Logan Courtney , Ramavarapu Sreenivas