Related papers: Deep Spatio-Temporal Random Fields for Efficient V…

Fast, Exact and Multi-Scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

In this work we propose a structured prediction technique that combines the virtues of Gaussian Conditional Random Fields (G-CRF) with Deep Learning: (a) our structured prediction task has a unique global optimum that is obtained exactly…

Computer Vision and Pattern Recognition · Computer Science 2016-11-30 Siddhartha Chandra , Iasonas Kokkinos

Deep Gaussian Markov Random Fields for Graph-Structured Dynamical Systems

Probabilistic inference in high-dimensional state-space models is computationally challenging. For many spatiotemporal systems, however, prior knowledge about the dependency structure of state variables is available. We leverage this…

Machine Learning · Computer Science 2024-08-09 Fiona Lippert , Bart Kranstauber , E. Emiel van Loon , Patrick Forré

Streaming Gaussian Dirichlet Random Fields for Spatial Predictions of High Dimensional Categorical Observations

We present the Streaming Gaussian Dirichlet Random Field (S-GDRF) model, a novel approach for modeling a stream of spatiotemporally distributed, sparse, high-dimensional categorical observations. The proposed approach efficiently learns…

Robotics · Computer Science 2024-02-26 J. E. San Soucie , H. M. Sosik , Y. Girdhar

Asynchronous Temporal Fields for Action Recognition

Actions are more than just movements and trajectories: we cook to eat and we hold a cup to drink from it. A thorough understanding of videos requires going beyond appearance modeling and necessitates reasoning about the sequence of…

Computer Vision and Pattern Recognition · Computer Science 2017-07-25 Gunnar A. Sigurdsson , Santosh Divvala , Ali Farhadi , Abhinav Gupta

Semantic Video Segmentation : Exploring Inference Efficiency

We explore the efficiency of the CRF inference beyond image level semantic segmentation and perform joint inference in video frames. The key idea is to combine best of two worlds: semantic co-labeling and more expressive models. Our…

Computer Vision and Pattern Recognition · Computer Science 2015-09-09 Subarna Tripathi , Serge Belongie , Youngbae Hwang , Truong Nguyen

Efficient Video Segmentation Models with Per-frame Inference

Most existing real-time deep models trained with each frame independently may produce inconsistent results across the temporal axis when tested on a video sequence. A few methods take the correlations in the video sequence into…

Computer Vision and Pattern Recognition · Computer Science 2022-02-28 Yifan Liu , Chunhua Shen , Changqian Yu , Jingdong Wang

Efficient Global-Local Memory for Real-time Instrument Segmentation of Robotic Surgical Video

Performing a real-time and accurate instrument segmentation from videos is of great significance for improving the performance of robotic-assisted surgery. We identify two important clues for surgical instrument perception, including local…

Computer Vision and Pattern Recognition · Computer Science 2021-09-29 Jiacheng Wang , Yueming Jin , Liansheng Wang , Shuntian Cai , Pheng-Ann Heng , Jing Qin

Deep Gaussian Conditional Random Field Network: A Model-based Deep Network for Discriminative Denoising

We propose a novel deep network architecture for image\\ denoising based on a Gaussian Conditional Random Field (GCRF) model. In contrast to the existing discriminative denoising methods that train a separate model for each noise level, the…

Computer Vision and Pattern Recognition · Computer Science 2015-11-13 Raviteja Vemulapalli , Oncel Tuzel , Ming-Yu Liu

Spatial-Temporal DAG Convolutional Networks for End-to-End Joint Effective Connectivity Learning and Resting-State fMRI Classification

Building comprehensive brain connectomes has proved of fundamental importance in resting-state fMRI (rs-fMRI) analysis. Based on the foundation of brain network, spatial-temporal-based graph convolutional networks have dramatically improved…

Machine Learning · Computer Science 2023-12-19 Rui Yang , Wenrui Dai , Huajun She , Yiping P. Du , Dapeng Wu , Hongkai Xiong

On Joint Estimation of Gaussian Graphical Models for Spatial and Temporal Data

In this paper, we first propose a Bayesian neighborhood selection method to estimate Gaussian Graphical Models (GGMs). We show the graph selection consistency of this method in the sense that the posterior probability of the true model…

Applications · Statistics 2015-07-08 Zhixiang Lin , Tao Wang , Can Yang , Hongyu Zhao

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Image segmentation is considered to be one of the critical tasks in hyperspectral remote sensing image processing. Recently, convolutional neural network (CNN) has established itself as a powerful model in segmentation and classification by…

Computer Vision and Pattern Recognition · Computer Science 2017-12-29 Fahim Irfan Alam , Jun Zhou , Alan Wee-Chung Liew , Xiuping Jia , Jocelyn Chanussot , Yongsheng Gao

Efficient Semantic Video Segmentation with Per-frame Inference

For semantic segmentation, most existing real-time deep models trained with each frame independently may produce inconsistent results for a video sequence. Advanced methods take into considerations the correlations in the video sequence,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Yifan Liu , Chunhua Shen , Changqian Yu , Jingdong Wang

Spatially Encoding Temporal Correlations to Classify Temporal Data Using Convolutional Neural Networks

We propose an off-line approach to explicitly encode temporal patterns spatially as different types of images, namely, Gramian Angular Fields and Markov Transition Fields. This enables the use of techniques from computer vision for feature…

Machine Learning · Computer Science 2015-09-25 Zhiguang Wang , Tim Oates

Unified Graph Structured Models for Video Understanding

Accurate video understanding involves reasoning about the relationships between actors, objects and their environment, often over long temporal intervals. In this paper, we propose a message passing graph neural network that explicitly…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Anurag Arnab , Chen Sun , Cordelia Schmid

A Projected Gradient Descent Method for CRF Inference allowing End-To-End Training of Arbitrary Pairwise Potentials

Are we using the right potential functions in the Conditional Random Field models that are popular in the Vision community? Semantic segmentation and other pixel-level labelling tasks have made significant progress recently due to the deep…

Computer Vision and Pattern Recognition · Computer Science 2018-01-03 Måns Larsson , Anurag Arnab , Fredrik Kahl , Shuai Zheng , Philip Torr

Coarse-Fine Networks for Temporal Activity Detection in Videos

In this paper, we introduce Coarse-Fine Networks, a two-stream architecture which benefits from different abstractions of temporal resolution to learn better video representations for long-term motion. Traditional Video models process…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Kumara Kahatapitiya , Michael S. Ryoo

Finding Temporally Consistent Occlusion Boundaries in Videos using Geometric Context

We present an algorithm for finding temporally consistent occlusion boundaries in videos to support segmentation of dynamic scenes. We learn occlusion boundaries in a pairwise Markov random field (MRF) framework. We first estimate the…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 S. Hussain Raza , Ahmad Humayun , Matthias Grundmann , David Anderson , Irfan Essa

Exploring Temporal Information for Improved Video Understanding

In this dissertation, I present my work towards exploring temporal information for better video understanding. Specifically, I have worked on two problems: action recognition and semantic segmentation. For action recognition, I have…

Computer Vision and Pattern Recognition · Computer Science 2019-05-28 Yi Zhu

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Deep convolutional networks have achieved great success for visual recognition in still images. However, for action recognition in videos, the advantage over traditional methods is not so evident. This paper aims to discover the principles…

Computer Vision and Pattern Recognition · Computer Science 2016-08-03 Limin Wang , Yuanjun Xiong , Zhe Wang , Yu Qiao , Dahua Lin , Xiaoou Tang , Luc Van Gool

Structural-RNN: Deep Learning on Spatio-Temporal Graphs

Deep Recurrent Neural Network architectures, though remarkably capable at modeling sequences, lack an intuitive high-level spatio-temporal structure. That is while many problems in computer vision inherently have an underlying high-level…

Computer Vision and Pattern Recognition · Computer Science 2016-04-12 Ashesh Jain , Amir R. Zamir , Silvio Savarese , Ashutosh Saxena