Related papers: Unsupervised Feature Learning from Temporal Data

Unsupervised Learning of Spatiotemporally Coherent Metrics

Current state-of-the-art classification and detection algorithms rely on supervised training. In this work we study unsupervised feature learning in the context of temporally coherent video data. We focus on feature learning from unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2015-09-09 Ross Goroshin , Joan Bruna , Jonathan Tompson , David Eigen , Yann LeCun

Unsupervised learning from videos using temporal coherency deep networks

In this work we address the challenging problem of unsupervised learning from videos. Existing methods utilize the spatio-temporal continuity in contiguous video frames as regularization for the learning process. Typically, this temporal…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Carolina Redondo-Cabrera , Roberto J. López-Sastre

Slow and steady feature analysis: higher order temporal coherence in video

How can unlabeled video augment visual learning? Existing methods perform "slow" feature analysis, encouraging the representations of temporally close frames to exhibit only small differences. While this standard approach captures the fact…

Computer Vision and Pattern Recognition · Computer Science 2016-04-15 Dinesh Jayaraman , Kristen Grauman

Incorporating Scalability in Unsupervised Spatio-Temporal Feature Learning

Deep neural networks are efficient learning machines which leverage upon a large amount of manually labeled data for learning discriminative features. However, acquiring substantial amount of supervised data, especially for videos can be a…

Computer Vision and Pattern Recognition · Computer Science 2018-08-16 Sujoy Paul , Sourya Roy , Amit K. Roy-Chowdhury

Object-Centric Representation Learning from Unlabeled Videos

Supervised (pre-)training currently yields state-of-the-art performance for representation learning for visual recognition, yet it comes at the cost of (1) intensive manual annotations and (2) an inherent restriction in the scope of data…

Computer Vision and Pattern Recognition · Computer Science 2016-12-05 Ruohan Gao , Dinesh Jayaraman , Kristen Grauman

Unsupervised Feature Learning in Remote Sensing

The need for labeled data is among the most common and well-known practical obstacles to deploying deep learning algorithms to solve real-world problems. The current generation of learning algorithms requires a large volume of data labeled…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Aaron Reite , Scott Kangas , Zackery Steck , Steven Goley , Jonathan Von Stroh , Steven Forsyth

Unsupervised Deep Representation Learning for Real-Time Tracking

The advancement of visual tracking has continuously been brought by deep learning models. Typically, supervised learning is employed to train these models with expensive labeled data. In order to reduce the workload of manual annotations…

Computer Vision and Pattern Recognition · Computer Science 2020-07-24 Ning Wang , Wengang Zhou , Yibing Song , Chao Ma , Wei Liu , Houqiang Li

Unsupervised Deep Tracking

We propose an unsupervised visual tracking method in this paper. Different from existing approaches using extensive annotated data for supervised learning, our CNN model is trained on large-scale unlabeled videos in an unsupervised manner.…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Ning Wang , Yibing Song , Chao Ma , Wengang Zhou , Wei Liu , Houqiang Li

Semi-supervised Tuning from Temporal Coherence

Recent works demonstrated the usefulness of temporal coherence to regularize supervised training or to learn invariant features with deep architectures. In particular, enforcing smooth output changes while presenting temporally-closed…

Machine Learning · Computer Science 2016-01-05 Davide Maltoni , Vincenzo Lomonaco

Unsupervised learning of action classes with continuous temporal embedding

The task of temporally detecting and segmenting actions in untrimmed videos has seen an increased attention recently. One problem in this context arises from the need to define and label action boundaries to create annotations for training…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Anna Kukleva , Hilde Kuehne , Fadime Sener , Juergen Gall

Exploiting Temporality for Semi-Supervised Video Segmentation

In recent years, there has been remarkable progress in supervised image segmentation. Video segmentation is less explored, despite the temporal dimension being highly informative. Semantic labels, e.g. that cannot be accurately detected in…

Computer Vision and Pattern Recognition · Computer Science 2019-08-30 Radu Sibechi , Olaf Booij , Nora Baka , Peter Bloem

Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences

Understanding the structure of complex activities in untrimmed videos is a challenging task in the area of action recognition. One problem here is that this task usually requires a large amount of hand-annotated minute- or even hour-long…

Computer Vision and Pattern Recognition · Computer Science 2020-10-01 Rosaura G. VidalMata , Walter J. Scheirer , Anna Kukleva , David Cox , Hilde Kuehne

Temporally Coherent Embeddings for Self-Supervised Video Representation Learning

This paper presents TCE: Temporally Coherent Embeddings for self-supervised video representation learning. The proposed method exploits inherent structure of unlabeled video data to explicitly enforce temporal coherency in the embedding…

Computer Vision and Pattern Recognition · Computer Science 2020-11-18 Joshua Knights , Ben Harwood , Daniel Ward , Anthony Vanderkop , Olivia Mackenzie-Ross , Peyman Moghadam

Unsupervised feature learning with discriminative encoder

In recent years, deep discriminative models have achieved extraordinary performance on supervised learning tasks, significantly outperforming their generative counterparts. However, their success relies on the presence of a large amount of…

Computer Vision and Pattern Recognition · Computer Science 2017-09-05 Gaurav Pandey , Ambedkar Dukkipati

Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification

While supervised techniques in re-identification are extremely effective, the need for large amounts of annotations makes them impractical for large camera networks. One-shot re-identification, which uses a singular labeled tracklet for…

Computer Vision and Pattern Recognition · Computer Science 2020-07-23 Dripta S. Raychaudhuri , Amit K. Roy-Chowdhury

Learning Correspondence from the Cycle-Consistency of Time

We introduce a self-supervised method for learning visual correspondence from unlabeled video. The main idea is to use cycle-consistency in time as free supervisory signal for learning visual representations from scratch. At training time,…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Xiaolong Wang , Allan Jabri , Alexei A. Efros

Unsupervised Representation Learning by Sorting Sequences

We present an unsupervised representation learning approach using videos without semantic labels. We leverage the temporal coherence as a supervisory signal by formulating representation learning as a sequence sorting task. We take…

Computer Vision and Pattern Recognition · Computer Science 2017-08-04 Hsin-Ying Lee , Jia-Bin Huang , Maneesh Singh , Ming-Hsuan Yang

Unsupervised Spatio-temporal Latent Feature Clustering for Multiple-object Tracking and Segmentation

Assigning consistent temporal identifiers to multiple moving objects in a video sequence is a challenging problem. A solution to that problem would have immediate ramifications in multiple object tracking and segmentation problems. We…

Computer Vision and Pattern Recognition · Computer Science 2021-11-08 Abubakar Siddique , Reza Jalil Mozhdehi , Henry Medeiros

Spatio-Temporal Action Localization in a Weakly Supervised Setting

Enabling computational systems with the ability to localize actions in video-based content has manifold applications. Traditionally, such a problem is approached in a fully-supervised setting where video-clips with complete frame-by-frame…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Kurt Degiorgio , Fabio Cuzzolin

Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey

Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer vision applications. To avoid extensive cost of collecting and…

Computer Vision and Pattern Recognition · Computer Science 2019-02-19 Longlong Jing , Yingli Tian