Related papers: Unsupervised Deep Tracking

Unsupervised Deep Representation Learning for Real-Time Tracking

The advancement of visual tracking has continuously been brought by deep learning models. Typically, supervised learning is employed to train these models with expensive labeled data. In order to reduce the workload of manual annotations…

Computer Vision and Pattern Recognition · Computer Science 2020-07-24 Ning Wang , Wengang Zhou , Yibing Song , Chao Ma , Wei Liu , Houqiang Li

Unsupervised Learning of Accurate Siamese Tracking

Unsupervised learning has been popular in various computer vision tasks, including visual object tracking. However, prior unsupervised tracking approaches rely heavily on spatial supervision from template-search pairs and are still unable…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Qiuhong Shen , Lei Qiao , Jinyang Guo , Peixia Li , Xin Li , Bo Li , Weitao Feng , Weihao Gan , Wei Wu , Wanli Ouyang

Unsupervised Learning of Visual Representations using Videos

Is strong supervision necessary for learning a good visual representation? Do we really need millions of semantically-labeled images to train a Convolutional Neural Network (CNN)? In this paper, we present a simple yet surprisingly powerful…

Computer Vision and Pattern Recognition · Computer Science 2015-10-07 Xiaolong Wang , Abhinav Gupta

Object-Centric Representation Learning from Unlabeled Videos

Supervised (pre-)training currently yields state-of-the-art performance for representation learning for visual recognition, yet it comes at the cost of (1) intensive manual annotations and (2) an inherent restriction in the scope of data…

Computer Vision and Pattern Recognition · Computer Science 2016-12-05 Ruohan Gao , Dinesh Jayaraman , Kristen Grauman

Self-supervised Object Tracking with Cycle-consistent Siamese Networks

Self-supervised learning for visual object tracking possesses valuable advantages compared to supervised learning, such as the non-necessity of laborious human annotations and online training. In this work, we exploit an end-to-end Siamese…

Computer Vision and Pattern Recognition · Computer Science 2020-08-04 Weihao Yuan , Michael Yu Wang , Qifeng Chen

Incorporating Scalability in Unsupervised Spatio-Temporal Feature Learning

Deep neural networks are efficient learning machines which leverage upon a large amount of manually labeled data for learning discriminative features. However, acquiring substantial amount of supervised data, especially for videos can be a…

Computer Vision and Pattern Recognition · Computer Science 2018-08-16 Sujoy Paul , Sourya Roy , Amit K. Roy-Chowdhury

Self-supervised Visual Feature Learning with Deep Neural Networks: A Survey

Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer vision applications. To avoid extensive cost of collecting and…

Computer Vision and Pattern Recognition · Computer Science 2019-02-19 Longlong Jing , Yingli Tian

Unsupervised learning from videos using temporal coherency deep networks

In this work we address the challenging problem of unsupervised learning from videos. Existing methods utilize the spatio-temporal continuity in contiguous video frames as regularization for the learning process. Typically, this temporal…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Carolina Redondo-Cabrera , Roberto J. López-Sastre

Unsupervised Deep Learning by Neighbourhood Discovery

Deep convolutional neural networks (CNNs) have demonstrated remarkable success in computer vision by supervisedly learning strong visual feature representations. However, training CNNs relies heavily on the availability of exhaustive…

Computer Vision and Pattern Recognition · Computer Science 2019-05-31 Jiabo Huang , Qi Dong , Shaogang Gong , Xiatian Zhu

Deep Meta Learning for Real-Time Target-Aware Visual Tracking

In this paper, we propose a novel on-line visual tracking framework based on the Siamese matching network and meta-learner network, which run at real-time speeds. Conventional deep convolutional feature-based discriminative visual tracking…

Computer Vision and Pattern Recognition · Computer Science 2019-08-19 Janghoon Choi , Junseok Kwon , Kyoung Mu Lee

Unsupervised Learning of View-invariant Action Representations

The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Junnan Li , Yongkang Wong , Qi Zhao , Mohan S. Kankanhalli

Unsupervised learning of foreground object detection

Unsupervised learning poses one of the most difficult challenges in computer vision today. The task has an immense practical value with many applications in artificial intelligence and emerging technologies, as large quantities of unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2019-05-28 Ioana Croitoru , Simion-Vlad Bogolin , Marius Leordeanu

Unsupervised learning from video to detect foreground objects in single images

Unsupervised learning from visual data is one of the most difficult challenges in computer vision, being a fundamental task for understanding how visual recognition works. From a practical point of view, learning from unsupervised visual…

Computer Vision and Pattern Recognition · Computer Science 2017-04-03 Ioana Croitoru , Simion-Vlad Bogolin , Marius Leordeanu

Unsupervised Keypoint Learning for Guiding Class-Conditional Video Prediction

We propose a deep video prediction model conditioned on a single image and an action class. To generate future frames, we first detect keypoints of a moving object and predict future motion as a sequence of keypoints. The input image is…

Computer Vision and Pattern Recognition · Computer Science 2019-10-07 Yunji Kim , Seonghyeon Nam , In Cho , Seon Joo Kim

Unsupervised Video Continual Learning via Non-Parametric Deep Embedded Clustering

We propose a realistic scenario for the unsupervised video learning where neither task boundaries nor labels are provided when learning a succession of tasks. We also provide a non-parametric learning solution for the under-explored problem…

Computer Vision and Pattern Recognition · Computer Science 2025-09-01 Nattapong Kurpukdee , Adrian G. Bors

Learning to Track Objects from Unlabeled Videos

In this paper, we propose to learn an Unsupervised Single Object Tracker (USOT) from scratch. We identify that three major challenges, i.e., moving object discovery, rich temporal variation exploitation, and online update, are the central…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Jilai Zheng , Chao Ma , Houwen Peng , Xiaokang Yang

A Survey on Self-supervised Pre-training for Sequential Transfer Learning in Neural Networks

Deep neural networks are typically trained under a supervised learning framework where a model learns a single task using labeled data. Instead of relying solely on labeled data, practitioners can harness unlabeled or related data to…

Machine Learning · Computer Science 2020-07-03 Huanru Henry Mao

Unsupervised Feature Learning from Temporal Data

Current state-of-the-art classification and detection algorithms rely on supervised training. In this work we study unsupervised feature learning in the context of temporally coherent video data. We focus on feature learning from unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2015-04-17 Ross Goroshin , Joan Bruna , Jonathan Tompson , David Eigen , Yann LeCun

Self-Supervised Learning of Face Representations for Video Face Clustering

Analyzing the story behind TV series and movies often requires understanding who the characters are and what they are doing. With improving deep face models, this may seem like a solved problem. However, as face detectors get better,…

Computer Vision and Pattern Recognition · Computer Science 2019-03-05 Vivek Sharma , Makarand Tapaswi , M. Saquib Sarfraz , Rainer Stiefelhagen

Multiview Supervision By Registration

This paper presents a semi-supervised learning framework to train a keypoint detector using multiview image streams given the limited labeled data (typically $<$4\%). We leverage the complementary relationship between multiview geometry and…

Computer Vision and Pattern Recognition · Computer Science 2019-03-26 Yilun Zhang , Hyun Soo Park