Related papers: MAST: A Memory-Augmented Self-supervised Tracker

Self-supervised Video Object Segmentation

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a.k.a. dense tracking). We make the following contributions: (i) we propose to improve the existing…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Fangrui Zhu , Li Zhang , Yanwei Fu , Guodong Guo , Weidi Xie

Multi-object tracking with self-supervised associating network

Multi-Object Tracking (MOT) is the task that has a lot of potential for development, and there are still many problems to be solved. In the traditional tracking by detection paradigm, There has been a lot of work on feature based object…

Computer Vision and Pattern Recognition · Computer Science 2020-10-27 Tae-young Chung , Heansung Lee , Myeong Ah Cho , Suhwan Cho , Sangyoun Lee

Video Diffusion Models Excel at Tracking Similar-Looking Objects Without Supervision

Distinguishing visually similar objects by their motion remains a critical challenge in computer vision. Although supervised trackers show promise, contemporary self-supervised trackers struggle when visual cues become ambiguous, limiting…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Chenshuang Zhang , Kang Zhang , Joon Son Chung , In So Kweon , Junmo Kim , Chengzhi Mao

Semi-supervised learning of deep metrics for stereo reconstruction

Deep-learning metrics have recently demonstrated extremely good performance to match image patches for stereo reconstruction. However, training such metrics requires large amount of labeled stereo images, which can be difficult or costly to…

Computer Vision and Pattern Recognition · Computer Science 2016-12-06 Stepan Tulyakov , Anton Ivanov , Francois Fleuret

Unsupervised Deep Tracking

We propose an unsupervised visual tracking method in this paper. Different from existing approaches using extensive annotated data for supervised learning, our CNN model is trained on large-scale unlabeled videos in an unsupervised manner.…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Ning Wang , Yibing Song , Chao Ma , Wengang Zhou , Wei Liu , Houqiang Li

Unsupervised Deep Representation Learning for Real-Time Tracking

The advancement of visual tracking has continuously been brought by deep learning models. Typically, supervised learning is employed to train these models with expensive labeled data. In order to reduce the workload of manual annotations…

Computer Vision and Pattern Recognition · Computer Science 2020-07-24 Ning Wang , Wengang Zhou , Yibing Song , Chao Ma , Wei Liu , Houqiang Li

Self-supervised Auxiliary Loss for Metric Learning in Music Similarity-based Retrieval and Auto-tagging

In the realm of music information retrieval, similarity-based retrieval and auto-tagging serve as essential components. Given the limitations and non-scalability of human supervision signals, it becomes crucial for models to learn from…

Sound · Computer Science 2023-04-18 Taketo Akama , Hiroaki Kitano , Katsuhiro Takematsu , Yasushi Miyajima , Natalia Polouliakh

Self-Supervised Fast Adaptation for Denoising via Meta-Learning

Under certain statistical assumptions of noise, recent self-supervised approaches for denoising have been introduced to learn network parameters without true clean images, and these methods can restore an image by exploiting information…

Computer Vision and Pattern Recognition · Computer Science 2020-01-10 Seunghwan Lee , Donghyeon Cho , Jiwon Kim , Tae Hyun Kim

Unsupervised Learning of Accurate Siamese Tracking

Unsupervised learning has been popular in various computer vision tasks, including visual object tracking. However, prior unsupervised tracking approaches rely heavily on spatial supervision from template-search pairs and are still unable…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Qiuhong Shen , Lei Qiao , Jinyang Guo , Peixia Li , Xin Li , Bo Li , Weitao Feng , Weihao Gan , Wei Wu , Wanli Ouyang

DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video

We present DINO-Tracker -- a new framework for long-term dense tracking in video. The pillar of our approach is combining test-time training on a single video, with the powerful localized semantic features learned by a pre-trained DINO-ViT…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Narek Tumanyan , Assaf Singer , Shai Bagon , Tali Dekel

Self-Supervised Multi-Object Tracking For Autonomous Driving From Consistency Across Timescales

Self-supervised multi-object trackers have tremendous potential as they enable learning from raw domain-specific data. However, their re-identification accuracy still falls short compared to their supervised counterparts. We hypothesize…

Computer Vision and Pattern Recognition · Computer Science 2023-09-22 Christopher Lang , Alexander Braun , Lars Schillingmann , Abhinav Valada

Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers

Online Multi-Object Tracking (MOT) from videos is a challenging computer vision task which has been extensively studied for decades. Most of the existing MOT algorithms are based on the Tracking-by-Detection (TBD) paradigm combined with…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Zhen He , Jian Li , Daxue Liu , Hangen He , David Barber

Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations

Spatially dense self-supervised learning is a rapidly growing problem domain with promising applications for unsupervised segmentation and pretraining for dense downstream tasks. Despite the abundance of temporal data in the form of videos,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Mohammadreza Salehi , Efstratios Gavves , Cees G. M. Snoek , Yuki M. Asano

Simple Unsupervised Multi-Object Tracking

Multi-object tracking has seen a lot of progress recently, albeit with substantial annotation costs for developing better and larger labeled datasets. In this work, we remove the need for annotated datasets by proposing an unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2020-06-05 Shyamgopal Karthik , Ameya Prabhu , Vineet Gandhi

A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning

Self-supervised learning has emerged as a powerful paradigm for label-free model pretraining, particularly in the video domain, where manual annotation is costly and time-intensive. However, existing self-supervised approaches employ…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Akash Kumar , Ashlesha Kumar , Vibhav Vineet , Yogesh S Rawat

Refining Pre-Trained Motion Models

Given the difficulty of manually annotating motion in video, the current best motion estimation methods are trained with synthetic data, and therefore struggle somewhat due to a train/test gap. Self-supervised methods hold the promise of…

Computer Vision and Pattern Recognition · Computer Science 2024-02-20 Xinglong Sun , Adam W. Harley , Leonidas J. Guibas

Digging Into Self-Supervised Monocular Depth Estimation

Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Clément Godard , Oisin Mac Aodha , Michael Firman , Gabriel Brostow

Analyzing the Sample Complexity of Self-Supervised Image Reconstruction Methods

Supervised training of deep neural networks on pairs of clean image and noisy measurement achieves state-of-the-art performance for many image reconstruction tasks, but such training pairs are difficult to collect. Self-supervised methods…

Image and Video Processing · Electrical Eng. & Systems 2023-10-30 Tobit Klug , Dogukan Atik , Reinhard Heckel

Training Self-Supervised Depth Completion Using Sparse Measurements and a Single Image

Depth completion is an important vision task, and many efforts have been made to enhance the quality of depth maps from sparse depth measurements. Despite significant advances, training these models to recover dense depth from sparse…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Rizhao Fan , Zhigen Li , Heping Li , Ning An

Semi-supervised Learning for Dense Object Detection in Retail Scenes

Retail scenes usually contain densely packed high number of objects in each image. Standard object detection techniques use fully supervised training methodology. This is highly costly as annotating a large dense retail object detection…

Computer Vision and Pattern Recognition · Computer Science 2021-07-06 Jaydeep Chauhan , Srikrishna Varadarajan , Muktabh Mayank Srivastava