English
Related papers

Related papers: MAST: A Memory-Augmented Self-supervised Tracker

200 papers

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a.k.a. dense tracking). We make the following contributions: (i) we propose to improve the existing…

Computer Vision and Pattern Recognition · Computer Science 2020-06-23 Fangrui Zhu , Li Zhang , Yanwei Fu , Guodong Guo , Weidi Xie

Multi-Object Tracking (MOT) is the task that has a lot of potential for development, and there are still many problems to be solved. In the traditional tracking by detection paradigm, There has been a lot of work on feature based object…

Computer Vision and Pattern Recognition · Computer Science 2020-10-27 Tae-young Chung , Heansung Lee , Myeong Ah Cho , Suhwan Cho , Sangyoun Lee

Distinguishing visually similar objects by their motion remains a critical challenge in computer vision. Although supervised trackers show promise, contemporary self-supervised trackers struggle when visual cues become ambiguous, limiting…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Chenshuang Zhang , Kang Zhang , Joon Son Chung , In So Kweon , Junmo Kim , Chengzhi Mao

Deep-learning metrics have recently demonstrated extremely good performance to match image patches for stereo reconstruction. However, training such metrics requires large amount of labeled stereo images, which can be difficult or costly to…

Computer Vision and Pattern Recognition · Computer Science 2016-12-06 Stepan Tulyakov , Anton Ivanov , Francois Fleuret

We propose an unsupervised visual tracking method in this paper. Different from existing approaches using extensive annotated data for supervised learning, our CNN model is trained on large-scale unlabeled videos in an unsupervised manner.…

Computer Vision and Pattern Recognition · Computer Science 2019-04-04 Ning Wang , Yibing Song , Chao Ma , Wengang Zhou , Wei Liu , Houqiang Li

The advancement of visual tracking has continuously been brought by deep learning models. Typically, supervised learning is employed to train these models with expensive labeled data. In order to reduce the workload of manual annotations…

Computer Vision and Pattern Recognition · Computer Science 2020-07-24 Ning Wang , Wengang Zhou , Yibing Song , Chao Ma , Wei Liu , Houqiang Li

In the realm of music information retrieval, similarity-based retrieval and auto-tagging serve as essential components. Given the limitations and non-scalability of human supervision signals, it becomes crucial for models to learn from…

Under certain statistical assumptions of noise, recent self-supervised approaches for denoising have been introduced to learn network parameters without true clean images, and these methods can restore an image by exploiting information…

Computer Vision and Pattern Recognition · Computer Science 2020-01-10 Seunghwan Lee , Donghyeon Cho , Jiwon Kim , Tae Hyun Kim

Unsupervised learning has been popular in various computer vision tasks, including visual object tracking. However, prior unsupervised tracking approaches rely heavily on spatial supervision from template-search pairs and are still unable…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Qiuhong Shen , Lei Qiao , Jinyang Guo , Peixia Li , Xin Li , Bo Li , Weitao Feng , Weihao Gan , Wei Wu , Wanli Ouyang

We present DINO-Tracker -- a new framework for long-term dense tracking in video. The pillar of our approach is combining test-time training on a single video, with the powerful localized semantic features learned by a pre-trained DINO-ViT…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Narek Tumanyan , Assaf Singer , Shai Bagon , Tali Dekel

Self-supervised multi-object trackers have tremendous potential as they enable learning from raw domain-specific data. However, their re-identification accuracy still falls short compared to their supervised counterparts. We hypothesize…

Computer Vision and Pattern Recognition · Computer Science 2023-09-22 Christopher Lang , Alexander Braun , Lars Schillingmann , Abhinav Valada

Online Multi-Object Tracking (MOT) from videos is a challenging computer vision task which has been extensively studied for decades. Most of the existing MOT algorithms are based on the Tracking-by-Detection (TBD) paradigm combined with…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Zhen He , Jian Li , Daxue Liu , Hangen He , David Barber

Spatially dense self-supervised learning is a rapidly growing problem domain with promising applications for unsupervised segmentation and pretraining for dense downstream tasks. Despite the abundance of temporal data in the form of videos,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Mohammadreza Salehi , Efstratios Gavves , Cees G. M. Snoek , Yuki M. Asano

Multi-object tracking has seen a lot of progress recently, albeit with substantial annotation costs for developing better and larger labeled datasets. In this work, we remove the need for annotated datasets by proposing an unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2020-06-05 Shyamgopal Karthik , Ameya Prabhu , Vineet Gandhi

Self-supervised learning has emerged as a powerful paradigm for label-free model pretraining, particularly in the video domain, where manual annotation is costly and time-intensive. However, existing self-supervised approaches employ…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Akash Kumar , Ashlesha Kumar , Vibhav Vineet , Yogesh S Rawat

Given the difficulty of manually annotating motion in video, the current best motion estimation methods are trained with synthetic data, and therefore struggle somewhat due to a train/test gap. Self-supervised methods hold the promise of…

Computer Vision and Pattern Recognition · Computer Science 2024-02-20 Xinglong Sun , Adam W. Harley , Leonidas J. Guibas

Per-pixel ground-truth depth data is challenging to acquire at scale. To overcome this limitation, self-supervised learning has emerged as a promising alternative for training models to perform monocular depth estimation. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Clément Godard , Oisin Mac Aodha , Michael Firman , Gabriel Brostow

Supervised training of deep neural networks on pairs of clean image and noisy measurement achieves state-of-the-art performance for many image reconstruction tasks, but such training pairs are difficult to collect. Self-supervised methods…

Image and Video Processing · Electrical Eng. & Systems 2023-10-30 Tobit Klug , Dogukan Atik , Reinhard Heckel

Depth completion is an important vision task, and many efforts have been made to enhance the quality of depth maps from sparse depth measurements. Despite significant advances, training these models to recover dense depth from sparse…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Rizhao Fan , Zhigen Li , Heping Li , Ning An

Retail scenes usually contain densely packed high number of objects in each image. Standard object detection techniques use fully supervised training methodology. This is highly costly as annotating a large dense retail object detection…

Computer Vision and Pattern Recognition · Computer Science 2021-07-06 Jaydeep Chauhan , Srikrishna Varadarajan , Muktabh Mayank Srivastava
‹ Prev 1 2 3 10 Next ›