Related papers: Learning Tracking Representations from Single Poin…

RTrack: Accelerating Convergence for Visual Object Tracking via Pseudo-Boxes Exploration

Single object tracking (SOT) heavily relies on the representation of the target object as a bounding box. However, due to the potential deformation and rotation experienced by the tracked targets, the genuine bounding box fails to capture…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Guotian Zeng , Bi Zeng , Hong Zhang , Jianqi Liu , Qingmao Wei

Decoupled Spatio-Temporal Consistency Learning for Self-Supervised Tracking

The success of visual tracking has been largely driven by datasets with manual box annotations. However, these box annotations require tremendous human effort, limiting the scale and diversity of existing tracking datasets. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Yaozong Zheng , Bineng Zhong , Qihua Liang , Ning Li , Shuxiang Song

An Adaptive Supervision Framework for Active Learning in Object Detection

Active learning approaches in computer vision generally involve querying strong labels for data. However, previous works have shown that weak supervision can be effective in training models for vision tasks while greatly reducing annotation…

Computer Vision and Pattern Recognition · Computer Science 2019-10-16 Sai Vikas Desai , Akshay L Chandra , Wei Guo , Seishi Ninomiya , Vineeth N Balasubramanian

Training object class detectors with click supervision

Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing…

Computer Vision and Pattern Recognition · Computer Science 2017-05-22 Dim P. Papadopoulos , Jasper R. R. Uijlings , Frank Keller , Vittorio Ferrari

Accelerated Video Annotation driven by Deep Detector and Tracker

Annotating object ground truth in videos is vital for several downstream tasks in robot perception and machine learning, such as for evaluating the performance of an object tracker or training an image-based object detector. The accuracy of…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Eric Price , Aamir Ahmad

Improving Weakly-Supervised Object Localization By Micro-Annotation

Weakly-supervised object localization methods tend to fail for object classes that consistently co-occur with the same background elements, e.g. trains on tracks. We propose a method to overcome these failures by adding a very small amount…

Computer Vision and Pattern Recognition · Computer Science 2016-05-19 Alexander Kolesnikov , Christoph H. Lampert

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

Online tracking of multiple objects in videos requires strong capacity of modeling and matching object appearances. Previous methods for learning appearance embedding mostly rely on instance-level matching without considering the temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-07-07 Wei Li , Yuanjun Xiong , Shuo Yang , Mingze Xu , Yongxin Wang , Wei Xia

Dynamic Supervisor for Cross-dataset Object Detection

The application of cross-dataset training in object detection tasks is complicated because the inconsistency in the category range across datasets transforms fully supervised learning into semi-supervised learning. To address this problem,…

Computer Vision and Pattern Recognition · Computer Science 2022-04-04 Ze Chen , Zhihang Fu , Jianqiang Huang , Mingyuan Tao , Shengyu Li , Rongxin Jiang , Xiang Tian , Yaowu Chen , Xian-sheng Hua

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos

We propose a unified point cloud video self-supervised learning framework for object-centric and scene-centric data. Previous methods commonly conduct representation learning at the clip or frame level and cannot well capture fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Xiaoxiao Sheng , Zhiqiang Shen , Gang Xiao , Longguang Wang , Yulan Guo , Hehe Fan

Towards Noise-resistant Object Detection with Noisy Annotations

Training deep object detectors requires significant amount of human-annotated images with accurate object labels and bounding box coordinates, which are extremely expensive to acquire. Noisy annotations are much more easily accessible, but…

Computer Vision and Pattern Recognition · Computer Science 2020-03-04 Junnan Li , Caiming Xiong , Richard Socher , Steven Hoi

Trustable Co-label Learning from Multiple Noisy Annotators

Supervised deep learning depends on massive accurately annotated examples, which is usually impractical in many real-world scenarios. A typical alternative is learning from multiple noisy annotators. Numerous earlier works assume that all…

Machine Learning · Computer Science 2022-03-09 Shikun Li , Tongliang Liu , Jiyong Tan , Dan Zeng , Shiming Ge

Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection

The status quo approach to training object detectors requires expensive bounding box annotations. Our framework takes a markedly different direction: we transfer tracked object boxes from weakly-labeled videos to weakly-labeled images to…

Computer Vision and Pattern Recognition · Computer Science 2016-04-21 Krishna Kumar Singh , Fanyi Xiao , Yong Jae Lee

Point-Teaching: Weakly Semi-Supervised Object Detection with Point Annotations

Point annotations are considerably more time-efficient than bounding box annotations. However, how to use cheap point annotations to boost the performance of semi-supervised object detection remains largely unsolved. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Yongtao Ge , Qiang Zhou , Xinlong Wang , Zhibin Wang , Hao Li , Chunhua Shen

Semi-Automatic Annotation For Visual Object Tracking

We propose a semi-automatic bounding box annotation method for visual object tracking by utilizing temporal information with a tracking-by-detection approach. For detection, we use an off-the-shelf object detector which is trained…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Kutalmis Gokalp Ince , Aybora Koksal , Arda Fazla , A. Aydin Alatan

Sparse3DTrack: Monocular 3D Object Tracking Using Sparse Supervision

Monocular 3D object tracking aims to estimate temporally consistent 3D object poses across video frames, enabling autonomous agents to reason about scene dynamics. However, existing state-of-the-art approaches are fully supervised and rely…

Robotics · Computer Science 2026-03-20 Nikhil Gosala , B. Ravi Kiran , Senthil Yogamani , Abhinav Valada

Self-taught learning of a deep invariant representation for visual tracking via temporal slowness principle

Visual representation is crucial for a visual tracking method's performances. Conventionally, visual representations adopted in visual tracking rely on hand-crafted computer vision descriptors. These descriptors were developed generically…

Computer Vision and Pattern Recognition · Computer Science 2016-04-15 Jason Kuen , Kian Ming Lim , Chin Poo Lee

A Simple Framework for Contrastive Learning of Visual Representations

This paper presents SimCLR: a simple framework for contrastive learning of visual representations. We simplify recently proposed contrastive self-supervised learning algorithms without requiring specialized architectures or a memory bank.…

Machine Learning · Computer Science 2020-07-02 Ting Chen , Simon Kornblith , Mohammad Norouzi , Geoffrey Hinton

Adaptive Soft Contrastive Learning

Self-supervised learning has recently achieved great success in representation learning without human annotations. The dominant method -- that is contrastive learning, is generally based on instance discrimination tasks, i.e., individual…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Chen Feng , Ioannis Patras

Noisy Annotation Refinement for Object Detection

Supervised training of object detectors requires well-annotated large-scale datasets, whose production is costly. Therefore, some efforts have been made to obtain annotations in economical ways, such as cloud sourcing. However, datasets…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Jiafeng Mao , Qing Yu , Yoko Yamakata , Kiyoharu Aizawa

SFTrack++: A Fast Learnable Spectral Segmentation Approach for Space-Time Consistent Tracking

We propose an object tracking method, SFTrack++, that smoothly learns to preserve the tracked object consistency over space and time dimensions by taking a spectral clustering approach over the graph of pixels from the video, using a fast…

Computer Vision and Pattern Recognition · Computer Science 2021-11-05 Elena Burceanu