Related papers: Instance Localization for Self-supervised Detectio…

A Study on Self-Supervised Object Detection Pretraining

In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Trung Dang , Simon Kornblith , Huy Thong Nguyen , Peter Chin , Maryam Khademi

Aligning Pretraining for Detection via Object-Level Contrastive Learning

Image-level contrastive representation learning has proven to be highly effective as a generic model for transfer learning. Such generality for transfer learning, however, sacrifices specificity if we are interested in a certain downstream…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Fangyun Wei , Yue Gao , Zhirong Wu , Han Hu , Stephen Lin

CoDo: Contrastive Learning with Downstream Background Invariance for Detection

The prior self-supervised learning researches mainly select image-level instance discrimination as pretext task. It achieves a fantastic classification performance that is comparable to supervised learning methods. However, with degraded…

Computer Vision and Pattern Recognition · Computer Science 2022-05-11 Bing Zhao , Jun Li , Hong Zhu

Self-taught Object Localization with Deep Networks

This paper introduces self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in images without additional human supervision, i.e., without using…

Computer Vision and Pattern Recognition · Computer Science 2016-02-03 Loris Bazzani , Alessandro Bergamo , Dragomir Anguelov , Lorenzo Torresani

On the Importance of Visual Context for Data Augmentation in Scene Understanding

Performing data augmentation for learning deep neural networks is known to be important for training visual recognition systems. By artificially increasing the number of training examples, it helps reducing overfitting and improves…

Computer Vision and Pattern Recognition · Computer Science 2019-09-23 Nikita Dvornik , Julien Mairal , Cordelia Schmid

Learning to detect and localize many objects from few examples

The current trend in object detection and localization is to learn predictions with high capacity deep neural networks trained on a very large amount of annotated data and using a high amount of processing power. In this work, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2016-11-18 Bastien Moysset , Christoper Kermorvant , Christian Wolf

Distilling Localization for Self-Supervised Representation Learning

Recent progress in contrastive learning has revolutionized unsupervised representation learning. Concretely, multiple views (augmentations) from the same image are encouraged to map to the similar embeddings, while views from different…

Computer Vision and Pattern Recognition · Computer Science 2021-01-20 Nanxuan Zhao , Zhirong Wu , Rynson W. H. Lau , Stephen Lin

Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning

Object category localization is a challenging problem in computer vision. Standard supervised training requires bounding box annotations of object instances. This time-consuming annotation process is sidestepped in weakly supervised…

Computer Vision and Pattern Recognition · Computer Science 2016-05-30 Ramazan Gokberk Cinbis , Jakob Verbeek , Cordelia Schmid

What makes instance discrimination good for transfer learning?

Contrastive visual pretraining based on the instance discrimination pretext task has made significant progress. Notably, recent work on unsupervised pretraining has shown to surpass the supervised counterpart for finetuning downstream…

Computer Vision and Pattern Recognition · Computer Science 2021-01-20 Nanxuan Zhao , Zhirong Wu , Rynson W. H. Lau , Stephen Lin

Towards Accurate Localization by Instance Search

Visual object localization is the key step in a series of object detection tasks. In the literature, high localization accuracy is achieved with the mainstream strongly supervised frameworks. However, such methods require object-level…

Computer Vision and Pattern Recognition · Computer Science 2021-08-10 Yi-Geng Hong , Hui-Chu Xiao , Wan-Lei Zhao

Efficient Visual Pretraining with Contrastive Detection

Self-supervised pretraining has been shown to yield powerful representations for transfer learning. These performance gains come at a large computational cost however, with state-of-the-art methods requiring an order of magnitude more…

Computer Vision and Pattern Recognition · Computer Science 2021-08-06 Olivier J. Hénaff , Skanda Koppula , Jean-Baptiste Alayrac , Aaron van den Oord , Oriol Vinyals , João Carreira

BOOTPLACE: Bootstrapped Object Placement with Detection Transformers

In this paper, we tackle the copy-paste image-to-image composition problem with a focus on object placement learning. Prior methods have leveraged generative models to reduce the reliance for dense supervision. However, this often limits…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Hang Zhou , Xinxin Zuo , Rui Ma , Li Cheng

Improving Open-World Object Localization by Discovering Background

Our work addresses the problem of learning to localize objects in an open-world setting, i.e., given the bounding box information of a limited number of object classes during training, the goal is to localize all objects, belonging to both…

Computer Vision and Pattern Recognition · Computer Science 2025-04-25 Ashish Singh , Michael J. Jones , Kuan-Chuan Peng , Anoop Cherian , Moitreya Chatterjee , Erik Learned-Miller

MOST: Multiple Object localization with Self-supervised Transformers for object discovery

We tackle the challenging task of unsupervised object localization in this work. Recently, transformers trained with self-supervised learning have been shown to exhibit object localization properties without being trained for this task. In…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Sai Saketh Rambhatla , Ishan Misra , Rama Chellappa , Abhinav Shrivastava

Learning to Perceive "Where": Spatial Pretext Tasks for Robust Self-Supervised Learning

Existing self-supervised learning (SSL) methods primarily learn object-invariant representations but often neglect the spatial structure and relationships among object parts. To address this limitation, we introduce Spatial Prediction (SP),…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Yang Shen , Yusen Cai , Weronika Hryniewska-Guzik , Qing Lin , Mengmi Zhang

Self-supervised Learning of 3D Object Understanding by Data Association and Landmark Estimation for Image Sequence

In this paper, we propose a self-supervised learningmethod for multi-object pose estimation. 3D object under-standing from 2D image is a challenging task that infers ad-ditional dimension from reduced-dimensional information.In particular,…

Computer Vision and Pattern Recognition · Computer Science 2021-04-16 Hyeonwoo Yu , Jean Oh

Progressive Representation Adaptation for Weakly Supervised Object Localization

We address the problem of weakly supervised object localization where only image-level annotations are available for training object detectors. Numerous methods have been proposed to tackle this problem through mining object proposals.…

Computer Vision and Pattern Recognition · Computer Science 2017-10-13 Dong Li , Jia-Bin Huang , Yali Li , Shengjin Wang , Ming-Hsuan Yang

Boundary Distribution Estimation for Precise Object Detection

In the field of state-of-the-art object detection, the task of object localization is typically accomplished through a dedicated subnet that emphasizes bounding box regression. This subnet traditionally predicts the object's position by…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Peng Zhi , Haoran Zhou , Hang Huang , Rui Zhao , Rui Zhou , Qingguo Zhou

Improving Localization for Semi-Supervised Object Detection

Nowadays, Semi-Supervised Object Detection (SSOD) is a hot topic, since, while it is rather easy to collect images for creating a new dataset, labeling them is still an expensive and time-consuming task. One of the successful methods to…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Leonardo Rossi , Akbar Karimi , Andrea Prati

Aligned Unsupervised Pretraining of Object Detectors with Self-training

The unsupervised pretraining of object detectors has recently become a key component of object detector training, as it leads to improved performance and faster convergence during the supervised fine-tuning stage. Existing unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Ioannis Maniadis Metaxas , Adrian Bulat , Ioannis Patras , Brais Martinez , Georgios Tzimiropoulos