Related papers: Improving Object Detection with Selective Self-sup…

Exploiting Web Images for Weakly Supervised Object Detection

In recent years, the performance of object detection has advanced significantly with the evolving deep convolutional neural networks. However, the state-of-the-art object detection methods still rely on accurate bounding box annotations…

Computer Vision and Pattern Recognition · Computer Science 2017-07-31 Qingyi Tao , Hao Yang , Jianfei Cai

A Study on Self-Supervised Object Detection Pretraining

In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Trung Dang , Simon Kornblith , Huy Thong Nguyen , Peter Chin , Maryam Khademi

Data Augmentation for Object Detection via Differentiable Neural Rendering

It is challenging to train a robust object detector under the supervised learning setting when the annotated data are scarce. Thus, previous approaches tackling this problem are in two categories: semi-supervised learning models that…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Guanghan Ning , Guang Chen , Chaowei Tan , Si Luo , Liefeng Bo , Heng Huang

Unsupervised learning based object detection using Contrastive Learning

Training image-based object detectors presents formidable challenges, as it entails not only the complexities of object detection but also the added intricacies of precisely localizing objects within potentially diverse and noisy…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Chandan Kumar , Jansel Herrera-Gerena , John Just , Matthew Darr , Ali Jannesari

Self-Supervised Viewpoint Learning From Image Collections

Training deep neural networks to estimate the viewpoint of objects requires large labeled training datasets. However, manually labeling viewpoints is notoriously hard, error-prone, and time-consuming. On the other hand, it is relatively…

Computer Vision and Pattern Recognition · Computer Science 2020-04-07 Siva Karthik Mustikovela , Varun Jampani , Shalini De Mello , Sifei Liu , Umar Iqbal , Carsten Rother , Jan Kautz

A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation

Progress has been achieved recently in object detection given advancements in deep learning. Nevertheless, such tools typically require a large amount of training data and significant manual effort to label objects. This limits their…

Robotics · Computer Science 2017-08-04 Chaitanya Mitash , Kostas E. Bekris , Abdeslam Boularias

Unsupervised learning from video to detect foreground objects in single images

Unsupervised learning from visual data is one of the most difficult challenges in computer vision, being a fundamental task for understanding how visual recognition works. From a practical point of view, learning from unsupervised visual…

Computer Vision and Pattern Recognition · Computer Science 2017-04-03 Ioana Croitoru , Simion-Vlad Bogolin , Marius Leordeanu

Learning from Web Data: the Benefit of Unsupervised Object Localization

Annotating a large number of training images is very time-consuming. In this background, this paper focuses on learning from easy-to-acquire web data and utilizes the learned model for fine-grained image classification in labeled datasets.…

Computer Vision and Pattern Recognition · Computer Science 2018-12-24 Xiaoxiao Sun , Liang Zheng , Yu-Kun Lai , Jufeng Yang

Learning to discover and localize visual objects with open vocabulary

To alleviate the cost of obtaining accurate bounding boxes for training today's state-of-the-art object detection models, recent weakly supervised detection work has proposed techniques to learn from image-level labels. However, requiring…

Computer Vision and Pattern Recognition · Computer Science 2018-11-27 Keren Ye , Mingda Zhang , Wei Li , Danfeng Qin , Adriana Kovashka , Jesse Berent

Shelf-Supervised Cross-Modal Pre-Training for 3D Object Detection

State-of-the-art 3D object detectors are often trained on massive labeled datasets. However, annotating 3D bounding boxes remains prohibitively expensive and time-consuming, particularly for LiDAR. Instead, recent works demonstrate that…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Mehar Khurana , Neehar Peri , James Hays , Deva Ramanan

Weakly Supervised Object Segmentation by Background Conditional Divergence

As a computer vision task, automatic object segmentation remains challenging in specialized image domains without massive labeled data, such as synthetic aperture sonar images, remote sensing, biomedical imaging, etc. In any domain,…

Computer Vision and Pattern Recognition · Computer Science 2025-11-05 Hassan Baker , Matthew S. Emigh , Austin J. Brockmeier

Self-Supervised Learning Across Domains

Human adaptability relies crucially on learning and merging knowledge from both supervised and unsupervised tasks: the parents point out few important concepts, but then the children fill in the gaps on their own. This is particularly…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Silvia Bucci , Antonio D'Innocente , Yujun Liao , Fabio Maria Carlucci , Barbara Caputo , Tatiana Tommasi

A Self-Supervised Approach for Enhanced Feature Representations in Object Detection Tasks

In the fast-evolving field of artificial intelligence, where models are increasingly growing in complexity and size, the availability of labeled data for training deep learning models has become a significant challenge. Addressing complex…

Computer Vision and Pattern Recognition · Computer Science 2026-02-19 Santiago C. Vilabella , Pablo Pérez-Núñez , Beatriz Remeseiro

Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos

This work proposes a self-supervised learning system for segmenting rigid objects in RGB images. The proposed pipeline is trained on unlabeled RGB-D videos of static objects, which can be captured with a camera carried by a mobile robot. A…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Shiyang Lu , Yunfu Deng , Abdeslam Boularias , Kostas Bekris

Self-taught Object Localization with Deep Networks

This paper introduces self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in images without additional human supervision, i.e., without using…

Computer Vision and Pattern Recognition · Computer Science 2016-02-03 Loris Bazzani , Alessandro Bergamo , Dragomir Anguelov , Lorenzo Torresani

Unsupervised Image Matching and Object Discovery as Optimization

Learning with complete or partial supervision is powerful but relies on ever-growing human annotation efforts. As a way to mitigate this serious problem, as well as to serve specific applications, unsupervised learning has emerged as an…

Computer Vision and Pattern Recognition · Computer Science 2019-04-08 Huy V. Vo , Francis Bach , Minsu Cho , Kai Han , Yann LeCun , Patrick Perez , Jean Ponce

Training Object Detectors from Few Weakly-Labeled and Many Unlabeled Images

Weakly-supervised object detection attempts to limit the amount of supervision by dispensing the need for bounding boxes, but still assumes image-level labels on the entire training set. In this work, we study the problem of training an…

Computer Vision and Pattern Recognition · Computer Science 2021-07-22 Zhaohui Yang , Miaojing Shi , Chao Xu , Vittorio Ferrari , Yannis Avrithis

Saliency Guided Contrastive Learning on Scene Images

Self-supervised learning holds promise in leveraging large numbers of unlabeled data. However, its success heavily relies on the highly-curated dataset, e.g., ImageNet, which still needs human cleaning. Directly learning representations…

Computer Vision and Pattern Recognition · Computer Science 2023-02-24 Meilin Chen , Yizhou Wang , Shixiang Tang , Feng Zhu , Haiyang Yang , Lei Bai , Rui Zhao , Donglian Qi , Wanli Ouyang

Self-Supervised Contrastive Learning for Multi-Label Images

Self-supervised learning (SSL) has demonstrated its effectiveness in learning representations through comparison methods that align with human intuition. However, mainstream SSL methods heavily rely on high body datasets with single label,…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Jiale Chen

A Survey on Self-supervised Contrastive Learning for Multimodal Text-Image Analysis

Self-supervised learning is a machine learning approach that generates implicit labels by learning underlined patterns and extracting discriminative features from unlabeled data without manual labelling. Contrastive learning introduces the…

Computer Vision and Pattern Recognition · Computer Science 2025-10-13 Asifullah Khan , Laiba Asmatullah , Anza Malik , Shahzaib Khan , Hamna Asif