Related papers: Look-into-Object: Self-supervised Structure Modeli…

Deep Template-based Object Instance Detection

Much of the focus in the object detection literature has been on the problem of identifying the bounding box of a particular class of object in an image. Yet, in contexts such as robotics and augmented reality, it is often necessary to find…

Computer Vision and Pattern Recognition · Computer Science 2020-11-17 Jean-Philippe Mercier , Mathieu Garon , Philippe Giguère , Jean-François Lalonde

Look Around and Learn: Self-Training Object Detection by Exploration

When an object detector is deployed in a novel setting it often experiences a drop in performance. This paper studies how an embodied agent can automatically fine-tune a pre-existing object detector while exploring and acquiring images in a…

Computer Vision and Pattern Recognition · Computer Science 2024-07-31 Gianluca Scarpellini , Stefano Rosa , Pietro Morerio , Lorenzo Natale , Alessio Del Bue

Self-Supervision by Prediction for Object Discovery in Videos

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data. On the other hand, unsupervised settings pose many challenges, especially about determining the right inductive bias in diverse scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Beril Besbinar , Pascal Frossard

Oh-A-DINO: Understanding and Enhancing Attribute-Level Information in Self-Supervised Object-Centric Representations

Object-centric understanding is fundamental to human vision and required for complex reasoning. Traditional methods define slot-based bottlenecks to learn object properties explicitly, while recent self-supervised vision models like DINO…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Stefan Sylvius Wagner , Stefan Harmeling

Improving Open-World Object Localization by Discovering Background

Our work addresses the problem of learning to localize objects in an open-world setting, i.e., given the bounding box information of a limited number of object classes during training, the goal is to localize all objects, belonging to both…

Computer Vision and Pattern Recognition · Computer Science 2025-04-25 Ashish Singh , Michael J. Jones , Kuan-Chuan Peng , Anoop Cherian , Moitreya Chatterjee , Erik Learned-Miller

re-OBJ: Jointly Learning the Foreground and Background for Object Instance Re-identification

Conventional approaches to object instance re-identification rely on matching appearances of the target objects among a set of frames. However, learning appearances of the objects alone might fail when there are multiple objects with…

Computer Vision and Pattern Recognition · Computer Science 2019-09-24 Vaibhav Bansal , Stuart James , Alessio Del Bue

Object Topological Character Acquisition by Inductive Learning

Understanding the shape and structure of objects is undoubtedly extremely important for object recognition, but the most common pattern recognition method currently used is machine learning, which often requires a large number of training…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Wei Hui , Liping Yu , Yiran Wei

Towards Accurate Localization by Instance Search

Visual object localization is the key step in a series of object detection tasks. In the literature, high localization accuracy is achieved with the mainstream strongly supervised frameworks. However, such methods require object-level…

Computer Vision and Pattern Recognition · Computer Science 2021-08-10 Yi-Geng Hong , Hui-Chu Xiao , Wan-Lei Zhao

Relation Networks for Object Detection

Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era. All state-of-the-art object detection systems still…

Computer Vision and Pattern Recognition · Computer Science 2018-06-15 Han Hu , Jiayuan Gu , Zheng Zhang , Jifeng Dai , Yichen Wei

A Survey of Modern Deep Learning based Object Detection Models

Object Detection is the task of classification and localization of objects in an image or video. It has gained prominence in recent years due to its widespread applications. This article surveys recent developments in deep learning based…

Computer Vision and Pattern Recognition · Computer Science 2021-05-13 Syed Sahil Abbas Zaidi , Mohammad Samar Ansari , Asra Aslam , Nadia Kanwal , Mamoona Asghar , Brian Lee

Learning Reinforced Attentional Representation for End-to-End Visual Tracking

Although numerous recent tracking approaches have made tremendous advances in the last decade, achieving high-performance visual tracking remains a challenge. In this paper, we propose an end-to-end network model to learn reinforced…

Computer Vision and Pattern Recognition · Computer Science 2020-01-03 Peng Gao , Qiquan Zhang , Fei Wang , Liyi Xiao , Hamido Fujita , Yan Zhang

Self-supervised structured object representation learning

Self-supervised learning (SSL) has emerged as a powerful technique for learning visual representations. While recent SSL approaches achieve strong results in global image understanding, they are limited in capturing the structured…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Oussama Hadjerci , Antoine Letienne , Mohamed Abbas Hedjazi , Adel Hafiane

Towards Self-Supervised Learning of Global and Object-Centric Representations

Self-supervision allows learning meaningful representations of natural images, which usually contain one central object. How well does it transfer to multi-entity scenes? We discuss key aspects of learning structured object-centric…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Federico Baldassarre , Hossein Azizpour

Self-taught Object Localization with Deep Networks

This paper introduces self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in images without additional human supervision, i.e., without using…

Computer Vision and Pattern Recognition · Computer Science 2016-02-03 Loris Bazzani , Alessandro Bergamo , Dragomir Anguelov , Lorenzo Torresani

Object Based Attention Through Internal Gating

Object-based attention is a key component of the visual system, relevant for perception, learning, and memory. Neurons tuned to features of attended objects tend to be more active than those associated with non-attended objects. There is a…

Neurons and Cognition · Quantitative Biology 2021-06-09 Jordan Lei , Ari S. Benjamin , Konrad P. Kording

Learning to Track with Object Permanence

Tracking by detection, the dominant approach for online multi-object tracking, alternates between localization and association steps. As a result, it strongly depends on the quality of instantaneous observations, often failing when objects…

Computer Vision and Pattern Recognition · Computer Science 2021-10-04 Pavel Tokmakov , Jie Li , Wolfram Burgard , Adrien Gaidon

Instance Localization for Self-supervised Detection Pretraining

Prior research on self-supervised learning has led to considerable progress on image classification, but often with degraded transfer performance on object detection. The objective of this paper is to advance self-supervised pretrained…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Ceyuan Yang , Zhirong Wu , Bolei Zhou , Stephen Lin

ALSO: Automotive Lidar Self-supervision by Occupancy estimation

We propose a new self-supervised method for pre-training the backbone of deep perception models operating on point clouds. The core idea is to train the model on a pretext task which is the reconstruction of the surface on which the 3D…

Computer Vision and Pattern Recognition · Computer Science 2023-04-05 Alexandre Boulch , Corentin Sautier , Björn Michele , Gilles Puy , Renaud Marlet

Describe and Attend to Track: Learning Natural Language guided Structural Representation and Visual Attention for Object Tracking

The tracking-by-detection framework requires a set of positive and negative training samples to learn robust tracking models for precise localization of target objects. However, existing tracking models mostly treat different samples…

Computer Vision and Pattern Recognition · Computer Science 2018-11-28 Xiao Wang , Chenglong Li , Rui Yang , Tianzhu Zhang , Jin Tang , Bin Luo

Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

Learning compositional representation is a key aspect of object-centric learning as it enables flexible systematic generalization and supports complex visual reasoning. However, most of the existing approaches rely on auto-encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Whie Jung , Jaehoon Yoo , Sungjin Ahn , Seunghoon Hong