Related papers: Object-Category Aware Reinforcement Learning

Beyond Object Recognition: A New Benchmark towards Object Concept Learning

Understanding objects is a central building block of artificial intelligence, especially for embodied AI. Even though object recognition excels with deep learning, current machines still struggle to learn higher-level knowledge, e.g., what…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Yong-Lu Li , Yue Xu , Xinyu Xu , Xiaohan Mao , Yuan Yao , Siqi Liu , Cewu Lu

ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning

Current image-based reinforcement learning (RL) algorithms typically operate on the whole image without performing object-level reasoning. This leads to inefficient goal sampling and ineffective reward functions. In this paper, we improve…

Machine Learning · Computer Science 2020-11-16 Yufei Wang , Gautham Narayan Narasimhan , Xingyu Lin , Brian Okorn , David Held

Language-Mediated, Object-Centric Representation Learning

We present Language-mediated, Object-centric Representation Learning (LORL), a paradigm for learning disentangled, object-centric scene representations from vision and language. LORL builds upon recent advances in unsupervised object…

Machine Learning · Computer Science 2021-06-09 Ruocheng Wang , Jiayuan Mao , Samuel J. Gershman , Jiajun Wu

ObjectRL: An Object-Oriented Reinforcement Learning Codebase

ObjectRL is an open-source Python codebase for deep reinforcement learning (RL), designed for research-oriented prototyping with minimal programming effort. Unlike existing codebases, ObjectRL is built on Object-Oriented Programming (OOP)…

Machine Learning · Computer Science 2025-07-08 Gulcin Baykal , Abdullah Akgül , Manuel Haussmann , Bahareh Tasdighi , Nicklas Werge , Yi-Shan Wu , Melih Kandemir

An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning

Unsupervised object-centric representation (OCR) learning has recently drawn attention as a new paradigm of visual representation. This is because of its potential of being an effective pre-training technique for various downstream tasks in…

Machine Learning · Computer Science 2024-02-27 Jaesik Yoon , Yi-Fu Wu , Heechul Bae , Sungjin Ahn

Are We Done with Object-Centric Learning?

Object-centric learning (OCL) seeks to learn representations that only encode an object, isolated from other objects or background cues in a scene. This approach underpins various aims, including out-of-distribution (OOD) generalization,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-14 Alexander Rubinstein , Ameya Prabhu , Matthias Bethge , Seong Joon Oh

Reinforcement Learning for Improving Object Detection

The performance of a trained object detection neural network depends a lot on the image quality. Generally, images are pre-processed before feeding them into the neural network and domain knowledge about the image dataset is used to choose…

Computer Vision and Pattern Recognition · Computer Science 2020-08-19 Siddharth Nayak , Balaraman Ravindran

Unsupervised Object-Level Representation Learning from Scene Images

Contrastive self-supervised learning has largely narrowed the gap to supervised pre-training on ImageNet. However, its success highly relies on the object-centric priors of ImageNet, i.e., different augmented views of the same image…

Computer Vision and Pattern Recognition · Computer Science 2021-12-06 Jiahao Xie , Xiaohang Zhan , Ziwei Liu , Yew Soon Ong , Chen Change Loy

From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects

Traditional object detection methods operate under the closed-set assumption, where models can only detect a fixed number of objects predefined in the training set. Recent works on open vocabulary object detection (OVD) enable the detection…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Zizhao Li , Zhengkang Xiang , Joseph West , Kourosh Khoshelham

Object-Centric World Models from Few-Shot Annotations for Sample-Efficient Reinforcement Learning

While deep reinforcement learning (RL) from pixels has achieved remarkable success, its sample inefficiency remains a critical limitation for real-world applications. Model-based RL (MBRL) addresses this by learning a world model to…

Machine Learning · Computer Science 2026-02-26 Weipu Zhang , Adam Jelley , Trevor McInroe , Amos Storkey , Gang Wang

Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning

Object-centric learning (OCL) aspires general and compositional understanding of scenes by representing a scene as a collection of object-centric representations. OCL has also been extended to multi-view image and video datasets to apply…

Computer Vision and Pattern Recognition · Computer Science 2023-04-03 Jinwoo Kim , Janghyuk Choi , Ho-Jin Choi , Seon Joo Kim

Tree-Structured Reinforcement Learning for Sequential Object Localization

Existing object proposal algorithms usually search for possible object regions over multiple locations and scales separately, which ignore the interdependency among different objects and deviate from the human perception procedure. To…

Computer Vision and Pattern Recognition · Computer Science 2017-03-09 Zequn Jie , Xiaodan Liang , Jiashi Feng , Xiaojie Jin , Wen Feng Lu , Shuicheng Yan

Open-World Object Detection via Discriminative Class Prototype Learning

Open-world object detection (OWOD) is a challenging problem that combines object detection with incremental learning and open-set learning. Compared to standard object detection, the OWOD setting is task to: 1) detect objects seen during…

Computer Vision and Pattern Recognition · Computer Science 2023-02-24 Jinan Yu , Liyan Ma , Zhenglin Li , Yan Peng , Shaorong Xie

Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy

Detecting abnormal events in real-world customer service dialogues is highly challenging due to the complexity of business data and the dynamic nature of customer interactions. Moreover, models must demonstrate strong out-of-domain (OOD)…

Machine Learning · Computer Science 2025-07-03 Xiaoyun Zhang , Jingqing Ruan , Xing Ma , Yawen Zhu , Jiansong Chen , Ke Zeng , Xunliang Cai

Aligning Data Selection with Performance: Performance-driven Reinforcement Learning for Active Learning in Object Detection

Active learning strategies aim to train high-performance models with minimal labeled data by selecting the most informative instances for labeling. However, existing methods for assessing data informativeness often fail to align directly…

Computer Vision and Pattern Recognition · Computer Science 2025-05-07 Zhixuan Liang , Xingyu Zeng , Rui Zhao , Ping Luo

Ranking-aware Reinforcement Learning for Ordinal Ranking

Ordinal regression and ranking are challenging due to inherent ordinal dependencies that conventional methods struggle to model. We propose Ranking-Aware Reinforcement Learning (RARL), a novel RL framework that explicitly learns these…

Machine Learning · Computer Science 2026-01-29 Aiming Hao , Chen Zhu , Jiashu Zhu , Jiahong Wu , Xiangxiang Chu

Provably Improved Context-Based Offline Meta-RL with Attention and Contrastive Learning

Meta-learning for offline reinforcement learning (OMRL) is an understudied problem with tremendous potential impact by enabling RL algorithms in many real-world applications. A popular solution to the problem is to infer task identity as…

Machine Learning · Computer Science 2021-10-18 Lanqing Li , Yuanhao Huang , Mingzhe Chen , Siteng Luo , Dijun Luo , Junzhou Huang

Semantic Representation and Dependency Learning for Multi-Label Image Recognition

Recently many multi-label image recognition (MLR) works have made significant progress by introducing pre-trained object detection models to generate lots of proposals or utilizing statistical label co-occurrence enhance the correlation…

Computer Vision and Pattern Recognition · Computer Science 2023-01-10 Tao Pu , Mingzhan Sun , Hefeng Wu , Tianshui Chen , Ling Tian , Liang Lin

Performance-guided Reinforced Active Learning for Object Detection

Active learning (AL) strategies aim to train high-performance models with minimal labeling efforts, only selecting the most informative instances for annotation. Current approaches to evaluating data informativeness predominantly focus on…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Zhixuan Liang , Xingyu Zeng , Rui Zhao , Ping Luo

One-Shot Open Affordance Learning with Foundation Models

We introduce One-shot Open Affordance Learning (OOAL), where a model is trained with just one example per base object category, but is expected to identify novel objects and affordances. While vision-language models excel at recognizing…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Gen Li , Deqing Sun , Laura Sevilla-Lara , Varun Jampani