Related papers: Tree-Structured Reinforcement Learning for Sequent…

Deep Variation-structured Reinforcement Learning for Visual Relationship and Attribute Detection

Despite progress in visual perception tasks such as image classification and detection, computers still struggle to understand the interdependency of objects in the scene as a whole, e.g., relations between objects or their attributes.…

Computer Vision and Pattern Recognition · Computer Science 2017-03-10 Xiaodan Liang , Lisa Lee , Eric P. Xing

TreeRL: LLM Reinforcement Learning with On-Policy Tree Search

Reinforcement learning (RL) with tree search has demonstrated superior performance in traditional reasoning tasks. Compared to conventional independent chain sampling strategies with outcome supervision, tree search enables better…

Machine Learning · Computer Science 2025-06-16 Zhenyu Hou , Ziniu Hu , Yujiang Li , Rui Lu , Jie Tang , Yuxiao Dong

Object-Centric World Models Meet Monte Carlo Tree Search

In this paper, we introduce ObjectZero, a novel reinforcement learning (RL) algorithm that leverages the power of object-level representations to model dynamic environments more effectively. Unlike traditional approaches that process the…

Artificial Intelligence · Computer Science 2026-01-13 Rodion Vakhitov , Leonid Ugadiarov , Aleksandr Panov

Human-Allied Relational Reinforcement Learning

Reinforcement learning (RL) has experienced a second wind in the past decade. While incredibly successful in images and videos, these systems still operate within the realm of propositional tasks ignoring the inherent structure that exists…

Machine Learning · Computer Science 2025-10-21 Fateme Golivand Darvishvand , Hikaru Shindo , Sahil Sidheekh , Kristian Kersting , Sriraam Natarajan

ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning

Current image-based reinforcement learning (RL) algorithms typically operate on the whole image without performing object-level reasoning. This leads to inefficient goal sampling and ineffective reward functions. In this paper, we improve…

Machine Learning · Computer Science 2020-11-16 Yufei Wang , Gautham Narayan Narasimhan , Xingyu Lin , Brian Okorn , David Held

Tree Search for LLM Agent Reinforcement Learning

Recent advances in reinforcement learning (RL) have significantly enhanced the agentic capabilities of large language models (LLMs). In long-term and multi-turn agent tasks, existing approaches driven solely by outcome rewards often suffer…

Machine Learning · Computer Science 2026-03-19 Yuxiang Ji , Ziyu Ma , Yong Wang , Guanhua Chen , Xiangxiang Chu , Liaoni Wu

Interpretable Preference-based Reinforcement Learning with Tree-Structured Reward Functions

The potential of reinforcement learning (RL) to deliver aligned and performant agents is partially bottlenecked by the reward engineering problem. One alternative to heuristic trial-and-error is preference-based RL (PbRL), where a reward…

Machine Learning · Computer Science 2021-12-22 Tom Bewley , Freddy Lecue

Object-Category Aware Reinforcement Learning

Object-oriented reinforcement learning (OORL) is a promising way to improve the sample efficiency and generalization ability over standard RL. Recent works that try to solve OORL tasks without additional feature engineering mainly focus on…

Machine Learning · Computer Science 2022-10-17 Qi Yi , Rui Zhang , Shaohui Peng , Jiaming Guo , Xing Hu , Zidong Du , Xishan Zhang , Qi Guo , Yunji Chen

TSLM: Tree-Structured Language Modeling for Divergent Thinking

Language models generate reasoning sequentially, preventing them from decoupling irrelevant exploration paths during search. We introduce Tree-Structured Language Modeling (TSLM), which uses special tokens to encode branching structure,…

Computation and Language · Computer Science 2026-02-02 Doyoung Kim , Jaehyeok Doo , Minjoon Seo

Learning to Segment Object Candidates via Recursive Neural Networks

To avoid the exhaustive search over locations and scales, current state-of-the-art object detection systems usually involve a crucial component generating a batch of candidate object proposals from images. In this paper, we present a simple…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Tianshui Chen , Liang Lin , Xian Wu , Nong Xiao , Xiaonan Luo

Structured Reinforcement Learning for Combinatorial Decision-Making

Reinforcement learning (RL) is increasingly applied to real-world problems involving complex and structured decisions, such as routing, scheduling, and assortment planning. These settings challenge standard RL algorithms, which struggle to…

Machine Learning · Computer Science 2025-10-29 Heiko Hoppe , Léo Baty , Louis Bouvier , Axel Parmentier , Maximilian Schiffer

Learning Hierarchical Features for Visual Object Tracking with Recursive Neural Networks

Recently, deep learning has achieved very promising results in visual object tracking. Deep neural networks in existing tracking methods require a lot of training data to learn a large number of parameters. However, training data is not…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Li Wang , Ting Liu , Bing Wang , Xulei Yang , Gang Wang

PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning

Inspired by the impressive reasoning capabilities demonstrated by reinforcement learning approaches like DeepSeek-R1, recent emerging research has begun exploring the use of reinforcement learning (RL) to enhance vision-language models…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Yizhen Zhang , Yang Ding , Shuoshuo Zhang , Xinchen Zhang , Haoling Li , Zhong-zhi Li , Peijie Wang , Jie Wu , Lei Ji , Yelong Shen , Yujiu Yang , Yeyun Gong

Robotic Search & Rescue via Online Multi-task Reinforcement Learning

Reinforcement learning (RL) is a general and well-known method that a robot can use to learn an optimal control policy to solve a particular task. We would like to build a versatile robot that can learn multiple tasks, but using RL for each…

Artificial Intelligence · Computer Science 2015-12-01 Lisa Lee

Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video

Temporally language grounding in untrimmed videos is a newly-raised task in video understanding. Most of the existing methods suffer from inferior efficiency, lacking interpretability, and deviating from the human perception mechanism.…

Computer Vision and Pattern Recognition · Computer Science 2020-01-22 Jie Wu , Guanbin Li , Si Liu , Liang Lin

A Reward-Free Viewpoint on Multi-Objective Reinforcement Learning

Many sequential decision-making tasks involve optimizing multiple conflicting objectives, requiring policies that adapt to different user preferences. In multi-objective reinforcement learning (MORL), one widely studied approach} addresses…

Machine Learning · Computer Science 2026-04-28 Ying-Tu Chen , Wei Hung , Bing-Shu Wu , Zhang-Wei Hong , Ping-Chun Hsieh

A Survey of Continual Reinforcement Learning

Reinforcement Learning (RL) is an important machine learning paradigm for solving sequential decision-making problems. Recent years have witnessed remarkable progress in this field due to the rapid development of deep neural networks.…

Machine Learning · Computer Science 2026-04-08 Chaofan Pan , Xin Yang , Yanhua Li , Wei Wei , Tianrui Li , Bo An , Jiye Liang

Entity-Centric Reinforcement Learning for Object Manipulation from Pixels

Manipulating objects is a hallmark of human intelligence, and an important task in domains such as robotics. In principle, Reinforcement Learning (RL) offers a general approach to learn object manipulation. In practice, however, domains…

Robotics · Computer Science 2024-04-02 Dan Haramati , Tal Daniel , Aviv Tamar

One-Step Distributional Reinforcement Learning

Reinforcement learning (RL) allows an agent interacting sequentially with an environment to maximize its long-term expected return. In the distributional RL (DistrRL) paradigm, the agent goes beyond the limit of the expected value, to…

Machine Learning · Computer Science 2023-05-01 Mastane Achab , Reda Alami , Yasser Abdelaziz Dahou Djilali , Kirill Fedyanin , Eric Moulines

An End-to-End Approach to Natural Language Object Retrieval via Context-Aware Deep Reinforcement Learning

We propose an end-to-end approach to the natural language object retrieval task, which localizes an object within an image according to a natural language description, i.e., referring expression. Previous works divide this problem into two…

Computer Vision and Pattern Recognition · Computer Science 2017-03-23 Fan Wu , Zhongwen Xu , Yi Yang