Related papers: Flexible Option Learning

Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning

Abstraction is key to scaling up reinforcement learning (RL). However, autonomously learning abstract state and action representations to enable transfer and generalization remains a challenging open problem. This paper presents a novel…

Artificial Intelligence · Computer Science 2024-12-24 Rashmeet Kaur Nayyar , Siddharth Srivastava

Temporal Abstraction in Reinforcement Learning with Offline Data

Standard reinforcement learning algorithms with a single policy perform poorly on tasks in complex environments involving sparse rewards, diverse behaviors, or long-term planning. This led to the study of algorithms that incorporate…

Machine Learning · Computer Science 2024-07-23 Ranga Shaarad Ayyagari , Anurita Ghosh , Ambedkar Dukkipati

Diversity-Enriched Option-Critic

Temporal abstraction allows reinforcement learning agents to represent knowledge and develop strategies over different temporal scales. The option-critic framework has been demonstrated to learn temporally extended actions, represented as…

Machine Learning · Computer Science 2025-11-21 Anand Kamat , Doina Precup

LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework

In this paper, a unified framework for exploration in reinforcement learning (RL) is proposed based on an option-critic model. The proposed framework learns to integrate a set of diverse exploration strategies so that the agent can…

Machine Learning · Computer Science 2024-09-10 Woojun Kim , Jeonghye Kim , Youngchul Sung

Attention Option-Critic

Temporal abstraction in reinforcement learning is the ability of an agent to learn and use high-level behaviors, called options. The option-critic architecture provides a gradient-based end-to-end learning method to construct options. We…

Machine Learning · Computer Science 2022-01-11 Raviteja Chunduru , Doina Precup

Learning Abstract Options

Building systems that autonomously create temporal abstractions from data is a key challenge in scaling learning and planning in reinforcement learning. One popular approach for addressing this challenge is the options framework (Sutton et…

Machine Learning · Computer Science 2020-01-01 Matthew Riemer , Miao Liu , Gerald Tesauro

The Option-Critic Architecture

Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While planning with temporally extended actions is well understood, creating such abstractions autonomously from data has remained challenging. We…

Artificial Intelligence · Computer Science 2016-12-06 Pierre-Luc Bacon , Jean Harb , Doina Precup

Hierarchical Reinforcement Learning with Hindsight

Reinforcement Learning (RL) algorithms can suffer from poor sample efficiency when rewards are delayed and sparse. We introduce a solution that enables agents to learn temporally extended actions at multiple levels of abstraction in a…

Machine Learning · Computer Science 2019-03-11 Andrew Levy , Robert Platt , Kate Saenko

Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration

To achieve scenario intelligence, humans must transfer knowledge to robots by developing goal-oriented algorithms, which are sometimes insensitive to dynamically changing environments. While deep reinforcement learning achieves significant…

Artificial Intelligence · Computer Science 2018-07-31 Tingguang Li , Jin Pan , Delong Zhu , Max Q. -H. Meng

Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation

Modern listwise recommendation systems need to consider both long-term user perceptions and short-term interest shifts. Reinforcement learning can be applied on recommendation to study such a problem but is also subject to large search…

Information Retrieval · Computer Science 2025-07-22 Luo Ji , Gao Liu , Mingyang Yin , Hongxia Yang , Jingren Zhou

A Hierarchical Reinforcement Learning Method for Persistent Time-Sensitive Tasks

Reinforcement learning has been applied to many interesting problems such as the famous TD-gammon and the inverted helicopter flight. However, little effort has been put into developing methods to learn policies for complex persistent tasks…

Artificial Intelligence · Computer Science 2016-06-22 Xiao Li , Calin Belta

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Reinforcement learning (RL) is a powerful machine learning technique that enables an intelligent agent to learn an optimal policy that maximizes the cumulative rewards in sequential decision making. Most of methods in the existing…

Machine Learning · Statistics 2023-01-06 Chengchun Shi , Zhengling Qi , Jianing Wang , Fan Zhou

Can Learned Optimization Make Reinforcement Learning Less Difficult?

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from…

Machine Learning · Computer Science 2025-04-16 Alexander David Goldie , Chris Lu , Matthew Thomas Jackson , Shimon Whiteson , Jakob Nicolaus Foerster

Learning Reusable Options for Multi-Task Reinforcement Learning

Reinforcement learning (RL) has become an increasingly active area of research in recent years. Although there are many algorithms that allow an agent to solve tasks efficiently, they often ignore the possibility that prior experience…

Artificial Intelligence · Computer Science 2020-01-07 Francisco M. Garcia , Chris Nota , Philip S. Thomas

Temporal Abstraction in Reinforcement Learning with the Successor Representation

Reasoning at multiple levels of temporal abstraction is one of the key attributes of intelligence. In reinforcement learning, this is often modeled through temporally extended courses of actions called options. Options allow agents to make…

Machine Learning · Computer Science 2023-04-13 Marlos C. Machado , Andre Barreto , Doina Precup , Michael Bowling

Intrinsically motivated option learning: a comparative study of recent methods

Options represent a framework for reasoning across multiple time scales in reinforcement learning (RL). With the recent active interest in the unsupervised learning paradigm in the RL research community, the option framework was adapted to…

Artificial Intelligence · Computer Science 2022-06-14 Djordje Božić , Predrag Tadić , Mladen Nikolić

Reinforcement Learning with Lookahead Information

We study reinforcement learning (RL) problems in which agents observe the reward or transition realizations at their current state before deciding which action to take. Such observations are available in many applications, including…

Machine Learning · Computer Science 2024-10-22 Nadav Merlis

Adaptive Reinforcement Learning for Dynamic Configuration Allocation in Pre-Production Testing

Ensuring reliability in modern software systems requires rigorous pre-production testing across highly heterogeneous and evolving environments. Because exhaustive evaluation is infeasible, practitioners must decide how to allocate limited…

Software Engineering · Computer Science 2025-10-08 Yu Zhu

Reusable Options through Gradient-based Meta Learning

Hierarchical methods in reinforcement learning have the potential to reduce the amount of decisions that the agent needs to perform when learning new tasks. However, finding reusable useful temporal abstractions that facilitate fast…

Machine Learning · Computer Science 2023-04-05 David Kuric , Herke van Hoof

Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals

Hierarchical reinforcement learning (HRL) learns to make decisions on multiple levels of temporal abstraction. A key challenge in HRL is that the low-level policy changes over time, making it difficult for the high-level policy to generate…

Machine Learning · Computer Science 2025-05-29 Vivienne Huiling Wang , Tinghuai Wang , Joni Pajarinen