Related papers: The Eigenoption-Critic Framework

Deep Laplacian-based Options for Temporally-Extended Exploration

Selecting exploratory actions that generate a rich stream of experience for better learning is a fundamental challenge in reinforcement learning (RL). An approach to tackle this problem consists in selecting actions according to specific…

Machine Learning · Computer Science 2023-06-12 Martin Klissarov , Marlos C. Machado

SOAC: The Soft Option Actor-Critic Architecture

The option framework has shown great promise by automatically extracting temporally-extended sub-tasks from a long-horizon task. Methods have been proposed for concurrently learning low-level intra-option policies and high-level option…

Artificial Intelligence · Computer Science 2020-06-26 Chenghao Li , Xiaoteng Ma , Chongjie Zhang , Jun Yang , Li Xia , Qianchuan Zhao

IPO: Interior-point Policy Optimization under Constraints

In this paper, we study reinforcement learning (RL) algorithms to solve real-world decision problems with the objective of maximizing the long-term reward as well as satisfying cumulative constraints. We propose a novel first-order policy…

Machine Learning · Computer Science 2019-10-23 Yongshuai Liu , Jiaxin Ding , Xin Liu

A Study of Value-Aware Eigenoptions

Options, which impose an inductive bias toward temporal and hierarchical structure, offer a powerful framework for reinforcement learning (RL). While effective in sequential decision-making, they are often handcrafted rather than learned.…

Machine Learning · Computer Science 2025-07-15 Harshil Kotamreddy , Marlos C. Machado

Context-Specific Representation Abstraction for Deep Option Learning

Hierarchical reinforcement learning has focused on discovering temporally extended actions, such as options, that can provide benefits in problems requiring extensive exploration. One promising approach that learns these options end-to-end…

Machine Learning · Computer Science 2022-04-26 Marwa Abdulhai , Dong-Ki Kim , Matthew Riemer , Miao Liu , Gerald Tesauro , Jonathan P. How

Learning Abstract Options

Building systems that autonomously create temporal abstractions from data is a key challenge in scaling learning and planning in reinforcement learning. One popular approach for addressing this challenge is the options framework (Sutton et…

Machine Learning · Computer Science 2020-01-01 Matthew Riemer , Miao Liu , Gerald Tesauro

An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation

Sequential recommendation, where user preference is dynamically inferred from sequential historical behaviors, is a critical task in recommender systems (RSs). To further optimize long-term user engagement, offline…

Machine Learning · Computer Science 2024-08-16 Jun Wang , Likang Wu , Qi Liu , Yu Yang

Diversity-Enriched Option-Critic

Temporal abstraction allows reinforcement learning agents to represent knowledge and develop strategies over different temporal scales. The option-critic framework has been demonstrated to learn temporally extended actions, represented as…

Machine Learning · Computer Science 2025-11-21 Anand Kamat , Doina Precup

Hierarchical Average Reward Policy Gradient Algorithms

Option-critic learning is a general-purpose reinforcement learning (RL) framework that aims to address the issue of long term credit assignment by leveraging temporal abstractions. However, when dealing with extended timescales, discounting…

Machine Learning · Computer Science 2019-11-21 Akshay Dharmavaram , Matthew Riemer , Shalabh Bhatnagar

Hierarchical Reinforcement Learning By Discovering Intrinsic Options

We propose a hierarchical reinforcement learning method, HIDIO, that can learn task-agnostic options in a self-supervised manner while jointly learning to utilize them to solve sparse-reward tasks. Unlike current hierarchical RL approaches…

Machine Learning · Computer Science 2022-08-10 Jesse Zhang , Haonan Yu , Wei Xu

Can Learned Optimization Make Reinforcement Learning Less Difficult?

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from…

Machine Learning · Computer Science 2025-04-16 Alexander David Goldie , Chris Lu , Matthew Thomas Jackson , Shimon Whiteson , Jakob Nicolaus Foerster

The Option-Critic Architecture

Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While planning with temporally extended actions is well understood, creating such abstractions autonomously from data has remained challenging. We…

Artificial Intelligence · Computer Science 2016-12-06 Pierre-Luc Bacon , Jean Harb , Doina Precup

Eigenoption Discovery through the Deep Successor Representation

Options in reinforcement learning allow agents to hierarchically decompose a task into subtasks, having the potential to speed up learning and planning. However, autonomously learning effective sets of options is still a major challenge in…

Machine Learning · Computer Science 2018-02-27 Marlos C. Machado , Clemens Rosenbaum , Xiaoxiao Guo , Miao Liu , Gerald Tesauro , Murray Campbell

A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning

Hierarchical Reinforcement Learning (HRL) approaches have shown successful results in solving a large variety of complex, structured, long-horizon problems. Nevertheless, a full theoretical understanding of this empirical evidence is…

Machine Learning · Computer Science 2025-02-05 Gianluca Drappo , Alberto Maria Metelli , Marcello Restelli

Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning

Abstraction is key to scaling up reinforcement learning (RL). However, autonomously learning abstract state and action representations to enable transfer and generalization remains a challenging open problem. This paper presents a novel…

Artificial Intelligence · Computer Science 2024-12-24 Rashmeet Kaur Nayyar , Siddharth Srivastava

Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning

The options framework in Hierarchical Reinforcement Learning breaks down overall goals into a combination of options or simpler tasks and associated policies, allowing for abstraction in the action space. Ideally, these options can be…

Machine Learning · Computer Science 2022-06-14 Kushal Chauhan , Soumya Chatterjee , Akash Reddy , Balaraman Ravindran , Pradeep Shenoy

Evolutionary Policy Optimization

On-policy reinforcement learning (RL) algorithms are widely used for their strong asymptotic performance and training stability, but they struggle to scale with larger batch sizes, as additional parallel environments yield redundant data…

Machine Learning · Computer Science 2025-11-13 Jianren Wang , Yifan Su , Abhinav Gupta , Deepak Pathak

Option Encoder: A Framework for Discovering a Policy Basis in Reinforcement Learning

Option discovery and skill acquisition frameworks are integral to the functioning of a Hierarchically organized Reinforcement learning agent. However, such techniques often yield a large number of options or skills, which can potentially be…

Machine Learning · Computer Science 2020-07-06 Arjun Manoharan , Rahul Ramesh , Balaraman Ravindran

e-COP : Episodic Constrained Optimization of Policies

In this paper, we present the $\texttt{e-COP}$ algorithm, the first policy optimization algorithm for constrained Reinforcement Learning (RL) in episodic (finite horizon) settings. Such formulations are applicable when there are separate…

Machine Learning · Computer Science 2024-12-19 Akhil Agnihotri , Rahul Jain , Deepak Ramachandran , Sahil Singla

LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework

In this paper, a unified framework for exploration in reinforcement learning (RL) is proposed based on an option-critic model. The proposed framework learns to integrate a set of diverse exploration strategies so that the agent can…

Machine Learning · Computer Science 2024-09-10 Woojun Kim , Jeonghye Kim , Youngchul Sung