Related papers: Learning Abstract Options

The Option-Critic Architecture

Temporal abstraction is key to scaling up learning and planning in reinforcement learning. While planning with temporally extended actions is well understood, creating such abstractions autonomously from data has remained challenging. We…

Artificial Intelligence · Computer Science 2016-12-06 Pierre-Luc Bacon , Jean Harb , Doina Precup

Diversity-Enriched Option-Critic

Temporal abstraction allows reinforcement learning agents to represent knowledge and develop strategies over different temporal scales. The option-critic framework has been demonstrated to learn temporally extended actions, represented as…

Machine Learning · Computer Science 2025-11-21 Anand Kamat , Doina Precup

Reusable Options through Gradient-based Meta Learning

Hierarchical methods in reinforcement learning have the potential to reduce the amount of decisions that the agent needs to perform when learning new tasks. However, finding reusable useful temporal abstractions that facilitate fast…

Machine Learning · Computer Science 2023-04-05 David Kuric , Herke van Hoof

Temporal Abstraction in Reinforcement Learning with Offline Data

Standard reinforcement learning algorithms with a single policy perform poorly on tasks in complex environments involving sparse rewards, diverse behaviors, or long-term planning. This led to the study of algorithms that incorporate…

Machine Learning · Computer Science 2024-07-23 Ranga Shaarad Ayyagari , Anurita Ghosh , Ambedkar Dukkipati

Attention Option-Critic

Temporal abstraction in reinforcement learning is the ability of an agent to learn and use high-level behaviors, called options. The option-critic architecture provides a gradient-based end-to-end learning method to construct options. We…

Machine Learning · Computer Science 2022-01-11 Raviteja Chunduru , Doina Precup

Multi-Timescale, Gradient Descent, Temporal Difference Learning with Linear Options

Deliberating on large or continuous state spaces have been long standing challenges in reinforcement learning. Temporal Abstraction have somewhat made this possible, but efficiently planing using temporal abstraction still remains an issue.…

Artificial Intelligence · Computer Science 2017-03-21 Peeyush Kumar , Doina Precup

Autonomous Option Invention for Continual Hierarchical Reinforcement Learning and Planning

Abstraction is key to scaling up reinforcement learning (RL). However, autonomously learning abstract state and action representations to enable transfer and generalization remains a challenging open problem. This paper presents a novel…

Artificial Intelligence · Computer Science 2024-12-24 Rashmeet Kaur Nayyar , Siddharth Srivastava

On the Role of Weight Sharing During Deep Option Learning

The options framework is a popular approach for building temporally extended actions in reinforcement learning. In particular, the option-critic architecture provides general purpose policy gradient theorems for learning actions from…

Machine Learning · Computer Science 2020-02-07 Matthew Riemer , Ignacio Cases , Clemens Rosenbaum , Miao Liu , Gerald Tesauro

Hierarchical Average Reward Policy Gradient Algorithms

Option-critic learning is a general-purpose reinforcement learning (RL) framework that aims to address the issue of long term credit assignment by leveraging temporal abstractions. However, when dealing with extended timescales, discounting…

Machine Learning · Computer Science 2019-11-21 Akshay Dharmavaram , Matthew Riemer , Shalabh Bhatnagar

Flexible Option Learning

Temporal abstraction in reinforcement learning (RL), offers the promise of improving generalization and knowledge transfer in complex environments, by propagating information more efficiently over time. Although option learning was…

Machine Learning · Computer Science 2021-12-07 Martin Klissarov , Doina Precup

Context-Specific Representation Abstraction for Deep Option Learning

Hierarchical reinforcement learning has focused on discovering temporally extended actions, such as options, that can provide benefits in problems requiring extensive exploration. One promising approach that learns these options end-to-end…

Machine Learning · Computer Science 2022-04-26 Marwa Abdulhai , Dong-Ki Kim , Matthew Riemer , Miao Liu , Gerald Tesauro , Jonathan P. How

Abstraction for Offline Goal-Conditioned Reinforcement Learning

Markov Decision Processes (MDPs) often exhibit significant redundancy due to symmetries and shared structure across state-goal pairs in real-world Goal-Conditioned Reinforcement Learning (GCRL). While hierarchical policies have been…

Machine Learning · Computer Science 2026-05-22 Clarisse Wibault , Alexander Goldie , Antonio Villares , Maike Osborne , Jakob Foerster

Abstract Value Iteration for Hierarchical Reinforcement Learning

We propose a novel hierarchical reinforcement learning framework for control with continuous state and action spaces. In our framework, the user specifies subgoal regions which are subsets of states; then, we (i) learn options that serve as…

Machine Learning · Computer Science 2021-02-26 Kishor Jothimurugan , Osbert Bastani , Rajeev Alur

Learning with Expert Abstractions for Efficient Multi-Task Continuous Control

Decision-making in complex, continuous multi-task environments is often hindered by the difficulty of obtaining accurate models for planning and the inefficiency of learning purely from trial and error. While precise environment dynamics…

Machine Learning · Computer Science 2025-03-20 Jeff Jewett , Sandhya Saisubramanian

Learning to Interrupt: A Hierarchical Deep Reinforcement Learning Framework for Efficient Exploration

To achieve scenario intelligence, humans must transfer knowledge to robots by developing goal-oriented algorithms, which are sometimes insensitive to dynamically changing environments. While deep reinforcement learning achieves significant…

Artificial Intelligence · Computer Science 2018-07-31 Tingguang Li , Jin Pan , Delong Zhu , Max Q. -H. Meng

A Provably Efficient Option-Based Algorithm for both High-Level and Low-Level Learning

Hierarchical Reinforcement Learning (HRL) approaches have shown successful results in solving a large variety of complex, structured, long-horizon problems. Nevertheless, a full theoretical understanding of this empirical evidence is…

Machine Learning · Computer Science 2025-02-05 Gianluca Drappo , Alberto Maria Metelli , Marcello Restelli

Options of Interest: Temporal Abstraction with Interest Functions

Temporal abstraction refers to the ability of an agent to use behaviours of controllers which act for a limited, variable amount of time. The options framework describes such behaviours as consisting of a subset of states in which they can…

Machine Learning · Computer Science 2020-01-03 Khimya Khetarpal , Martin Klissarov , Maxime Chevalier-Boisvert , Pierre-Luc Bacon , Doina Precup

Learnings Options End-to-End for Continuous Action Tasks

We present new results on learning temporally extended actions for continuoustasks, using the options framework (Suttonet al.[1999b], Precup [2000]). In orderto achieve this goal we work with the option-critic architecture (Baconet…

Machine Learning · Computer Science 2017-12-04 Martin Klissarov , Pierre-Luc Bacon , Jean Harb , Doina Precup

Reinforcement Learning with Options and State Representation

The current thesis aims to explore the reinforcement learning field and build on existing methods to produce improved ones to tackle the problem of learning in high-dimensional and complex environments. It addresses such goals by…

Machine Learning · Computer Science 2024-03-26 Ayoub Ghriss , Masashi Sugiyama , Alessandro Lazaric

Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs

Large Language Models (LLMs) have shown remarkable reasoning ability through explicit Chain-of-Thought (CoT) prompting, but generating these step-by-step textual explanations is computationally expensive and slow. To overcome this, we aim…

Artificial Intelligence · Computer Science 2025-07-25 Chang Li , Yaren Zhang , Haoran Lv , Qiong Cao , Chao Xue , Xiaodong He