Related papers: Learning Options via Compression

Reinforcement Learning with Convex Constraints

In standard reinforcement learning (RL), a learning agent seeks to optimize the overall reward. However, many key aspects of a desired behavior are more naturally expressed as constraints. For instance, the designer may want to limit the…

Machine Learning · Computer Science 2021-01-29 Sobhan Miryoosefi , Kianté Brantley , Hal Daumé , Miroslav Dudik , Robert Schapire

Compression-Based Regularization with an Application to Multi-Task Learning

This paper investigates, from information theoretic grounds, a learning problem based on the principle that any regularity in a given dataset can be exploited to extract compact features from data, i.e., using fewer bits than needed to…

Machine Learning · Statistics 2018-11-14 Matías Vera , Leonardo Rey Vega , Pablo Piantanida

Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning

Reinforcement learning (RL) with diverse offline datasets can have the advantage of leveraging the relation of multiple tasks and the common skills learned across those tasks, hence allowing us to deal with real-world complex problems…

Machine Learning · Computer Science 2024-08-29 Minjong Yoo , Sangwoo Cho , Honguk Woo

Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context Variables

Deep reinforcement learning algorithms require large amounts of experience to learn an individual task. While in principle meta-reinforcement learning (meta-RL) algorithms enable agents to learn new skills from small amounts of experience,…

Machine Learning · Computer Science 2019-03-21 Kate Rakelly , Aurick Zhou , Deirdre Quillen , Chelsea Finn , Sergey Levine

Behavior Contrastive Learning for Unsupervised Skill Discovery

In reinforcement learning, unsupervised skill discovery aims to learn diverse skills without extrinsic rewards. Previous methods discover skills by maximizing the mutual information (MI) between states and skills. However, such an MI…

Machine Learning · Computer Science 2023-05-09 Rushuai Yang , Chenjia Bai , Hongyi Guo , Siyuan Li , Bin Zhao , Zhen Wang , Peng Liu , Xuelong Li

SLIM: Skill Learning with Multiple Critics

Self-supervised skill learning aims to acquire useful behaviors that leverage the underlying dynamics of the environment. Latent variable models, based on mutual information maximization, have been successful in this task but still struggle…

Machine Learning · Computer Science 2024-03-22 David Emukpere , Bingbing Wu , Julien Perez , Jean-Michel Renders

Leveraging convergence behavior to balance conflicting tasks in multi-task learning

Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to…

Machine Learning · Computer Science 2022-04-15 Angelica Tiemi Mizuno Nakamura , Denis Fernando Wolf , Valdir Grassi

Focused Skill Discovery: Learning to Control Specific State Variables while Minimizing Side Effects

Skills are essential for unlocking higher levels of problem solving. A common approach to discovering these skills is to learn ones that reliably reach different states, thus empowering the agent to control its environment. However,…

Machine Learning · Computer Science 2025-10-07 Jonathan Colaço Carr , Qinyi Sun , Cameron Allen

The Option Keyboard: Combining Skills in Reinforcement Learning

The ability to combine known skills to create new ones may be crucial in the solution of complex reinforcement learning problems that unfold over extended periods. We argue that a robust way of combining skills is to define and manipulate…

Artificial Intelligence · Computer Science 2021-06-25 André Barreto , Diana Borsa , Shaobo Hou , Gheorghe Comanici , Eser Aygün , Philippe Hamel , Daniel Toyama , Jonathan Hunt , Shibl Mourad , David Silver , Doina Precup

Can Learned Optimization Make Reinforcement Learning Less Difficult?

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from…

Machine Learning · Computer Science 2025-04-16 Alexander David Goldie , Chris Lu , Matthew Thomas Jackson , Shimon Whiteson , Jakob Nicolaus Foerster

Learning from Multiple Outlooks

We propose a novel problem formulation of learning a single task when the data are provided in different feature spaces. Each such space is called an outlook, and is assumed to contain both labeled and unlabeled data. The objective is to…

Machine Learning · Computer Science 2011-06-15 Maayan Harel , Shie Mannor

Optimal Options for Multi-Task Reinforcement Learning Under Time Constraints

Reinforcement learning can greatly benefit from the use of options as a way of encoding recurring behaviours and to foster exploration. An important open problem is how can an agent autonomously learn useful options when solving particular…

Machine Learning · Computer Science 2020-01-07 Manuel Del Verme , Bruno Castro da Silva , Gianluca Baldassarre

Efficient Skill Discovery via Regret-Aware Optimization

Unsupervised skill discovery aims to learn diverse and distinguishable behaviors in open-ended reinforcement learning. For existing methods, they focus on improving diversity through pure exploration, mutual information optimization, and…

Machine Learning · Computer Science 2025-06-27 He Zhang , Ming Zhou , Shaopeng Zhai , Ying Sun , Hui Xiong

The Information Geometry of Unsupervised Reinforcement Learning

How can a reinforcement learning (RL) agent prepare to solve downstream tasks if those tasks are not known a priori? One approach is unsupervised skill discovery, a class of algorithms that learn a set of policies without access to a reward…

Machine Learning · Computer Science 2021-10-07 Benjamin Eysenbach , Ruslan Salakhutdinov , Sergey Levine

Learning, compression, and leakage: Minimising classification error via meta-universal compression principles

Learning and compression are driven by the common aim of identifying and exploiting statistical regularities in data, which opens the door for fertile collaboration between these areas. A promising group of compression techniques for…

Machine Learning · Computer Science 2021-02-02 Fernando E. Rosas , Pedro A. M. Mediano , Michael Gastpar

Complex Skill Acquisition Through Simple Skill Imitation Learning

Humans often think of complex tasks as combinations of simpler subtasks in order to learn those complex tasks more efficiently. For example, a backflip could be considered a combination of four subskills: jumping, tucking knees, rolling…

Machine Learning · Computer Science 2020-10-21 Pranay Pasula

Unsupervised Skill Discovery through Skill Regions Differentiation

Unsupervised Reinforcement Learning (RL) aims to discover diverse behaviors that can accelerate the learning of downstream tasks. Previous methods typically focus on entropy-based exploration or empowerment-driven skill learning. However,…

Machine Learning · Computer Science 2025-06-18 Ting Xiao , Jiakun Zheng , Rushuai Yang , Kang Xu , Qiaosheng Zhang , Peng Liu , Chenjia Bai

Conditional Meta-Learning of Linear Representations

Standard meta-learning for representation learning aims to find a common representation to be shared across multiple tasks. The effectiveness of these methods is often limited when the nuances of the tasks' distribution cannot be captured…

Machine Learning · Computer Science 2021-03-31 Giulia Denevi , Massimiliano Pontil , Carlo Ciliberto

Probabilistic Active Meta-Learning

Data-efficient learning algorithms are essential in many practical applications where data collection is expensive, e.g., in robotics due to the wear and tear. To address this problem, meta-learning algorithms use prior experience about…

Machine Learning · Computer Science 2020-10-26 Jean Kaddour , Steindór Sæmundsson , Marc Peter Deisenroth

Resilient Constrained Learning

When deploying machine learning solutions, they must satisfy multiple requirements beyond accuracy, such as fairness, robustness, or safety. These requirements are imposed during training either implicitly, using penalties, or explicitly,…

Machine Learning · Computer Science 2024-01-12 Ignacio Hounie , Alejandro Ribeiro , Luiz F. O. Chamon