Related papers: Learning telic-controllable state representations

Goals and the Structure of Experience

Purposeful behavior is a hallmark of natural and artificial intelligence. Its acquisition is often believed to rely on world models, comprising both descriptive (what is) and prescriptive (what is desirable) aspects that identify and…

Artificial Intelligence · Computer Science 2025-08-22 Nadav Amir , Stas Tiomkin , Angela Langdon

States as goal-directed concepts: an epistemic approach to state-representation learning

Our goals fundamentally shape how we experience the world. For example, when we are hungry, we tend to view objects in our environment according to whether or not they are edible (or tasty). Alternatively, when we are cold, we may view the…

Neurons and Cognition · Quantitative Biology 2024-01-05 Nadav Amir , Yael Niv , Angela Langdon

PcLast: Discovering Plannable Continuous Latent States

Goal-conditioned planning benefits from learned low-dimensional representations of rich observations. While compact latent representations typically learned from variational autoencoders or inverse dynamics enable goal-conditioned decision…

Machine Learning · Computer Science 2024-06-12 Anurag Koul , Shivakanth Sujit , Shaoru Chen , Ben Evans , Lili Wu , Byron Xu , Rajan Chari , Riashat Islam , Raihan Seraj , Yonathan Efroni , Lekan Molu , Miro Dudik , John Langford , Alex Lamb

Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning

In reinforcement learning, an agent learns to reach a set of goals by means of an external reward signal. In the natural world, intelligent organisms learn from internal drives, bypassing the need for external signals, which is beneficial…

Machine Learning · Computer Science 2020-06-16 Rui Zhao , Yang Gao , Pieter Abbeel , Volker Tresp , Wei Xu

Signal Temporal Logic-Guided Apprenticeship Learning

Apprenticeship learning crucially depends on effectively learning rewards, and hence control policies from user demonstrations. Of particular difficulty is the setting where the desired task consists of a number of sub-goals with temporal…

Robotics · Computer Science 2023-11-10 Aniruddh G. Puranic , Jyotirmoy V. Deshmukh , Stefanos Nikolaidis

Reward prediction for representation learning and reward shaping

One of the fundamental challenges in reinforcement learning (RL) is the one of data efficiency: modern algorithms require a very large number of training samples, especially compared to humans, for solving environments with high-dimensional…

Machine Learning · Computer Science 2021-05-10 Hlynur Davíð Hlynsson , Laurenz Wiskott

Learning Task Agnostic Skills with Data-driven Guidance

To increase autonomy in reinforcement learning, agents need to learn useful behaviours without reliance on manually designed reward functions. To that end, skill discovery methods have been used to learn the intrinsic options available to…

Artificial Intelligence · Computer Science 2021-08-05 Even Klemsdal , Sverre Herland , Abdulmajid Murad

Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

Building autonomous machines that can explore open-ended environments, discover possible interactions and build repertoires of skills is a general objective of artificial intelligence. Developmental approaches argue that this can only be…

Machine Learning · Computer Science 2026-01-30 Cédric Colas , Tristan Karch , Olivier Sigaud , Pierre-Yves Oudeyer

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it often struggles to solve tasks that require more temporally extended reasoning. In this work, we propose to incorporate imagined subgoals into…

Machine Learning · Computer Science 2021-07-02 Elliot Chane-Sane , Cordelia Schmid , Ivan Laptev

Explainable robotic systems: Understanding goal-driven actions in a reinforcement learning scenario

Robotic systems are more present in our society everyday. In human-robot environments, it is crucial that end-users may correctly understand their robotic team-partners, in order to collaboratively complete a task. To increase action…

Artificial Intelligence · Computer Science 2021-09-03 Francisco Cruz , Richard Dazeley , Peter Vamplew , Ithan Moreira

Collaborative learning model predictive control for repetitive tasks

This paper presents a cloud-based learning model predictive controller that integrates three interacting components: a set of agents, which must learn to perform a finite set of tasks with the minimum possible local cost; a coordinator,…

Systems and Control · Electrical Eng. & Systems 2022-12-01 Paula Chanfreut , José María Maestre , Eduardo F. Camacho , Francesco Borrelli

State Representation Learning for Goal-Conditioned Reinforcement Learning

This paper presents a novel state representation for reward-free Markov decision processes. The idea is to learn, in a self-supervised manner, an embedding space where distances between pairs of embedded states correspond to the minimum…

Machine Learning · Computer Science 2022-05-05 Lorenzo Steccanella , Anders Jonsson

Relate to Predict: Towards Task-Independent Knowledge Representations for Reinforcement Learning

Reinforcement Learning (RL) can enable agents to learn complex tasks. However, it is difficult to interpret the knowledge and reuse it across tasks. Inductive biases can address such issues by explicitly providing generic yet useful…

Artificial Intelligence · Computer Science 2022-12-13 Thomas Schnürer , Malte Probst , Horst-Michael Gross

Neural Distillation as a State Representation Bottleneck in Reinforcement Learning

Learning a good state representation is a critical skill when dealing with multiple tasks in Reinforcement Learning as it allows for transfer and better generalization between tasks. However, defining what constitute a useful representation…

Machine Learning · Computer Science 2022-10-06 Valentin Guillet , Dennis G. Wilson , Carlos Aguilar-Melchor , Emmanuel Rachelson

Belief States for Cooperative Multi-Agent Reinforcement Learning under Partial Observability

Reinforcement learning in partially observable environments is typically challenging, as it requires agents to learn an estimate of the underlying system state. These challenges are exacerbated in multi-agent settings, where agents learn…

Artificial Intelligence · Computer Science 2025-04-14 Paul J. Pritz , Kin K. Leung

C-Learning: Learning to Achieve Goals via Recursive Classification

We study the problem of predicting and controlling the future state distribution of an autonomous agent. This problem, which can be viewed as a reframing of goal-conditioned reinforcement learning (RL), is centered around learning a…

Machine Learning · Computer Science 2021-04-21 Benjamin Eysenbach , Ruslan Salakhutdinov , Sergey Levine

Towards Governing Agent's Efficacy: Action-Conditional $\beta$-VAE for Deep Transparent Reinforcement Learning

We tackle the blackbox issue of deep neural networks in the settings of reinforcement learning (RL) where neural agents learn towards maximizing reward gains in an uncontrollable way. Such learning approach is risky when the interacting…

Machine Learning · Computer Science 2018-11-13 John Yang , Gyujeong Lee , Minsung Hyun , Simyung Chang , Nojun Kwak

Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observability

Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios. Yet, previous modeling work on agent learning and…

Adaptation and Self-Organizing Systems · Physics 2022-04-15 Wolfram Barfuss , Richard P. Mann

Transferable Delay-Aware Reinforcement Learning via Implicit Causal Graph Modeling

Random delays weaken the temporal correspondence between actions and subsequent state feedback, making it difficult for agents to identify the true propagation process of action effects. In cross-task scenarios, changes in task objectives…

Machine Learning · Computer Science 2026-05-13 Chenran Zhao , Dianxi Shi , Yaowen Zhang , Chunping Qiu , Shaowu Yang

Predictive Control Using Learned State Space Models via Rolling Horizon Evolution

A large part of the interest in model-based reinforcement learning derives from the potential utility to acquire a forward model capable of strategic long term decision making. Assuming that an agent succeeds in learning a useful predictive…

Machine Learning · Computer Science 2021-06-29 Alvaro Ovalle , Simon M. Lucas