Related papers: Evaluating Agents without Rewards

Learning To Explore With Predictive World Model Via Self-Supervised Learning

Autonomous artificial agents must be able to learn behaviors in complex environments without humans to design tasks and rewards. Designing these functions for each environment is not feasible, thus, motivating the development of intrinsic…

Machine Learning · Computer Science 2025-02-20 Alana Santana , Paula P. Costa , Esther L. Colombini

Intrinsically-Motivated Humans and Agents in Open-World Exploration

What drives exploration? Understanding intrinsic motivation is a long-standing challenge in both cognitive science and artificial intelligence; numerous objectives have been proposed and used to train agents, yet there remains a gap between…

Artificial Intelligence · Computer Science 2025-05-29 Aly Lidayan , Yuqing Du , Eliza Kosoy , Maria Rufova , Pieter Abbeel , Alison Gopnik

Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning

Solving tasks with sparse rewards is one of the most important challenges in reinforcement learning. In the single-agent setting, this challenge is addressed by introducing intrinsic rewards that motivate agents to explore unseen regions of…

Machine Learning · Computer Science 2021-05-25 Shariq Iqbal , Fei Sha

Minding Motivation: The Effect of Intrinsic Motivation on Agent Behaviors

Games are challenging for Reinforcement Learning~(RL) agents due to their reward-sparsity, as rewards are only obtainable after long sequences of deliberate actions. Intrinsic Motivation~(IM) methods -- which introduce exploration rewards…

Artificial Intelligence · Computer Science 2025-07-29 Leonardo Villalobos-Arias , Grant Forbes , Jianxun Wang , David L Roberts , Arnav Jhala

Large-Scale Study of Curiosity-Driven Learning

Reinforcement learning algorithms rely on carefully engineering environment rewards that are extrinsic to the agent. However, annotating each environment with hand-designed, dense rewards is not scalable, motivating the need for developing…

Machine Learning · Computer Science 2018-08-14 Yuri Burda , Harri Edwards , Deepak Pathak , Amos Storkey , Trevor Darrell , Alexei A. Efros

Automatic Reward Design via Learning Motivation-Consistent Intrinsic Rewards

Reward design is a critical part of the application of reinforcement learning, the performance of which strongly depends on how well the reward signal frames the goal of the designer and how well the signal assesses progress in reaching…

Machine Learning · Computer Science 2022-08-01 Yixiang Wang , Yujing Hu , Feng Wu , Yingfeng Chen

Towards Improving Exploration in Self-Imitation Learning using Intrinsic Motivation

Reinforcement Learning has emerged as a strong alternative to solve optimization tasks efficiently. The use of these algorithms highly depends on the feedback signals provided by the environment in charge of informing about how good (or…

Machine Learning · Computer Science 2022-12-01 Alain Andres , Esther Villar-Rodriguez , Javier Del Ser

How Should an Agent Practice?

We present a method for learning intrinsic reward functions to drive the learning of an agent during periods of practice in which extrinsic task rewards are not available. During practice, the environment may differ from the one available…

Artificial Intelligence · Computer Science 2019-12-17 Janarthanan Rajendran , Richard Lewis , Vivek Veeriah , Honglak Lee , Satinder Singh

Information is Power: Intrinsic Control via Information Capture

Humans and animals explore their environment and acquire useful skills even in the absence of clear goals, exhibiting intrinsic motivation. The study of intrinsic motivation in artificial agents is concerned with the following question:…

Machine Learning · Computer Science 2021-12-08 Nicholas Rhinehart , Jenny Wang , Glen Berseth , John D. Co-Reyes , Danijar Hafner , Chelsea Finn , Sergey Levine

On Learning Intrinsic Rewards for Policy Gradient Methods

In many sequential decision making tasks, it is challenging to design reward functions that help an RL agent efficiently learn behavior that is considered good by the agent designer. A number of different formulations of the reward-design…

Artificial Intelligence · Computer Science 2018-06-25 Zeyu Zheng , Junhyuk Oh , Satinder Singh

Reinforcement Learning with Unsupervised Auxiliary Tasks

Deep reinforcement learning agents have achieved state-of-the-art results by directly maximising cumulative reward. However, environments contain a much wider variety of possible training signals. In this paper, we introduce an agent that…

Machine Learning · Computer Science 2016-11-17 Max Jaderberg , Volodymyr Mnih , Wojciech Marian Czarnecki , Tom Schaul , Joel Z Leibo , David Silver , Koray Kavukcuoglu

ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward

Modern multi-agent reinforcement learning frameworks rely on centralized training and reward shaping to perform well. However, centralized training and dense rewards are not readily available in the real world. Current multi-agent…

Multiagent Systems · Computer Science 2022-11-11 Zixian Ma , Rose Wang , Li Fei-Fei , Michael Bernstein , Ranjay Krishna

What Can Learned Intrinsic Rewards Capture?

The objective of a reinforcement learning agent is to behave so as to maximise the sum of a suitable scalar function of state: the reward. These rewards are typically given and immutable. In this paper, we instead consider the proposition…

Artificial Intelligence · Computer Science 2020-08-25 Zeyu Zheng , Junhyuk Oh , Matteo Hessel , Zhongwen Xu , Manuel Kroiss , Hado van Hasselt , David Silver , Satinder Singh

Reward learning from human preferences and demonstrations in Atari

To solve complex real-world problems with reinforcement learning, we cannot rely on manually specified reward functions. Instead, we can have humans communicate an objective to the agent directly. In this work, we combine two approaches to…

Machine Learning · Computer Science 2018-11-16 Borja Ibarz , Jan Leike , Tobias Pohlen , Geoffrey Irving , Shane Legg , Dario Amodei

Adapting Behaviour via Intrinsic Reward: A Survey and Empirical Study

Learning about many things can provide numerous benefits to a reinforcement learning system. For example, learning many auxiliary value functions, in addition to optimizing the environmental reward, appears to improve both exploration and…

Machine Learning · Computer Science 2020-08-25 Cam Linke , Nadia M. Ady , Martha White , Thomas Degris , Adam White

Learning to Incentivize Other Learning Agents

The challenge of developing powerful and general Reinforcement Learning (RL) agents has received increasing attention in recent years. Much of this effort has focused on the single-agent setting, in which an agent maximizes a predefined…

Machine Learning · Computer Science 2020-10-21 Jiachen Yang , Ang Li , Mehrdad Farajtabar , Peter Sunehag , Edward Hughes , Hongyuan Zha

Mutual Information State Intrinsic Control

Reinforcement learning has been shown to be highly successful at many challenging tasks. However, success heavily relies on well-shaped rewards. Intrinsically motivated RL attempts to remove this constraint by defining an intrinsic reward…

Machine Learning · Computer Science 2021-03-16 Rui Zhao , Yang Gao , Pieter Abbeel , Volker Tresp , Wei Xu

Learning Human Rewards by Inferring Their Latent Intelligence Levels in Multi-Agent Games: A Theory-of-Mind Approach with Application to Driving Data

Reward function, as an incentive representation that recognizes humans' agency and rationalizes humans' actions, is particularly appealing for modeling human behavior in human-robot interaction. Inverse Reinforcement Learning is an…

Artificial Intelligence · Computer Science 2021-03-09 Ran Tian , Masayoshi Tomizuka , Liting Sun

Unsupervised Control Through Non-Parametric Discriminative Rewards

Learning to control an environment without hand-crafted rewards or expert data remains challenging and is at the frontier of reinforcement learning research. We present an unsupervised learning algorithm to train agents to achieve…

Machine Learning · Computer Science 2018-11-29 David Warde-Farley , Tom Van de Wiele , Tejas Kulkarni , Catalin Ionescu , Steven Hansen , Volodymyr Mnih

Experimental Evidence that Empowerment May Drive Exploration in Sparse-Reward Environments

Reinforcement Learning (RL) is known to be often unsuccessful in environments with sparse extrinsic rewards. A possible countermeasure is to endow RL agents with an intrinsic reward function, or 'intrinsic motivation', which rewards the…

Artificial Intelligence · Computer Science 2021-07-16 Francesco Massari , Martin Biehl , Lisa Meeden , Ryota Kanai