Related papers: Exploration Unbound

Exploration Potential

We introduce exploration potential, a quantity that measures how much a reinforcement learning agent has explored its environment class. In contrast to information gain, exploration potential takes the problem's reward structure into…

Machine Learning · Computer Science 2016-11-21 Jan Leike

Bayesian Exploration: Incentivizing Exploration in Bayesian Games

We consider a ubiquitous scenario in the Internet economy when individual decision-makers (henceforth, agents) both produce and consume information as they make strategic choices in an uncertain environment. This creates a three-way…

Computer Science and Game Theory · Computer Science 2021-04-09 Yishay Mansour , Aleksandrs Slivkins , Vasilis Syrgkanis , Zhiwei Steven Wu

Exploration and Persuasion

How to incentivize self-interested agents to explore when they prefer to exploit? Consider a population of self-interested agents that make decisions under uncertainty. They "explore" to acquire new information and "exploit" this…

Computer Science and Game Theory · Computer Science 2024-10-23 Aleksandrs Slivkins

Exploitation Is All You Need... for Exploration

Ensuring sufficient exploration is a central challenge when training meta-reinforcement learning (meta-RL) agents to solve novel environments. Conventional solutions to the exploration-exploitation dilemma inject explicit incentives such as…

Machine Learning · Computer Science 2025-08-05 Micah Rentschler , Jesse Roberts

Bounded Rational Decision-Making in Changing Environments

A perfectly rational decision-maker chooses the best action with the highest utility gain from a set of possible actions. The optimality principles that describe such decision processes do not take into account the computational costs of…

Artificial Intelligence · Computer Science 2013-12-25 Jordi Grau-Moya , Daniel A. Braun

A Short Survey on Probabilistic Reinforcement Learning

A reinforcement learning agent tries to maximize its cumulative payoff by interacting in an unknown environment. It is important for the agent to explore suboptimal actions as well as to pick actions with highest known rewards. Yet, in…

Machine Learning · Computer Science 2019-01-23 Reazul Hasan Russel

Fast active learning for pure exploration in reinforcement learning

Realistic environments often provide agents with very limited feedback. When the environment is initially unknown, the feedback, in the beginning, can be completely absent, and the agents may first choose to devote all their effort on…

Machine Learning · Computer Science 2020-10-13 Pierre Ménard , Omar Darwiche Domingues , Anders Jonsson , Emilie Kaufmann , Edouard Leurent , Michal Valko

Reasoning about Unforeseen Possibilities During Policy Learning

Methods for learning optimal policies in autonomous agents often assume that the way the domain is conceptualised---its possible states and actions and their causal structure---is known in advance and does not change during learning. This…

Artificial Intelligence · Computer Science 2018-01-11 Craig Innes , Alex Lascarides , Stefano V Albrecht , Subramanian Ramamoorthy , Benjamin Rosman

Meta-Learning to Explore via Memory Density Feedback

Exploration algorithms for reinforcement learning typically replace or augment the reward function with an additional ``intrinsic'' reward that trains the agent to seek previously unseen states of the environment. Here, we consider an…

Machine Learning · Computer Science 2025-09-30 Kevin McKee , Eric Alt , Andrew Grebenisan , Mick van Gelderen , Gary Miguel

Satisficing Exploration for Deep Reinforcement Learning

A default assumption in the design of reinforcement-learning algorithms is that a decision-making agent always explores to learn optimal behavior. In sufficiently complex environments that approach the vastness and scale of the real world,…

Machine Learning · Computer Science 2024-07-23 Dilip Arumugam , Saurabh Kumar , Ramki Gummadi , Benjamin Van Roy

Incentivizing Exploration with Selective Data Disclosure

We propose and design recommendation systems that incentivize efficient exploration. Agents arrive sequentially, choose actions and receive rewards, drawn from fixed but unknown action-specific distributions. The recommendation system…

Computer Science and Game Theory · Computer Science 2026-04-02 Nicole Immorlica , Jieming Mao , Aleksandrs Slivkins , Zhiwei Steven Wu

MULEX: Disentangling Exploitation from Exploration in Deep RL

An agent learning through interactions should balance its action selection process between probing the environment to discover new rewards and using the information acquired in the past to adopt useful behaviour. This trade-off is usually…

Machine Learning · Computer Science 2019-07-02 Lucas Beyer , Damien Vincent , Olivier Teboul , Sylvain Gelly , Matthieu Geist , Olivier Pietquin

Active Learning for Autonomous Intelligent Agents: Exploration, Curiosity, and Interaction

In this survey we present different approaches that allow an intelligent agent to explore autonomous its environment to gather information and learn multiple tasks. Different communities proposed different solutions, that are in many cases,…

Artificial Intelligence · Computer Science 2014-03-07 Manuel Lopes , Luis Montesano

Sparse Reward Processes

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained during execution of one task has value for the execution of…

Machine Learning · Computer Science 2012-09-06 Christos Dimitrakakis

Information-Seeking Decision Strategies Mitigate Risk in Dynamic, Uncertain Environments

To survive in dynamic and uncertain environments, individuals must develop effective decision strategies that balance information gathering and decision commitment. Models of such strategies often prioritize either optimizing tangible…

Artificial Intelligence · Computer Science 2025-03-26 Nicholas W. Barendregt , Joshua I. Gold , Krešimir Josić , Zachary P. Kilpatrick

Exploration Conscious Reinforcement Learning Revisited

The Exploration-Exploitation tradeoff arises in Reinforcement Learning when one cannot tell if a policy is optimal. Then, there is a constant need to explore new actions instead of exploiting past experience. In practice, it is common to…

Machine Learning · Computer Science 2019-09-10 Lior Shani , Yonathan Efroni , Shie Mannor

Active Learning within Constrained Environments through Imitation of an Expert Questioner

Active learning agents typically employ a query selection algorithm which solely considers the agent's learning objectives. However, this may be insufficient in more realistic human domains. This work uses imitation learning to enable an…

Machine Learning · Computer Science 2019-07-02 Kalesha Bullard , Yannick Schroecker , Sonia Chernova

Interesting Object, Curious Agent: Learning Task-Agnostic Exploration

Common approaches for task-agnostic exploration learn tabula-rasa --the agent assumes isolated environments and no prior knowledge or experience. However, in the real world, agents learn in many environments and always come with prior…

Machine Learning · Computer Science 2021-11-29 Simone Parisi , Victoria Dean , Deepak Pathak , Abhinav Gupta

Model-Based Bayesian Exploration

Reinforcement learning systems are often concerned with balancing exploration of untested actions against exploitation of actions that are known to be good. The benefit of exploration can be estimated using the classical notion of Value of…

Artificial Intelligence · Computer Science 2013-01-30 Richard Dearden , Nir Friedman , David Andre

Planning to Be Surprised: Optimal Bayesian Exploration in Dynamic Environments

To maximize its success, an AGI typically needs to explore its initially unknown world. Is there an optimal way of doing so? Here we derive an affirmative answer for a broad class of environments.

Artificial Intelligence · Computer Science 2011-03-30 Yi Sun , Faustino Gomez , Juergen Schmidhuber