English
Related papers

Related papers: Exploration Unbound

200 papers

We introduce exploration potential, a quantity that measures how much a reinforcement learning agent has explored its environment class. In contrast to information gain, exploration potential takes the problem's reward structure into…

Machine Learning · Computer Science 2016-11-21 Jan Leike

We consider a ubiquitous scenario in the Internet economy when individual decision-makers (henceforth, agents) both produce and consume information as they make strategic choices in an uncertain environment. This creates a three-way…

Computer Science and Game Theory · Computer Science 2021-04-09 Yishay Mansour , Aleksandrs Slivkins , Vasilis Syrgkanis , Zhiwei Steven Wu

How to incentivize self-interested agents to explore when they prefer to exploit? Consider a population of self-interested agents that make decisions under uncertainty. They "explore" to acquire new information and "exploit" this…

Computer Science and Game Theory · Computer Science 2024-10-23 Aleksandrs Slivkins

Ensuring sufficient exploration is a central challenge when training meta-reinforcement learning (meta-RL) agents to solve novel environments. Conventional solutions to the exploration-exploitation dilemma inject explicit incentives such as…

Machine Learning · Computer Science 2025-08-05 Micah Rentschler , Jesse Roberts

A perfectly rational decision-maker chooses the best action with the highest utility gain from a set of possible actions. The optimality principles that describe such decision processes do not take into account the computational costs of…

Artificial Intelligence · Computer Science 2013-12-25 Jordi Grau-Moya , Daniel A. Braun

A reinforcement learning agent tries to maximize its cumulative payoff by interacting in an unknown environment. It is important for the agent to explore suboptimal actions as well as to pick actions with highest known rewards. Yet, in…

Machine Learning · Computer Science 2019-01-23 Reazul Hasan Russel

Realistic environments often provide agents with very limited feedback. When the environment is initially unknown, the feedback, in the beginning, can be completely absent, and the agents may first choose to devote all their effort on…

Machine Learning · Computer Science 2020-10-13 Pierre Ménard , Omar Darwiche Domingues , Anders Jonsson , Emilie Kaufmann , Edouard Leurent , Michal Valko

Methods for learning optimal policies in autonomous agents often assume that the way the domain is conceptualised---its possible states and actions and their causal structure---is known in advance and does not change during learning. This…

Artificial Intelligence · Computer Science 2018-01-11 Craig Innes , Alex Lascarides , Stefano V Albrecht , Subramanian Ramamoorthy , Benjamin Rosman

Exploration algorithms for reinforcement learning typically replace or augment the reward function with an additional ``intrinsic'' reward that trains the agent to seek previously unseen states of the environment. Here, we consider an…

Machine Learning · Computer Science 2025-09-30 Kevin McKee , Eric Alt , Andrew Grebenisan , Mick van Gelderen , Gary Miguel

A default assumption in the design of reinforcement-learning algorithms is that a decision-making agent always explores to learn optimal behavior. In sufficiently complex environments that approach the vastness and scale of the real world,…

Machine Learning · Computer Science 2024-07-23 Dilip Arumugam , Saurabh Kumar , Ramki Gummadi , Benjamin Van Roy

We propose and design recommendation systems that incentivize efficient exploration. Agents arrive sequentially, choose actions and receive rewards, drawn from fixed but unknown action-specific distributions. The recommendation system…

Computer Science and Game Theory · Computer Science 2026-04-02 Nicole Immorlica , Jieming Mao , Aleksandrs Slivkins , Zhiwei Steven Wu

An agent learning through interactions should balance its action selection process between probing the environment to discover new rewards and using the information acquired in the past to adopt useful behaviour. This trade-off is usually…

Machine Learning · Computer Science 2019-07-02 Lucas Beyer , Damien Vincent , Olivier Teboul , Sylvain Gelly , Matthieu Geist , Olivier Pietquin

In this survey we present different approaches that allow an intelligent agent to explore autonomous its environment to gather information and learn multiple tasks. Different communities proposed different solutions, that are in many cases,…

Artificial Intelligence · Computer Science 2014-03-07 Manuel Lopes , Luis Montesano

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained during execution of one task has value for the execution of…

Machine Learning · Computer Science 2012-09-06 Christos Dimitrakakis

To survive in dynamic and uncertain environments, individuals must develop effective decision strategies that balance information gathering and decision commitment. Models of such strategies often prioritize either optimizing tangible…

Artificial Intelligence · Computer Science 2025-03-26 Nicholas W. Barendregt , Joshua I. Gold , Krešimir Josić , Zachary P. Kilpatrick

The Exploration-Exploitation tradeoff arises in Reinforcement Learning when one cannot tell if a policy is optimal. Then, there is a constant need to explore new actions instead of exploiting past experience. In practice, it is common to…

Machine Learning · Computer Science 2019-09-10 Lior Shani , Yonathan Efroni , Shie Mannor

Active learning agents typically employ a query selection algorithm which solely considers the agent's learning objectives. However, this may be insufficient in more realistic human domains. This work uses imitation learning to enable an…

Machine Learning · Computer Science 2019-07-02 Kalesha Bullard , Yannick Schroecker , Sonia Chernova

Common approaches for task-agnostic exploration learn tabula-rasa --the agent assumes isolated environments and no prior knowledge or experience. However, in the real world, agents learn in many environments and always come with prior…

Machine Learning · Computer Science 2021-11-29 Simone Parisi , Victoria Dean , Deepak Pathak , Abhinav Gupta

Reinforcement learning systems are often concerned with balancing exploration of untested actions against exploitation of actions that are known to be good. The benefit of exploration can be estimated using the classical notion of Value of…

Artificial Intelligence · Computer Science 2013-01-30 Richard Dearden , Nir Friedman , David Andre

To maximize its success, an AGI typically needs to explore its initially unknown world. Is there an optimal way of doing so? Here we derive an affirmative answer for a broad class of environments.

Artificial Intelligence · Computer Science 2011-03-30 Yi Sun , Faustino Gomez , Juergen Schmidhuber
‹ Prev 1 2 3 10 Next ›