Related papers: Model-Based Bayesian Exploration

Information Maximizing Exploration with a Latent Dynamics Model

All reinforcement learning algorithms must handle the trade-off between exploration and exploitation. Many state-of-the-art deep reinforcement learning methods use noise in the action selection, such as Gaussian noise in policy gradient…

Machine Learning · Computer Science 2018-04-05 Trevor Barron , Oliver Obst , Heni Ben Amor

A Short Survey on Probabilistic Reinforcement Learning

A reinforcement learning agent tries to maximize its cumulative payoff by interacting in an unknown environment. It is important for the agent to explore suboptimal actions as well as to pick actions with highest known rewards. Yet, in…

Machine Learning · Computer Science 2019-01-23 Reazul Hasan Russel

Temporal Difference Uncertainties as a Signal for Exploration

An effective approach to exploration in reinforcement learning is to rely on an agent's uncertainty over the optimal policy, which can yield near-optimal exploration strategies in tabular settings. However, in non-tabular settings that…

Artificial Intelligence · Computer Science 2021-07-02 Sebastian Flennerhag , Jane X. Wang , Pablo Sprechmann , Francesco Visin , Alexandre Galashov , Steven Kapturowski , Diana L. Borsa , Nicolas Heess , Andre Barreto , Razvan Pascanu

Variance-Based Rewards for Approximate Bayesian Reinforcement Learning

The explore{exploit dilemma is one of the central challenges in Reinforcement Learning (RL). Bayesian RL solves the dilemma by providing the agent with information in the form of a prior distribution over environments; however, full…

Machine Learning · Computer Science 2012-03-19 Jonathan Sorg , Satinder Singh , Richard L. Lewis

Direct Uncertainty Estimation in Reinforcement Learning

Optimal probabilistic approach in reinforcement learning is computationally infeasible. Its simplification consisting in neglecting difference between true environment and its model estimated using limited number of observations causes…

Artificial Intelligence · Computer Science 2013-06-26 Sergey Rodionov , Alexey Potapov , Yurii Vinogradov

Bayesian Exploration: Incentivizing Exploration in Bayesian Games

We consider a ubiquitous scenario in the Internet economy when individual decision-makers (henceforth, agents) both produce and consume information as they make strategic choices in an uncertain environment. This creates a three-way…

Computer Science and Game Theory · Computer Science 2021-04-09 Yishay Mansour , Aleksandrs Slivkins , Vasilis Syrgkanis , Zhiwei Steven Wu

Exploration Potential

We introduce exploration potential, a quantity that measures how much a reinforcement learning agent has explored its environment class. In contrast to information gain, exploration potential takes the problem's reward structure into…

Machine Learning · Computer Science 2016-11-21 Jan Leike

Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation

Efficiently tackling multiple tasks within complex environment, such as those found in robot manipulation, remains an ongoing challenge in robotics and an opportunity for data-driven solutions, such as reinforcement learning (RL).…

Robotics · Computer Science 2024-04-03 Carlos Plou , Ana C. Murillo , Ruben Martinez-Cantin

Tree Exploration for Bayesian RL Exploration

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The first employs a Bayesian framework, where optimality improves with increased computational time.…

Machine Learning · Statistics 2011-09-22 Christos Dimitrakakis

A Bayesian Sampling Approach to Exploration in Reinforcement Learning

We present a modular approach to reinforcement learning that uses a Bayesian representation of the uncertainty over models. The approach, BOSS (Best of Sampled Set), drives exploration by sampling multiple models from the posterior and…

Machine Learning · Computer Science 2012-05-14 John Asmuth , Lihong Li , Michael L. Littman , Ali Nouri , David Wingate

Probabilistic Exploration in Planning while Learning

Sequential decision tasks with incomplete information are characterized by the exploration problem; namely the trade-off between further exploration for learning more about the environment and immediate exploitation of the accrued…

Artificial Intelligence · Computer Science 2013-02-21 Grigoris I. Karakoulas

Empowerment-driven Exploration using Mutual Information Estimation

Exploration is a difficult challenge in reinforcement learning and is of prime importance in sparse reward environments. However, many of the state of the art deep reinforcement learning algorithms, that rely on epsilon-greedy, fail on…

Machine Learning · Computer Science 2018-10-15 Navneet Madhu Kumar

On Efficient Bayesian Exploration in Model-Based Reinforcement Learning

In this work, we address the challenge of data-efficient exploration in reinforcement learning by examining existing principled, information-theoretic approaches to intrinsic motivation. Specifically, we focus on a class of exploration…

Machine Learning · Computer Science 2025-07-04 Alberto Caron , Chris Hicks , Vasilios Mavroudis

Implicit Generative Modeling for Efficient Exploration

Efficient exploration remains a challenging problem in reinforcement learning, especially for those tasks where rewards from environments are sparse. A commonly used approach for exploring such environments is to introduce some "intrinsic"…

Machine Learning · Computer Science 2020-07-16 Neale Ratzlaff , Qinxun Bai , Li Fuxin , Wei Xu

Dealing with uncertainty: balancing exploration and exploitation in deep recurrent reinforcement learning

Incomplete knowledge of the environment leads an agent to make decisions under uncertainty. One of the major dilemmas in Reinforcement Learning (RL) where an autonomous agent has to balance two contrasting needs in making its decisions is:…

Machine Learning · Statistics 2024-02-21 Valentina Zangirolami , Matteo Borrotti

Modeling Human Exploration Through Resource-Rational Reinforcement Learning

Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Humans, on the other hand, seem to manage the trade-off between exploration and exploitation effortlessly. In the present article, we put…

Machine Learning · Computer Science 2022-11-15 Marcel Binz , Eric Schulz

Model-Based Active Exploration

Efficient exploration is an unsolved problem in Reinforcement Learning which is usually addressed by reactively rewarding the agent for fortuitously encountering novel situations. This paper introduces an efficient active exploration…

Machine Learning · Computer Science 2019-06-17 Pranav Shyam , Wojciech Jaśkowski , Faustino Gomez

Exploration Conscious Reinforcement Learning Revisited

The Exploration-Exploitation tradeoff arises in Reinforcement Learning when one cannot tell if a policy is optimal. Then, there is a constant need to explore new actions instead of exploiting past experience. In practice, it is common to…

Machine Learning · Computer Science 2019-09-10 Lior Shani , Yonathan Efroni , Shie Mannor

MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer Sampling

Data selection is essential for any data-based optimization technique, such as Reinforcement Learning. State-of-the-art sampling strategies for the experience replay buffer improve the performance of the Reinforcement Learning agent.…

Machine Learning · Computer Science 2023-11-28 Julius Ott , Lorenzo Servadei , Jose Arjona-Medina , Enrico Rinaldi , Gianfranco Mauro , Daniela Sánchez Lopera , Michael Stephan , Thomas Stadelmayer , Avik Santra , Robert Wille

Model-Free Active Exploration in Reinforcement Learning

We study the problem of exploration in Reinforcement Learning and present a novel model-free solution. We adopt an information-theoretical viewpoint and start from the instance-specific lower bound of the number of samples that have to be…

Machine Learning · Computer Science 2024-07-02 Alessio Russo , Alexandre Proutiere