Related papers: Hyper: Hyperparameter Robust Efficient Exploration…

Reward-Free Exploration for Reinforcement Learning

Exploration is widely regarded as one of the most challenging aspects of reinforcement learning (RL), with many naive approaches succumbing to exponential sample complexity. To isolate the challenges of exploration, we propose a new…

Machine Learning · Computer Science 2020-02-10 Chi Jin , Akshay Krishnamurthy , Max Simchowitz , Tiancheng Yu

Overcoming Exploration in Reinforcement Learning with Demonstrations

Exploration in environments with sparse rewards has been a persistent problem in reinforcement learning (RL). Many tasks are natural to specify with a sparse reward, and manually shaping a reward function can result in suboptimal…

Machine Learning · Computer Science 2018-02-27 Ashvin Nair , Bob McGrew , Marcin Andrychowicz , Wojciech Zaremba , Pieter Abbeel

Fast Rates for Maximum Entropy Exploration

We address the challenge of exploration in reinforcement learning (RL) when the agent operates in an unknown environment with sparse or no rewards. In this work, we study the maximum entropy exploration problem of two different types. The…

Machine Learning · Statistics 2023-06-07 Daniil Tiapkin , Denis Belomestny , Daniele Calandriello , Eric Moulines , Remi Munos , Alexey Naumov , Pierre Perrault , Yunhao Tang , Michal Valko , Pierre Menard

Hyperparameter Optimization for Multi-Objective Reinforcement Learning

Reinforcement learning (RL) has emerged as a powerful approach for tackling complex problems. The recent introduction of multi-objective reinforcement learning (MORL) has further expanded the scope of RL by enabling agents to make…

Machine Learning · Computer Science 2023-10-26 Florian Felten , Daniel Gareev , El-Ghazali Talbi , Grégoire Danoy

Auto-exploration for online reinforcement learning

The exploration-exploitation dilemma in reinforcement learning (RL) is a fundamental challenge to efficient RL algorithms. Existing algorithms for finite state and action discounted RL problems address this by assuming sufficient…

Machine Learning · Computer Science 2025-12-09 Caleb Ju , Guanghui Lan

Modeling Human Exploration Through Resource-Rational Reinforcement Learning

Equipping artificial agents with useful exploration mechanisms remains a challenge to this day. Humans, on the other hand, seem to manage the trade-off between exploration and exploitation effortlessly. In the present article, we put…

Machine Learning · Computer Science 2022-11-15 Marcel Binz , Eric Schulz

Satisficing Exploration for Deep Reinforcement Learning

A default assumption in the design of reinforcement-learning algorithms is that a decision-making agent always explores to learn optimal behavior. In sufficiently complex environments that approach the vastness and scale of the real world,…

Machine Learning · Computer Science 2024-07-23 Dilip Arumugam , Saurabh Kumar , Ramki Gummadi , Benjamin Van Roy

A Short Survey on Probabilistic Reinforcement Learning

A reinforcement learning agent tries to maximize its cumulative payoff by interacting in an unknown environment. It is important for the agent to explore suboptimal actions as well as to pick actions with highest known rewards. Yet, in…

Machine Learning · Computer Science 2019-01-23 Reazul Hasan Russel

Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets

The objective of a reinforcement learning agent is to discover better actions through exploration. However, typical exploration techniques aim to maximize rewards, often incurring high costs in both exploration and learning processes. We…

Machine Learning · Computer Science 2024-12-24 Akane Tsuboya , Yu Kono , Tatsuji Takahashi

Efficient Exploration in Resource-Restricted Reinforcement Learning

In many real-world applications of reinforcement learning (RL), performing actions requires consuming certain types of resources that are non-replenishable in each episode. Typical applications include robotic control with limited energy…

Machine Learning · Computer Science 2022-12-15 Zhihai Wang , Taoxing Pan , Qi Zhou , Jie Wang

Reinforcement Learning with Probabilistically Complete Exploration

Balancing exploration and exploitation remains a key challenge in reinforcement learning (RL). State-of-the-art RL algorithms suffer from high sample complexity, particularly in the sparse reward case, where they can do no better than to…

Machine Learning · Computer Science 2020-01-22 Philippe Morere , Gilad Francis , Tom Blau , Fabio Ramos

Reward Uncertainty for Exploration in Preference-based Reinforcement Learning

Conveying complex objectives to reinforcement learning (RL) agents often requires meticulous reward engineering. Preference-based RL methods are able to learn a more flexible reward model based on human preferences by actively incorporating…

Machine Learning · Computer Science 2022-05-26 Xinran Liang , Katherine Shu , Kimin Lee , Pieter Abbeel

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

To rapidly learn a new task, it is often essential for agents to explore efficiently -- especially when performance matters from the first timestep. One way to learn such behaviour is via meta-learning. Many existing methods however rely on…

Machine Learning · Computer Science 2021-06-11 Luisa Zintgraf , Leo Feng , Cong Lu , Maximilian Igl , Kristian Hartikainen , Katja Hofmann , Shimon Whiteson

On Hard Exploration for Reinforcement Learning: a Case Study in Pommerman

How to best explore in domains with sparse, delayed, and deceptive rewards is an important open problem for reinforcement learning (RL). This paper considers one such domain, the recently-proposed multi-agent benchmark of Pommerman. This…

Machine Learning · Computer Science 2019-07-30 Chao Gao , Bilal Kartal , Pablo Hernandez-Leal , Matthew E. Taylor

A Framework for History-Aware Hyperparameter Optimisation in Reinforcement Learning

A Reinforcement Learning (RL) system depends on a set of initial conditions (hyperparameters) that affect the system's performance. However, defining a good choice of hyperparameters is a challenging problem. Hyperparameter tuning often…

Machine Learning · Computer Science 2023-03-10 Juan Marcelo Parra-Ullauri , Chen Zhen , Antonio García-Domínguez , Nelly Bencomo , Changgang Zheng , Juan Boubeta-Puig , Guadalupe Ortiz , Shufan Yang

Smart Exploration in Reinforcement Learning using Bounded Uncertainty Models

Reinforcement learning (RL) is a powerful framework for decision-making in uncertain environments, but it often requires large amounts of data to learn an optimal policy. We address this challenge by incorporating prior model knowledge to…

Machine Learning · Computer Science 2026-01-29 J. S. van Hulst , W. P. M. H. Heemels , D. J. Antunes

A Survey of Exploration Methods in Reinforcement Learning

Exploration is an essential component of reinforcement learning algorithms, where agents need to learn how to predict and control unknown and often stochastic environments. Reinforcement learning agents depend crucially on exploration to…

Machine Learning · Computer Science 2021-09-03 Susan Amin , Maziar Gomrokchi , Harsh Satija , Herke van Hoof , Doina Precup

Diversity-Driven Exploration Strategy for Deep Reinforcement Learning

Efficient exploration remains a challenging research problem in reinforcement learning, especially when an environment contains large state spaces, deceptive local optima, or sparse rewards. To tackle this problem, we present a…

Artificial Intelligence · Computer Science 2018-10-30 Zhang-Wei Hong , Tzu-Yun Shann , Shih-Yang Su , Yi-Hsiang Chang , Chun-Yi Lee

Constrained Exploration and Recovery from Experience Shaping

We consider the problem of reinforcement learning under safety requirements, in which an agent is trained to complete a given task, typically formalized as the maximization of a reward signal over time, while concurrently avoiding…

Machine Learning · Computer Science 2018-09-25 Tu-Hoa Pham , Giovanni De Magistris , Don Joven Agravante , Subhajit Chaudhury , Asim Munawar , Ryuki Tachibana

Safe Exploration Method for Reinforcement Learning under Existence of Disturbance

Recent rapid developments in reinforcement learning algorithms have been giving us novel possibilities in many fields. However, due to their exploring property, we have to take the risk into consideration when we apply those algorithms to…

Machine Learning · Computer Science 2023-03-21 Yoshihiro Okawa , Tomotake Sasaki , Hitoshi Yanami , Toru Namerikawa