Related papers: When should agents explore?

An Autonomous Non-monolithic Agent with Multi-mode Exploration based on Options Framework

Most exploration research on reinforcement learning (RL) has paid attention to `the way of exploration', which is `how to explore'. The other exploration research, `when to explore', has not been the main focus of RL exploration research.…

Artificial Intelligence · Computer Science 2024-05-07 JaeYoon Kim , Junyu Xuan , Christy Liang , Farookh Hussain

LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework

In this paper, a unified framework for exploration in reinforcement learning (RL) is proposed based on an option-critic model. The proposed framework learns to integrate a set of diverse exploration strategies so that the agent can…

Machine Learning · Computer Science 2024-09-10 Woojun Kim , Jeonghye Kim , Youngchul Sung

Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization

Recent advancements in agentic test-time scaling allow models to gather environmental feedback before committing to final actions. A key limitation of existing methods is that they typically employ undifferentiated exploration strategies,…

Artificial Intelligence · Computer Science 2026-05-13 Xingyuan Hua , Sheng Yue , Ju Ren

Evolutionary Multi-agent Reinforcement Learning in Group Social Dilemmas

Reinforcement learning (RL) is a powerful machine learning technique that has been successfully applied to a wide variety of problems. However, it can be unpredictable and produce suboptimal results in complicated learning environments.…

Multiagent Systems · Computer Science 2024-11-19 Brian Mintz , Feng Fu

Dealing with uncertainty: balancing exploration and exploitation in deep recurrent reinforcement learning

Incomplete knowledge of the environment leads an agent to make decisions under uncertainty. One of the major dilemmas in Reinforcement Learning (RL) where an autonomous agent has to balance two contrasting needs in making its decisions is:…

Machine Learning · Statistics 2024-02-21 Valentina Zangirolami , Matteo Borrotti

Exploration Hacking: Can LLMs Learn to Resist RL Training?

Reinforcement learning (RL) has become essential to the post-training of large language models (LLMs) for reasoning, agentic capabilities and alignment. Successful RL relies on sufficient exploration of diverse actions by the model during…

Machine Learning · Computer Science 2026-05-01 Eyon Jang , Damon Falck , Joschka Braun , Nathalie Kirch , Achu Menon , Perusha Moodley , Scott Emmons , Roland S. Zimmermann , David Lindner

Exploration by Maximizing R\'enyi Entropy for Reward-Free RL Framework

Exploration is essential for reinforcement learning (RL). To face the challenges of exploration, we consider a reward-free RL framework that completely separates exploration from exploitation and brings new challenges for exploration…

Machine Learning · Computer Science 2020-12-11 Chuheng Zhang , Yuanying Cai , Longbo Huang , Jian Li

Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning

In lifelong learning, an agent learns throughout its entire life without resets, in a constantly changing environment, as we humans do. Consequently, lifelong learning comes with a plethora of research problems such as continual domain…

Machine Learning · Computer Science 2022-09-23 Christian Steinparz , Thomas Schmied , Fabian Paischer , Marius-Constantin Dinu , Vihang Patil , Angela Bitto-Nemling , Hamid Eghbal-zadeh , Sepp Hochreiter

Reasoning with Exploration: An Entropy Perspective

Balancing exploration and exploitation is a central goal in reinforcement learning (RL). Despite recent advances in enhancing large language model (LLM) reasoning, most methods lean toward exploitation, and increasingly encounter…

Computation and Language · Computer Science 2025-11-11 Daixuan Cheng , Shaohan Huang , Xuekai Zhu , Bo Dai , Wayne Xin Zhao , Zhenliang Zhang , Furu Wei

Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning

In deep reinforcement learning (RL) research, there has been a concerted effort to design more efficient and productive exploration methods while solving sparse-reward problems. These exploration methods often share common principles (e.g.,…

Machine Learning · Computer Science 2024-04-04 Jonathan C. Balloch , Rishav Bhagat , Geigh Zollicoffer , Ruoran Jia , Julia Kim , Mark O. Riedl

Beyond Optimism: Exploration With Partially Observable Rewards

Exploration in reinforcement learning (RL) remains an open challenge. RL algorithms rely on observing rewards to train the agent, and if informative rewards are sparse the agent learns slowly or may not learn at all. To improve exploration…

Machine Learning · Computer Science 2024-11-12 Simone Parisi , Alireza Kazemipour , Michael Bowling

Show me the Way: Intrinsic Motivation from Demonstrations

The study of exploration in the domain of decision making has a long history but remains actively debated. From the vast literature that addressed this topic for decades under various points of view (e.g., developmental psychology,…

Machine Learning · Computer Science 2021-01-14 Léonard Hussenot , Robert Dadashi , Matthieu Geist , Olivier Pietquin

Temporal Difference Uncertainties as a Signal for Exploration

An effective approach to exploration in reinforcement learning is to rely on an agent's uncertainty over the optimal policy, which can yield near-optimal exploration strategies in tabular settings. However, in non-tabular settings that…

Artificial Intelligence · Computer Science 2021-07-02 Sebastian Flennerhag , Jane X. Wang , Pablo Sprechmann , Francesco Visin , Alexandre Galashov , Steven Kapturowski , Diana L. Borsa , Nicolas Heess , Andre Barreto , Razvan Pascanu

The Role of Exploration for Task Transfer in Reinforcement Learning

The exploration--exploitation trade-off in reinforcement learning (RL) is a well-known and much-studied problem that balances greedy action selection with novel experience, and the study of exploration methods is usually only considered in…

Machine Learning · Computer Science 2022-10-13 Jonathan C Balloch , Julia Kim , and Jessica L Inman , Mark O Riedl

Diversity-Driven Exploration Strategy for Deep Reinforcement Learning

Efficient exploration remains a challenging research problem in reinforcement learning, especially when an environment contains large state spaces, deceptive local optima, or sparse rewards. To tackle this problem, we present a…

Artificial Intelligence · Computer Science 2018-10-30 Zhang-Wei Hong , Tzu-Yun Shann , Shih-Yang Su , Yi-Hsiang Chang , Chun-Yi Lee

Exploratory Not Explanatory: Counterfactual Analysis of Saliency Maps for Deep Reinforcement Learning

Saliency maps are frequently used to support explanations of the behavior of deep reinforcement learning (RL) agents. However, a review of how saliency maps are used in practice indicates that the derived explanations are often…

Machine Learning · Computer Science 2020-02-24 Akanksha Atrey , Kaleigh Clary , David Jensen

A Survey of Exploration Methods in Reinforcement Learning

Exploration is an essential component of reinforcement learning algorithms, where agents need to learn how to predict and control unknown and often stochastic environments. Reinforcement learning agents depend crucially on exploration to…

Machine Learning · Computer Science 2021-09-03 Susan Amin , Maziar Gomrokchi , Harsh Satija , Herke van Hoof , Doina Precup

Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games

Recent advancements in large language models (LLMs) have expanded their capabilities beyond traditional text-based tasks to multimodal domains, integrating visual, auditory, and textual data. While multimodal LLMs have been extensively…

Artificial Intelligence · Computer Science 2024-12-03 Nicholas R. Waytowich , Devin White , MD Sunbeam , Vinicius G. Goecks

An Atari Model Zoo for Analyzing, Visualizing, and Comparing Deep Reinforcement Learning Agents

Much human and computational effort has aimed to improve how deep reinforcement learning algorithms perform on benchmarks such as the Atari Learning Environment. Comparatively less effort has focused on understanding what has been learned…

Neural and Evolutionary Computing · Computer Science 2019-05-31 Felipe Petroski Such , Vashisht Madhavan , Rosanne Liu , Rui Wang , Pablo Samuel Castro , Yulun Li , Jiale Zhi , Ludwig Schubert , Marc G. Bellemare , Jeff Clune , Joel Lehman

Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning

The exploration \& exploitation dilemma poses significant challenges in reinforcement learning (RL). Recently, curiosity-based exploration methods achieved great success in tackling hard-exploration problems. However, they necessitate…

Machine Learning · Computer Science 2024-12-06 Yiran Wang , Chenshu Liu , Yunfan Li , Sanae Amani , Bolei Zhou , Lin F. Yang