English
Related papers

Related papers: When should agents explore?

200 papers

Most exploration research on reinforcement learning (RL) has paid attention to `the way of exploration', which is `how to explore'. The other exploration research, `when to explore', has not been the main focus of RL exploration research.…

Artificial Intelligence · Computer Science 2024-05-07 JaeYoon Kim , Junyu Xuan , Christy Liang , Farookh Hussain

In this paper, a unified framework for exploration in reinforcement learning (RL) is proposed based on an option-critic model. The proposed framework learns to integrate a set of diverse exploration strategies so that the agent can…

Machine Learning · Computer Science 2024-09-10 Woojun Kim , Jeonghye Kim , Youngchul Sung

Recent advancements in agentic test-time scaling allow models to gather environmental feedback before committing to final actions. A key limitation of existing methods is that they typically employ undifferentiated exploration strategies,…

Artificial Intelligence · Computer Science 2026-05-13 Xingyuan Hua , Sheng Yue , Ju Ren

Reinforcement learning (RL) is a powerful machine learning technique that has been successfully applied to a wide variety of problems. However, it can be unpredictable and produce suboptimal results in complicated learning environments.…

Multiagent Systems · Computer Science 2024-11-19 Brian Mintz , Feng Fu

Incomplete knowledge of the environment leads an agent to make decisions under uncertainty. One of the major dilemmas in Reinforcement Learning (RL) where an autonomous agent has to balance two contrasting needs in making its decisions is:…

Machine Learning · Statistics 2024-02-21 Valentina Zangirolami , Matteo Borrotti

Reinforcement learning (RL) has become essential to the post-training of large language models (LLMs) for reasoning, agentic capabilities and alignment. Successful RL relies on sufficient exploration of diverse actions by the model during…

Exploration is essential for reinforcement learning (RL). To face the challenges of exploration, we consider a reward-free RL framework that completely separates exploration from exploitation and brings new challenges for exploration…

Machine Learning · Computer Science 2020-12-11 Chuheng Zhang , Yuanying Cai , Longbo Huang , Jian Li

In lifelong learning, an agent learns throughout its entire life without resets, in a constantly changing environment, as we humans do. Consequently, lifelong learning comes with a plethora of research problems such as continual domain…

Balancing exploration and exploitation is a central goal in reinforcement learning (RL). Despite recent advances in enhancing large language model (LLM) reasoning, most methods lean toward exploitation, and increasingly encounter…

Computation and Language · Computer Science 2025-11-11 Daixuan Cheng , Shaohan Huang , Xuekai Zhu , Bo Dai , Wayne Xin Zhao , Zhenliang Zhang , Furu Wei

In deep reinforcement learning (RL) research, there has been a concerted effort to design more efficient and productive exploration methods while solving sparse-reward problems. These exploration methods often share common principles (e.g.,…

Machine Learning · Computer Science 2024-04-04 Jonathan C. Balloch , Rishav Bhagat , Geigh Zollicoffer , Ruoran Jia , Julia Kim , Mark O. Riedl

Exploration in reinforcement learning (RL) remains an open challenge. RL algorithms rely on observing rewards to train the agent, and if informative rewards are sparse the agent learns slowly or may not learn at all. To improve exploration…

Machine Learning · Computer Science 2024-11-12 Simone Parisi , Alireza Kazemipour , Michael Bowling

The study of exploration in the domain of decision making has a long history but remains actively debated. From the vast literature that addressed this topic for decades under various points of view (e.g., developmental psychology,…

Machine Learning · Computer Science 2021-01-14 Léonard Hussenot , Robert Dadashi , Matthieu Geist , Olivier Pietquin

An effective approach to exploration in reinforcement learning is to rely on an agent's uncertainty over the optimal policy, which can yield near-optimal exploration strategies in tabular settings. However, in non-tabular settings that…

The exploration--exploitation trade-off in reinforcement learning (RL) is a well-known and much-studied problem that balances greedy action selection with novel experience, and the study of exploration methods is usually only considered in…

Machine Learning · Computer Science 2022-10-13 Jonathan C Balloch , Julia Kim , and Jessica L Inman , Mark O Riedl

Efficient exploration remains a challenging research problem in reinforcement learning, especially when an environment contains large state spaces, deceptive local optima, or sparse rewards. To tackle this problem, we present a…

Artificial Intelligence · Computer Science 2018-10-30 Zhang-Wei Hong , Tzu-Yun Shann , Shih-Yang Su , Yi-Hsiang Chang , Chun-Yi Lee

Saliency maps are frequently used to support explanations of the behavior of deep reinforcement learning (RL) agents. However, a review of how saliency maps are used in practice indicates that the derived explanations are often…

Machine Learning · Computer Science 2020-02-24 Akanksha Atrey , Kaleigh Clary , David Jensen

Exploration is an essential component of reinforcement learning algorithms, where agents need to learn how to predict and control unknown and often stochastic environments. Reinforcement learning agents depend crucially on exploration to…

Machine Learning · Computer Science 2021-09-03 Susan Amin , Maziar Gomrokchi , Harsh Satija , Herke van Hoof , Doina Precup

Recent advancements in large language models (LLMs) have expanded their capabilities beyond traditional text-based tasks to multimodal domains, integrating visual, auditory, and textual data. While multimodal LLMs have been extensively…

Artificial Intelligence · Computer Science 2024-12-03 Nicholas R. Waytowich , Devin White , MD Sunbeam , Vinicius G. Goecks

Much human and computational effort has aimed to improve how deep reinforcement learning algorithms perform on benchmarks such as the Atari Learning Environment. Comparatively less effort has focused on understanding what has been learned…

Neural and Evolutionary Computing · Computer Science 2019-05-31 Felipe Petroski Such , Vashisht Madhavan , Rosanne Liu , Rui Wang , Pablo Samuel Castro , Yulun Li , Jiale Zhi , Ludwig Schubert , Marc G. Bellemare , Jeff Clune , Joel Lehman

The exploration \& exploitation dilemma poses significant challenges in reinforcement learning (RL). Recently, curiosity-based exploration methods achieved great success in tackling hard-exploration problems. However, they necessitate…

Machine Learning · Computer Science 2024-12-06 Yiran Wang , Chenshu Liu , Yunfan Li , Sanae Amani , Bolei Zhou , Lin F. Yang
‹ Prev 1 2 3 10 Next ›