Related papers: Agent Spaces

A Survey of Exploration Methods in Reinforcement Learning

Exploration is an essential component of reinforcement learning algorithms, where agents need to learn how to predict and control unknown and often stochastic environments. Reinforcement learning agents depend crucially on exploration to…

Machine Learning · Computer Science 2021-09-03 Susan Amin , Maziar Gomrokchi , Harsh Satija , Herke van Hoof , Doina Precup

Exploration and Incentives in Reinforcement Learning

How do you incentivize self-interested agents to $\textit{explore}$ when they prefer to $\textit{exploit}$? We consider complex exploration problems, where each agent faces the same (but unknown) MDP. In contrast with traditional…

Machine Learning · Computer Science 2023-02-21 Max Simchowitz , Aleksandrs Slivkins

Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets

The objective of a reinforcement learning agent is to discover better actions through exploration. However, typical exploration techniques aim to maximize rewards, often incurring high costs in both exploration and learning processes. We…

Machine Learning · Computer Science 2024-12-24 Akane Tsuboya , Yu Kono , Tatsuji Takahashi

Reinforcement Learning Driven Multi-Robot Exploration via Explicit Communication and Density-Based Frontier Search

Collaborative multi-agent exploration of unknown environments is crucial for search and rescue operations. Effective real-world deployment must address challenges such as limited inter-agent communication and static and dynamic obstacles.…

Robotics · Computer Science 2024-12-31 Gabriele Calzolari , Vidya Sumathy , Christoforos Kanellakis , George Nikolakopoulos

Constrained Exploration in Reinforcement Learning with Optimality Preservation

We consider a class of reinforcement-learning systems in which the agent follows a behavior policy to explore a discrete state-action space to find an optimal policy while adhering to some restriction on its behavior. Such restriction may…

Machine Learning · Computer Science 2023-04-07 Peter C. Y. Chen

Satisficing Exploration for Deep Reinforcement Learning

A default assumption in the design of reinforcement-learning algorithms is that a decision-making agent always explores to learn optimal behavior. In sufficiently complex environments that approach the vastness and scale of the real world,…

Machine Learning · Computer Science 2024-07-23 Dilip Arumugam , Saurabh Kumar , Ramki Gummadi , Benjamin Van Roy

Meta-Learning to Explore via Memory Density Feedback

Exploration algorithms for reinforcement learning typically replace or augment the reward function with an additional ``intrinsic'' reward that trains the agent to seek previously unseen states of the environment. Here, we consider an…

Machine Learning · Computer Science 2025-09-30 Kevin McKee , Eric Alt , Andrew Grebenisan , Mick van Gelderen , Gary Miguel

A Theory of Abstraction in Reinforcement Learning

Reinforcement learning defines the problem facing agents that learn to make good decisions through action and observation alone. To be effective problem solvers, such agents must efficiently explore vast worlds, assign credit from delayed…

Machine Learning · Computer Science 2022-03-02 David Abel

Reinforcement Learning: A Survey

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of…

Artificial Intelligence · Computer Science 2014-11-17 L. P. Kaelbling , M. L. Littman , A. W. Moore

A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning

In this paper we consider the problem of how a reinforcement learning agent that is tasked with solving a sequence of reinforcement learning problems (a sequence of Markov decision processes) can use knowledge acquired early in its lifetime…

Machine Learning · Computer Science 2019-02-05 Francisco M. Garcia , Philip S. Thomas

Learning to Explore by Reinforcement over High-Level Options

Autonomous 3D environment exploration is a fundamental task for various applications such as navigation. The goal of exploration is to investigate a new environment and build its occupancy map efficiently. In this paper, we propose a new…

Artificial Intelligence · Computer Science 2021-11-03 Liu Juncheng , McCane Brendan , Mills Steven

An agent design with goal reaching guarantees for enhancement of learning

Reinforcement learning is commonly concerned with problems of maximizing accumulated rewards in Markov decision processes. Oftentimes, a certain goal state or a subset of the state space attain maximal reward. In such a case, the…

Artificial Intelligence · Computer Science 2024-08-23 Pavel Osinenko , Grigory Yaremenko , Georgiy Malaniya , Anton Bolychev , Alexander Gepperth

Open-Ended Learning Leads to Generally Capable Agents

In this work we create agents that can perform well beyond a single, individual task, that exhibit much wider generalisation of behaviour to a massive, rich space of challenges. We define a universe of tasks within an environment domain and…

Machine Learning · Computer Science 2021-08-03 Open Ended Learning Team , Adam Stooke , Anuj Mahajan , Catarina Barros , Charlie Deck , Jakob Bauer , Jakub Sygnowski , Maja Trebacz , Max Jaderberg , Michael Mathieu , Nat McAleese , Nathalie Bradley-Schmieg , Nathaniel Wong , Nicolas Porcel , Roberta Raileanu , Steph Hughes-Fitt , Valentin Dalibard , Wojciech Marian Czarnecki

Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning

One of the remaining challenges in reinforcement learning is to develop agents that can generalise to novel scenarios they might encounter once deployed. This challenge is often framed in a multi-task setting where agents train on a fixed…

Machine Learning · Computer Science 2024-09-19 Max Weltevrede , Felix Kaubek , Matthijs T. J. Spaan , Wendelin Böhmer

Constrained Exploration and Recovery from Experience Shaping

We consider the problem of reinforcement learning under safety requirements, in which an agent is trained to complete a given task, typically formalized as the maximization of a reward signal over time, while concurrently avoiding…

Machine Learning · Computer Science 2018-09-25 Tu-Hoa Pham , Giovanni De Magistris , Don Joven Agravante , Subhajit Chaudhury , Asim Munawar , Ryuki Tachibana

Explainable Reinforcement Learning via Model Transforms

Understanding emerging behaviors of reinforcement learning (RL) agents may be difficult since such agents are often trained in complex environments using highly complex decision making procedures. This has given rise to a variety of…

Artificial Intelligence · Computer Science 2022-12-02 Mira Finkelstein , Lucy Liu , Nitsan Levy Schlot , Yoav Kolumbus , David C. Parkes , Jeffrey S. Rosenshein , Sarah Keren

Disentangling Exploration of Large Language Models by Optimal Exploitation

Exploration is a crucial skill for in-context reinforcement learning in unknown environments. However, it remains unclear if large language models can effectively explore a partially hidden state space. This work isolates exploration as the…

Machine Learning · Computer Science 2025-08-26 Tim Grams , Patrick Betz , Sascha Marton , Stefan Lüdtke , Christian Bartelt

A Short Survey on Probabilistic Reinforcement Learning

A reinforcement learning agent tries to maximize its cumulative payoff by interacting in an unknown environment. It is important for the agent to explore suboptimal actions as well as to pick actions with highest known rewards. Yet, in…

Machine Learning · Computer Science 2019-01-23 Reazul Hasan Russel

Reinforcement Unlearning

Machine unlearning refers to the process of mitigating the influence of specific training data on machine learning models based on removal requests from data owners. However, one important area that has been largely overlooked in the…

Cryptography and Security · Computer Science 2025-07-17 Dayong Ye , Tianqing Zhu , Congcong Zhu , Derui Wang , Kun Gao , Zewei Shi , Sheng Shen , Wanlei Zhou , Minhui Xue

Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

Deep Reinforcement Learning (DRL) and Deep Multi-agent Reinforcement Learning (MARL) have achieved significant successes across a wide range of domains, including game AI, autonomous vehicles, robotics, and so on. However, DRL and deep MARL…

Artificial Intelligence · Computer Science 2023-02-03 Jianye Hao , Tianpei Yang , Hongyao Tang , Chenjia Bai , Jinyi Liu , Zhaopeng Meng , Peng Liu , Zhen Wang