Related papers: State Action Separable Reinforcement Learning

Reactive Reinforcement Learning in Asynchronous Environments

The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or…

Artificial Intelligence · Computer Science 2018-06-29 Jaden B. Travnik , Kory W. Mathewson , Richard S. Sutton , Patrick M. Pilarski

State-wise Safe Reinforcement Learning: A Survey

Despite the tremendous success of Reinforcement Learning (RL) algorithms in simulation environments, applying RL to real-world applications still faces many challenges. A major concern is safety, in another word, constraint satisfaction.…

Machine Learning · Computer Science 2023-07-04 Weiye Zhao , Tairan He , Rui Chen , Tianhao Wei , Changliu Liu

Representation Learning for Continuous Action Spaces is Beneficial for Efficient Policy Learning

Deep reinforcement learning (DRL) breaks through the bottlenecks of traditional reinforcement learning (RL) with the help of the perception capability of deep learning and has been widely applied in real-world problems.While model-free RL,…

Machine Learning · Computer Science 2022-11-28 Tingting Zhao , Ying Wang , Wei Sun , Yarui Chen , Gang Niub , Masashi Sugiyama

Efficient Policy Learning for Non-Stationary MDPs under Adversarial Manipulation

A Markov Decision Process (MDP) is a popular model for reinforcement learning. However, its commonly used assumption of stationary dynamics and rewards is too stringent and fails to hold in adversarial, nonstationary, or multi-agent…

Machine Learning · Computer Science 2019-08-22 Tiancheng Yu , Suvrit Sra

Social Interpretable Reinforcement Learning

Reinforcement Learning (RL) bears the promise of being a game-changer in many applications. However, since most of the literature in the field is currently focused on opaque models, the use of RL in high-stakes scenarios, where…

Machine Learning · Computer Science 2025-01-22 Leonardo Lucio Custode , Giovanni Iacca

Safe Reinforcement Learning Using Advantage-Based Intervention

Many sequential decision problems involve finding a policy that maximizes total reward while obeying safety constraints. Although much recent research has focused on the development of safe reinforcement learning (RL) algorithms that…

Machine Learning · Computer Science 2021-07-20 Nolan Wagener , Byron Boots , Ching-An Cheng

Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act

Traditionally, Reinforcement Learning (RL) aims at deciding how to act optimally for an artificial agent. We argue that deciding when to act is equally important. As humans, we drift from default, instinctive or memorized behaviors to…

Machine Learning · Computer Science 2022-03-17 Alexis Jacq , Johan Ferret , Olivier Pietquin , Matthieu Geist

A generalized stacked reinforcement learning method for sampled systems

A common setting of reinforcement learning (RL) is a Markov decision process (MDP) in which the environment is a stochastic discrete-time dynamical system. Whereas MDPs are suitable in such applications as video-games or puzzles, physical…

Robotics · Computer Science 2022-11-29 Pavel Osinenko , Dmitrii Dobriborsci , Grigory Yaremenko , Georgiy Malaniya

An Experimental Design Perspective on Model-Based Reinforcement Learning

In many practical applications of RL, it is expensive to observe state transitions from the environment. For example, in the problem of plasma control for nuclear fusion, computing the next state for a given state-action pair requires…

Machine Learning · Computer Science 2022-03-16 Viraj Mehta , Biswajit Paria , Jeff Schneider , Stefano Ermon , Willie Neiswanger

Multi-Agent Reinforcement Learning: A Report on Challenges and Approaches

Reinforcement Learning (RL) is a learning paradigm concerned with learning to control a system so as to maximize an objective over the long term. This approach to learning has received immense interest in recent times and success manifests…

Artificial Intelligence · Computer Science 2018-07-26 Sanyam Kapoor

Constrained Markov Decision Processes via Backward Value Functions

Although Reinforcement Learning (RL) algorithms have found tremendous success in simulated domains, they often cannot directly be applied to physical systems, especially in cases where there are hard constraints to satisfy (e.g. on safety…

Machine Learning · Computer Science 2020-08-28 Harsh Satija , Philip Amortila , Joelle Pineau

Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics

Inverse Reinforcement Learning (IRL) describes the problem of learning an unknown reward function of a Markov Decision Process (MDP) from observed behavior of an agent. Since the agent's behavior originates in its policy and MDP policies…

Artificial Intelligence · Computer Science 2016-04-14 Michael Herman , Tobias Gindele , Jörg Wagner , Felix Schmitt , Wolfram Burgard

Planning and Learning with Stochastic Action Sets

In many practical uses of reinforcement learning (RL) the set of actions available at a given state is a random variable, with realizations governed by an exogenous stochastic process. Somewhat surprisingly, the foundations for such…

Artificial Intelligence · Computer Science 2021-02-16 Craig Boutilier , Alon Cohen , Amit Daniely , Avinatan Hassidim , Yishay Mansour , Ofer Meshi , Martin Mladenov , Dale Schuurmans

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

Multi-task representation learning (MTRL) is an approach that learns shared latent representations across related tasks, facilitating collaborative learning that improves the overall learning efficiency. This paper studies MTRL for…

Machine Learning · Computer Science 2026-04-07 Yaoze Guo , Shana Moothedath

Deep Reinforcement Learning Discovers Internal Models

Deep Reinforcement Learning (DRL) is a trending field of research, showing great promise in challenging problems such as playing Atari, solving Go and controlling robots. While DRL agents perform well in practice we are still lacking the…

Artificial Intelligence · Computer Science 2016-06-17 Nir Baram , Tom Zahavy , Shie Mannor

A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning

While reinforcement learning (RL) provides a framework for learning through trial and error, translating RL algorithms into the real world has remained challenging. A major hurdle to real-world application arises from the development of…

Machine Learning · Computer Science 2022-05-12 Archit Sharma , Rehaan Ahmad , Chelsea Finn

Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics

Deep Reinforcement Learning has shown its ability in solving complicated problems directly from high-dimensional observations. However, in end-to-end settings, Reinforcement Learning algorithms are not sample-efficient and requires long…

Machine Learning · Computer Science 2021-07-06 Nicolò Botteghi , Mannes Poel , Beril Sirmacek , Christoph Brune

Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization

Reinforcement learning (RL) has demonstrated impressive performance in decision-making tasks like embodied control, autonomous driving and financial trading. In many decision-making tasks, the agents often encounter the problem of executing…

Machine Learning · Computer Science 2024-07-23 Jing-Cheng Pang , Tian Xu , Shengyi Jiang , Yu-Ren Liu , Yang Yu

Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning

Reinforcement Learning (RL) enables an intelligent agent to optimise its performance in a task by continuously taking action from an observed state and receiving a feedback from the environment in form of rewards. RL typically uses tables…

Artificial Intelligence · Computer Science 2025-01-28 Alberto Castagna

Learning Abstract Models for Strategic Exploration and Fast Reward Transfer

Model-based reinforcement learning (RL) is appealing because (i) it enables planning and thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables fast transfer to new reward functions. However, learning an…

Machine Learning · Computer Science 2020-07-14 Evan Zheran Liu , Ramtin Keramati , Sudarshan Seshadri , Kelvin Guu , Panupong Pasupat , Emma Brunskill , Percy Liang