Related papers: Safe Reinforcement Learning Using Advantage-Based …

Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization

Safe reinforcement learning (Safe RL) refers to a class of techniques that aim to prevent RL algorithms from violating constraints in the process of decision-making and exploration during trial and error. In this paper, a novel model-free…

Systems and Control · Electrical Eng. & Systems 2024-08-14 Homayoun Honari , Mehran Ghafarian Tamizi , Homayoun Najjaran

Probabilistic Shielding for Safe Reinforcement Learning

In real-life scenarios, a Reinforcement Learning (RL) agent aiming to maximise their reward, must often also behave in a safe manner, including at training time. Thus, much attention in recent years has been given to Safe RL, where an agent…

Machine Learning · Statistics 2025-03-26 Edwin Hamel-De le Court , Francesco Belardinelli , Alexander W. Goodall

Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization

Safe Reinforcement Learning (RL) plays an important role in applying RL algorithms to safety-critical real-world applications, addressing the trade-off between maximizing rewards and adhering to safety constraints. This work introduces a…

Robotics · Computer Science 2024-07-16 Fan Yang , Wenxuan Zhou , Zuxin Liu , Ding Zhao , David Held

Safe Reinforcement Learning via Probabilistic Shields

This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty. Markov decision processes (MDPs) are prominent models to capture such planning problems. Reinforcement learning…

Artificial Intelligence · Computer Science 2019-11-26 Nils Jansen , Bettina Könighofer , Sebastian Junges , Alexandru C. Serban , Roderick Bloem

Provably Optimal Reinforcement Learning under Safety Filtering

Recent advances in reinforcement learning (RL) enable its use on increasingly complex tasks, but the lack of formal safety guarantees still limits its application in safety-critical settings. A common practical approach is to augment the RL…

Machine Learning · Computer Science 2026-02-12 Donggeon David Oh , Duy P. Nguyen , Haimin Hu , Jaime F. Fisac

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Recently, safe reinforcement learning (RL) with the actor-critic structure for continuous control tasks has received increasing attention. It is still challenging to learn a near-optimal control policy with safety and convergence…

Machine Learning · Computer Science 2024-02-06 Xinglong Zhang , Yaoqian Peng , Biao Luo , Wei Pan , Xin Xu , Haibin Xie

A Survey of Safe Reinforcement Learning and Constrained MDPs: A Technical Survey on Single-Agent and Multi-Agent Safety

Safe Reinforcement Learning (SafeRL) is the subfield of reinforcement learning that explicitly deals with safety constraints during the learning and deployment of agents. This survey provides a mathematically rigorous overview of SafeRL…

Machine Learning · Computer Science 2026-04-30 Ankita Kushwaha , Kiran Ravish , Preeti Lamba , Pawan Kumar

Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm

During initial iterations of training in most Reinforcement Learning (RL) algorithms, agents perform a significant number of random exploratory steps. In the real world, this can limit the practicality of these algorithms as it can lead to…

Machine Learning · Computer Science 2022-10-17 Ashish Kumar Jayant , Shalabh Bhatnagar

Learning over Forward-Invariant Policy Classes: Reinforcement Learning without Safety Concerns

This paper proposes a safe reinforcement learning (RL) framework based on forward-invariance-induced action-space design. The control problem is cast as a Markov decision process, but instead of relying on runtime shielding or penalty-based…

Systems and Control · Electrical Eng. & Systems 2026-04-10 Chieh Tsai , Muhammad Junayed Hasan Zahed , Salim Hariri , Hossein Rastgoftar

Constrained Markov Decision Processes via Backward Value Functions

Although Reinforcement Learning (RL) algorithms have found tremendous success in simulated domains, they often cannot directly be applied to physical systems, especially in cases where there are hard constraints to satisfy (e.g. on safety…

Machine Learning · Computer Science 2020-08-28 Harsh Satija , Philip Amortila , Joelle Pineau

Safe reinforcement learning control for continuous-time nonlinear systems without a backup controller

This paper proposes an on-policy reinforcement learning (RL) control algorithm that solves the optimal regulation problem for a class of uncertain continuous-time nonlinear systems under user-defined state constraints. We formulate the safe…

Systems and Control · Electrical Eng. & Systems 2022-09-20 Soutrik Bandyopadhyay , Shubhendu Bhasin

A Lyapunov-based Approach to Safe Reinforcement Learning

In many real-world reinforcement learning (RL) problems, besides optimizing the main objective function, an agent must concurrently avoid violating a number of constraints. In particular, besides optimizing performance it is crucial to…

Machine Learning · Computer Science 2018-05-22 Yinlam Chow , Ofir Nachum , Edgar Duenez-Guzman , Mohammad Ghavamzadeh

Safe Distributional Reinforcement Learning

Safety in reinforcement learning (RL) is a key property in both training and execution in many domains such as autonomous driving or finance. In this paper, we formalize it with a constrained RL formulation in the distributional RL setting.…

Machine Learning · Computer Science 2021-03-01 Jianyi Zhang , Paul Weng

Sampling-Based Safe Reinforcement Learning

Safe exploration remains a fundamental challenge in reinforcement learning (RL), limiting the deployment of RL agents in the real world. We propose Sampling-Based Safe Reinforcement Learning (SBSRL), a model-based RL algorithm that…

Machine Learning · Computer Science 2026-05-20 Luca Vignola , Bruce D. Lee , Manish Prajapat , Manuel Wendl , Melanie Zeilinger , Andreas Krause , Yarden As

Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning

In safe offline reinforcement learning (RL), the objective is to develop a policy that maximizes cumulative rewards while strictly adhering to safety constraints, utilizing only offline data. Traditional methods often face difficulties in…

Machine Learning · Computer Science 2026-02-11 Prajwal Koirala , Zhanhong Jiang , Soumik Sarkar , Cody Fleming

Constrained Variational Policy Optimization for Safe Reinforcement Learning

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications. Previous primal-dual style approaches suffer from instability issues and lack optimality…

Machine Learning · Computer Science 2022-06-20 Zuxin Liu , Zhepeng Cen , Vladislav Isenbaev , Wei Liu , Zhiwei Steven Wu , Bo Li , Ding Zhao

Safe Reinforcement Learning with Chance-constrained Model Predictive Control

Real-world reinforcement learning (RL) problems often demand that agents behave safely by obeying a set of designed constraints. We address the challenge of safe RL by coupling a safety guide based on model predictive control (MPC) with a…

Machine Learning · Computer Science 2022-03-30 Samuel Pfrommer , Tanmay Gautam , Alec Zhou , Somayeh Sojoudi

Learning Safe Autonomous Driving Policies Using Predictive Safety Representations

Safe reinforcement learning (SafeRL) is a prominent paradigm for autonomous driving, where agents are required to optimize performance under strict safety requirements. This dual objective creates a fundamental tension, as overly…

Machine Learning · Computer Science 2025-12-24 Mahesh Keswani , Raunak Bhattacharyya

Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint Safeguards

Ensuring safety in Reinforcement Learning (RL), typically framed as a Constrained Markov Decision Process (CMDP), is crucial for real-world exploration applications. Current approaches in handling CMDP struggle to balance optimality and…

Robotics · Computer Science 2024-03-07 Zhaorun Chen , Zhuokai Zhao , Tairan He , Binhao Chen , Xuhao Zhao , Liang Gong , Chengliang Liu

Evaluating Model-free Reinforcement Learning toward Safety-critical Tasks

Safety comes first in many real-world applications involving autonomous agents. Despite a large number of reinforcement learning (RL) methods focusing on safety-critical tasks, there is still a lack of high-quality evaluation of those…

Machine Learning · Computer Science 2022-12-13 Linrui Zhang , Qin Zhang , Li Shen , Bo Yuan , Xueqian Wang , Dacheng Tao