Related papers: Safe Continuous Control with Constrained Model-Bas…

Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm

During initial iterations of training in most Reinforcement Learning (RL) algorithms, agents perform a significant number of random exploratory steps. In the real world, this can limit the practicality of these algorithms as it can lead to…

Machine Learning · Computer Science 2022-10-17 Ashish Kumar Jayant , Shalabh Bhatnagar

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Reinforcement learning has been successfully used to solve difficult tasks in complex unknown environments. However, these methods typically do not provide any safety guarantees during the learning process. This is particularly problematic,…

Systems and Control · Electrical Eng. & Systems 2019-07-02 Torsten Koller , Felix Berkenkamp , Matteo Turchetta , Joschka Boedecker , Andreas Krause

Sampling-Based Safe Reinforcement Learning

Safe exploration remains a fundamental challenge in reinforcement learning (RL), limiting the deployment of RL agents in the real world. We propose Sampling-Based Safe Reinforcement Learning (SBSRL), a model-based RL algorithm that…

Machine Learning · Computer Science 2026-05-20 Luca Vignola , Bruce D. Lee , Manish Prajapat , Manuel Wendl , Melanie Zeilinger , Andreas Krause , Yarden As

Constrained Variational Policy Optimization for Safe Reinforcement Learning

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications. Previous primal-dual style approaches suffer from instability issues and lack optimality…

Machine Learning · Computer Science 2022-06-20 Zuxin Liu , Zhepeng Cen , Vladislav Isenbaev , Wei Liu , Zhiwei Steven Wu , Bo Li , Ding Zhao

Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization

Reinforcement learning (RL) has achieved promising results on most robotic control tasks. Safety of learning-based controllers is an essential notion of ensuring the effectiveness of the controllers. Current methods adopt whole consistency…

Robotics · Computer Science 2023-07-31 Haotian Xu , Shengjie Wang , Zhaolei Wang , Yunzhe Zhang , Qing Zhuo , Yang Gao , Tao Zhang

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Recently, safe reinforcement learning (RL) with the actor-critic structure for continuous control tasks has received increasing attention. It is still challenging to learn a near-optimal control policy with safety and convergence…

Machine Learning · Computer Science 2024-02-06 Xinglong Zhang , Yaoqian Peng , Biao Luo , Wei Pan , Xin Xu , Haibin Xie

Safe Planning and Policy Optimization via World Model Learning

Reinforcement Learning (RL) applications in real-world scenarios must prioritize safety and reliability, which impose strict constraints on agent behavior. Model-based RL leverages predictive world models for action planning and policy…

Artificial Intelligence · Computer Science 2025-06-06 Artem Latyshev , Gregory Gorbov , Aleksandr I. Panov

Constrained Model-based Reinforcement Learning with Robust Cross-Entropy Method

This paper studies the constrained/safe reinforcement learning (RL) problem with sparse indicator signals for constraint violations. We propose a model-based approach to enable RL agents to effectively explore the environment with unknown…

Artificial Intelligence · Computer Science 2021-03-09 Zuxin Liu , Hongyi Zhou , Baiming Chen , Sicheng Zhong , Martial Hebert , Ding Zhao

Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization

Safe reinforcement learning (Safe RL) refers to a class of techniques that aim to prevent RL algorithms from violating constraints in the process of decision-making and exploration during trial and error. In this paper, a novel model-free…

Systems and Control · Electrical Eng. & Systems 2024-08-14 Homayoun Honari , Mehran Ghafarian Tamizi , Homayoun Najjaran

Exploration via Planning for Information about the Optimal Trajectory

Many potential applications of reinforcement learning (RL) are stymied by the large numbers of samples required to learn an effective policy. This is especially true when applying RL to real-world control tasks, e.g. in the sciences or…

Machine Learning · Computer Science 2022-10-11 Viraj Mehta , Ian Char , Joseph Abbate , Rory Conlin , Mark D. Boyer , Stefano Ermon , Jeff Schneider , Willie Neiswanger

Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs

Many physical systems have underlying safety considerations that require that the policy employed ensures the satisfaction of a set of constraints. The analytical formulation usually takes the form of a Constrained Markov Decision Process…

Machine Learning · Computer Science 2021-03-03 Aria HasanzadeZonuzy , Archana Bura , Dileep Kalathil , Srinivas Shakkottai

Model-Free Active Exploration in Reinforcement Learning

We study the problem of exploration in Reinforcement Learning and present a novel model-free solution. We adopt an information-theoretical viewpoint and start from the instance-specific lower bound of the number of samples that have to be…

Machine Learning · Computer Science 2024-07-02 Alessio Russo , Alexandre Proutiere

Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning. In a variety of RL applications the safety of the…

Machine Learning · Computer Science 2023-12-19 Rohan Mitta , Hosein Hasanbeig , Jun Wang , Daniel Kroening , Yiannis Kantaros , Alessandro Abate

Conservative Safety Critics for Exploration

Safe exploration presents a major challenge in reinforcement learning (RL): when active data collection requires deploying partially trained policies, we must ensure that these policies avoid catastrophically unsafe regions, while still…

Machine Learning · Computer Science 2021-04-27 Homanga Bharadhwaj , Aviral Kumar , Nicholas Rhinehart , Sergey Levine , Florian Shkurti , Animesh Garg

Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones

Safety remains a central obstacle preventing widespread use of RL in the real world: learning new tasks in uncertain environments requires extensive exploration, but safety requires limiting exploration. We propose Recovery RL, an algorithm…

Machine Learning · Computer Science 2021-05-19 Brijen Thananjeyan , Ashwin Balakrishna , Suraj Nair , Michael Luo , Krishnan Srinivasan , Minho Hwang , Joseph E. Gonzalez , Julian Ibarz , Chelsea Finn , Ken Goldberg

Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

When safety is formulated as a limit of cumulative cost, safe reinforcement learning (RL) aims to learn policies that maximize return subject to the cost constraint in data collection and deployment. Off-policy safe RL methods, although…

Machine Learning · Computer Science 2026-03-26 Guopeng Li , Matthijs T. J. Spaan , Julian F. P. Kooij

Explicit Explore-Exploit Algorithms in Continuous State Spaces

We present a new model-based algorithm for reinforcement learning (RL) which consists of explicit exploration and exploitation phases, and is applicable in large or infinite state spaces. The algorithm maintains a set of dynamics models…

Machine Learning · Computer Science 2019-12-03 Mikael Henaff

Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach

Emerging applications in robotics and autonomous systems, such as autonomous driving and robotic surgery, often involve critical safety constraints that must be satisfied even when information about system models is limited. In this regard,…

Robotics · Computer Science 2020-02-25 Subin Huh , Insoon Yang

ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning

Reinforcement learning (RL) is ubiquitous in the development of modern AI systems. However, state-of-the-art RL agents require extensive, and potentially unsafe, interactions with their environments to learn effectively. These limitations…

Machine Learning · Computer Science 2025-08-01 Yarden As , Bhavya Sukhija , Lenart Treven , Carmelo Sferrazza , Stelian Coros , Andreas Krause

Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving

Reinforcement learning (RL) has been widely used in decision-making and control tasks, but the risk is very high for the agent in the training process due to the requirements of interaction with the environment, which seriously limits its…

Machine Learning · Computer Science 2024-09-13 Xuemin Hu , Pan Chen , Yijun Wen , Bo Tang , Long Chen