Related papers: Solving Parameter-Robust Avoid Problems with Unkno…

Iterative Reachability Estimation for Safe Reinforcement Learning

Ensuring safety is important for the practical deployment of reinforcement learning (RL). Various challenges must be addressed, such as handling stochasticity in the environments, providing rigorous guarantees of persistent state-wise…

Machine Learning · Computer Science 2023-09-26 Milan Ganai , Zheng Gong , Chenning Yu , Sylvia Herbert , Sicun Gao

Reachability Constrained Reinforcement Learning

Constrained reinforcement learning (CRL) has gained significant interest recently, since safety constraints satisfaction is critical for real-world problems. However, existing CRL methods constraining discounted cumulative costs generally…

Machine Learning · Computer Science 2022-06-08 Dongjie Yu , Haitong Ma , Shengbo Eben Li , Jianyu Chen

Feasible Policy Iteration for Safe Reinforcement Learning

Safety is the priority concern when applying reinforcement learning (RL) algorithms to real-world control problems. While policy iteration provides a fundamental algorithm for standard RL, an analogous theoretical algorithm for safe RL…

Machine Learning · Computer Science 2025-03-14 Yujie Yang , Zhilong Zheng , Shengbo Eben Li , Wei Xu , Jingjing Liu , Xianyuan Zhan , Ya-Qin Zhang

Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality

Reinforcement learning (RL) for reachability specifications is fundamental in sequential decision-making, yet theoretical guarantees remain less explored. A recent work achieves asymptotic convergence to optimal policies. However, this…

Machine Learning · Computer Science 2026-05-26 Amogh Palasamudram , Jakub Svoboda , Suguman Bansal , Krishnendu Chatterjee

The Feasibility Theory of Constrained Reinforcement Learning: A Tutorial Study

Satisfying safety constraints is a priority concern when solving optimal control problems (OCPs). Due to the existence of infeasibility phenomenon, where a constraint-satisfying solution cannot be found, it is necessary to identify a…

Systems and Control · Electrical Eng. & Systems 2026-04-21 Yujie Yang , Zhilong Zheng , Masayoshi Tomizuka , Changliu Liu , Shengbo Eben Li

Rule-Guided Reinforcement Learning Policy Evaluation and Improvement

We consider the challenging problem of using domain knowledge to improve deep reinforcement learning policies. To this end, we propose LEGIBLE, a novel approach, following a multi-step process, which starts by mining rules from a deep RL…

Machine Learning · Computer Science 2025-03-13 Martin Tappler , Ignacio D. Lopez-Miguel , Sebastian Tschiatschek , Ezio Bartocci

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

Robust reinforcement learning (RL) considers the problem of learning policies that perform well in the worst case among a set of possible environment parameter values. In real-world environments, choosing the set of possible values for…

Machine Learning · Computer Science 2022-10-05 JB Lanier , Stephen McAleer , Pierre Baldi , Roy Fox

Learning Reusable Options for Multi-Task Reinforcement Learning

Reinforcement learning (RL) has become an increasingly active area of research in recent years. Although there are many algorithms that allow an agent to solve tasks efficiently, they often ignore the possibility that prior experience…

Artificial Intelligence · Computer Science 2020-01-07 Francisco M. Garcia , Chris Nota , Philip S. Thomas

On the Equilibrium between Feasible Zone and Uncertain Model in Safe Exploration

Ensuring the safety of environmental exploration is a critical problem in reinforcement learning (RL). While limiting exploration to a feasible zone has become widely accepted as a way to ensure safety, key questions remain unresolved: what…

Machine Learning · Computer Science 2026-02-05 Yujie Yang , Zhilong Zheng , Shengbo Eben Li

Learning to Generate All Feasible Actions

Modern cyber-physical systems are becoming increasingly complex to model, thus motivating data-driven techniques such as reinforcement learning (RL) to find appropriate control agents. However, most systems are subject to hard constraints…

Machine Learning · Computer Science 2024-07-08 Mirco Theile , Daniele Bernardini , Raphael Trumpp , Cristina Piazza , Marco Caccamo , Alberto L. Sangiovanni-Vincentelli

Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety

The safety constraints commonly used by existing safe reinforcement learning (RL) methods are defined only on expectation of initial states, but allow each certain state to be unsafe, which is unsatisfying for real-world safety-critical…

Machine Learning · Computer Science 2021-06-15 Haitong Ma , Yang Guan , Shegnbo Eben Li , Xiangteng Zhang , Sifa Zheng , Jianyu Chen

Smart Exploration in Reinforcement Learning using Bounded Uncertainty Models

Reinforcement learning (RL) is a powerful framework for decision-making in uncertain environments, but it often requires large amounts of data to learn an optimal policy. We address this challenge by incorporating prior model knowledge to…

Machine Learning · Computer Science 2026-01-29 J. S. van Hulst , W. P. M. H. Heemels , D. J. Antunes

ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning

Reinforcement learning (RL) is ubiquitous in the development of modern AI systems. However, state-of-the-art RL agents require extensive, and potentially unsafe, interactions with their environments to learn effectively. These limitations…

Machine Learning · Computer Science 2025-08-01 Yarden As , Bhavya Sukhija , Lenart Treven , Carmelo Sferrazza , Stelian Coros , Andreas Krause

Policy Bifurcation in Safe Reinforcement Learning

Safe reinforcement learning (RL) offers advanced solutions to constrained optimal control problems. Existing studies in safe RL implicitly assume continuity in policy functions, where policies map states to actions in a smooth,…

Machine Learning · Computer Science 2024-03-29 Wenjun Zou , Yao Lyu , Jie Li , Yujie Yang , Shengbo Eben Li , Jingliang Duan , Xianyuan Zhan , Jingjing Liu , Yaqin Zhang , Keqiang Li

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning

Sequential decision making using Markov Decision Process underpins many realworld applications. Both model-based and model free methods have achieved strong results in these settings. However, real-world tasks must balance reward…

Machine Learning · Computer Science 2026-04-01 Janaka Chathuranga Brahmanage , Akshat Kumar

Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning. In a variety of RL applications the safety of the…

Machine Learning · Computer Science 2023-12-19 Rohan Mitta , Hosein Hasanbeig , Jun Wang , Daniel Kroening , Yiannis Kantaros , Alessandro Abate

Safe Exploration in Reinforcement Learning: A Generalized Formulation and Algorithms

Safe exploration is essential for the practical use of reinforcement learning (RL) in many real-world scenarios. In this paper, we present a generalized safe exploration (GSE) problem as a unified formulation of common safe exploration…

Machine Learning · Computer Science 2023-10-06 Akifumi Wachi , Wataru Hashimoto , Xun Shen , Kazumune Hashimoto

Constrained Reinforcement Learning Under Model Mismatch

Existing studies on constrained reinforcement learning (RL) may obtain a well-performing policy in the training environment. However, when deployed in a real environment, it may easily violate constraints that were originally satisfied…

Machine Learning · Computer Science 2024-05-06 Zhongchang Sun , Sihong He , Fei Miao , Shaofeng Zou

R2L: Reliable Reinforcement Learning: Guaranteed Return & Reliable Policies in Reinforcement Learning

In this work, we address the problem of determining reliable policies in reinforcement learning (RL), with a focus on optimization under uncertainty and the need for performance guarantees. While classical RL algorithms aim at maximizing…

Machine Learning · Computer Science 2025-10-22 Nadir Farhi

Some Supervision Required: Incorporating Oracle Policies in Reinforcement Learning via Epistemic Uncertainty Metrics

An inherent problem of reinforcement learning is performing exploration of an environment through random actions, of which a large portion can be unproductive. Instead, exploration can be improved by initializing the learning policy with an…

Machine Learning · Computer Science 2023-08-22 Jun Jet Tai , Jordan K. Terry , Mauro S. Innocente , James Brusey , Nadjim Horri