Related papers: Constrained Model-Free Reinforcement Learning for …

Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty

Dynamic real-time optimization (DRTO) is a challenging task due to the fact that optimal operating conditions must be computed in real time. The main bottleneck in the industrial application of DRTO is the presence of uncertainty. Many…

Machine Learning · Computer Science 2020-06-05 Panagiotis Petsagkourakis , Ilya Orson Sandoval , Eric Bradford , Dongda Zhang , Ehecatl Antonio del Río Chanona

Reinforcement Learning for Quantum Control under Physical Constraints

Quantum control is concerned with the realisation of desired dynamics in quantum systems, serving as a linchpin for advancing quantum technologies and fundamental research. Analytic approaches and standard optimisation algorithms do not…

Quantum Physics · Physics 2025-05-29 Jan Ole Ernst , Aniket Chatterjee , Tim Franzmeyer , Axel Kuhn

Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms

Reinforcement Learning (RL) serves as a versatile framework for sequential decision-making, finding applications across diverse domains such as robotics, autonomous driving, recommendation systems, supply chain optimization, biology,…

Machine Learning · Computer Science 2024-08-26 Vaneet Aggarwal , Washim Uddin Mondal , Qinbo Bai

Safe Chance Constrained Reinforcement Learning for Batch Process Control

Reinforcement Learning (RL) controllers have generated excitement within the control community. The primary advantage of RL controllers relative to existing methods is their ability to optimize uncertain systems independently of explicit…

Machine Learning · Computer Science 2021-12-07 Max Mowbray , Panagiotis Petsagkourakis , Ehecatl Antonio del Río Chanona , Dongda Zhang

Density Constrained Reinforcement Learning

We study constrained reinforcement learning (CRL) from a novel perspective by setting constraints directly on state density functions, rather than the value functions considered by previous works. State density has a clear physical and…

Machine Learning · Computer Science 2021-06-25 Zengyi Qin , Yuxiao Chen , Chuchu Fan

Constrained Meta Reinforcement Learning with Provable Test-Time Safety

Meta reinforcement learning (RL) allows agents to leverage experience across a distribution of tasks on which the agent can train at will, enabling faster learning of optimal policies on new test tasks. Despite its success in improving…

Machine Learning · Computer Science 2026-05-27 Tingting Ni , Maryam Kamgarpour

Constrained Variational Policy Optimization for Safe Reinforcement Learning

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications. Previous primal-dual style approaches suffer from instability issues and lack optimality…

Machine Learning · Computer Science 2022-06-20 Zuxin Liu , Zhepeng Cen , Vladislav Isenbaev , Wei Liu , Zhiwei Steven Wu , Bo Li , Ding Zhao

Constraints Penalized Q-learning for Safe Offline Reinforcement Learning

We study the problem of safe offline reinforcement learning (RL), the goal is to learn a policy that maximizes long-term reward while satisfying safety constraints given only offline data, without further interaction with the environment.…

Machine Learning · Computer Science 2022-04-11 Haoran Xu , Xianyuan Zhan , Xiangyu Zhu

Probabilistically safe and efficient model-based reinforcement learning

This paper proposes tackling safety-critical stochastic Reinforcement Learning (RL) tasks with a sample-based, model-based approach. At the core of the method lies a Model Predictive Control (MPC) scheme that acts as function approximation,…

Systems and Control · Electrical Eng. & Systems 2025-07-30 Filippo Airaldi , Bart De Schutter , Azita Dabiri

Probabilistic Satisfaction of Temporal Logic Constraints in Reinforcement Learning via Adaptive Policy-Switching

Constrained Reinforcement Learning (CRL) is a subset of machine learning that introduces constraints into the traditional reinforcement learning (RL) framework. Unlike conventional RL which aims solely to maximize cumulative rewards, CRL…

Artificial Intelligence · Computer Science 2024-12-02 Xiaoshan Lin , Sadık Bera Yüksel , Yasin Yazıcıoğlu , Derya Aksaray

Reinforcement Learning with Convex Constraints

In standard reinforcement learning (RL), a learning agent seeks to optimize the overall reward. However, many key aspects of a desired behavior are more naturally expressed as constraints. For instance, the designer may want to limit the…

Machine Learning · Computer Science 2021-01-29 Sobhan Miryoosefi , Kianté Brantley , Hal Daumé , Miroslav Dudik , Robert Schapire

ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs

In recent years, Reinforcement Learning (RL) has been applied to real-world problems with increasing success. Such applications often require to put constraints on the agent's behavior. Existing algorithms for constrained RL (CRL) rely on…

Machine Learning · Computer Science 2023-03-07 Ted Moskovitz , Brendan O'Donoghue , Vivek Veeriah , Sebastian Flennerhag , Satinder Singh , Tom Zahavy

Reinforcement Learning with $\omega$-Regular Objectives and Constraints

Reinforcement learning (RL) commonly relies on scalar rewards with limited ability to express temporal, conditional, or safety-critical goals, and can lead to reward hacking. Temporal logic expressible via the more general class of…

Artificial Intelligence · Computer Science 2025-11-26 Dominik Wagner , Leon Witzman , Luke Ong

Resilient Constrained Reinforcement Learning

We study a class of constrained reinforcement learning (RL) problems in which multiple constraint specifications are not identified before training. It is challenging to identify appropriate constraint specifications due to the undefined…

Optimization and Control · Mathematics 2024-01-02 Dongsheng Ding , Zhengyan Huan , Alejandro Ribeiro

Constrained Model-based Reinforcement Learning with Robust Cross-Entropy Method

This paper studies the constrained/safe reinforcement learning (RL) problem with sparse indicator signals for constraint violations. We propose a model-based approach to enable RL agents to effectively explore the environment with unknown…

Artificial Intelligence · Computer Science 2021-03-09 Zuxin Liu , Hongyi Zhou , Baiming Chen , Sicheng Zhong , Martial Hebert , Ding Zhao

Cautious Reinforcement Learning with Logical Constraints

This paper presents the concept of an adaptive safe padding that forces Reinforcement Learning (RL) to synthesise optimal control policies while ensuring safety during the learning process. Policies are synthesised to satisfy a goal,…

Machine Learning · Computer Science 2020-03-24 Mohammadhosein Hasanbeig , Alessandro Abate , Daniel Kroening

Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement

Reinforcement learning (RL) is a powerful data-driven control method that has been largely explored in autonomous driving tasks. However, conventional RL approaches learn control policies through trial-and-error interactions with the…

Robotics · Computer Science 2021-11-03 Tianyu Shi , Dong Chen , Kaian Chen , Zhaojian Li

Reinforcement Learning Under Probabilistic Spatio-Temporal Constraints with Time Windows

We propose an automata-theoretic approach for reinforcement learning (RL) under complex spatio-temporal constraints with time windows. The problem is formulated using a Markov decision process under a bounded temporal logic constraint.…

Artificial Intelligence · Computer Science 2023-08-01 Xiaoshan Lin , Abbasali Koochakzadeh , Yasin Yazicioglu , Derya Aksaray

A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants

Traditional control theory-based methods require tailored engineering for each system and constant fine-tuning. In power plant control, one often needs to obtain a precise representation of the system dynamics and carefully design the…

Systems and Control · Electrical Eng. & Systems 2024-09-21 Yixuan Sun , Sami Khairy , Richard B. Vilim , Rui Hu , Akshay J. Dave

A Convex Programming Approach to Data-Driven Risk-Averse Reinforcement Learning

This paper presents a model-free reinforcement learning (RL) algorithm to solve the risk-averse optimal control (RAOC) problem for discrete-time nonlinear systems. While successful RL algorithms have been presented to learn optimal control…

Systems and Control · Electrical Eng. & Systems 2021-03-29 Yuzhen Han , Majid Mazouchi , Subramanya Nageshrao , Hamidreza Modares