Related papers: Value constrained model-free continuous control

A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants

Traditional control theory-based methods require tailored engineering for each system and constant fine-tuning. In power plant control, one often needs to obtain a precise representation of the system dynamics and carefully design the…

Systems and Control · Electrical Eng. & Systems 2024-09-21 Yixuan Sun , Sami Khairy , Richard B. Vilim , Rui Hu , Akshay J. Dave

Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions

Reinforcement learning has become an essential algorithm for generating complex robotic behaviors. However, to learn such behaviors, it is necessary to design a reward function that describes the task, which often consists of multiple…

Robotics · Computer Science 2025-01-10 Yu Ishihara , Noriaki Takasugi , Kotaro Kawakami , Masaya Kinoshita , Kazumi Aoyama

Reinforcement Learning Based Power Control for Reliable Mission-Critical Wireless Transmission

In this paper, we investigate sequential power allocation over fast varying channels for mission-critical applications, aiming to minimize the expected sum power while guaranteeing the transmission success probability. In particular, a…

Information Theory · Computer Science 2023-06-09 Chongtao Guo , Zhengchao Li , Le Liang , Geoffrey Ye Li

A Model-Free Optimal Control Method With Fixed Terminal States and Delay

Model-free algorithms are brought into the control system's research with the emergence of reinforcement learning algorithms. However, there are two practical challenges of reinforcement learning-based methods. First, learning by…

Systems and Control · Electrical Eng. & Systems 2024-09-18 Mi Zhou , Erik Verriest , Chaouki Abdallah

Predictive Lagrangian Optimization for Constrained Reinforcement Learning

Constrained optimization is popularly seen in reinforcement learning for addressing complex control tasks. From the perspective of dynamic system, iteratively solving a constrained optimization problem can be framed as the temporal…

Machine Learning · Computer Science 2025-01-28 Tianqi Zhang , Puzhen Yuan , Guojian Zhan , Ziyu Lin , Yao Lyu , Zhenzhi Qin , Jingliang Duan , Liping Zhang , Shengbo Eben Li

Deep Constrained Q-learning

In many real world applications, reinforcement learning agents have to optimize multiple objectives while following certain rules or satisfying a list of constraints. Classical methods based on reward shaping, i.e. a weighted combination of…

Machine Learning · Computer Science 2020-09-15 Gabriel Kalweit , Maria Huegle , Moritz Werling , Joschka Boedecker

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

Towards a Practical Understanding of Lagrangian Methods in Safe Reinforcement Learning

Safe reinforcement learning addresses constrained optimization problems where maximizing performance must be balanced against safety constraints, and Lagrangian methods are a widely used approach for this purpose. However, the effectiveness…

Machine Learning · Computer Science 2026-03-24 Lindsay Spoor , Álvaro Serra-Gómez , Aske Plaat , Thomas Moerland

Responsive Safety in Reinforcement Learning by PID Lagrangian Methods

Lagrangian methods are widely used algorithms for constrained optimization problems, but their learning dynamics exhibit oscillations and overshoot which, when applied to safe reinforcement learning, leads to constraint-violating behavior…

Optimization and Control · Mathematics 2020-07-09 Adam Stooke , Joshua Achiam , Pieter Abbeel

Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm

During initial iterations of training in most Reinforcement Learning (RL) algorithms, agents perform a significant number of random exploratory steps. In the real world, this can limit the practicality of these algorithms as it can lead to…

Machine Learning · Computer Science 2022-10-17 Ashish Kumar Jayant , Shalabh Bhatnagar

State Augmented Constrained Reinforcement Learning: Overcoming the Limitations of Learning with Rewards

A common formulation of constrained reinforcement learning involves multiple rewards that must individually accumulate to given thresholds. In this class of problems, we show a simple example in which the desired optimal policy cannot be…

Machine Learning · Computer Science 2023-09-22 Miguel Calvo-Fullana , Santiago Paternain , Luiz F. O. Chamon , Alejandro Ribeiro

Safe Continuous Control with Constrained Model-Based Policy Optimization

The applicability of reinforcement learning (RL) algorithms in real-world domains often requires adherence to safety constraints, a need difficult to address given the asymptotic nature of the classic RL optimization objective. In contrast…

Machine Learning · Computer Science 2021-04-15 Moritz A. Zanger , Karam Daaboul , J. Marius Zöllner

Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies

We consider the problem of reinforcement learning when provided with (1) a baseline control policy and (2) a set of constraints that the learner must satisfy. The baseline policy can arise from demonstration data or a teacher agent and may…

Machine Learning · Computer Science 2021-07-13 Tsung-Yen Yang , Justinian Rosca , Karthik Narasimhan , Peter J. Ramadge

Resilient Constrained Learning

When deploying machine learning solutions, they must satisfy multiple requirements beyond accuracy, such as fairness, robustness, or safety. These requirements are imposed during training either implicitly, using penalties, or explicitly,…

Machine Learning · Computer Science 2024-01-12 Ignacio Hounie , Alejandro Ribeiro , Luiz F. O. Chamon

On Reward-Balancing Methods for Reinforcement Learning

This paper investigates the so-called reward-balancing methods, a novel class of algorithms for solving discounted-return reinforcement learning (RL) problems. These methods consist of iteratively adjusting the reward function to transform…

Optimization and Control · Mathematics 2026-04-23 Simone Baroncini , Bahman Gharesifard , Giuseppe Notarstefano

A Pontryagin Perspective on Reinforcement Learning

Reinforcement learning has traditionally focused on learning state-dependent policies to solve optimal control problems in a closed-loop fashion. In this work, we introduce the paradigm of open-loop reinforcement learning where a fixed…

Machine Learning · Computer Science 2025-04-23 Onno Eberhard , Claire Vernade , Michael Muehlebach

Neural Lyapunov and Optimal Control

Despite impressive results, reinforcement learning (RL) suffers from slow convergence and requires a large variety of tuning strategies. In this paper, we investigate the ability of RL algorithms on simple continuous control tasks. We show…

Robotics · Computer Science 2024-02-16 Daniel Layeghi , Steve Tonneau , Michael Mistry

Constrained Model-Free Reinforcement Learning for Process Optimization

Reinforcement learning (RL) is a control approach that can handle nonlinear stochastic optimal control problems. However, despite the promise exhibited, RL has yet to see marked translation to industrial practice primarily due to its…

Machine Learning · Computer Science 2021-04-15 Elton Pan , Panagiotis Petsagkourakis , Max Mowbray , Dongda Zhang , Antonio del Rio-Chanona

Robust Model-free Reinforcement Learning with Multi-objective Bayesian Optimization

In reinforcement learning (RL), an autonomous agent learns to perform complex tasks by maximizing an exogenous reward signal while interacting with its environment. In real-world applications, test conditions may differ substantially from…

Robotics · Computer Science 2019-10-30 Matteo Turchetta , Andreas Krause , Sebastian Trimpe

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under…

Machine Learning · Computer Science 2026-04-03 Klemens Iten , Bruce Lee , Chenhao Li , Lenart Treven , Andreas Krause , Bhavya Sukhija