Related papers: Proximal Reliability Optimization for Reinforcemen…

A stabilizing reinforcement learning approach for sampled systems with partially unknown models

Reinforcement learning is commonly associated with training of reward-maximizing (or cost-minimizing) agents, in other words, controllers. It can be applied in model-free or model-based fashion, using a priori or online collected system…

Systems and Control · Electrical Eng. & Systems 2022-09-01 Lukas Beckenbach , Pavel Osinenko , Stefan Streif

Robust Reinforcement Learning with Distributional Risk-averse formulation

Robust Reinforcement Learning tries to make predictions more robust to changes in the dynamics or rewards of the system. This problem is particularly important when the dynamics and rewards of the environment are estimated from the data. In…

Machine Learning · Computer Science 2022-06-15 Pierre Clavier , Stéphanie Allassonière , Erwan Le Pennec

Robust Constrained Reinforcement Learning

Constrained reinforcement learning is to maximize the expected reward subject to constraints on utilities/costs. However, the training environment may not be the same as the test one, due to, e.g., modeling error, adversarial attack,…

Machine Learning · Computer Science 2022-09-16 Yue Wang , Fei Miao , Shaofeng Zou

Can Learned Optimization Make Reinforcement Learning Less Difficult?

While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from…

Machine Learning · Computer Science 2025-04-16 Alexander David Goldie , Chris Lu , Matthew Thomas Jackson , Shimon Whiteson , Jakob Nicolaus Foerster

Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction

Model-free reinforcement learning based methods such as Proximal Policy Optimization, or Q-learning typically require thousands of interactions with the environment to approximate the optimum controller which may not always be feasible in…

Machine Learning · Computer Science 2019-05-16 Narendra Patwardhan , Zequn Wang

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

A Survey of Reinforcement Learning For Economics

This survey (re)introduces reinforcement learning methods to economists. The curse of dimensionality limits how far exact dynamic programming can be effectively applied, forcing us to rely on suitably "small" problems or our ability to…

General Economics · Economics 2026-03-25 Pranjal Rawat

Performative Reinforcement Learning

We introduce the framework of performative reinforcement learning where the policy chosen by the learner affects the underlying reward and transition dynamics of the environment. Following the recent literature on performative…

Machine Learning · Computer Science 2023-06-08 Debmalya Mandal , Stelios Triantafyllou , Goran Radanovic

On Reward-Balancing Methods for Reinforcement Learning

This paper investigates the so-called reward-balancing methods, a novel class of algorithms for solving discounted-return reinforcement learning (RL) problems. These methods consist of iteratively adjusting the reward function to transform…

Optimization and Control · Mathematics 2026-04-23 Simone Baroncini , Bahman Gharesifard , Giuseppe Notarstefano

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Reinforcement learning has been successfully used to solve difficult tasks in complex unknown environments. However, these methods typically do not provide any safety guarantees during the learning process. This is particularly problematic,…

Systems and Control · Electrical Eng. & Systems 2019-07-02 Torsten Koller , Felix Berkenkamp , Matteo Turchetta , Joschka Boedecker , Andreas Krause

A note on stabilizing reinforcement learning

Reinforcement learning is a general methodology of adaptive optimal control that has attracted much attention in various fields ranging from video game industry to robot manipulators. Despite its remarkable performance demonstrations, plain…

Dynamical Systems · Mathematics 2022-06-14 Pavel Osinenko , Grigory Yaremenko , Ilya Osokin

Adaptive control of a mechatronic system using constrained residual reinforcement learning

We propose a simple, practical and intuitive approach to improve the performance of a conventional controller in uncertain environments using deep reinforcement learning while maintaining safe operation. Our approach is motivated by the…

Systems and Control · Electrical Eng. & Systems 2021-10-07 Tom Staessens , Tom Lefebvre , Guillaume Crevecoeur

Robustifying Reinforcement Learning Policies with $\mathcal{L}_1$ Adaptive Control

A reinforcement learning (RL) policy trained in a nominal environment could fail in a new/perturbed environment due to the existence of dynamic variations. Existing robust methods try to obtain a fixed policy for all envisioned dynamic…

Machine Learning · Computer Science 2022-03-10 Yikun Cheng , Pan Zhao , Manan Gandhi , Bo Li , Evangelos Theodorou , Naira Hovakimyan

Viability of Future Actions: Robust Safety in Reinforcement Learning via Entropy Regularization

Despite the many recent advances in reinforcement learning (RL), the question of learning policies that robustly satisfy state constraints under unknown disturbances remains open. In this paper, we offer a new perspective on achieving…

Machine Learning · Computer Science 2025-12-23 Pierre-François Massiani , Alexander von Rohr , Lukas Haverbeck , Sebastian Trimpe

Maintaining Plasticity in Reinforcement Learning: A Cost-Aware Framework for Aerial Robot Control in Non-stationary Environments

Reinforcement learning (RL) has demonstrated the ability to maintain the plasticity of the policy throughout short-term training in aerial robot control. However, these policies have been shown to loss of plasticity when extended to…

Robotics · Computer Science 2025-03-11 Ali Tahir Karasahin , Ziniu Wu , Basaran Bahadir Kocer

Safe Model-based Reinforcement Learning with Stability Guarantees

Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world…

Machine Learning · Statistics 2017-11-15 Felix Berkenkamp , Matteo Turchetta , Angela P. Schoellig , Andreas Krause

Learning to Optimize for Reinforcement Learning

In recent years, by leveraging more data, computation, and diverse tasks, learned optimizers have achieved remarkable success in supervised learning, outperforming classical hand-designed optimizers. Reinforcement learning (RL) is…

Machine Learning · Computer Science 2024-06-05 Qingfeng Lan , A. Rupam Mahmood , Shuicheng Yan , Zhongwen Xu

Adaptive Reinforcement Learning for Dynamic Configuration Allocation in Pre-Production Testing

Ensuring reliability in modern software systems requires rigorous pre-production testing across highly heterogeneous and evolving environments. Because exhaustive evaluation is infeasible, practitioners must decide how to allocate limited…

Software Engineering · Computer Science 2025-10-08 Yu Zhu

An Optical Control Environment for Benchmarking Reinforcement Learning Algorithms

Deep reinforcement learning has the potential to address various scientific problems. In this paper, we implement an optics simulation environment for reinforcement learning based controllers. The environment captures the essence of…

Machine Learning · Computer Science 2023-10-03 Abulikemu Abuduweili , Changliu Liu

Ctrl-Z: Recovering from Instability in Reinforcement Learning

When learning behavior, training data is often generated by the learner itself; this can result in unstable training dynamics, and this problem has particularly important applications in safety-sensitive real-world control tasks such as…

Machine Learning · Computer Science 2019-10-10 Vibhavari Dasagi , Jake Bruce , Thierry Peynot , Jürgen Leitner