Related papers: Decoupling regularization from the action space

Viability of Future Actions: Robust Safety in Reinforcement Learning via Entropy Regularization

Despite the many recent advances in reinforcement learning (RL), the question of learning policies that robustly satisfy state constraints under unknown disturbances remains open. In this paper, we offer a new perspective on achieving…

Machine Learning · Computer Science 2025-12-23 Pierre-François Massiani , Alexander von Rohr , Lukas Haverbeck , Sebastian Trimpe

Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning

In the pursuit of finding an optimal policy, reinforcement learning (RL) methods generally ignore the properties of learned policies apart from their expected return. Thus, even when successful, it is difficult to characterize which…

Machine Learning · Computer Science 2025-10-10 Yash Jhaveri , Harley Wiltzer , Patrick Shafto , Marc G. Bellemare , David Meger

Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems

Reinforcement Learning (RL) is a powerful method for controlling dynamic systems, but its learning mechanism can lead to unpredictable actions that undermine the safety of critical systems. Here, we propose RL with Adaptive Regularization…

Machine Learning · Computer Science 2024-11-01 Haozhe Tian , Homayoun Hamedmoghadam , Robert Shorten , Pietro Ferraro

Decoupling Dynamics and Reward for Transfer Learning

Current reinforcement learning (RL) methods can successfully learn single tasks but often generalize poorly to modest perturbations in task domain or training procedure. In this work, we present a decoupled learning strategy for RL that…

Machine Learning · Computer Science 2018-05-10 Amy Zhang , Harsh Satija , Joelle Pineau

Control Regularization for Reduced Variance Reinforcement Learning

Dealing with high variance is a significant challenge in model-free reinforcement learning (RL). Existing methods are unreliable, exhibiting high variance in performance from run to run using different initializations/seeds. Focusing on…

Machine Learning · Computer Science 2019-05-15 Richard Cheng , Abhinav Verma , Gabor Orosz , Swarat Chaudhuri , Yisong Yue , Joel W. Burdick

Action Space Shaping in Deep Reinforcement Learning

Reinforcement learning (RL) has been successful in training agents in various learning environments, including video-games. However, such work modifies and shrinks the action space from the game's original. This is to avoid trying…

Artificial Intelligence · Computer Science 2020-05-27 Anssi Kanervisto , Christian Scheller , Ville Hautamäki

Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization

Reinforcement learning (RL) is recognized as lacking generalization and robustness under environmental perturbations, which excessively restricts its application for real-world robotics. Prior work claimed that adding regularization to the…

Machine Learning · Computer Science 2023-12-06 Yuan Zhang , Jianhong Wang , Joschka Boedecker

Reinforcement Learning Approach to Shortcuts between Thermodynamic States with Extra Constraints

We propose a systematic method based on reinforcement learning (RL) techniques to find the optimal path that can minimize the total entropy production between two equilibrium states of open systems at the same temperature in a given fixed…

Quantum Physics · Physics 2022-06-07 Rongxing Xu

Behavior-Consistent Deep Reinforcement Learning

Reinforcement learning (RL) often exhibits high variance across training runs, leading to unreliable performance and posing a major challenge to deployment in real-world domains. In this work, we address the challenge of cross-run policy…

Machine Learning · Computer Science 2026-05-22 Marcel Hussing , Liv G. d'Aliberti , Claas Voelcker , Benjamin Eysenbach , Eric Eaton

Safe Exploration in Continuous Action Spaces

We address the problem of deploying a reinforcement learning (RL) agent on a physical system such as a datacenter cooling unit or robot, where critical constraints must never be violated. We show how to exploit the typically smooth dynamics…

Artificial Intelligence · Computer Science 2018-01-29 Gal Dalal , Krishnamurthy Dvijotham , Matej Vecerik , Todd Hester , Cosmin Paduraru , Yuval Tassa

Entropy Regularized Reinforcement Learning with Cascading Networks

Deep Reinforcement Learning (Deep RL) has had incredible achievements on high dimensional problems, yet its learning process remains unstable even on the simplest tasks. Deep RL uses neural networks as function approximators. These neural…

Machine Learning · Computer Science 2022-10-18 Riccardo Della Vecchia , Alena Shilova , Philippe Preux , Riad Akrour

REBEL: Reward Regularization-Based Approach for Robotic Reinforcement Learning from Human Feedback

The effectiveness of reinforcement learning (RL) agents in continuous control robotics tasks is mainly dependent on the design of the underlying reward function, which is highly prone to reward hacking. A misalignment between the reward…

Robotics · Computer Science 2025-01-22 Souradip Chakraborty , Anukriti Singh , Amisha Bhaskar , Pratap Tokekar , Dinesh Manocha , Amrit Singh Bedi

Reinforcement Learning in Real Option Models

We investigate an entropy-regularized reinforcement learning (RL) approach to optimal stopping problems motivated by real option models. Classical stopping rules are strict and non-randomized, limiting natural exploration in RL settings. To…

Optimization and Control · Mathematics 2026-02-18 Jodi Dianetti , Giorgio Ferrari , Renyuan Xu

Regularization Matters in Policy Optimization

Deep Reinforcement Learning (Deep RL) has been receiving increasingly more attention thanks to its encouraging performance on a variety of control tasks. Yet, conventional regularization techniques in training neural networks (e.g., $L_2$…

Machine Learning · Computer Science 2021-11-30 Zhuang Liu , Xuanlin Li , Bingyi Kang , Trevor Darrell

Task Switching Without Forgetting via Proximal Decoupling

In continual learning, the primary challenge is to learn new information without forgetting old knowledge. A common solution addresses this trade-off through regularization, penalizing changes to parameters critical for previous tasks. In…

Machine Learning · Computer Science 2026-04-22 Pourya Shamsolmoali , Masoumeh Zareapoor , Eric Granger , William A. P. Smith , Yue Lu

Deconfounding Reinforcement Learning in Observational Settings

We propose a general formulation for addressing reinforcement learning (RL) problems in settings with observational data. That is, we consider the problem of learning good policies solely from historical data in which unobserved factors…

Machine Learning · Computer Science 2018-12-31 Chaochao Lu , Bernhard Schölkopf , José Miguel Hernández-Lobato

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Recent advancements in off-policy Reinforcement Learning (RL) have significantly improved sample efficiency, primarily due to the incorporation of various forms of regularization that enable more gradient update steps than traditional…

Machine Learning · Computer Science 2024-06-21 Michal Nauman , Michał Bortkiewicz , Piotr Miłoś , Tomasz Trzciński , Mateusz Ostaszewski , Marek Cygan

State Entropy Regularization for Robust Reinforcement Learning

State entropy regularization has empirically shown better exploration and sample complexity in reinforcement learning (RL). However, its theoretical guarantees have not been studied. In this paper, we show that state entropy regularization…

Machine Learning · Computer Science 2025-12-02 Yonatan Ashlag , Uri Koren , Mirco Mutti , Esther Derman , Pierre-Luc Bacon , Shie Mannor

Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control

Recent work has shown that reinforcement learning (RL) is a promising approach to control dynamical systems described by partial differential equations (PDE). This paper shows how to use RL to tackle more general PDE control problems that…

Machine Learning · Computer Science 2018-06-20 Yangchen Pan , Amir-massoud Farahmand , Martha White , Saleh Nabi , Piyush Grover , Daniel Nikovski

Efficient Deep Reinforcement Learning Requires Regulating Overfitting

Deep reinforcement learning algorithms that learn policies by trial-and-error must learn from limited amounts of data collected by actively interacting with the environment. While many prior works have shown that proper regularization…

Machine Learning · Computer Science 2023-04-21 Qiyang Li , Aviral Kumar , Ilya Kostrikov , Sergey Levine