English
Related papers

Related papers: Decoupling regularization from the action space

200 papers

Despite the many recent advances in reinforcement learning (RL), the question of learning policies that robustly satisfy state constraints under unknown disturbances remains open. In this paper, we offer a new perspective on achieving…

Machine Learning · Computer Science 2025-12-23 Pierre-François Massiani , Alexander von Rohr , Lukas Haverbeck , Sebastian Trimpe

In the pursuit of finding an optimal policy, reinforcement learning (RL) methods generally ignore the properties of learned policies apart from their expected return. Thus, even when successful, it is difficult to characterize which…

Machine Learning · Computer Science 2025-10-10 Yash Jhaveri , Harley Wiltzer , Patrick Shafto , Marc G. Bellemare , David Meger

Reinforcement Learning (RL) is a powerful method for controlling dynamic systems, but its learning mechanism can lead to unpredictable actions that undermine the safety of critical systems. Here, we propose RL with Adaptive Regularization…

Machine Learning · Computer Science 2024-11-01 Haozhe Tian , Homayoun Hamedmoghadam , Robert Shorten , Pietro Ferraro

Current reinforcement learning (RL) methods can successfully learn single tasks but often generalize poorly to modest perturbations in task domain or training procedure. In this work, we present a decoupled learning strategy for RL that…

Machine Learning · Computer Science 2018-05-10 Amy Zhang , Harsh Satija , Joelle Pineau

Dealing with high variance is a significant challenge in model-free reinforcement learning (RL). Existing methods are unreliable, exhibiting high variance in performance from run to run using different initializations/seeds. Focusing on…

Machine Learning · Computer Science 2019-05-15 Richard Cheng , Abhinav Verma , Gabor Orosz , Swarat Chaudhuri , Yisong Yue , Joel W. Burdick

Reinforcement learning (RL) has been successful in training agents in various learning environments, including video-games. However, such work modifies and shrinks the action space from the game's original. This is to avoid trying…

Artificial Intelligence · Computer Science 2020-05-27 Anssi Kanervisto , Christian Scheller , Ville Hautamäki

Reinforcement learning (RL) is recognized as lacking generalization and robustness under environmental perturbations, which excessively restricts its application for real-world robotics. Prior work claimed that adding regularization to the…

Machine Learning · Computer Science 2023-12-06 Yuan Zhang , Jianhong Wang , Joschka Boedecker

We propose a systematic method based on reinforcement learning (RL) techniques to find the optimal path that can minimize the total entropy production between two equilibrium states of open systems at the same temperature in a given fixed…

Quantum Physics · Physics 2022-06-07 Rongxing Xu

Reinforcement learning (RL) often exhibits high variance across training runs, leading to unreliable performance and posing a major challenge to deployment in real-world domains. In this work, we address the challenge of cross-run policy…

Machine Learning · Computer Science 2026-05-22 Marcel Hussing , Liv G. d'Aliberti , Claas Voelcker , Benjamin Eysenbach , Eric Eaton

We address the problem of deploying a reinforcement learning (RL) agent on a physical system such as a datacenter cooling unit or robot, where critical constraints must never be violated. We show how to exploit the typically smooth dynamics…

Artificial Intelligence · Computer Science 2018-01-29 Gal Dalal , Krishnamurthy Dvijotham , Matej Vecerik , Todd Hester , Cosmin Paduraru , Yuval Tassa

Deep Reinforcement Learning (Deep RL) has had incredible achievements on high dimensional problems, yet its learning process remains unstable even on the simplest tasks. Deep RL uses neural networks as function approximators. These neural…

Machine Learning · Computer Science 2022-10-18 Riccardo Della Vecchia , Alena Shilova , Philippe Preux , Riad Akrour

The effectiveness of reinforcement learning (RL) agents in continuous control robotics tasks is mainly dependent on the design of the underlying reward function, which is highly prone to reward hacking. A misalignment between the reward…

We investigate an entropy-regularized reinforcement learning (RL) approach to optimal stopping problems motivated by real option models. Classical stopping rules are strict and non-randomized, limiting natural exploration in RL settings. To…

Optimization and Control · Mathematics 2026-02-18 Jodi Dianetti , Giorgio Ferrari , Renyuan Xu

Deep Reinforcement Learning (Deep RL) has been receiving increasingly more attention thanks to its encouraging performance on a variety of control tasks. Yet, conventional regularization techniques in training neural networks (e.g., $L_2$…

Machine Learning · Computer Science 2021-11-30 Zhuang Liu , Xuanlin Li , Bingyi Kang , Trevor Darrell

In continual learning, the primary challenge is to learn new information without forgetting old knowledge. A common solution addresses this trade-off through regularization, penalizing changes to parameters critical for previous tasks. In…

Machine Learning · Computer Science 2026-04-22 Pourya Shamsolmoali , Masoumeh Zareapoor , Eric Granger , William A. P. Smith , Yue Lu

We propose a general formulation for addressing reinforcement learning (RL) problems in settings with observational data. That is, we consider the problem of learning good policies solely from historical data in which unobserved factors…

Machine Learning · Computer Science 2018-12-31 Chaochao Lu , Bernhard Schölkopf , José Miguel Hernández-Lobato

Recent advancements in off-policy Reinforcement Learning (RL) have significantly improved sample efficiency, primarily due to the incorporation of various forms of regularization that enable more gradient update steps than traditional…

Machine Learning · Computer Science 2024-06-21 Michal Nauman , Michał Bortkiewicz , Piotr Miłoś , Tomasz Trzciński , Mateusz Ostaszewski , Marek Cygan

State entropy regularization has empirically shown better exploration and sample complexity in reinforcement learning (RL). However, its theoretical guarantees have not been studied. In this paper, we show that state entropy regularization…

Machine Learning · Computer Science 2025-12-02 Yonatan Ashlag , Uri Koren , Mirco Mutti , Esther Derman , Pierre-Luc Bacon , Shie Mannor

Recent work has shown that reinforcement learning (RL) is a promising approach to control dynamical systems described by partial differential equations (PDE). This paper shows how to use RL to tackle more general PDE control problems that…

Machine Learning · Computer Science 2018-06-20 Yangchen Pan , Amir-massoud Farahmand , Martha White , Saleh Nabi , Piyush Grover , Daniel Nikovski

Deep reinforcement learning algorithms that learn policies by trial-and-error must learn from limited amounts of data collected by actively interacting with the environment. While many prior works have shown that proper regularization…

Machine Learning · Computer Science 2023-04-21 Qiyang Li , Aviral Kumar , Ilya Kostrikov , Sergey Levine
‹ Prev 1 2 3 10 Next ›