Related papers: Physics-Informed Policy Optimization via Analytic …

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Recent advancements in off-policy Reinforcement Learning (RL) have significantly improved sample efficiency, primarily due to the incorporation of various forms of regularization that enable more gradient update steps than traditional…

Machine Learning · Computer Science 2024-06-21 Michal Nauman , Michał Bortkiewicz , Piotr Miłoś , Tomasz Trzciński , Mateusz Ostaszewski , Marek Cygan

Physics-Informed Model-Based Reinforcement Learning

We apply reinforcement learning (RL) to robotics tasks. One of the drawbacks of traditional RL algorithms has been their poor sample efficiency. One approach to improve the sample efficiency is model-based RL. In our model-based RL…

Machine Learning · Computer Science 2023-05-16 Adithya Ramesh , Balaraman Ravindran

Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

We focus on developing efficient and reliable policy optimization strategies for robot learning with real-world data. In recent years, policy gradient methods have emerged as a promising paradigm for training control policies in simulation.…

Machine Learning · Computer Science 2023-11-07 Tyler Westenbroek , Jacob Levy , David Fridovich-Keil

A Safe Reinforcement Learning Algorithm for Supervisory Control of Power Plants

Traditional control theory-based methods require tailored engineering for each system and constant fine-tuning. In power plant control, one often needs to obtain a precise representation of the system dynamics and carefully design the…

Systems and Control · Electrical Eng. & Systems 2024-09-21 Yixuan Sun , Sami Khairy , Richard B. Vilim , Rui Hu , Akshay J. Dave

Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation

Recent advances in GPU-based parallel simulation have enabled practitioners to collect large amounts of data and train complex control policies using deep reinforcement learning (RL), on commodity GPUs. However, such successes for RL in…

Machine Learning · Computer Science 2025-03-03 Eliot Xing , Vernon Luk , Jean Oh

Model-based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian

Safety is essential for reinforcement learning (RL) applied in the real world. Adding chance constraints (or probabilistic constraints) is a suitable way to enhance RL safety under uncertainty. Existing chance-constrained RL methods like…

Machine Learning · Computer Science 2021-08-27 Baiyu Peng , Jingliang Duan , Jianyu Chen , Shengbo Eben Li , Genjin Xie , Congsheng Zhang , Yang Guan , Yao Mu , Enxin Sun

Reinforcement Learning with Adaptive Regularization for Safe Control of Critical Systems

Reinforcement Learning (RL) is a powerful method for controlling dynamic systems, but its learning mechanism can lead to unpredictable actions that undermine the safety of critical systems. Here, we propose RL with Adaptive Regularization…

Machine Learning · Computer Science 2024-11-01 Haozhe Tian , Homayoun Hamedmoghadam , Robert Shorten , Pietro Ferraro

Optimization Algorithm for Feedback and Feedforward Policies towards Robot Control Robust to Sensing Failures

Model-free or learning-based control, in particular, reinforcement learning (RL), is expected to be applied for complex robotic tasks. Traditional RL requires a policy to be optimized is state-dependent, that means, the policy is a kind of…

Machine Learning · Computer Science 2022-08-09 Taisuke Kobayashi , Kenta Yoshizawa

On the Convergence of Approximate and Regularized Policy Iteration Schemes

Entropy regularized algorithms such as Soft Q-learning and Soft Actor-Critic, recently showed state-of-the-art performance on a number of challenging reinforcement learning (RL) tasks. The regularized formulation modifies the standard RL…

Machine Learning · Statistics 2019-10-15 Elena Smirnova , Elvis Dohmatob

Reinforcement Learning Control of Robotic Knee with Human in the Loop by Flexible Policy Iteration

We are motivated by the real challenges presented in a human-robot system to develop new designs that are efficient at data level and with performance guarantees such as stability and optimality at systems level. Existing…

Systems and Control · Electrical Eng. & Systems 2021-01-19 Xiang Gao , Jennie Si , Yue Wen , Minhan Li , He , Huang

Responsive Safety in Reinforcement Learning by PID Lagrangian Methods

Lagrangian methods are widely used algorithms for constrained optimization problems, but their learning dynamics exhibit oscillations and overshoot which, when applied to safe reinforcement learning, leads to constraint-violating behavior…

Optimization and Control · Mathematics 2020-07-09 Adam Stooke , Joshua Achiam , Pieter Abbeel

Lipschitz-Regularized Critics Lead to Policy Robustness Against Transition Dynamics Uncertainty

Uncertainties in transition dynamics pose a critical challenge in reinforcement learning (RL), often resulting in performance degradation of trained policies when deployed on hardware. Many robust RL approaches follow two strategies:…

Machine Learning · Computer Science 2026-01-29 Xulin Chen , Ruipeng Liu , Zhenyu Gan , Garrett E. Katz

Optimisation of the Accelerator Control by Reinforcement Learning: A Simulation-Based Approach

Optimizing accelerator control is a critical challenge in experimental particle physics, requiring significant manual effort and resource expenditure. Traditional tuning methods are often time-consuming and reliant on expert input,…

Accelerator Physics · Physics 2026-01-27 Anwar Ibrahim , Denis Derkach , Alexey Petrenko , Fedor Ratnikov , Maxim Kaledin

SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning

Model-based reinforcement learning (RL) has proven to be a data efficient approach for learning control tasks but is difficult to utilize in domains with complex observations such as images. In this paper, we present a method for learning…

Machine Learning · Computer Science 2019-06-25 Marvin Zhang , Sharad Vikram , Laura Smith , Pieter Abbeel , Matthew J. Johnson , Sergey Levine

Towards Learning Controllable Representations of Physical Systems

Learned representations of dynamical systems reduce dimensionality, potentially supporting downstream reinforcement learning (RL). However, no established methods predict a representation's suitability for control and evaluation is largely…

Machine Learning · Computer Science 2020-11-25 Kevin Haninger , Raul Vicente Garcia , Joerg Krueger

Proximal Reliability Optimization for Reinforcement Learning

Despite the numerous advances, reinforcement learning remains away from widespread acceptance for autonomous controller design as compared to classical methods due to lack of ability to effectively tackle the reality gap. The reliance on…

Machine Learning · Computer Science 2024-09-23 Narendra Patwardhan , Zequn Wang

Predictive Lagrangian Optimization for Constrained Reinforcement Learning

Constrained optimization is popularly seen in reinforcement learning for addressing complex control tasks. From the perspective of dynamic system, iteratively solving a constrained optimization problem can be framed as the temporal…

Machine Learning · Computer Science 2025-01-28 Tianqi Zhang , Puzhen Yuan , Guojian Zhan , Ziyu Lin , Yao Lyu , Zhenzhi Qin , Jingliang Duan , Liping Zhang , Shengbo Eben Li

ACERAC: Efficient reinforcement learning in fine time discretization

One of the main goals of reinforcement learning (RL) is to provide a~way for physical machines to learn optimal behavior instead of being programmed. However, effective control of the machines usually requires fine time discretization. The…

Machine Learning · Computer Science 2022-07-12 Jakub Łyskawa , Paweł Wawrzyński

Performative Reinforcement Learning with Linear Markov Decision Process

We study the setting of \emph{performative reinforcement learning} where the deployed policy affects both the reward, and the transition of the underlying Markov decision process. Prior work~\parencite{MTR23} has addressed this problem…

Machine Learning · Computer Science 2025-03-18 Debmalya Mandal , Goran Radanovic

Regularization Matters in Policy Optimization

Deep Reinforcement Learning (Deep RL) has been receiving increasingly more attention thanks to its encouraging performance on a variety of control tasks. Yet, conventional regularization techniques in training neural networks (e.g., $L_2$…

Machine Learning · Computer Science 2021-11-30 Zhuang Liu , Xuanlin Li , Bingyi Kang , Trevor Darrell