Related papers: Accelerated Policy Learning with Parallel Differen…

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

Deep reinforcement learning (RL) algorithms can use high-capacity deep networks to learn directly from image observations. However, these high-dimensional observation spaces present a number of challenges in practice, since the policy must…

Machine Learning · Computer Science 2020-10-27 Alex X. Lee , Anusha Nagabandi , Pieter Abbeel , Sergey Levine

Accelerating Visual-Policy Learning through Parallel Differentiable Simulation

In this work, we propose a computationally efficient algorithm for visual policy learning that leverages differentiable simulation and first-order analytical policy gradients. Our approach decouple the rendering process from the computation…

Machine Learning · Computer Science 2025-11-12 Haoxiang You , Yilang Liu , Ian Abraham

Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios

Reinforcement learning (RL), known for its self-evolution capability, offers a promising approach to training high-level autonomous driving systems. However, handling constraints remains a significant challenge for existing RL algorithms,…

Robotics · Computer Science 2025-05-21 Feihong Zhang , Guojian Zhan , Bin Shuai , Tianyi Zhang , Jingliang Duan , Shengbo Eben Li

Learning Quadrotor Control From Visual Features Using Differentiable Simulation

The sample inefficiency of reinforcement learning (RL) remains a significant challenge in robotics. RL requires large-scale simulation and can still cause long training times, slowing research and innovation. This issue is particularly…

Robotics · Computer Science 2026-01-16 Johannes Heeg , Yunlong Song , Davide Scaramuzza

Influence-Augmented Local Simulators: A Scalable Solution for Fast Deep RL in Large Networked Systems

Learning effective policies for real-world problems is still an open challenge for the field of reinforcement learning (RL). The main limitation being the amount of data needed and the pace at which that data can be obtained. In this paper,…

Machine Learning · Computer Science 2022-02-04 Miguel Suau , Jinke He , Matthijs T. J. Spaan , Frans A. Oliehoek

Stabilizing Reinforcement Learning in Differentiable Multiphysics Simulation

Recent advances in GPU-based parallel simulation have enabled practitioners to collect large amounts of data and train complex control policies using deep reinforcement learning (RL), on commodity GPUs. However, such successes for RL in…

Machine Learning · Computer Science 2025-03-03 Eliot Xing , Vernon Luk , Jean Oh

Deep Reinforcement Learning with Adjustments

Deep reinforcement learning (RL) algorithms can learn complex policies to optimize agent operation over time. RL algorithms have shown promising results in solving complicated problems in recent years. However, their application on…

Machine Learning · Computer Science 2021-09-29 Hamed Khorasgani , Haiyan Wang , Chetan Gupta , Susumu Serita

Unlocking the Potential of Simulators: Design with RL in Mind

Using Reinforcement Learning (RL) in simulation to construct policies useful in real life is challenging. This is often attributed to the sequential decision making aspect: inaccuracies in simulation accumulate over multiple steps, hence…

Machine Learning · Computer Science 2017-06-09 Rika Antonova , Silvia Cruciani

Adaptable Automation with Modular Deep Reinforcement Learning and Policy Transfer

Recent advances in deep Reinforcement Learning (RL) have created unprecedented opportunities for intelligent automation, where a machine can autonomously learn an optimal policy for performing a given task. However, current deep RL…

Machine Learning · Computer Science 2021-05-27 Zohreh Raziei , Mohsen Moghaddam

Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization

Reinforcement learning (RL) is attracting increasing interests in autonomous driving due to its potential to solve complex classification and control problems. However, existing RL algorithms are rarely applied to real vehicles for two…

Machine Learning · Computer Science 2020-03-04 Lu Wen , Jingliang Duan , Shengbo Eben Li , Shaobing Xu , Huei Peng

Towards Bridging the Gap between Large-Scale Pretraining and Efficient Finetuning for Humanoid Control

Reinforcement learning (RL) is widely used for humanoid control, with on-policy methods such as Proximal Policy Optimization (PPO) enabling robust training via large-scale parallel simulation and, in some cases, zero-shot deployment to real…

Robotics · Computer Science 2026-02-24 Weidong Huang , Zhehan Li , Hangxin Liu , Biao Hou , Yao Su , Jingwen Zhang

Decentralizing Multi-Agent Reinforcement Learning with Temporal Causal Information

Reinforcement learning (RL) algorithms can find an optimal policy for a single agent to accomplish a particular task. However, many real-world problems require multiple agents to collaborate in order to achieve a common goal. For example, a…

Machine Learning · Computer Science 2025-10-20 Jan Corazza , Hadi Partovi Aria , Hyohun Kim , Daniel Neider , Zhe Xu

ACERAC: Efficient reinforcement learning in fine time discretization

One of the main goals of reinforcement learning (RL) is to provide a~way for physical machines to learn optimal behavior instead of being programmed. However, effective control of the machines usually requires fine time discretization. The…

Machine Learning · Computer Science 2022-07-12 Jakub Łyskawa , Paweł Wawrzyński

Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

In recent years, fully differentiable rigid body physics simulators have been developed, which can be used to simulate a wide range of robotic systems. In the context of reinforcement learning for control, these simulators theoretically…

Machine Learning · Computer Science 2022-03-08 Sean Gillen , Katie Byl

Safe Reinforcement Learning with Dual Robustness

Reinforcement learning (RL) agents are vulnerable to adversarial disturbances, which can deteriorate task performance or compromise safety specifications. Existing methods either address safety requirements under the assumption of no…

Machine Learning · Computer Science 2023-09-14 Zeyang Li , Chuxiong Hu , Yunan Wang , Yujie Yang , Shengbo Eben Li

Solving Challenging Control Problems Using Two-Staged Deep Reinforcement Learning

We present a deep reinforcement learning (deep RL) algorithm that consists of learning-based motion planning and imitation to tackle challenging control problems. Deep RL has been an effective tool for solving many high-dimensional…

Robotics · Computer Science 2023-03-02 Nitish Sontakke , Sehoon Ha

Identifying Policy Gradient Subspaces

Policy gradient methods hold great potential for solving complex continuous control tasks. Still, their training efficiency can be improved by exploiting structure within the optimization problem. Recent work indicates that supervised…

Machine Learning · Computer Science 2024-03-19 Jan Schneider , Pierre Schumacher , Simon Guist , Le Chen , Daniel Häufle , Bernhard Schölkopf , Dieter Büchler

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Recently, safe reinforcement learning (RL) with the actor-critic structure for continuous control tasks has received increasing attention. It is still challenging to learn a near-optimal control policy with safety and convergence…

Machine Learning · Computer Science 2024-02-06 Xinglong Zhang , Yaoqian Peng , Biao Luo , Wei Pan , Xin Xu , Haibin Xie

Deployable Reinforcement Learning with Variable Control Rate

Deploying controllers trained with Reinforcement Learning (RL) on real robots can be challenging: RL relies on agents' policies being modeled as Markov Decision Processes (MDPs), which assume an inherently discrete passage of time. The use…

Robotics · Computer Science 2024-04-03 Dong Wang , Giovanni Beltrame

Learning fast changing slow in spiking neural networks

Reinforcement learning (RL) faces substantial challenges when applied to real-life problems, primarily stemming from the scarcity of available data due to limited interactions with the environment. This limitation is exacerbated by the fact…

Neural and Evolutionary Computing · Computer Science 2024-04-10 Cristiano Capone , Paolo Muratore