Related papers: Constrained Attractor Selection Using Deep Reinfor…

Attractor Selection in Nonlinear Energy Harvesting Using Deep Reinforcement Learning

Recent research efforts demonstrate that the intentional use of nonlinearity enhances the capabilities of energy harvesting systems. One of the primary challenges that arise in nonlinear harvesters is that nonlinearities can often result in…

Systems and Control · Electrical Eng. & Systems 2020-10-06 Xue-She Wang , Brian P. Mann

Deterministic Policy Gradient for Reinforcement Learning with Continuous Time and State

The theory of continuous-time reinforcement learning (RL) has progressed rapidly in recent years. While the ultimate objective of RL is typically to learn deterministic control policies, most existing continuous-time RL methods rely on…

Machine Learning · Computer Science 2026-03-17 Ziheng Cheng , Xin Guo , Yufei Zhang

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

Deep learning and reinforcement learning methods have recently been used to solve a variety of problems in continuous control domains. An obvious application of these techniques is dexterous manipulation tasks in robotics which are…

Machine Learning · Computer Science 2017-04-12 Ivaylo Popov , Nicolas Heess , Timothy Lillicrap , Roland Hafner , Gabriel Barth-Maron , Matej Vecerik , Thomas Lampe , Yuval Tassa , Tom Erez , Martin Riedmiller

Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning

In this paper we focus on developing a control algorithm for multi-terrain tracked robots with flippers using a reinforcement learning (RL) approach. The work is based on the deep deterministic policy gradient (DDPG) algorithm, proven to be…

Robotics · Computer Science 2017-09-26 Giuseppe Paolo , Lei Tai , Ming Liu

A Reinforcement Learning Approach to Non-prehensile Manipulation through Sliding

Although robotic applications increasingly demand versatile and dynamic object handling, most existing techniques are predominantly focused on grasp-based manipulation, limiting their applicability in non-prehensile tasks. To address this…

Robotics · Computer Science 2025-02-25 Hamidreza Raei , Elena De Momi , Arash Ajoudani

Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control

This paper presents a robust reinforcement learning algorithm called robust deterministic policy gradient (RDPG), which reformulates the H-infinity control problem as a two-player zero-sum dynamic game between a user and an adversary. The…

Robotics · Computer Science 2025-12-04 Taeho Lee , Donghwan Lee

Learning Constrained Adaptive Differentiable Predictive Control Policies With Guarantees

We present differentiable predictive control (DPC), a method for learning constrained neural control policies for linear systems with probabilistic performance guarantees. We employ automatic differentiation to obtain direct policy…

Systems and Control · Electrical Eng. & Systems 2022-01-28 Jan Drgona , Aaron Tuor , Draguna Vrabie

Deterministic policy gradient based optimal control with probabilistic constraints

This paper studies a deep deterministic policy gradient (DDPG) based actor critic (AC) reinforcement learning (RL) technique to control a linear discrete-time system with a quadratic control cost while ensuring a constraint on the…

Systems and Control · Electrical Eng. & Systems 2023-12-22 Arunava Naha , Subhrakanti Dey

Deep Reinforcement Learning Control for Disturbance Rejection in a Nonlinear Dynamic System with Parametric Uncertainty

This work describes a technique for active rejection of multiple independent and time-correlated stochastic disturbances for a nonlinear flexible inverted pendulum with cart system with uncertain model parameters. The control law is…

Systems and Control · Electrical Eng. & Systems 2024-04-09 Vincent W. Hill

Learning with Stochastic Guidance for Navigation

Due to the sparse rewards and high degree of environment variation, reinforcement learning approaches such as Deep Deterministic Policy Gradient (DDPG) are plagued by issues of high variance when applied in complex real world environments.…

Robotics · Computer Science 2018-11-28 Linhai Xie , Yishu Miao , Sen Wang , Phil Blunsom , Zhihua Wang , Changhao Chen , Andrew Markham , Niki Trigoni

Deep Reinforcement Learning for Robotic Manipulation under Distribution Shift with Bounded Extremum Seeking

Reinforcement learning has shown strong performance in robotic manipulation, but learned policies often degrade in performance when test conditions differ from the training distribution. This limitation is especially important in…

Robotics · Computer Science 2026-04-02 Shaifalee Saxena , Rafael Fierro , Alexander Scheinker

Drag-reduction strategies in wall-bounded turbulent flows using deep reinforcement learning

In this work we compare different drag-reduction strategies that compute their actuation based on the fluctuations at a given wall-normal location in turbulent open channel flow. In order to perform this study, we implement and describe in…

Fluid Dynamics · Physics 2023-09-07 L. Guastoni , J. Rabault , H. Azizpour , R. Vinuesa

Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control

Uncertainty quantification is one of the central challenges for machine learning in real-world applications. In reinforcement learning, an agent confronts two kinds of uncertainty, called epistemic uncertainty and aleatoric uncertainty.…

Machine Learning · Computer Science 2023-07-06 Takuya Kanazawa , Haiyan Wang , Chetan Gupta

A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization

Inefficient traffic signal control methods may cause numerous problems, such as traffic congestion and waste of energy. Reinforcement learning (RL) is a trending data-driven approach for adaptive traffic signal control in complex urban…

Signal Processing · Electrical Eng. & Systems 2021-07-14 Zhenning Li , Chengzhong Xu , Guohui Zhang

Distributed Distributional Deterministic Policy Gradients

This work adopts the very successful distributional perspective on reinforcement learning and adapts it to the continuous control setting. We combine this within a distributed framework for off-policy learning in order to develop what we…

Machine Learning · Computer Science 2018-04-25 Gabriel Barth-Maron , Matthew W. Hoffman , David Budden , Will Dabney , Dan Horgan , Dhruva TB , Alistair Muldal , Nicolas Heess , Timothy Lillicrap

Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method

Flocking control has been studied extensively along with the wide application of multi-vehicle systems. In this paper the Multi-vehicles System (MVS) flocking control with collision avoidance and communication preserving is considered based…

Robotics · Computer Science 2018-06-04 Yang Lyu , Quan Pan , Jinwen Hu , Chunhui Zhao , Shuai Liu

Deep Reinforcement Learning for Autonomous Driving

Reinforcement learning has steadily improved and outperform human in lots of traditional games since the resurgence of deep neural network. However, these success is not easy to be copied to autonomous driving because the state spaces in…

Computer Vision and Pattern Recognition · Computer Science 2019-05-21 Sen Wang , Daoyuan Jia , Xinshuo Weng

Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning

In this paper, we propose a reinforcement learning-based algorithm for trajectory optimization for constrained dynamical systems. This problem is motivated by the fact that for most robotic systems, the dynamics may not always be known.…

Machine Learning · Statistics 2020-03-05 Kei Ota , Devesh K. Jha , Tomoaki Oiki , Mamoru Miura , Takashi Nammoto , Daniel Nikovski , Toshisada Mariyama

Curiosity-Driven Experience Prioritization via Density Estimation

In Reinforcement Learning (RL), an agent explores the environment and collects trajectories into the memory buffer for later learning. However, the collected trajectories can easily be imbalanced with respect to the achieved goal states.…

Machine Learning · Computer Science 2020-05-27 Rui Zhao , Volker Tresp

Deterministic Value-Policy Gradients

Reinforcement learning algorithms such as the deep deterministic policy gradient algorithm (DDPG) has been widely used in continuous control tasks. However, the model-free DDPG algorithm suffers from high sample complexity. In this paper we…

Machine Learning · Computer Science 2019-11-14 Qingpeng Cai , Ling Pan , Pingzhong Tang