Related papers: Regularly Updated Deterministic Policy Gradient Al…

Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control

This paper presents a robust reinforcement learning algorithm called robust deterministic policy gradient (RDPG), which reformulates the H-infinity control problem as a two-player zero-sum dynamic game between a user and an adversary. The…

Robotics · Computer Science 2025-12-04 Taeho Lee , Donghwan Lee

Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios

Deep Reinforcement Learning is gaining increasing attention thanks to its capability to learn complex policies in high-dimensional settings. Recent advancements utilize a dual-network architecture to learn optimal policies through the…

Machine Learning · Computer Science 2025-10-14 Alberto Sinigaglia , Niccolò Turcato , Ruggero Carli , Gian Antonio Susto

Deterministic Policy Gradient for Reinforcement Learning with Continuous Time and State

The theory of continuous-time reinforcement learning (RL) has progressed rapidly in recent years. While the ultimate objective of RL is typically to learn deterministic control policies, most existing continuous-time RL methods rely on…

Machine Learning · Computer Science 2026-03-17 Ziheng Cheng , Xin Guo , Yufei Zhang

Deterministic Value-Policy Gradients

Reinforcement learning algorithms such as the deep deterministic policy gradient algorithm (DDPG) has been widely used in continuous control tasks. However, the model-free DDPG algorithm suffers from high sample complexity. In this paper we…

Machine Learning · Computer Science 2019-11-14 Qingpeng Cai , Ling Pan , Pingzhong Tang

Adversarial Deep Reinforcement Learning in Portfolio Management

In this paper, we implement three state-of-art continuous reinforcement learning algorithms, Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimization (PPO) and Policy Gradient (PG)in portfolio management. All of them are…

Portfolio Management · Quantitative Finance 2018-11-20 Zhipeng Liang , Hao Chen , Junhao Zhu , Kangkang Jiang , Yanran Li

Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments

Deep Deterministic Policy Gradient (DDPG) has been proved to be a successful reinforcement learning (RL) algorithm for continuous control tasks. However, DDPG still suffers from data insufficiency and training inefficiency, especially in…

Machine Learning · Computer Science 2019-03-05 Zhizheng Zhang , Jiale Chen , Zhibo Chen , Weiping Li

Data-efficient Deep Reinforcement Learning for Dexterous Manipulation

Deep learning and reinforcement learning methods have recently been used to solve a variety of problems in continuous control domains. An obvious application of these techniques is dexterous manipulation tasks in robotics which are…

Machine Learning · Computer Science 2017-04-12 Ivaylo Popov , Nicolas Heess , Timothy Lillicrap , Roland Hafner , Gabriel Barth-Maron , Matej Vecerik , Thomas Lampe , Yuval Tassa , Tom Erez , Martin Riedmiller

Deep reinforcement learning for optimal trading with partial information

Reinforcement Learning (RL) applied to financial problems has been the subject of a lively area of research. The use of RL for optimal trading strategies that exploit latent information in the market is, to the best of our knowledge, not…

Trading and Market Microstructure · Quantitative Finance 2025-11-04 Andrea Macrì , Sebastian Jaimungal , Fabrizio Lillo

Distributed Distributional Deterministic Policy Gradients

This work adopts the very successful distributional perspective on reinforcement learning and adapts it to the continuous control setting. We combine this within a distributed framework for off-policy learning in order to develop what we…

Machine Learning · Computer Science 2018-04-25 Gabriel Barth-Maron , Matthew W. Hoffman , David Budden , Will Dabney , Dan Horgan , Dhruva TB , Alistair Muldal , Nicolas Heess , Timothy Lillicrap

Deterministic policy gradient based optimal control with probabilistic constraints

This paper studies a deep deterministic policy gradient (DDPG) based actor critic (AC) reinforcement learning (RL) technique to control a linear discrete-time system with a quadratic control cost while ensuring a constraint on the…

Systems and Control · Electrical Eng. & Systems 2023-12-22 Arunava Naha , Subhrakanti Dey

DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning

This paper prescribes a suite of techniques for off-policy Reinforcement Learning (RL) that simplify the training process and reduce the sample complexity. First, we show that simple Deterministic Policy Gradient works remarkably well as…

Machine Learning · Computer Science 2020-06-30 Rasool Fakoor , Pratik Chaudhari , Alexander J. Smola

Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control

Uncertainty quantification is one of the central challenges for machine learning in real-world applications. In reinforcement learning, an agent confronts two kinds of uncertainty, called epistemic uncertainty and aleatoric uncertainty.…

Machine Learning · Computer Science 2023-07-06 Takuya Kanazawa , Haiyan Wang , Chetan Gupta

Deterministic Policy Gradients With General State Transitions

We study a reinforcement learning setting, where the state transition function is a convex combination of a stochastic continuous function and a deterministic function. Such a setting generalizes the widely-studied stochastic state…

Machine Learning · Computer Science 2018-10-03 Qingpeng Cai , Ling Pan , Pingzhong Tang

Zeroth-order Deterministic Policy Gradient

Deterministic Policy Gradient (DPG) removes a level of randomness from standard randomized-action Policy Gradient (PG), and demonstrates substantial empirical success for tackling complex dynamic problems involving Markov decision…

Machine Learning · Computer Science 2020-07-14 Harshat Kumar , Dionysios S. Kalogerias , George J. Pappas , Alejandro Ribeiro

Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives

Reinforcement learning (RL) has achieved remarkable success in a wide range of control and decision-making tasks. However, RL agents often exhibit unstable or degraded performance when deployed in environments subject to unexpected external…

Machine Learning · Computer Science 2026-03-13 Taeho Lee , Donghwan Lee

Deep Reinforcement Learning for Online Optimal Execution Strategies

This paper tackles the challenge of learning non-Markovian optimal execution strategies in dynamic financial markets. We introduce a novel actor-critic algorithm based on Deep Deterministic Policy Gradient (DDPG) to address this issue, with…

Machine Learning · Computer Science 2024-10-18 Alessandro Micheli , Mélodie Monod

Application of linear regression and quasi-Newton methods to the deep reinforcement learning in continuous action cases

The linear regression (LR) method offers the advantage that optimal parameters can be calculated relatively easily, although its representation capability is limited than that of the deep learning technique. To improve deep reinforcement…

Machine Learning · Computer Science 2025-04-28 Hisato Komatsu

Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy Gradients

Reach-Avoid-Stay (RAS) optimal control enables systems such as robots and air taxis to reach their targets, avoid obstacles, and stay near the target. However, current methods for RAS often struggle with handling complex, dynamic…

Systems and Control · Electrical Eng. & Systems 2024-10-10 Gabriel Chenevert , Jingqi Li , Achyuta kannan , Sangjae Bae , Donggun Lee

Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient

Deep reinforcement learning (RL) algorithms typically parameterize the policy as a deep network that outputs either a deterministic action or a stochastic one modeled as a Gaussian distribution, hence restricting learning to a single…

Machine Learning · Computer Science 2024-06-04 Zechu Li , Rickmer Krohn , Tao Chen , Anurag Ajay , Pulkit Agrawal , Georgia Chalvatzaki

Quantum reinforcement learning in continuous action space

Quantum reinforcement learning (QRL) is a promising paradigm for near-term quantum devices. While existing QRL methods have shown success in discrete action spaces, extending these techniques to continuous domains is challenging due to the…

Quantum Physics · Physics 2025-03-19 Shaojun Wu , Shan Jin , Dingding Wen , Donghong Han , Xiaoting Wang