Related papers: Gaining efficiency in deep policy gradient method …

Computational methods for stochastic control with metric interval temporal logic specifications

This paper studies an optimal control problem for continuous-time stochastic systems subject to reachability objectives specified in a subclass of metric interval temporal logic specifications, a temporal logic with real-time constraints.…

Systems and Control · Computer Science 2015-04-21 Jie Fu , Ufuk Topcu

Optimal Control of Discrete-Time Nonlinear Systems

This paper focuses on optimal control problem for a class of discrete-time nonlinear systems. In practical applications, computation time is a crucial consideration when solving nonlinear optimal control problems, especially under real-time…

Optimization and Control · Mathematics 2025-04-01 Chuanzhi Lv , Xunmin Yin , Hongdan Li , Huanshui Zhang

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning,…

Robotics · Computer Science 2012-02-27 Vu Anh Huynh , Sertac Karaman , Emilio Frazzoli

Fast Gradient Method for Model Predictive Control with Input Rate and Amplitude Constraints

This paper is concerned with the computing efficiency of model predictive control (MPC) problems for dynamical systems with both rate and amplitude constraints on the inputs. Instead of augmenting the decision variables of the underlying…

Optimization and Control · Mathematics 2020-03-13 Idris Kempf , Paul Goulart , Stephen Duncan

Deterministic Trajectory Optimization through Probabilistic Optimal Control

In this article, we discuss two algorithms tailored to discrete-time deterministic finite-horizon nonlinear optimal control problems or so-called deterministic trajectory optimization problems. Both algorithms can be derived from an…

Optimization and Control · Mathematics 2024-12-10 Mohammad Mahmoudi Filabadi , Tom Lefebvre , Guillaume Crevecoeur

Policy Gradient Methods for Discrete Time Linear Quadratic Regulator With Random Parameters

This paper studies an infinite horizon optimal control problem for discrete-time linear system and quadratic criteria, both with random parameters which are independent and identically distributed with respect to time. In this general…

Optimization and Control · Mathematics 2024-03-04 Deyue Li

A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee

We consider policy gradient methods for stochastic optimal control problem in continuous time. In particular, we analyze the gradient flow for the control, viewed as a continuous time limit of the policy gradient method. We prove the global…

Optimization and Control · Mathematics 2025-04-15 Mo Zhou , Jianfeng Lu

Depth-Adaptive Neural Networks from the Optimal Control viewpoint

In recent years, deep learning has been connected with optimal control as a way to define a notion of a continuous underlying learning problem. In this view, neural networks can be interpreted as a discretization of a parametric Ordinary…

Optimization and Control · Mathematics 2020-07-07 Joubine Aghili , Olga Mula

Numerical analysis for the pure Neumann control problem using the gradient discretisation method

The article discusses the gradient discretisation method (GDM) for distributed optimal control problems governed by diffusion equation with pure Neumann boundary condition. Using the GDM framework enables to develop an analysis that…

Numerical Analysis · Mathematics 2018-10-09 Jerome Droniou , Neela Nataraj , Devika Shylaja

Convergence of Proximal Policy Gradient Method for Problems with Control Dependent Diffusion Coefficients

We prove convergence of the proximal policy gradient method for a class of constrained stochastic control problems with control in both the drift and diffusion of the state process. The problem requires either the running or terminal cost…

Optimization and Control · Mathematics 2025-05-27 Ashley Davey , Harry Zheng

Approximate Q-Learning for Controlled Diffusion Processes and its Near Optimality

We study a Q learning algorithm for continuous time stochastic control problems. The proposed algorithm uses the sampled state process by discretizing the state and control action spaces under piece-wise constant control processes. We show…

Optimization and Control · Mathematics 2023-03-10 Erhan Bayraktar , Ali Devran Kara

Faster Policy Learning with Continuous-Time Gradients

We study the estimation of policy gradients for continuous-time systems with known dynamics. By reframing policy learning in continuous-time, we show that it is possible construct a more efficient and accurate gradient estimator. The…

Machine Learning · Computer Science 2021-06-25 Samuel Ainsworth , Kendall Lowrey , John Thickstun , Zaid Harchaoui , Siddhartha Srinivasa

Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems

We study the global linear convergence of policy gradient (PG) methods for finite-horizon continuous-time exploratory linear-quadratic control (LQC) problems. The setting includes stochastic LQC problems with indefinite costs and allows…

Optimization and Control · Mathematics 2024-03-05 Michael Giegrich , Christoph Reisinger , Yufei Zhang

Policy gradient learning methods for stochastic control with exit time and applications to share repurchase pricing

We develop policy gradients methods for stochastic control with exit time in a model-free setting. We propose two types of algorithms for learning either directly the optimal policy or by learning alternately the value function (critic) and…

Computational Finance · Quantitative Finance 2023-02-16 Mohamed Hamdouche , Pierre Henry-Labordere , Huyen Pham

Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching

We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the connection between stochastic control problems and randomised problems, enabling applications across…

Optimization and Control · Mathematics 2024-05-01 Robert Denkert , Huyên Pham , Xavier Warin

Deep combinatorial optimisation for optimal stopping time problems : application to swing options pricing

A new method for stochastic control based on neural networks and using randomisation of discrete random variables is proposed and applied to optimal stopping time problems. The method models directly the policy and does not need the…

Computational Finance · Quantitative Finance 2021-01-11 Thomas Deschatre , Joseph Mikael

A tree structure algorithm for optimal control problems with state constraints

We present a tree structure algorithm for optimal control problems with state constraints. We prove a convergence result for a discrete time approximation of the value function based on a novel formulation of the constrained problem. Then…

Numerical Analysis · Mathematics 2020-09-29 Alessandro Alla , Maurizio Falcone , Luca Saluzzi

GPU-Accelerated Policy Optimization via Batch Automatic Differentiation of Gaussian Processes for Real-World Control

The ability of Gaussian processes (GPs) to predict the behavior of dynamical systems as a more sample-efficient alternative to parametric models seems promising for real-world robotics research. However, the computational complexity of GPs…

Robotics · Computer Science 2022-03-01 Abdolreza Taheri , Joni Pajarinen , Reza Ghabcheloo

Discrete-time approximation for stochastic optimal control problems under the $G$-expectation framework

In this paper, we propose a class of discrete-time approximation schemes for stochastic optimal control problems under the $G$-expectation framework. The proposed schemes are constructed recursively based on piecewise constant policy. We…

Optimization and Control · Mathematics 2021-10-05 Lianzi Jiang

The Discrete-Time Geometric Maximum Principle

We establish a variety of results extending the well-known Pontryagin maximum principle of optimal control to discrete-time optimal control problems posed on smooth manifolds. These results are organized around a new theorem on critical and…

Optimization and Control · Mathematics 2017-07-14 Robert Kipka , Rohit Gupta