English
Related papers

Related papers: Gaining efficiency in deep policy gradient method …

200 papers

This paper studies an optimal control problem for continuous-time stochastic systems subject to reachability objectives specified in a subclass of metric interval temporal logic specifications, a temporal logic with real-time constraints.…

Systems and Control · Computer Science 2015-04-21 Jie Fu , Ufuk Topcu

This paper focuses on optimal control problem for a class of discrete-time nonlinear systems. In practical applications, computation time is a crucial consideration when solving nonlinear optimal control problems, especially under real-time…

Optimization and Control · Mathematics 2025-04-01 Chuanzhi Lv , Xunmin Yin , Hongdan Li , Huanshui Zhang

In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning,…

Robotics · Computer Science 2012-02-27 Vu Anh Huynh , Sertac Karaman , Emilio Frazzoli

This paper is concerned with the computing efficiency of model predictive control (MPC) problems for dynamical systems with both rate and amplitude constraints on the inputs. Instead of augmenting the decision variables of the underlying…

Optimization and Control · Mathematics 2020-03-13 Idris Kempf , Paul Goulart , Stephen Duncan

In this article, we discuss two algorithms tailored to discrete-time deterministic finite-horizon nonlinear optimal control problems or so-called deterministic trajectory optimization problems. Both algorithms can be derived from an…

Optimization and Control · Mathematics 2024-12-10 Mohammad Mahmoudi Filabadi , Tom Lefebvre , Guillaume Crevecoeur

This paper studies an infinite horizon optimal control problem for discrete-time linear system and quadratic criteria, both with random parameters which are independent and identically distributed with respect to time. In this general…

Optimization and Control · Mathematics 2024-03-04 Deyue Li

We consider policy gradient methods for stochastic optimal control problem in continuous time. In particular, we analyze the gradient flow for the control, viewed as a continuous time limit of the policy gradient method. We prove the global…

Optimization and Control · Mathematics 2025-04-15 Mo Zhou , Jianfeng Lu

In recent years, deep learning has been connected with optimal control as a way to define a notion of a continuous underlying learning problem. In this view, neural networks can be interpreted as a discretization of a parametric Ordinary…

Optimization and Control · Mathematics 2020-07-07 Joubine Aghili , Olga Mula

The article discusses the gradient discretisation method (GDM) for distributed optimal control problems governed by diffusion equation with pure Neumann boundary condition. Using the GDM framework enables to develop an analysis that…

Numerical Analysis · Mathematics 2018-10-09 Jerome Droniou , Neela Nataraj , Devika Shylaja

We prove convergence of the proximal policy gradient method for a class of constrained stochastic control problems with control in both the drift and diffusion of the state process. The problem requires either the running or terminal cost…

Optimization and Control · Mathematics 2025-05-27 Ashley Davey , Harry Zheng

We study a Q learning algorithm for continuous time stochastic control problems. The proposed algorithm uses the sampled state process by discretizing the state and control action spaces under piece-wise constant control processes. We show…

Optimization and Control · Mathematics 2023-03-10 Erhan Bayraktar , Ali Devran Kara

We study the estimation of policy gradients for continuous-time systems with known dynamics. By reframing policy learning in continuous-time, we show that it is possible construct a more efficient and accurate gradient estimator. The…

Machine Learning · Computer Science 2021-06-25 Samuel Ainsworth , Kendall Lowrey , John Thickstun , Zaid Harchaoui , Siddhartha Srinivasa

We study the global linear convergence of policy gradient (PG) methods for finite-horizon continuous-time exploratory linear-quadratic control (LQC) problems. The setting includes stochastic LQC problems with indefinite costs and allows…

Optimization and Control · Mathematics 2024-03-05 Michael Giegrich , Christoph Reisinger , Yufei Zhang

We develop policy gradients methods for stochastic control with exit time in a model-free setting. We propose two types of algorithms for learning either directly the optimal policy or by learning alternately the value function (critic) and…

Computational Finance · Quantitative Finance 2023-02-16 Mohamed Hamdouche , Pierre Henry-Labordere , Huyen Pham

We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the connection between stochastic control problems and randomised problems, enabling applications across…

Optimization and Control · Mathematics 2024-05-01 Robert Denkert , Huyên Pham , Xavier Warin

A new method for stochastic control based on neural networks and using randomisation of discrete random variables is proposed and applied to optimal stopping time problems. The method models directly the policy and does not need the…

Computational Finance · Quantitative Finance 2021-01-11 Thomas Deschatre , Joseph Mikael

We present a tree structure algorithm for optimal control problems with state constraints. We prove a convergence result for a discrete time approximation of the value function based on a novel formulation of the constrained problem. Then…

Numerical Analysis · Mathematics 2020-09-29 Alessandro Alla , Maurizio Falcone , Luca Saluzzi

The ability of Gaussian processes (GPs) to predict the behavior of dynamical systems as a more sample-efficient alternative to parametric models seems promising for real-world robotics research. However, the computational complexity of GPs…

Robotics · Computer Science 2022-03-01 Abdolreza Taheri , Joni Pajarinen , Reza Ghabcheloo

In this paper, we propose a class of discrete-time approximation schemes for stochastic optimal control problems under the $G$-expectation framework. The proposed schemes are constructed recursively based on piecewise constant policy. We…

Optimization and Control · Mathematics 2021-10-05 Lianzi Jiang

We establish a variety of results extending the well-known Pontryagin maximum principle of optimal control to discrete-time optimal control problems posed on smooth manifolds. These results are organized around a new theorem on critical and…

Optimization and Control · Mathematics 2017-07-14 Robert Kipka , Rohit Gupta
‹ Prev 1 2 3 10 Next ›