Related papers: Maximum Entropy Differential Dynamic Programming

Robust Differential Dynamic Programming

Differential Dynamic Programming is an optimal control technique often used for trajectory generation. Many variations of this algorithm have been developed in the literature, including algorithms for stochastic dynamics or state and input…

Optimization and Control · Mathematics 2022-05-26 Dennis Gramlich , Carsten W. Scherer , Christian Ebenbauer

Constrained Differential Dynamic Programming Revisited

Differential Dynamic Programming (DDP) has become a well established method for unconstrained trajectory optimization. Despite its several applications in robotics and controls however, a widely successful constrained version of the…

Optimization and Control · Mathematics 2020-05-05 Yuichiro Aoyama , George Boutselis , Akash Patel , Evangelos A. Theodorou

Dynamic Programming-based Approximate Optimal Control for Model-Based Reinforcement Learning

This article proposes an improved trajectory optimization approach for stochastic optimal control of dynamical systems affected by measurement noise by combining optimal control with maximum likelihood techniques to improve the reduction of…

Systems and Control · Electrical Eng. & Systems 2023-12-25 Prakash Mallick , Zhiyong Chen

Accelerated Point-wise Maximum Approach to Approximate Dynamic Programming

We describe an approximate dynamic programming approach to compute lower bounds on the optimal value function for a discrete time, continuous space, infinite horizon setting. The approach iteratively constructs a family of lower bounding…

Systems and Control · Electrical Eng. & Systems 2024-12-20 Paul N. Beuchat , Joseph Warrington , John Lygeros

Relationships Between the Maximum Principle and Dynamic Programming for Infinite Dimensional Stochastic Control Systems

Pontryagin type maximum principle and Bellman's dynamic programming principle serve as two of the most important tools in solving optimal control problems. There is a huge literature on the study of relationship between them. The main…

Optimization and Control · Mathematics 2021-12-30 Liangying Chen , Qi Lü

Parameterized Differential Dynamic Programming

Differential Dynamic Programming (DDP) is an efficient trajectory optimization algorithm relying on second-order approximations of a system's dynamics and cost function, and has recently been applied to optimize systems with time-invariant…

Optimization and Control · Mathematics 2022-04-11 Alex Oshin , Matthew D. Houghton , Michael J. Acheson , Irene M. Gregory , Evangelos A. Theodorou

Dynamic Programming Subject to Total Variation Distance Ambiguity

The aim of this paper is to address optimality of stochastic control strategies via dynamic programming subject to total variation distance ambiguity on the conditional distribution of the controlled process. We formulate the stochastic…

Optimization and Control · Mathematics 2014-02-06 Ioannis Tzortzis , Charalambos D. Charalambous , Themistoklis Charalambous

Tropical Dynamic Programming for Lipschitz Multistage Stochastic Programming

We present an algorithm called Tropical Dynamic Programming (TDP) which builds upper and lower approximations of the Bellman value functions in risk-neutral Multistage Stochastic Programming (MSP), with independent noises of finite…

Optimization and Control · Mathematics 2020-10-22 Marianne Akian , Jean-Philippe Chancelier , Benoît Tran

Exact Dynamic Programming for Positive Systems with Linear Optimal Cost

Recent work [Ran22] formulated a class of optimal control problems involving positive linear systems, linear stage costs, and elementwise constraints on control. It was shown that the problem admits linear optimal cost and the associated…

Optimization and Control · Mathematics 2023-09-27 Yuchao Li , Anders Rantzer

A Dynamic Programming Formulation for the Nonlinear Filter

This paper build on our recent work where we presented a dual stochastic optimal control formulation of the nonlinear filtering problem [1]. The constraint for the dual problem is a backward stochastic differential equations (BSDE). The…

Optimization and Control · Mathematics 2021-11-02 Jin Won Kim , Prashant G. Mehta

Soft Policy Gradient Method for Maximum Entropy Deep Reinforcement Learning

Maximum entropy deep reinforcement learning (RL) methods have been demonstrated on a range of challenging continuous tasks. However, existing methods either suffer from severe instability when training on large off-policy data or cannot…

Machine Learning · Computer Science 2019-09-10 Wenjie Shi , Shiji Song , Cheng Wu

Generalized Maximum Entropy Differential Dynamic Programming

We present a sampling-based trajectory optimization method derived from the maximum entropy formulation of Differential Dynamic Programming with Tsallis entropy. This method is a generalization of the legacy work with Shannon entropy, which…

Optimization and Control · Mathematics 2024-09-18 Yuichiro Aoyama , Evangelos A. Theodorou

Continuous-Time Robust Dynamic Programming

This paper presents a new theory, known as robust dynamic pro- gramming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze…

Optimization and Control · Mathematics 2018-09-18 Tao Bian , Zhong-Ping Jiang

Maximum Likelihood Constraint Inference from Stochastic Demonstrations

When an expert operates a perilous dynamic system, ideal constraint information is tacitly contained in their demonstrated trajectories and controls. The likelihood of these demonstrations can be computed, given the system dynamics and task…

Systems and Control · Electrical Eng. & Systems 2021-02-26 David L. McPherson , Kaylene C. Stocking , S. Shankar Sastry

Entropy Maximization for Markov Decision Processes Under Temporal Logic Constraints

We study the problem of synthesizing a policy that maximizes the entropy of a Markov decision process (MDP) subject to a temporal logic constraint. Such a policy minimizes the predictability of the paths it generates, or dually, maximizes…

Optimization and Control · Mathematics 2019-06-17 Yagiz Savas , Melkior Ornik , Murat Cubuktepe , Mustafa O. Karabag , Ufuk Topcu

Interior Point Differential Dynamic Programming

This paper introduces a novel Differential Dynamic Programming (DDP) algorithm for solving discrete-time finite-horizon optimal control problems with inequality constraints. Two variants, namely Feasible- and Infeasible-IPDDP algorithms,…

Systems and Control · Electrical Eng. & Systems 2020-10-21 Andrei Pavlov , Iman Shames , Chris Manzie

Relationships Between the Maximum Principle and Dynamic Programming for Infinite Dimensional Non-Markovian Stochastic Control Systems

This paper investigates the relationship between Pontryagin's maximum principle and dynamic programming principle in the context of stochastic optimal control systems governed by stochastic evolution equations with random coefficients in…

Optimization and Control · Mathematics 2025-11-05 Dingqian Gao , Qi Lü

Dynamic Programming Principles for Optimal Stopping with Expectation Constraint

We analyze an optimal stopping problem with a constraint on the expected cost. When the reward function and cost function are Lipschitz continuous in state variable, we show that the value of such an optimal stopping problem is a continuous…

Optimization and Control · Mathematics 2017-08-08 Erhan Bayraktar , Song Yao

A Generalization of Bellman's Equation with Application to Path Planning, Obstacle Avoidance and Invariant Set Estimation

The standard Dynamic Programming (DP) formulation can be used to solve Multi-Stage Optimization Problems (MSOP's) with additively separable objective functions. In this paper we consider a larger class of MSOP's with monotonically backward…

Optimization and Control · Mathematics 2020-10-15 Morgan Jones , Matthew Peet

A (Slightly) Improved Deterministic Approximation Algorithm for Metric TSP

We show that the max entropy algorithm can be derandomized (with respect to a particular objective function) to give a deterministic $3/2-\epsilon$ approximation algorithm for metric TSP for some $\epsilon > 10^{-36}$. To obtain our result,…

Data Structures and Algorithms · Computer Science 2022-12-14 Anna R. Karlin , Nathan Klein , Shayan Oveis Gharan