Related papers: Differentiable Dynamic Programming for Structured …

Dynamic programming by polymorphic semiring algebraic shortcut fusion

Dynamic programming (DP) is an algorithmic design paradigm for the efficient, exact solution of otherwise intractable, combinatorial problems. However, DP algorithm design is often presented in an ad-hoc manner. It is sometimes difficult to…

Data Structures and Algorithms · Computer Science 2024-05-17 Max A. Little , Xi He , Ugur Kayas

Constrained Differential Dynamic Programming Revisited

Differential Dynamic Programming (DDP) has become a well established method for unconstrained trajectory optimization. Despite its several applications in robotics and controls however, a widely successful constrained version of the…

Optimization and Control · Mathematics 2020-05-05 Yuichiro Aoyama , George Boutselis , Akash Patel , Evangelos A. Theodorou

On Connections between Constrained Optimization and Reinforcement Learning

Dynamic Programming (DP) provides standard algorithms to solve Markov Decision Processes. However, these algorithms generally do not optimize a scalar objective function. In this paper, we draw connections between DP and (constrained)…

Machine Learning · Computer Science 2019-10-30 Nino Vieillard , Olivier Pietquin , Matthieu Geist

Robust Differential Dynamic Programming

Differential Dynamic Programming is an optimal control technique often used for trajectory generation. Many variations of this algorithm have been developed in the literature, including algorithms for stochastic dynamics or state and input…

Optimization and Control · Mathematics 2022-05-26 Dennis Gramlich , Carsten W. Scherer , Christian Ebenbauer

Deep Policy Dynamic Programming for Vehicle Routing Problems

Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical…

Machine Learning · Computer Science 2021-12-06 Wouter Kool , Herke van Hoof , Joaquim Gromicho , Max Welling

Distributionally Robust Differential Dynamic Programming with Wasserstein Distance

Differential dynamic programming (DDP) is a popular technique for solving nonlinear optimal control problems with locally quadratic approximations. However, existing DDP methods are not designed for stochastic systems with unknown…

Systems and Control · Electrical Eng. & Systems 2023-05-18 Astghik Hakobyan , Insoon Yang

Differentiable programming and its applications to dynamical systems

Differentiable programming is the combination of classical neural networks modules with algorithmic ones in an end-to-end differentiable model. These new models, that use automatic differentiation to calculate gradients, have new learning…

Dynamical Systems · Mathematics 2020-05-05 Adrián Hernández , José M. Amigó

Dual Dynamic Programming with cut selection: convergence proof and numerical experiments

We consider convex optimization problems formulated using dynamic programming equations. Such problems can be solved using the Dual Dynamic Programming algorithm combined with the Level 1 cut selection strategy or the Territory algorithm to…

Optimization and Control · Mathematics 2017-05-26 Vincent Guigues

A Unified Perspective on Multiple Shooting In Differential Dynamic Programming

Differential Dynamic Programming (DDP) is an efficient computational tool for solving nonlinear optimal control problems. It was originally designed as a single shooting method and thus is sensitive to the initial guess supplied. This work…

Robotics · Computer Science 2023-09-29 He Li , Wenhao Yu , Tingnan Zhang , Patrick M. Wensing

Differential Dynamic Programming for time-delayed systems

Trajectory optimization considers the problem of deciding how to control a dynamical system to move along a trajectory which minimizes some cost function. Differential Dynamic Programming (DDP) is an optimal control method which utilizes a…

Systems and Control · Computer Science 2017-01-10 David D. Fan , Evangelos A. Theodorou

An exact dynamic programming algorithm, lower and upper bounds, applied to the large block sale problem

In this article, we address a class of non convex, integer, non linear mathematical programs using dynamic programming. The mathematical program considered, whose properties are studied in this article, may be used to model the optimal…

Discrete Mathematics · Computer Science 2021-12-28 David Nizard , Nicolas Dupin , Dominique Quadri

A Differential Game Theoretic Neural Optimizer for Training Residual Networks

Connections between Deep Neural Networks (DNNs) training and optimal control theory has attracted considerable attention as a principled tool of algorithmic design. Differential Dynamic Programming (DDP) neural optimizer is a recently…

Machine Learning · Computer Science 2020-07-20 Guan-Horng Liu , Tianrong Chen , Evangelos A. Theodorou

PODDP: Partially Observable Differential Dynamic Programming for Latent Belief Space Planning

Autonomous agents are limited in their ability to observe the world state. Partially observable Markov decision processes (POMDPs) formally model the problem of planning under world state uncertainty, but POMDPs with continuous actions and…

Robotics · Computer Science 2020-07-08 Dicong Qiu , Yibiao Zhao , Chris L. Baker

Constrained Differential Dynamic Programming: A primal-dual augmented Lagrangian approach

Trajectory optimization is an efficient approach for solving optimal control problems for complex robotic systems. It relies on two key components: first the transcription into a sparse nonlinear program, and second the corresponding solver…

Robotics · Computer Science 2022-10-31 Wilson Jallet , Antoine Bambade , Nicolas Mansard , Justin Carpentier

DDPNOpt: Differential Dynamic Programming Neural Optimizer

Interpretation of Deep Neural Networks (DNNs) training as an optimal control problem with nonlinear dynamical systems has received considerable attention recently, yet the algorithmic development remains relatively limited. In this work, we…

Machine Learning · Computer Science 2021-06-14 Guan-Horng Liu , Tianrong Chen , Evangelos A. Theodorou

Parameterized Differential Dynamic Programming

Differential Dynamic Programming (DDP) is an efficient trajectory optimization algorithm relying on second-order approximations of a system's dynamics and cost function, and has recently been applied to optimize systems with time-invariant…

Optimization and Control · Mathematics 2022-04-11 Alex Oshin , Matthew D. Houghton , Michael J. Acheson , Irene M. Gregory , Evangelos A. Theodorou

Differential Dynamic Programming with Nonlinear Safety Constraints Under System Uncertainties

Safe operation of systems such as robots requires them to plan and execute trajectories subject to safety constraints. When those systems are subject to uncertainties in their dynamics, it is challenging to ensure that the constraints are…

Robotics · Computer Science 2022-01-13 Gokhan Alcan , Ville Kyrki

Fast, Differentiable and Sparse Top-k: a Convex Analysis Perspective

The top-k operator returns a sparse vector, where the non-zero values correspond to the k largest values of the input. Unfortunately, because it is a discontinuous function, it is difficult to incorporate in neural networks trained…

Machine Learning · Computer Science 2023-06-06 Michael E. Sander , Joan Puigcerver , Josip Djolonga , Gabriel Peyré , Mathieu Blondel

Robustness and risk management via distributional dynamic programming

In dynamic programming (DP) and reinforcement learning (RL), an agent learns to act optimally in terms of expected long-term return by sequentially interacting with its environment modeled by a Markov decision process (MDP). More generally…

Machine Learning · Computer Science 2022-01-03 Mastane Achab , Gergely Neu

Differentiable Optimal Control via Differential Dynamic Programming

Robot design optimization, imitation learning and system identification share a common problem which requires optimization over robot or task parameters at the same time as optimizing the robot motion. To solve these problems, we can use…

Robotics · Computer Science 2022-09-05 Traiko Dinev , Carlos Mastalli , Vladimir Ivan , Steve Tonneau , Sethu Vijayakumar