Related papers: Tropical Dynamic Programming for Lipschitz Multist…

Topological Approximate Dynamic Programming under Temporal Logic Constraints

In this paper, we develop a Topological Approximate Dynamic Programming (TADP) method for planningin stochastic systems modeled as Markov Decision Processesto maximize the probability of satisfying high-level systemspecifications expressed…

Optimization and Control · Mathematics 2020-08-04 Lening Li , Jie Fu

Approximate Dynamic Programming based on Projection onto the (min,+) subsemimodule

We develop a new Approximate Dynamic Programming (ADP) method for infinite horizon discounted reward Markov Decision Processes (MDP) based on projection onto a subsemimodule. We approximate the value function in terms of a $(\min,+)$ linear…

Systems and Control · Computer Science 2014-03-18 Chandrashekar Lakshminarayanan , Shalabh Bhatnagar

Maximum Entropy Differential Dynamic Programming

In this paper, we present a novel maximum entropy formulation of the Differential Dynamic Programming algorithm and derive two variants using unimodal and multimodal value functions parameterizations. By combining the maximum entropy…

Optimization and Control · Mathematics 2022-03-01 Oswin So , Ziyi Wang , Evangelos A. Theodorou

Analysis of approximate linear programming solution to Markov decision problem with log barrier function

There are two primary approaches to solving Markov decision problems (MDPs): dynamic programming based on the Bellman equation and linear programming (LP). Dynamic programming methods are the most widely used and form the foundation of both…

Artificial Intelligence · Computer Science 2026-02-24 Donghwan Lee , Hyukjun Yang , Bum Geun Park

Approximate dynamic programming with $(\min,+)$ linear function approximation for Markov decision processes

Markov Decision Processes (MDP) is an useful framework to cast optimal sequential decision making problems. Given any MDP the aim is to find the optimal action selection mechanism i.e., the optimal policy. Typically, the optimal policy…

Systems and Control · Computer Science 2014-03-18 Chandrashekar Lakshminarayanan , Shalabh Bhatnagar

A Generalization of Bellman's Equation with Application to Path Planning, Obstacle Avoidance and Invariant Set Estimation

The standard Dynamic Programming (DP) formulation can be used to solve Multi-Stage Optimization Problems (MSOP's) with additively separable objective functions. In this paper we consider a larger class of MSOP's with monotonically backward…

Optimization and Control · Mathematics 2020-10-15 Morgan Jones , Matthew Peet

Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization

Solving large-scale multistage stochastic programming (MSP) problems poses a significant challenge as commonly used stagewise decomposition algorithms, including stochastic dual dynamic programming (SDDP), face growing time complexity as…

Machine Learning · Computer Science 2025-02-12 Chanyeong Kim , Jongwoong Park , Hyunglip Bae , Woo Chang Kim

Approximate Dynamic Programming via a Smoothed Linear Program

We present a novel linear program for the approximation of the dynamic programming cost-to-go function in high-dimensional stochastic control problems. LP approaches to approximate DP have typically relied on a natural `projection' of a…

Optimization and Control · Mathematics 2009-10-05 V. V. Desai , V. F. Farias , C. C. Moallemi

An efficient DP algorithm on a tree-structure for finite horizon optimal control problems

The classical Dynamic Programming (DP) approach to optimal control problems is based on the characterization of the value function as the unique viscosity solution of a Hamilton-Jacobi-Bellman (HJB) equation. The DP scheme for the numerical…

Numerical Analysis · Mathematics 2019-04-15 Alessandro Alla , Maurizio Falcone , Luca Saluzzi

Stochastic Lipschitz Dynamic Programming

We propose a new algorithm for solving multistage stochastic mixed integer linear programming (MILP) problems with complete continuous recourse. In a similar way to cutting plane methods, we construct nonlinear Lipschitz cuts to build lower…

Optimization and Control · Mathematics 2019-05-24 Shabbir Ahmed , Filipe Goulart Cabral , Bernardo Freitas Paulo da Costa

A stochastic algorithm for deterministic multistage optimization problems

Several attempts to dampen the curse of dimensionnality problem of the Dynamic Programming approach for solving multistage optimization problems have been investigated. One popular way to address this issue is the Stochastic Dual Dynamic…

Optimization and Control · Mathematics 2020-10-09 Marianne Akian , Jean-Philippe Chancelier , Benoît Tran

An Alternating Approach to Approximate Dynamic Programming

In this paper, we give a new approximate dynamic programming (ADP) method to solve large-scale Markov decision programming (MDP) problem. In comparison with many classic ADP methods which have large number of constraints, we formulate an…

Optimization and Control · Mathematics 2025-07-15 Di Zhang

An Approximate Dynamic Programming Algorithm for Monotone Value Functions

Many sequential decision problems can be formulated as Markov Decision Processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in some or all of its dimensions. When the state…

Optimization and Control · Mathematics 2015-09-03 Daniel R. Jiang , Warren B. Powell

Constrained Differential Dynamic Programming Revisited

Differential Dynamic Programming (DDP) has become a well established method for unconstrained trajectory optimization. Despite its several applications in robotics and controls however, a widely successful constrained version of the…

Optimization and Control · Mathematics 2020-05-05 Yuichiro Aoyama , George Boutselis , Akash Patel , Evangelos A. Theodorou

A Dynamic Programming Approach to Evaluating Multivariate Gaussian Probabilities

We propose a method of approximating multivariate Gaussian probabilities using dynamic programming. We show that solving the optimization problem associated with a class of discrete-time finite horizon Markov decision processes with…

Optimization and Control · Mathematics 2018-02-08 Morgan Jones , Matthew M. Peet

Unrolling Dynamic Programming via Graph Filters

Dynamic programming (DP) is a fundamental tool used across many engineering fields. The main goal of DP is to solve Bellman's optimality equations for a given Markov decision process (MDP). Standard methods like policy iteration exploit the…

Artificial Intelligence · Computer Science 2025-07-30 Sergio Rozada , Samuel Rey , Gonzalo Mateos , Antonio G. Marques

Approximate Dynamic Programming By Minimizing Distributionally Robust Bounds

Approximate dynamic programming is a popular method for solving large Markov decision processes. This paper describes a new class of approximate dynamic programming (ADP) methods- distributionally robust ADP-that address the curse of…

Machine Learning · Statistics 2012-05-22 Marek Petrik

Efficient Learning for Entropy-Regularized Markov Decision Processes via Multilevel Monte Carlo

Designing efficient learning algorithms with complexity guarantees for Markov decision processes (MDPs) with large or continuous state and action spaces remains a fundamental challenge. We address this challenge for entropy-regularized MDPs…

Machine Learning · Computer Science 2025-06-05 Matthieu Meunier , Christoph Reisinger , Yufei Zhang

Nonlinear Monte Carlo methods with polynomial runtime for Bellman equations of discrete time high-dimensional stochastic optimal control problems

Discrete time stochastic optimal control problems and Markov decision processes (MDPs), respectively, serve as fundamental models for problems that involve sequential decision making under uncertainty and as such constitute the theoretical…

Optimization and Control · Mathematics 2023-03-08 Christian Beck , Arnulf Jentzen , Konrad Kleinberg , Thomas Kruse

Approximate Dynamic Programming via Sum of Squares Programming

We describe an approximate dynamic programming method for stochastic control problems on infinite state and input spaces. The optimal value function is approximated by a linear combination of basis functions with coefficients as decision…

Optimization and Control · Mathematics 2012-12-07 Tyler H. Summers , Konstantin Kunz , Nikolaos Kariotoglou , Maryam Kamgarpour , Sean Summers , John Lygeros