Related papers: Gradient-Bounded Dynamic Programming with Submodul…

Gradient-Bounded Dynamic Programming for Submodular and Concave Extensible Value Functions with Probabilistic Performance Guarantees

We consider stochastic dynamic programming problems with high-dimensional, discrete state-spaces and finite, discrete-time horizons that prohibit direct computation of the value function from a given Bellman equation for all states and time…

Optimization and Control · Mathematics 2020-06-05 Denis Lebedev , Paul Goulart , Kostas Margellos

Dynamic Programming for Optimal Delivery Time Slot Pricing

We study the dynamic programming approach to revenue management in the context of attended home delivery. We draw on results from dynamic programming theory for Markov decision problems to show that the underlying Bellman operator has a…

Optimization and Control · Mathematics 2019-10-28 Denis Lebedev , Paul Goulart , Kostas Margellos

Accelerated Point-wise Maximum Approach to Approximate Dynamic Programming

We describe an approximate dynamic programming approach to compute lower bounds on the optimal value function for a discrete time, continuous space, infinite horizon setting. The approach iteratively constructs a family of lower bounding…

Systems and Control · Electrical Eng. & Systems 2024-12-20 Paul N. Beuchat , Joseph Warrington , John Lygeros

Generalized Dual Dynamic Programming for Infinite Horizon Problems in Continuous State and Action Spaces

We describe a nonlinear generalization of dual dynamic programming theory and its application to value function estimation for deterministic control problems over continuous state and action spaces, in a discrete-time infinite horizon…

Optimization and Control · Mathematics 2018-10-05 Joseph Warrington , Paul N. Beuchat , John Lygeros

A Concave Value Function Extension for the Dynamic Programming Approach to Revenue Management in Attended Home Delivery

We study the approximate dynamic programming approach to revenue management in the context of attended home delivery. We draw on results from dynamic programming theory for Markov decision problems, convex optimisation and discrete convex…

Optimization and Control · Mathematics 2019-03-18 Denis Lebedev , Paul Goulart , Kostas Margellos

A Convex Optimization Approach to Dynamic Programming in Continuous State and Action Spaces

In this paper, a convex optimization-based method is proposed for numerically solving dynamic programs in continuous state and action spaces. The key idea is to approximate the output of the Bellman operator at a particular state by the…

Optimization and Control · Mathematics 2020-10-23 Insoon Yang

Approximate Dynamic Programming for Delivery Time Slot Pricing: a Sensitivity Analysis

We consider the revenue management problem of finding profit-maximising prices for delivery time slots in the context of attended home delivery. This multi-stage optimal control problem admits a dynamic programming formulation that is…

Optimization and Control · Mathematics 2020-08-04 Denis Lebedev , Kostas Margellos , Paul Goulart

Approximate Dynamic Programming via Sum of Squares Programming

We describe an approximate dynamic programming method for stochastic control problems on infinite state and input spaces. The optimal value function is approximated by a linear combination of basis functions with coefficients as decision…

Optimization and Control · Mathematics 2012-12-07 Tyler H. Summers , Konstantin Kunz , Nikolaos Kariotoglou , Maryam Kamgarpour , Sean Summers , John Lygeros

A Uniform-grid Discretization Algorithm for Stochastic Control with Risk Constraints

In this paper, we present a discretization algorithm for finite horizon risk constrained dynamic programming algorithm in [Chow_Pavone_13]. Although in a theoretical standpoint, Bellman's recursion provides a systematic way to find optimal…

Optimization and Control · Mathematics 2015-01-12 Yin-Lam Chow , Marco Pavone

Dynamic programming and dimensionality in convex stochastic optimization and control

This paper studies stochastic optimization problems and associated Bellman equations in formats that allow for reduced dimensionality of the cost-to-go functions. In particular, we study stochastic control problems in the…

Optimization and Control · Mathematics 2025-05-20 Teemu Pennanen , Ari-Pekka Perkkiö

Value and Policy Iteration in Optimal Control and Adaptive Dynamic Programming

In this paper, we consider discrete-time infinite horizon problems of optimal control to a terminal set of states. These are the problems that are often taken as the starting point for adaptive dynamic programming. Under very general…

Systems and Control · Computer Science 2015-10-05 Dimitri P. Bertsekas

Quantum speedups for convex dynamic programming

We present a quantum algorithm to solve dynamic programming problems with convex value functions. For linear discrete-time systems with a $d$-dimensional state space of size $N$, the proposed algorithm outputs a quantum-mechanical…

Quantum Physics · Physics 2021-03-18 David Sutter , Giacomo Nannicini , Tobias Sutter , Stefan Woerner

Dynamic Programming Deconstructed: Transformations of the Bellman Equation and Computational Efficiency

Some approaches to solving challenging dynamic programming problems, such as Q-learning, begin by transforming the Bellman equation into an alternative functional equation, in order to open up a new line of attack. Our paper studies this…

Optimization and Control · Mathematics 2019-12-05 Qingyin Ma , John Stachurski

Value iteration for approximate dynamic programming under convexity

This paper studies value iteration for infinite horizon contracting Markov decision processes under convexity assumptions and when the state space is uncountable. The original value iteration is replaced with a more tractable form and the…

Optimization and Control · Mathematics 2018-02-21 Jeremy Yee

On Bellman equations for continuous-time policy evaluation I: discretization and approximation

We study the problem of computing the value function from a discretely-observed trajectory of a continuous-time diffusion process. We develop a new class of algorithms based on easily implementable numerical schemes that are compatible with…

Machine Learning · Computer Science 2024-07-09 Wenlong Mou , Yuhua Zhu

Dynamic Programming Through the Lens of Semismooth Newton-Type Methods (Extended Version)

Policy iteration and value iteration are at the core of many (approximate) dynamic programming methods. For Markov Decision Processes with finite state and action spaces, we show that they are instances of semismooth Newton-type methods to…

Optimization and Control · Mathematics 2022-06-28 Matilde Gargiani , Andrea Zanelli , Dominic Liao-McPherson , Tyler Summers , John Lygeros

Approximation Algorithms for Optimization of Combinatorial Dynamical Systems

This paper considers an optimization problem for a dynamical system whose evolution depends on a collection of binary decision variables. We develop scalable approximation algorithms with provable suboptimality bounds to provide…

Optimization and Control · Mathematics 2016-10-31 Insoon Yang , Samuel A. Burden , Ram Rajagopal , S. Shankar Sastry , Claire J. Tomlin

Regular Policies in Abstract Dynamic Programming

We consider challenging dynamic programming models where the associated Bellman equation, and the value and policy iteration algorithms commonly exhibit complex and even pathological behavior. Our analysis is based on the new notion of…

Optimization and Control · Mathematics 2016-09-13 Dimitri P. Bertsekas

An efficient DP algorithm on a tree-structure for finite horizon optimal control problems

The classical Dynamic Programming (DP) approach to optimal control problems is based on the characterization of the value function as the unique viscosity solution of a Hamilton-Jacobi-Bellman (HJB) equation. The DP scheme for the numerical…

Numerical Analysis · Mathematics 2019-04-15 Alessandro Alla , Maurizio Falcone , Luca Saluzzi

Dynamic Programming in Ordered Vector Space

New approaches to the theory of dynamic programming view dynamic programs as families of policy operators acting on partially ordered sets. In this paper, we extend these ideas by shifting from arbitrary partially ordered sets to ordered…

Optimization and Control · Mathematics 2026-02-02 Nisha Peng , John Stachurski