Related papers: Uniform value in Dynamic Programming

General limit value in Dynamic Programming

We consider a dynamic programming problem with arbitrary state space and bounded rewards. Is it possible to define in an unique way a limit value for the problem, where the "patience" of the decision-maker tends to infinity ? We consider,…

Optimization and Control · Mathematics 2013-01-04 Jérôme Renault

Dynamic Programming for Structured Continuous Markov Decision Problems

We describe an approach for exploiting structure in Markov Decision Processes with continuous state variables. At each step of the dynamic programming, the state space is dynamically partitioned into regions where the value function is the…

Artificial Intelligence · Computer Science 2012-07-19 Zhengzhu Feng , Richard Dearden , Nicolas Meuleau , Richard Washington

Semilinear Dynamic Programming: Analysis, Algorithms, and Certainty Equivalence Properties

We consider a broad class of dynamic programming (DP) problems that involve a partially linear structure and some positivity properties in their system equation and cost function. We address deterministic and stochastic problems, possibly…

Optimization and Control · Mathematics 2026-04-21 Yuchao Li , Dimitri Bertsekas

Safe Dynamic Programming

We incorporate safety specifications into dynamic programming. Explicitly, we address the minimization problem of a Markov decision process up to a stopping time with safety constraints. To incorporate safety into dynamic programming, we…

Optimization and Control · Mathematics 2021-09-09 Rafal Wisniewski , Manuela L. Bujorianu

Approximate Constrained Discounted Dynamic Programming with Uniform Feasibility and Optimality

We consider a dynamic programming (DP) approach to approximately solving an infinite-horizon constrained Markov decision process (CMDP) problem with a fixed initial-state for the expected total discounted-reward criterion with a…

Optimization and Control · Mathematics 2023-08-08 Hyeong Soo Chang

Pathwise uniform value in gambling houses and Partially Observable Markov Decision Processes

In several standard models of dynamic programming (gambling houses, MDPs, POMDPs), we prove the existence of a very robust notion of value for the infinitely repeated problem, namely the pathwise uniform value. This solves two open…

Optimization and Control · Mathematics 2015-09-09 Xavier Venel , Bruno Ziliotto

Dynamic Programming: From Local Optimality to Global Optimality

In the theory of dynamic programming, an optimal policy is a policy whose lifetime value dominates that of all other policies from every possible initial condition in the state space. This raises a natural question: when does optimality…

Optimization and Control · Mathematics 2025-05-13 John Stachurski , Jingni Yang , Ziyue Yang

Asymptotic Properties of Optimal Trajectories in Dynamic Programming

We prove in a dynamic programming framework that uniform convergence of the finite horizon values implies that asymptotically the average accumulated payoff is constant on optimal trajectories. We analyze and discuss several possible…

Optimization and Control · Mathematics 2010-12-24 Sylvain Sorin , Xavier Venel , Guillaume Vigeral

Dynamic Programming with State-Dependent Discounting

This paper extends the core results of discrete time infinite horizon dynamic programming to the case of state-dependent discounting. We obtain a condition on the discount factor process under which all of the standard optimality results…

General Economics · Economics 2020-10-15 John Stachurski , Junnan Zhang

The Principle of Optimality in Dynamic Programming: A Pedagogical Note

The principle of optimality is a fundamental aspect of dynamic programming, which states that the optimal solution to a dynamic optimization problem can be found by combining the optimal solutions to its sub-problems. While this principle…

Optimization and Control · Mathematics 2024-08-14 Bar Light

Dynamic Set Values for Nonzero Sum Games with Multiple Equilibriums

Nonzero sum games typically have multiple Nash equilibriums (or no equilibrium), and unlike the zero sum case, they may have different values at different equilibriums. Instead of focusing on the existence of individual equilibriums, we…

Optimization and Control · Mathematics 2020-08-27 Zachary Feinstein , Birgit Rudloff , Jianfeng Zhang

A dynamic programming principle with continuous solutions related to the $p$-Laplacian, $1 < p < \infty$

We study a Dynamic Programming Principle related to the $p$-Laplacian for $1 < p < \infty$. The main results are existence, uniqueness and continuity of solutions.

Analysis of PDEs · Mathematics 2015-05-01 Hans Hartikainen

Applications of Variable Discounting Dynamic Programming to Iterated Function Systems and Related Problems

We study existence and uniqueness of the fixed points solutions of a large class of non-linear variable discounted transfer operators associated to a sequential decision-making process. We establish regularity properties of these solutions,…

Dynamical Systems · Mathematics 2019-02-20 L. Cioletti , Elismar R. Oliveira

Discounted Continuous-time Markov Decision Processes with Unbounded Rates: the Dynamic Programming Approach

This paper deals with unconstrained discounted continuous-time Markov decision processes in Borel state and action spaces. Under some conditions imposed on the primitives, allowing unbounded transition rates and unbounded (from both above…

Optimization and Control · Mathematics 2011-03-02 Alexey Piunovskiy , Yi Zhang

Dynamic Programming for Optimal Delivery Time Slot Pricing

We study the dynamic programming approach to revenue management in the context of attended home delivery. We draw on results from dynamic programming theory for Markov decision problems to show that the underlying Bellman operator has a…

Optimization and Control · Mathematics 2019-10-28 Denis Lebedev , Paul Goulart , Kostas Margellos

Dynamic Programming for controlled Markov families: abstractly and over Martingale Measures

We describe an abstract control-theoretic framework in which the validity of the dynamic programming principle can be established in continuous time by a verification of a small number of structural properties. As an application we treat…

Optimization and Control · Mathematics 2014-03-18 Gordan Zitkovic

Value Asymptotics in Dynamic Games on Large Horizons

This paper is concerned with two-person dynamic zero-sum games. Let games for some family have common dynamics, running costs and capabilities of players, and let these games differ in densities only. We show that the Dynamic Programming…

Optimization and Control · Mathematics 2017-09-26 Dmitry Khlopin

Killed Markov Decision Processes on Finite Time Interval for Countable Models

We consider killed Markov decision processes for countable models on a finite time-interval. Existence of a uniform $\varepsilon$-optimal policy is proven. We show the correctness of the fundamental equation. The optimal control problem is…

Optimization and Control · Mathematics 2013-04-10 Nestor Parolya , Yaroslav Yeleyko

General Existence of Solutions to Dynamic Programming Principle

We provide an alternative approach to the existence of solutions to dynamic programming equations arising in the discrete game-theoretic interpretations for various nonlinear partial differential equations including the infinity Laplacian,…

Analysis of PDEs · Mathematics 2013-07-19 Qing Liu , Armin Schikorra

Gradient-Bounded Dynamic Programming with Submodular and Concave Extensible Value Functions

We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value…

Optimization and Control · Mathematics 2020-05-25 Denis Lebedev , Paul Goulart , Kostas Margellos