Related papers: General limit value in Dynamic Programming

Uniform value in Dynamic Programming

We consider dynamic programming problems with a large time horizon, and give sufficient conditions for the existence of the uniform value. As a consequence, we obtain an existence result when the state space is precompact, payoffs are…

Optimization and Control · Mathematics 2009-04-20 Jérôme Renault

Tauberian theorem for value functions

For two-person dynamic zero-sum games (both discrete and continuous settings), we investigate the limit of value functions of finite horizon games with long run average cost as the time horizon tends to infinity and the limit of value…

Optimization and Control · Mathematics 2017-09-26 Dmitry Khlopin

Dynamic Programming Principles for Optimal Stopping with Expectation Constraint

We analyze an optimal stopping problem with a constraint on the expected cost. When the reward function and cost function are Lipschitz continuous in state variable, we show that the value of such an optimal stopping problem is a continuous…

Optimization and Control · Mathematics 2017-08-08 Erhan Bayraktar , Song Yao

Value Asymptotics in Dynamic Games on Large Horizons

This paper is concerned with two-person dynamic zero-sum games. Let games for some family have common dynamics, running costs and capabilities of players, and let these games differ in densities only. We show that the Dynamic Programming…

Optimization and Control · Mathematics 2017-09-26 Dmitry Khlopin

Pathwise uniform value in gambling houses and Partially Observable Markov Decision Processes

In several standard models of dynamic programming (gambling houses, MDPs, POMDPs), we prove the existence of a very robust notion of value for the infinitely repeated problem, namely the pathwise uniform value. This solves two open…

Optimization and Control · Mathematics 2015-09-09 Xavier Venel , Bruno Ziliotto

On asymptotic value for dynamic games with saddle point

The paper is concerned with two-person games with saddle point. We investigate the limits of value functions for long-time-average payoff, discounted average payoff, and the payoff that follows a probability density. Most of our assumptions…

Optimization and Control · Mathematics 2015-01-29 Dmitry Khlopin

Optimal Switching in Finite Horizon under State Constraints

We study an optimal switching problem with a state constraint: the controller is only allowed to choose strategies that keep the controlled diffusion in a closed domain. We prove that the value function associated with this problem is the…

Probability · Mathematics 2016-06-09 Idris Kharroubi

Global regularity of the value function in a stopper vs. singular-controller game

We study a class of zero-sum stochastic games between a stopper and a singular-controller, previously considered in [Bovo and De Angelis (2025)]. The underlying singularly-controlled dynamics takes values in…

Optimization and Control · Mathematics 2025-06-25 Andrea Bovo , Alessandro Milazzo

Analysis of the vanishing discount limit for optimal control problems in continuous and discrete time

A classical problem in ergodic continuous time control consists of studying the limit behavior of the optimal value of a discounted cost functional with infinite horizon as the discount factor $\lambda$ tends to zero. In the literature,…

Optimization and Control · Mathematics 2024-01-23 Piermarco Cannarsa , Stephane Gaubert , Cristian Mendico , Marc Quincampoix

Linear Phase Transition in Random Linear Constraint Satisfaction Problem

Our model is a generalized linear programming relaxation of a much studied random K-SAT problem. Specifically, a set of linear constraints C on K variables is fixed. From a pool of n variables, K variables are chosen uniformly at random and…

Probability · Mathematics 2007-05-23 David Gamarnik

A Concave Value Function Extension for the Dynamic Programming Approach to Revenue Management in Attended Home Delivery

We study the approximate dynamic programming approach to revenue management in the context of attended home delivery. We draw on results from dynamic programming theory for Markov decision problems, convex optimisation and discrete convex…

Optimization and Control · Mathematics 2019-03-18 Denis Lebedev , Paul Goulart , Kostas Margellos

Gradient-Bounded Dynamic Programming with Submodular and Concave Extensible Value Functions

We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value…

Optimization and Control · Mathematics 2020-05-25 Denis Lebedev , Paul Goulart , Kostas Margellos

General Discounting versus Average Reward

Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to m (average value) with the future discounted reward V from…

Machine Learning · Computer Science 2007-05-23 Marcus Hutter

Continuity of the Value Function for Deterministic Optimal Impulse Control with Terminal State Constraint

Deterministic optimal impulse control problem with terminal state constraint is considered. Due to the appearance of the terminal state constraint, the value function might be discontinuous in general. The main contribution of this paper is…

Optimization and Control · Mathematics 2020-11-10 Yue Zhou , Xinwei Feng , Jiongmin Yong

Probabilistic analysis of a differential equation for linear programming

In this paper we address the complexity of solving linear programming problems with a set of differential equations that converge to a fixed point that represents the optimal solution. Assuming a probabilistic model, where the inputs are…

Computational Complexity · Computer Science 2007-05-23 Asa Ben-Hur , Joshua Feinberg , Shmuel Fishman , Hava T. Siegelmann

Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance

This paper investigates the optimization problem of an infinite stage discrete time Markov decision process (MDP) with a long-run average metric considering both mean and variance of rewards together. Such performance metric is important…

Optimization and Control · Mathematics 2020-08-11 Li Xia

Generalized Dual Dynamic Programming for Infinite Horizon Problems in Continuous State and Action Spaces

We describe a nonlinear generalization of dual dynamic programming theory and its application to value function estimation for deterministic control problems over continuous state and action spaces, in a discrete-time infinite horizon…

Optimization and Control · Mathematics 2018-10-05 Joseph Warrington , Paul N. Beuchat , John Lygeros

Operator approach to values of stochastic games with varying stage duration

We study the links between the values of stochastic games with varying stage duration $h$, the corresponding Shapley operators $\bf{T}$ and ${\bf{T}}\_h$and the solution of $\dot f\_t = ({\bf{T}} - Id )f\_t$. Considering general non…

Optimization and Control · Mathematics 2016-01-11 Sylvain Sorin , Guillaume Vigeral

Uniform Value and Decidability in Ergodic Blind Stochastic Games

We study a class of two-player zero-sum stochastic games known as \textit{blind stochastic games}, where players neither observe the state nor receive any information about it during the game. A central concept for analyzing long-duration…

Optimization and Control · Mathematics 2025-11-24 Krishnendu Chatterjee , David Lurie , Raimundo Saona , Bruno Ziliotto

Gradient-Bounded Dynamic Programming for Submodular and Concave Extensible Value Functions with Probabilistic Performance Guarantees

We consider stochastic dynamic programming problems with high-dimensional, discrete state-spaces and finite, discrete-time horizons that prohibit direct computation of the value function from a given Bellman equation for all states and time…

Optimization and Control · Mathematics 2020-06-05 Denis Lebedev , Paul Goulart , Kostas Margellos