English
Related papers

Related papers: General limit value in Dynamic Programming

200 papers

We consider dynamic programming problems with a large time horizon, and give sufficient conditions for the existence of the uniform value. As a consequence, we obtain an existence result when the state space is precompact, payoffs are…

Optimization and Control · Mathematics 2009-04-20 Jérôme Renault

For two-person dynamic zero-sum games (both discrete and continuous settings), we investigate the limit of value functions of finite horizon games with long run average cost as the time horizon tends to infinity and the limit of value…

Optimization and Control · Mathematics 2017-09-26 Dmitry Khlopin

We analyze an optimal stopping problem with a constraint on the expected cost. When the reward function and cost function are Lipschitz continuous in state variable, we show that the value of such an optimal stopping problem is a continuous…

Optimization and Control · Mathematics 2017-08-08 Erhan Bayraktar , Song Yao

This paper is concerned with two-person dynamic zero-sum games. Let games for some family have common dynamics, running costs and capabilities of players, and let these games differ in densities only. We show that the Dynamic Programming…

Optimization and Control · Mathematics 2017-09-26 Dmitry Khlopin

In several standard models of dynamic programming (gambling houses, MDPs, POMDPs), we prove the existence of a very robust notion of value for the infinitely repeated problem, namely the pathwise uniform value. This solves two open…

Optimization and Control · Mathematics 2015-09-09 Xavier Venel , Bruno Ziliotto

The paper is concerned with two-person games with saddle point. We investigate the limits of value functions for long-time-average payoff, discounted average payoff, and the payoff that follows a probability density. Most of our assumptions…

Optimization and Control · Mathematics 2015-01-29 Dmitry Khlopin

We study an optimal switching problem with a state constraint: the controller is only allowed to choose strategies that keep the controlled diffusion in a closed domain. We prove that the value function associated with this problem is the…

Probability · Mathematics 2016-06-09 Idris Kharroubi

We study a class of zero-sum stochastic games between a stopper and a singular-controller, previously considered in [Bovo and De Angelis (2025)]. The underlying singularly-controlled dynamics takes values in…

Optimization and Control · Mathematics 2025-06-25 Andrea Bovo , Alessandro Milazzo

A classical problem in ergodic continuous time control consists of studying the limit behavior of the optimal value of a discounted cost functional with infinite horizon as the discount factor $\lambda$ tends to zero. In the literature,…

Optimization and Control · Mathematics 2024-01-23 Piermarco Cannarsa , Stephane Gaubert , Cristian Mendico , Marc Quincampoix

Our model is a generalized linear programming relaxation of a much studied random K-SAT problem. Specifically, a set of linear constraints C on K variables is fixed. From a pool of n variables, K variables are chosen uniformly at random and…

Probability · Mathematics 2007-05-23 David Gamarnik

We study the approximate dynamic programming approach to revenue management in the context of attended home delivery. We draw on results from dynamic programming theory for Markov decision problems, convex optimisation and discrete convex…

Optimization and Control · Mathematics 2019-03-18 Denis Lebedev , Paul Goulart , Kostas Margellos

We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value…

Optimization and Control · Mathematics 2020-05-25 Denis Lebedev , Paul Goulart , Kostas Margellos

Consider an agent interacting with an environment in cycles. In every interaction cycle the agent is rewarded for its performance. We compare the average reward U from cycle 1 to m (average value) with the future discounted reward V from…

Machine Learning · Computer Science 2007-05-23 Marcus Hutter

Deterministic optimal impulse control problem with terminal state constraint is considered. Due to the appearance of the terminal state constraint, the value function might be discontinuous in general. The main contribution of this paper is…

Optimization and Control · Mathematics 2020-11-10 Yue Zhou , Xinwei Feng , Jiongmin Yong

In this paper we address the complexity of solving linear programming problems with a set of differential equations that converge to a fixed point that represents the optimal solution. Assuming a probabilistic model, where the inputs are…

Computational Complexity · Computer Science 2007-05-23 Asa Ben-Hur , Joshua Feinberg , Shmuel Fishman , Hava T. Siegelmann

This paper investigates the optimization problem of an infinite stage discrete time Markov decision process (MDP) with a long-run average metric considering both mean and variance of rewards together. Such performance metric is important…

Optimization and Control · Mathematics 2020-08-11 Li Xia

We describe a nonlinear generalization of dual dynamic programming theory and its application to value function estimation for deterministic control problems over continuous state and action spaces, in a discrete-time infinite horizon…

Optimization and Control · Mathematics 2018-10-05 Joseph Warrington , Paul N. Beuchat , John Lygeros

We study the links between the values of stochastic games with varying stage duration $h$, the corresponding Shapley operators $\bf{T}$ and ${\bf{T}}\_h$and the solution of $\dot f\_t = ({\bf{T}} - Id )f\_t$. Considering general non…

Optimization and Control · Mathematics 2016-01-11 Sylvain Sorin , Guillaume Vigeral

We study a class of two-player zero-sum stochastic games known as \textit{blind stochastic games}, where players neither observe the state nor receive any information about it during the game. A central concept for analyzing long-duration…

Optimization and Control · Mathematics 2025-11-24 Krishnendu Chatterjee , David Lurie , Raimundo Saona , Bruno Ziliotto

We consider stochastic dynamic programming problems with high-dimensional, discrete state-spaces and finite, discrete-time horizons that prohibit direct computation of the value function from a given Bellman equation for all states and time…

Optimization and Control · Mathematics 2020-06-05 Denis Lebedev , Paul Goulart , Kostas Margellos
‹ Prev 1 2 3 10 Next ›