English

A Relative Value Iteration Algorithm for Non-degenerate Controlled Diffusions

Optimization and Control 2019-03-20 v1

Abstract

The ergodic control problem for a non-degenerate controlled diffusion controlled through its drift is considered under a uniform stability condition that ensures the well-posedness of the associated Hamilton-Jacobi-Bellman (HJB) equation. A nonlinear parabolic evolution equation is then proposed as a continuous time continuous state space analog of White's `relative value iteration' algorithm for solving the ergodic dynamic programming equation for the finite state finite action case. Its convergence to the solution of the HJB equation is established using the theory of monotone dynamical systems and also, alternatively, by using the theory of reverse martingales.

Keywords

Cite

@article{arxiv.1110.1273,
  title  = {A Relative Value Iteration Algorithm for Non-degenerate Controlled Diffusions},
  author = {Ari Arapostathis and Vivek S. Borkar},
  journal= {arXiv preprint arXiv:1110.1273},
  year   = {2019}
}

Comments

17 pages

R2 v1 2026-06-21T19:16:07.259Z