Tighter Value-Function Approximations for POMDPs

Merlijn Krale; Wietze Koops; Sebastian Junges; Thiago D. Simão; Nils Jansen

Tighter Value-Function Approximations for POMDPs

Artificial Intelligence 2025-02-11 v1

Authors: Merlijn Krale , Wietze Koops , Sebastian Junges , Thiago D. Simão , Nils Jansen

Abstract

Solving partially observable Markov decision processes (POMDPs) typically requires reasoning about the values of exponentially many state beliefs. Towards practical performance, state-of-the-art solvers use value bounds to guide this reasoning. However, sound upper value bounds are often computationally expensive to compute, and there is a tradeoff between the tightness of such bounds and their computational cost. This paper introduces new and provably tighter upper value bounds than the commonly used fast informed bound. Our empirical evaluation shows that, despite their additional computational overhead, the new upper bounds accelerate state-of-the-art POMDP solvers on a wide range of benchmarks.

Keywords

markov decision processes approximation algorithm mixed precision training

Cite

@article{arxiv.2502.06523,
  title  = {Tighter Value-Function Approximations for POMDPs},
  author = {Merlijn Krale and Wietze Koops and Sebastian Junges and Thiago D. Simão and Nils Jansen},
  journal= {arXiv preprint arXiv:2502.06523},
  year   = {2025}
}

Comments

AAMAS 2025 submission

Tighter Value-Function Approximations for POMDPs

Abstract

Keywords

Cite

Comments

Related papers