English

Quantum speedups for convex dynamic programming

Quantum Physics 2021-03-18 v2 Optimization and Control

Abstract

We present a quantum algorithm to solve dynamic programming problems with convex value functions. For linear discrete-time systems with a dd-dimensional state space of size NN, the proposed algorithm outputs a quantum-mechanical representation of the value function in time O(TγdTpolylog(N,(T/ε)d))O(T \gamma^{dT}\mathrm{polylog}(N,(T/\varepsilon)^{d})), where ε\varepsilon is the accuracy of the solution, TT is the time horizon, and γ\gamma is a problem-specific parameter depending on the condition numbers of the cost functions. This allows us to evaluate the value function at any fixed state in time O(TγdTNpolylog(N,(T/ε)d))O(T \gamma^{dT}\sqrt{N}\,\mathrm{polylog}(N,(T/\varepsilon)^{d})), and the corresponding optimal action can be recovered by solving a convex program. The class of optimization problems to which our algorithm can be applied includes provably hard stochastic dynamic programs. Finally, we show that the algorithm obtains a quadratic speedup (up to polylogarithmic factors) compared to the classical Bellman approach on some dynamic programs with continuous state space that have γ=1\gamma=1.

Keywords

Cite

@article{arxiv.2011.11654,
  title  = {Quantum speedups for convex dynamic programming},
  author = {David Sutter and Giacomo Nannicini and Tobias Sutter and Stefan Woerner},
  journal= {arXiv preprint arXiv:2011.11654},
  year   = {2021}
}

Comments

33 pages; v2: error in the running time due to an error in the QLFT algorithm