English

An adaptive stochastic optimization algorithm for resource allocation

Machine Learning 2020-01-17 v3 Machine Learning Optimization and Control

Abstract

We consider the classical problem of sequential resource allocation where a decision maker must repeatedly divide a budget between several resources, each with diminishing returns. This can be recast as a specific stochastic optimization problem where the objective is to maximize the cumulative reward, or equivalently to minimize the regret. We construct an algorithm that is {\em adaptive} to the complexity of the problem, expressed in term of the regularity of the returns of the resources, measured by the exponent in the {\L}ojasiewicz inequality (or by their universal concavity parameter). Our parameter-independent algorithm recovers the optimal rates for strongly-concave functions and the classical fast rates of multi-armed bandit (for linear reward functions). Moreover, the algorithm improves existing results on stochastic optimization in this regret minimization setting for intermediate cases.

Keywords

Cite

@article{arxiv.1902.04376,
  title  = {An adaptive stochastic optimization algorithm for resource allocation},
  author = {Xavier Fontaine and Shie Mannor and Vianney Perchet},
  journal= {arXiv preprint arXiv:1902.04376},
  year   = {2020}
}

Comments

ALT2020, 45 pages, 9 figures