The Nonstochastic Control Problem
Machine Learning
2020-01-22 v2 Machine Learning
Abstract
We consider the problem of controlling an unknown linear dynamical system in the presence of (nonstochastic) adversarial perturbations and adversarial convex loss functions. In contrast to classical control, the a priori determination of an optimal controller here is hindered by the latter's dependence on the yet unknown perturbations and costs. Instead, we measure regret against an optimal linear policy in hindsight, and give the first efficient algorithm that guarantees a sublinear regret bound, scaling as T^{2/3}, in this setting.
Keywords
Cite
@article{arxiv.1911.12178,
title = {The Nonstochastic Control Problem},
author = {Elad Hazan and Sham M. Kakade and Karan Singh},
journal= {arXiv preprint arXiv:1911.12178},
year = {2020}
}
Comments
To appear at Algorithmic Learning Theory (ALT) 2020; small revisions from the last ver