English

Online Linear Optimization with Many Hints

Machine Learning 2020-10-08 v1

Abstract

We study an online linear optimization (OLO) problem in which the learner is provided access to KK "hint" vectors in each round prior to making a decision. In this setting, we devise an algorithm that obtains logarithmic regret whenever there exists a convex combination of the KK hints that has positive correlation with the cost vectors. This significantly extends prior work that considered only the case K=1K=1. To accomplish this, we develop a way to combine many arbitrary OLO algorithms to obtain regret only a logarithmically worse factor than the minimum regret of the original algorithms in hindsight; this result is of independent interest.

Keywords

Cite

@article{arxiv.2010.03082,
  title  = {Online Linear Optimization with Many Hints},
  author = {Aditya Bhaskara and Ashok Cutkosky and Ravi Kumar and Manish Purohit},
  journal= {arXiv preprint arXiv:2010.03082},
  year   = {2020}
}

Comments

Accepted at Neurips 2020