Online Learning under Delayed Feedback

Pooria Joulani; András György; Csaba Szepesvári

Online Learning under Delayed Feedback

Machine Learning 2015-07-02 v2 Artificial Intelligence Machine Learning

Authors: Pooria Joulani , András György , Csaba Szepesvári

Abstract

Online learning with delayed feedback has received increasing attention recently due to its several applications in distributed, web-based learning problems. In this paper we provide a systematic study of the topic, and analyze the effect of delay on the regret of online learning algorithms. Somewhat surprisingly, it turns out that delay increases the regret in a multiplicative way in adversarial problems, and in an additive way in stochastic problems. We give meta-algorithms that transform, in a black-box fashion, algorithms developed for the non-delayed case into ones that can handle the presence of delays in the feedback loop. Modifications of the well-known UCB algorithm are also developed for the bandit problem with delayed feedback, with the advantage over the meta-algorithms that they can be implemented with lower complexity.

Keywords

online learning education technology machine learning

Cite

@article{arxiv.1306.0686,
  title  = {Online Learning under Delayed Feedback},
  author = {Pooria Joulani and András György and Csaba Szepesvári},
  journal= {arXiv preprint arXiv:1306.0686},
  year   = {2015}
}

Comments

Extended version of a paper accepted to ICML-2013

Online Learning under Delayed Feedback

Abstract

Keywords

Cite

Comments

Related papers