English
Related papers

Related papers: Online Learning with Predictable Sequences

200 papers

We study online classification when the learner has access to predictions about future examples. We design an online learner whose expected regret is never worse than the worst-case regret, gracefully improves with the quality of the…

Machine Learning · Computer Science 2024-05-24 Vinod Raman , Ambuj Tewari

Online learning methods yield sequential regret bounds under minimal assumptions and provide in-expectation risk bounds for statistical learning. However, despite the apparent advantage of online guarantees over their statistical…

Machine Learning · Computer Science 2023-08-16 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

In online learning, the data is provided in a sequential order, and the goal of the learner is to make online decisions to minimize overall regrets. This note is concerned with continuous-time models and algorithms for several online…

Machine Learning · Statistics 2024-05-20 Lexing Ying

Motivated by the predictable nature of real-life in data streams, we study online regression when the learner has access to predictions about future examples. In the extreme case, called transductive online learning, the sequence of…

Machine Learning · Computer Science 2025-10-07 Vinod Raman , Shenghao Xie , Samson Zhou

In this paper, we investigate the framework of Online Convex Optimization (OCO) for online learning. OCO offers a very powerful online learning framework for many applications. In this context, we study a specific framework of OCO called…

Machine Learning · Computer Science 2022-11-01 Deepan Muthirayan , Jianjun Yuan , Pramod P. Khargonekar

We consider the problem of online learning where the sequence of actions played by the learner must adhere to an unknown safety constraint at every round. The goal is to minimize regret with respect to the best safe action in hindsight…

Machine Learning · Computer Science 2024-03-08 Karthik Sridharan , Seung Won Wilson Yoo

Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In this work, we initiate the study of best-case lower bounds in online convex optimization, wherein we bound the largest improvement an…

Machine Learning · Computer Science 2021-06-25 Cristóbal Guzmán , Nishant A. Mehta , Ali Mortazavi

One of the main strengths of online algorithms is their ability to adapt to arbitrary data sequences. This is especially important in nonparametric settings, where performance is measured against rich classes of comparator functions that…

Machine Learning · Computer Science 2020-11-03 Ilja Kuzborskij , Nicolò Cesa-Bianchi

In the convex optimization approach to online regret minimization, many methods have been developed to guarantee a $O(\sqrt{T})$ bound on regret for subdifferentiable convex loss functions with bounded subgradients, by using a reduction to…

Machine Learning · Computer Science 2016-09-20 Arthur Flajolet , Patrick Jaillet

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

We propose a general framework for studying adaptive regret bounds in the online learning framework, including model selection bounds and data-dependent bounds. Given a data- or model-dependent bound we ask, "Does there exist some algorithm…

Machine Learning · Computer Science 2020-02-14 Dylan J. Foster , Alexander Rakhlin , Karthik Sridharan

In the random-order model for online learning, the sequence of losses is chosen upfront by an adversary and presented to the learner after a random permutation. Any random-order input is \emph{asymptotically} equivalent to a stochastic…

Machine Learning · Computer Science 2025-10-06 Martino Bernasconi , Andrea Celli , Riccardo Colini-Baldeschi , Federico Fusco , Stefano Leonardi , Matteo Russo

We introduce algorithms for online, full-information prediction that are competitive with contextual tree experts of unknown complexity, in both probabilistic and adversarial settings. We show that by incorporating a probabilistic framework…

Machine Learning · Computer Science 2018-05-23 Vidya Muthukumar , Mitas Ray , Anant Sahai , Peter L. Bartlett

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi

This work focuses on the setting of dynamic regret in the context of online learning with full information. In particular, we analyze regret bounds with respect to the temporal variability of the loss functions. By assuming that the…

Machine Learning · Computer Science 2021-02-16 Nicolò Campolongo , Francesco Orabona

We study the effectiveness of stochastic side information in deterministic online learning scenarios. We propose a forecaster to predict a deterministic sequence where its performance is evaluated against an expert class. We assume that…

Machine Learning · Computer Science 2023-03-13 Junzhang Jia , Xuetong Wu , Jingge Zhu , Jamie Evans

We study the problem of uncertainty quantification via prediction sets, in an online setting where the data distribution may vary arbitrarily over time. Recent work develops online conformal prediction techniques that leverage regret…

Machine Learning · Computer Science 2023-02-16 Aadyot Bhatnagar , Huan Wang , Caiming Xiong , Yu Bai

Many prediction domains, such as ad placement, recommendation, trajectory prediction, and document summarization, require predicting a set or list of options. Such lists are often evaluated using submodular reward functions that measure…

Machine Learning · Computer Science 2013-05-14 Stephane Ross , Jiaji Zhou , Yisong Yue , Debadeepta Dey , J. Andrew Bagnell

We uncover a fairly general principle in online learning: If regret can be (approximately) expressed as a function of certain "sufficient statistics" for the data sequence, then there exists a special Burkholder function that 1) can be used…

Machine Learning · Computer Science 2018-03-22 Dylan J. Foster , Alexander Rakhlin , Karthik Sridharan

In this book, I introduce the basic concepts of Online Learning through the modern view of Online Convex Optimization. Here, online learning refers to the framework of regret minimization under worst-case assumptions. I present first-order…

Machine Learning · Computer Science 2026-04-28 Francesco Orabona
‹ Prev 1 2 3 10 Next ›