Related papers: Adaptive Online Learning

Online PAC-Bayes Learning

Most PAC-Bayesian bounds hold in the batch learning setting where data is collected at once, prior to inference or prediction. This somewhat departs from many contemporary learning problems where data streams are collected and the…

Machine Learning · Computer Science 2023-01-25 Maxime Haddouche , Benjamin Guedj

A closer look at temporal variability in dynamic online learning

This work focuses on the setting of dynamic regret in the context of online learning with full information. In particular, we analyze regret bounds with respect to the temporal variability of the loss functions. By assuming that the…

Machine Learning · Computer Science 2021-02-16 Nicolò Campolongo , Francesco Orabona

Regret Analysis: a control perspective

Online learning and model reference adaptive control have many interesting intersections. One area where they differ however is in how the algorithms are analyzed and what objective or metric is used to discriminate "good" algorithms from…

Systems and Control · Electrical Eng. & Systems 2025-01-24 Travis E. Gibson , Sawal Acharya

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs

We develop a new approach to obtaining high probability regret bounds for online learning with bandit feedback against an adaptive adversary. While existing approaches all require carefully constructing optimistic and biased loss…

Machine Learning · Computer Science 2020-11-02 Chung-Wei Lee , Haipeng Luo , Chen-Yu Wei , Mengxiao Zhang

Temporal Variability in Implicit Online Learning

In the setting of online learning, Implicit algorithms turn out to be highly successful from a practical standpoint. However, the tightest regret analyses only show marginal improvements over Online Mirror Descent. In this work, we shed…

Machine Learning · Computer Science 2020-11-10 Nicolò Campolongo , Francesco Orabona

Adaptive Regret for Control of Time-Varying Dynamics

We consider the problem of online control of systems with time-varying linear dynamics. This is a general formulation that is motivated by the use of local linearization in control of nonlinear dynamical systems. To state meaningful…

Machine Learning · Computer Science 2022-02-15 Paula Gradu , Elad Hazan , Edgar Minasyan

High-Probability Risk Bounds via Sequential Predictors

Online learning methods yield sequential regret bounds under minimal assumptions and provide in-expectation risk bounds for statistical learning. However, despite the apparent advantage of online guarantees over their statistical…

Machine Learning · Computer Science 2023-08-16 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

Isotuning With Applications To Scale-Free Online Learning

We extend and combine several tools of the literature to design fast, adaptive, anytime and scale-free online learning algorithms. Scale-free regret bounds must scale linearly with the maximum loss, both toward large losses and toward very…

Machine Learning · Computer Science 2024-10-22 Laurent Orseau , Marcus Hutter

PAC-Bayes Meets Online Contextual Optimization

The predict-then-optimize paradigm bridges online learning and contextual optimization in dynamic environments. Previous works have investigated the sequential updating of predictors using feedback from downstream decisions to minimize…

Optimization and Control · Mathematics 2025-11-26 Zhuojun Xie , Adam Abdin , Yiping Fang

Online Optimization : Competing with Dynamic Comparators

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

Improved Online Conformal Prediction via Strongly Adaptive Online Learning

We study the problem of uncertainty quantification via prediction sets, in an online setting where the data distribution may vary arbitrarily over time. Recent work develops online conformal prediction techniques that leverage regret…

Machine Learning · Computer Science 2023-02-16 Aadyot Bhatnagar , Huan Wang , Caiming Xiong , Yu Bai

Sparsity regret bounds for individual sequences in online linear regression

We consider the problem of online linear regression on arbitrary deterministic sequences when the ambient dimension d can be much larger than the number of time rounds T. We introduce the notion of sparsity regret bound, which is a…

Machine Learning · Statistics 2013-04-17 Sébastien Gerchinovitz

Adaptively Learning to Select-Rank in Online Platforms

Ranking algorithms are fundamental to various online platforms across e-commerce sites to content streaming services. Our research addresses the challenge of adaptively ranking items from a candidate pool for heterogeneous users, a key…

Machine Learning · Computer Science 2024-06-10 Jingyuan Wang , Perry Dong , Ying Jin , Ruohan Zhan , Zhengyuan Zhou

Online Learning with Predictable Sequences

We present methods for online linear optimization that take advantage of benign (as opposed to worst-case) sequences. Specifically if the sequence encountered by the learner is described well by a known "predictable process", the algorithms…

Machine Learning · Statistics 2014-05-27 Alexander Rakhlin , Karthik Sridharan

Adaptive Gradient-Based Meta-Learning Methods

We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential…

Machine Learning · Computer Science 2019-12-10 Mikhail Khodak , Maria-Florina Balcan , Ameet Talwalkar

Adaptive Hedge

Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case…

Machine Learning · Statistics 2015-03-04 Tim van Erven , Peter Grünwald , Wouter M. Koolen , Steven de Rooij

Adaptive Bound Optimization for Online Convex Optimization

We introduce a new online convex optimization algorithm that adaptively chooses its regularization function based on the loss functions observed so far. This is in contrast to previous algorithms that use a fixed regularization function…

Machine Learning · Computer Science 2010-07-08 H. Brendan McMahan , Matthew Streeter

Adaptive and Efficient Algorithms for Tracking the Best Expert

In this paper, we consider the problem of prediction with expert advice in dynamic environments. We choose tracking regret as the performance metric and develop two adaptive and efficient algorithms with data-dependent tracking regret…

Machine Learning · Computer Science 2020-02-11 Shiyin Lu , Lijun Zhang

Best of many worlds: Robust model selection for online supervised learning

We introduce algorithms for online, full-information prediction that are competitive with contextual tree experts of unknown complexity, in both probabilistic and adversarial settings. We show that by incorporating a probabilistic framework…

Machine Learning · Computer Science 2018-05-23 Vidya Muthukumar , Mitas Ray , Anant Sahai , Peter L. Bartlett

Locally-Adaptive Nonparametric Online Learning

One of the main strengths of online algorithms is their ability to adapt to arbitrary data sequences. This is especially important in nonparametric settings, where performance is measured against rich classes of comparator functions that…

Machine Learning · Computer Science 2020-11-03 Ilja Kuzborskij , Nicolò Cesa-Bianchi