Related papers: A Chaining Algorithm for Online Nonparametric Regr…

Online Nonparametric Regression

We establish optimal rates for online regression for arbitrary classes of regression functions in terms of the sequential entropy introduced in (Rakhlin, Sridharan, Tewari, 2010). The optimal rates are shown to exhibit a phase transition…

Machine Learning · Statistics 2014-02-12 Alexander Rakhlin , Karthik Sridharan

Uniform regret bounds over $R^d$ for the sequential linear regression problem with the square loss

We consider the setting of online linear regression for arbitrary deterministic sequences, with the square loss. We are interested in the aim set by Bartlett et al. (2015): obtain regret bounds that hold uniformly over all competitor…

Machine Learning · Statistics 2019-02-26 Pierre Gaillard , Sébastien Gerchinovitz , Malo Huard , Gilles Stoltz

Minimax-optimal and Locally-adaptive Online Nonparametric Regression

We study adversarial online nonparametric regression with general convex losses and propose a parameter-free learning algorithm that achieves minimax optimal rates. Our approach leverages chaining trees to compete against H{\"o}lder…

Statistics Theory · Mathematics 2025-04-14 Paul Liautaud , Pierre Gaillard , Olivier Wintenberger

Adaptive and optimal online linear regression on $\ell^1$-balls

We consider the problem of online linear regression on individual sequences. The goal in this paper is for the forecaster to output sequential predictions which are, after $T$ time rounds, almost as good as the ones output by the best…

Machine Learning · Statistics 2019-01-17 Sébastien Gerchinovitz , Jia Yuan Yu

High-Probability Risk Bounds via Sequential Predictors

Online learning methods yield sequential regret bounds under minimal assumptions and provide in-expectation risk bounds for statistical learning. However, despite the apparent advantage of online guarantees over their statistical…

Machine Learning · Computer Science 2023-08-16 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

Online estimation and control with optimal pathlength regret

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi

Online Optimization : Competing with Dynamic Comparators

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

Parameter-free Regret in High Probability with Heavy Tails

We present new algorithms for online convex optimization over unbounded domains that obtain parameter-free regret in high-probability given access only to potentially heavy-tailed subgradient estimates. Previous work in unbounded domains…

Machine Learning · Statistics 2023-02-28 Jiujia Zhang , Ashok Cutkosky

Sparsity regret bounds for individual sequences in online linear regression

We consider the problem of online linear regression on arbitrary deterministic sequences when the ambient dimension d can be much larger than the number of time rounds T. We introduce the notion of sparsity regret bound, which is a…

Machine Learning · Statistics 2013-04-17 Sébastien Gerchinovitz

Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge

We consider the problem of online linear regression in the stochastic setting. We derive high probability regret bounds for online ridge regression and the forward algorithm. This enables us to compare online regression algorithms more…

Machine Learning · Computer Science 2021-11-03 Reda Ouhamma , Odalric Maillard , Vianney Perchet

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit…

Machine Learning · Statistics 2017-07-03 Nicolò Cesa-Bianchi , Pierre Gaillard , Claudio Gentile , Sébastien Gerchinovitz

Fast Rates for Nonparametric Online Learning: From Realizability to Learning in Games

We study fast rates of convergence in the setting of nonparametric online regression, namely where regret is defined with respect to an arbitrary function class which has bounded complexity. Our contributions are two-fold: - In the…

Machine Learning · Computer Science 2022-04-13 Constantinos Daskalakis , Noah Golowich

Decentralized Parameter-Free Online Learning

We propose the first parameter-free decentralized online learning algorithms with network regret guarantees, which achieve sublinear regret without requiring hyperparameter tuning. This family of algorithms connects multi-agent coin-betting…

Machine Learning · Computer Science 2025-10-20 Tomas Ortega , Hamid Jafarkhani

Logistic Regression Regret: What's the Catch?

We address the problem of the achievable regret rates with online logistic regression. We derive lower bounds with logarithmic regret under $L_1$, $L_2$, and $L_\infty$ constraints on the parameter values. The bounds are dominated by $d/2…

Machine Learning · Computer Science 2020-02-20 Gil I. Shamir

Towards minimax policies for online linear optimization with bandit feedback

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of order $\sqrt{d n \log N}$ for any finite action set with $N$…

Machine Learning · Computer Science 2012-02-15 Sébastien Bubeck , Nicolò Cesa-Bianchi , Sham M. Kakade

Online nonparametric regression with Sobolev kernels

In this work we investigate the variation of the online kernelized ridge regression algorithm in the setting of $d-$dimensional adversarial nonparametric regression. We derive the regret upper bounds on the classes of Sobolev spaces…

Statistics Theory · Mathematics 2021-07-14 Oleksandr Zadorozhnyi , Pierre Gaillard , Sebastien Gerschinovitz , Alessandro Rudi

Nearly Optimal Algorithms with Sublinear Computational Complexity for Online Kernel Regression

The trade-off between regret and computational cost is a fundamental problem for online kernel regression, and previous algorithms worked on the trade-off can not keep optimal regret bounds at a sublinear computational complexity. In this…

Machine Learning · Computer Science 2023-06-16 Junfan Li , Shizhong Liao

Adversarial Online Learning with Changing Action Sets: Efficient Algorithms with Approximate Regret Bounds

We revisit the problem of online learning with sleeping experts/bandits: in each time step, only a subset of the actions are available for the algorithm to choose from (and learn about). The work of Kleinberg et al. (2010) showed that there…

Machine Learning · Computer Science 2021-04-27 Ehsan Emamjomeh-Zadeh , Chen-Yu Wei , Haipeng Luo , David Kempe

Online Optimization in Dynamic Environments: Improved Regret Rates for Strongly Convex Problems

In this paper, we address tracking of a time-varying parameter with unknown dynamics. We formalize the problem as an instance of online optimization in a dynamic setting. Using online gradient descent, we propose a method that sequentially…

Machine Learning · Computer Science 2016-03-17 Aryan Mokhtari , Shahin Shahrampour , Ali Jadbabaie , Alejandro Ribeiro

Online Control of Unknown Time-Varying Dynamical Systems

We study online control of time-varying linear systems with unknown dynamics in the nonstochastic control model. At a high level, we demonstrate that this setting is \emph{qualitatively harder} than that of either unknown time-invariant or…

Machine Learning · Computer Science 2022-02-17 Edgar Minasyan , Paula Gradu , Max Simchowitz , Elad Hazan