English
Related papers

Related papers: Implicit Parameter-free Online Learning with Trunc…

200 papers

We develop a modified online mirror descent framework that is suitable for building adaptive and parameter-free algorithms in unbounded domains. We leverage this technique to develop the first unconstrained online linear optimization…

Machine Learning · Computer Science 2024-02-12 Andrew Jacobsen , Ashok Cutkosky

In the recent years, a number of parameter-free algorithms have been developed for online linear optimization over Hilbert spaces and for learning with expert advice. These algorithms achieve optimal regret bounds that depend on the unknown…

Machine Learning · Computer Science 2016-11-07 Francesco Orabona , Dávid Pál

We propose the first parameter-free decentralized online learning algorithms with network regret guarantees, which achieve sublinear regret without requiring hyperparameter tuning. This family of algorithms connects multi-agent coin-betting…

Machine Learning · Computer Science 2025-10-20 Tomas Ortega , Hamid Jafarkhani

We design and analyze algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. Our algorithms are instances of the Follow the…

Machine Learning · Computer Science 2016-12-15 Francesco Orabona , Dávid Pál

We show how to take any two parameter-free online learning algorithms with different regret guarantees and obtain a single algorithm whose regret is the minimum of the two base algorithms. Our method is embarrassingly simple: just add the…

Machine Learning · Statistics 2019-02-26 Ashok Cutkosky

We consider online learning with linear models, where the algorithm predicts on sequentially revealed instances (feature vectors), and is compared against the best linear function (comparator) in hindsight. Popular algorithms in this…

Machine Learning · Computer Science 2019-02-21 Michał Kempka , Wojciech Kotłowski , Manfred K. Warmuth

We study adversarial online nonparametric regression with general convex losses and propose a parameter-free learning algorithm that achieves minimax optimal rates. Our approach leverages chaining trees to compete against H{\"o}lder…

Statistics Theory · Mathematics 2025-04-14 Paul Liautaud , Pierre Gaillard , Olivier Wintenberger

We design algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. We achieve adaptiveness to norms of loss vectors by scale…

Machine Learning · Computer Science 2015-07-03 Francesco Orabona , David Pal

A crucial problem in reinforcement learning is learning the optimal policy. We study this in tabular infinite-horizon discounted Markov decision processes under the online setting. The existing algorithms either fail to achieve regret…

Machine Learning · Computer Science 2023-12-13 Xiang Ji , Gen Li

This paper describes a new parameter-free online learning algorithm for changing environments. In comparing against algorithms with the same time complexity as ours, we obtain a strongly adaptive regret bound that is a factor of at least…

Machine Learning · Statistics 2017-08-08 Kwang-Sung Jun , Francesco Orabona , Rebecca Willett , Stephen Wright

Algorithms for online learning typically require one or more boundedness assumptions: that the domain is bounded, that the losses are Lipschitz, or both. In this paper, we develop a new setting for online learning with unbounded domains and…

Machine Learning · Computer Science 2023-07-18 Andrew Jacobsen , Ashok Cutkosky

We propose an online learning algorithm for a class of machine learning models under a separable stochastic approximation framework. The essence of our idea lies in the observation that certain parameters in the models are easier to…

Machine Learning · Computer Science 2023-05-23 Min Gan , Xiang-xiang Su , Guang-yong Chen , Jing Chen

We propose an online convex optimization algorithm (RescaledExp) that achieves optimal regret in the unconstrained setting without prior knowledge of any bounds on the loss functions. We prove a lower bound showing an exponential separation…

Machine Learning · Computer Science 2017-03-09 Ashok Cutkosky , Kwabena Boahen

We consider the online version of the isotonic regression problem. Given a set of linearly ordered points (e.g., on the real line), the learner must predict labels sequentially at adversarially chosen positions and is evaluated by her total…

Machine Learning · Computer Science 2016-10-10 Wojciech Kotłowski , Wouter M. Koolen , Alan Malek

We design and analyze minimax-optimal algorithms for online linear optimization games where the player's choice is unconstrained. The player strives to minimize regret, the difference between his loss and the loss of a post-hoc benchmark…

Machine Learning · Computer Science 2013-02-12 H. Brendan McMahan

We present new algorithms for online convex optimization over unbounded domains that obtain parameter-free regret in high-probability given access only to potentially heavy-tailed subgradient estimates. Previous work in unbounded domains…

Machine Learning · Statistics 2023-02-28 Jiujia Zhang , Ashok Cutkosky

We introduce several new black-box reductions that significantly improve the design of adaptive and parameter-free online learning algorithms by simplifying analysis, improving regret guarantees, and sometimes even improving runtime. We…

Machine Learning · Computer Science 2018-06-27 Ashok Cutkosky , Francesco Orabona

We develop parameter-free algorithms for unconstrained online learning with regret guarantees that scale with the gradient variation $V_T(u) = \sum_{t=2}^T \|\nabla f_t(u)-\nabla f_{t-1}(u)\|^2$. For $L$-smooth convex loss, we provide…

Machine Learning · Computer Science 2026-04-14 Yuheng Zhao , Andrew Jacobsen , Nicolò Cesa-Bianchi , Peng Zhao

Stochastic gradient descent algorithms for training linear and kernel predictors are gaining more and more importance, thanks to their scalability. While various methods have been proposed to speed up their convergence, the model selection…

Machine Learning · Computer Science 2014-06-17 Francesco Orabona

We consider an online two-stage stochastic optimization with long-term constraints over a finite horizon of $T$ periods. At each period, we take the first-stage action, observe a model parameter realization and then take the second-stage…

Machine Learning · Computer Science 2024-01-03 Piao Hu , Jiashuo Jiang , Guodong Lyu , Hao Su
‹ Prev 1 2 3 10 Next ›