English
Related papers

Related papers: A parameter-free hedging algorithm

200 papers

We propose the first parameter-free decentralized online learning algorithms with network regret guarantees, which achieve sublinear regret without requiring hyperparameter tuning. This family of algorithms connects multi-agent coin-betting…

Machine Learning · Computer Science 2025-10-20 Tomas Ortega , Hamid Jafarkhani

Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case…

Machine Learning · Statistics 2015-03-04 Tim van Erven , Peter Grünwald , Wouter M. Koolen , Steven de Rooij

In the recent years, a number of parameter-free algorithms have been developed for online linear optimization over Hilbert spaces and for learning with expert advice. These algorithms achieve optimal regret bounds that depend on the unknown…

Machine Learning · Computer Science 2016-11-07 Francesco Orabona , Dávid Pál

This paper describes a new parameter-free online learning algorithm for changing environments. In comparing against algorithms with the same time complexity as ours, we obtain a strongly adaptive regret bound that is a factor of at least…

Machine Learning · Statistics 2017-08-08 Kwang-Sung Jun , Francesco Orabona , Rebecca Willett , Stephen Wright

Maintaining predictive accuracy in non-stationary environments requires online model selection to adapt autonomously to unknown distribution shifts. However, existing tuning-free algorithms face a fundamental trade-off between robustness…

Machine Learning · Computer Science 2026-05-27 Kei Takemura , Ryuta Matsuno , Keita Sakuma

A natural goal when designing online learning algorithms for non-stationary environments is to bound the regret of the algorithm in terms of the temporal variation of the input sequence. Intuitively, when the variation is small, it should…

Machine Learning · Computer Science 2021-12-08 Gautam Goel , Babak Hassibi

Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In this work, we initiate the study of best-case lower bounds in online convex optimization, wherein we bound the largest improvement an…

Machine Learning · Computer Science 2021-06-25 Cristóbal Guzmán , Nishant A. Mehta , Ali Mortazavi

We present new algorithms for online convex optimization over unbounded domains that obtain parameter-free regret in high-probability given access only to potentially heavy-tailed subgradient estimates. Previous work in unbounded domains…

Machine Learning · Statistics 2023-02-28 Jiujia Zhang , Ashok Cutkosky

Online learning algorithms have been successfully used to design caching policies with sublinear regret in the total number of requests, with no statistical assumption about the request sequence. Most existing algorithms involve…

Machine Learning · Computer Science 2025-03-05 Younes Ben Mazziane , Francescomaria Faticanti , Sara Alouf , Giovanni Neglia

Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. This leads to poor performance in theory and…

Machine Learning · Computer Science 2015-03-17 Stephane Ross , Geoffrey J. Gordon , J. Andrew Bagnell

We design and analyze algorithms for online linear optimization that have optimal regret and at the same time do not need to know any upper or lower bounds on the norm of the loss vectors. Our algorithms are instances of the Follow the…

Machine Learning · Computer Science 2016-12-15 Francesco Orabona , Dávid Pál

In online learning an algorithm plays against an environment with losses possibly picked by an adversary at each round. The generality of this framework includes problems that are not adversarial, for example offline optimization, or saddle…

Machine Learning · Computer Science 2021-02-04 Ryan D'Orazio , Ruitong Huang

We consider the problem of online learning in Linear Quadratic Control systems whose state transition and state-action transition matrices $A$ and $B$ may be initially unknown. We devise an online learning algorithm and provide guarantees…

Machine Learning · Computer Science 2021-09-30 Yassir Jedra , Alexandre Proutiere

We consider an online two-stage stochastic optimization with long-term constraints over a finite horizon of $T$ periods. At each period, we take the first-stage action, observe a model parameter realization and then take the second-stage…

Machine Learning · Computer Science 2024-01-03 Piao Hu , Jiashuo Jiang , Guodong Lyu , Hao Su

We extend and combine several tools of the literature to design fast, adaptive, anytime and scale-free online learning algorithms. Scale-free regret bounds must scale linearly with the maximum loss, both toward large losses and toward very…

Machine Learning · Computer Science 2024-10-22 Laurent Orseau , Marcus Hutter

This paper considers the stability of online learning algorithms and its implications for learnability (bounded regret). We introduce a novel quantity called {\em forward regret} that intuitively measures how good an online learning…

Machine Learning · Computer Science 2012-11-28 Ankan Saha , Prateek Jain , Ambuj Tewari

Parameter-free algorithms are online learning algorithms that do not require setting learning rates. They achieve optimal regret with respect to the distance between the initial point and any competitor. Yet, parameter-free algorithms do…

Machine Learning · Computer Science 2022-03-22 Keyi Chen , Ashok Cutkosky , Francesco Orabona

The design of effective online caching policies is an increasingly important problem for content distribution networks, online social networks and edge computing services, among other areas. This paper proposes a new algorithmic toolbox for…

Networking and Internet Architecture · Computer Science 2022-10-21 Naram Mhaisen , George Iosifidis , Douglas Leith

Online linear programming plays an important role in both revenue management and resource allocation, and recent research has focused on developing efficient first-order online learning algorithms. Despite the empirical success of…

Machine Learning · Computer Science 2025-01-08 Wenzhi Gao , Chunlin Sun , Chenyu Xue , Dongdong Ge , Yinyu Ye

We propose a new partial-observability model for online learning problems where the learner, besides its own loss, also observes some noisy feedback about the other actions, depending on the underlying structure of the problem. We represent…

Machine Learning · Computer Science 2026-04-16 Tomáš Kocák , Gergely Neu , Michal Valko
‹ Prev 1 2 3 10 Next ›