English
Related papers

Related papers: Adaptive Hedge

200 papers

In this paper, we study the behavior of the Hedge algorithm in the online stochastic setting. We prove that anytime Hedge with decreasing learning rate, which is one of the simplest algorithm for the problem of prediction with expert…

Machine Learning · Statistics 2019-07-10 Jaouad Mourtada , Stéphane Gaïffas

In online learning an algorithm plays against an environment with losses possibly picked by an adversary at each round. The generality of this framework includes problems that are not adversarial, for example offline optimization, or saddle…

Machine Learning · Computer Science 2021-02-04 Ryan D'Orazio , Ruitong Huang

A key challenge in online learning is that classical algorithms can be slow to adapt to changing environments. Recent studies have proposed "meta" algorithms that convert any online learning algorithm to one that is adaptive to changing…

Machine Learning · Statistics 2017-11-08 Kwang-Sung Jun , Francesco Orabona , Stephen Wright , Rebecca Willett

We propose a general framework for studying adaptive regret bounds in the online learning framework, including model selection bounds and data-dependent bounds. Given a data- or model-dependent bound we ask, "Does there exist some algorithm…

Machine Learning · Computer Science 2020-02-14 Dylan J. Foster , Alexander Rakhlin , Karthik Sridharan

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

Maintaining predictive accuracy in non-stationary environments requires online model selection to adapt autonomously to unknown distribution shifts. However, existing tuning-free algorithms face a fundamental trade-off between robustness…

Machine Learning · Computer Science 2026-05-27 Kei Takemura , Ryuta Matsuno , Keita Sakuma

Hedge has been proposed as an adaptive scheme, which guides an agent's decision in resource selection and distribution problems that can be modeled as a multi-armed bandit full information game. Such problems are encountered in the areas of…

Machine Learning · Computer Science 2018-12-10 Miltiades E. Anagnostou , Maria A. Lambrou

Regret minimization is treated as the golden rule in the traditional study of online learning. However, regret minimization algorithms tend to converge to the static optimum, thus being suboptimal for changing environments. To address this…

Machine Learning · Computer Science 2020-02-07 Lijun Zhang , Shiyin Lu , Tianbao Yang

We show that the Hedge algorithm, a method that is widely used in Machine Learning, can be interpreted as a particular instance of Dual Averaging schemes, which have recently been introduced by Nesterov for regret minimization. Based on…

Optimization and Control · Mathematics 2011-12-07 Michel Baes , Michael Bürgisser

We aim to design strategies for sequential decision making that adjust to the difficulty of the learning problem. We study this question both in the setting of prediction with expert advice, and for more general combinatorial decision…

Machine Learning · Computer Science 2015-03-02 Wouter M. Koolen , Tim van Erven

Online learning and model reference adaptive control have many interesting intersections. One area where they differ however is in how the algorithms are analyzed and what objective or metric is used to discriminate "good" algorithms from…

Systems and Control · Electrical Eng. & Systems 2025-01-24 Travis E. Gibson , Sawal Acharya

We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential…

Machine Learning · Computer Science 2019-12-10 Mikhail Khodak , Maria-Florina Balcan , Ameet Talwalkar

This paper describes a new parameter-free online learning algorithm for changing environments. In comparing against algorithms with the same time complexity as ours, we obtain a strongly adaptive regret bound that is a factor of at least…

Machine Learning · Statistics 2017-08-08 Kwang-Sung Jun , Francesco Orabona , Rebecca Willett , Stephen Wright

Practical online learning tasks are often naturally defined on unconstrained domains, where optimal algorithms for general convex losses are characterized by the notion of comparator adaptivity. In this paper, we design such algorithms in…

Machine Learning · Computer Science 2022-10-13 Zhiyu Zhang , Ashok Cutkosky , Ioannis Ch. Paschalidis

We study the problem of decision-theoretic online learning (DTOL). Motivated by practical applications, we focus on DTOL when the number of actions is very large. Previous algorithms for learning in this framework have a tunable learning…

Machine Learning · Computer Science 2010-01-19 Kamalika Chaudhuri , Yoav Freund , Daniel Hsu

Ranking algorithms are fundamental to various online platforms across e-commerce sites to content streaming services. Our research addresses the challenge of adaptively ranking items from a candidate pool for heterogeneous users, a key…

Machine Learning · Computer Science 2024-06-10 Jingyuan Wang , Perry Dong , Ying Jin , Ruohan Zhan , Zhengyuan Zhou

Online learning algorithms are designed to learn even when their input is generated by an adversary. The widely-accepted formal definition of an online algorithm's ability to learn is the game-theoretic notion of regret. We argue that the…

Machine Learning · Computer Science 2012-07-03 Raman Arora , Ofer Dekel , Ambuj Tewari

Adaptive gradient methods are the method of choice for optimization in machine learning and used to train the largest deep models. In this paper we study the problem of learning a local preconditioner, that can change as the data is…

Machine Learning · Computer Science 2023-01-27 Zhou Lu , Wenhan Xia , Sanjeev Arora , Elad Hazan

Prediction markets rely on liquidity to convert trades into informative prices, yet existing mechanisms fix liquidity ex ante. This restriction enforces a static trade-off between price responsiveness and worst-case loss despite inherently…

Computer Science and Game Theory · Computer Science 2026-05-12 Enrique Nueve , Bao Nguyen , Rafael Frongillo , Bo Waggoner

Any gradient descent optimization requires to choose a learning rate. With deeper and deeper models, tuning that learning rate can easily become tedious and does not necessarily lead to an ideal convergence. We propose a variation of the…

Machine Learning · Statistics 2018-04-10 Mathieu Ravaut , Satya Gorti
‹ Prev 1 2 3 10 Next ›