English
Related papers

Related papers: Optimal anytime regret with two experts

200 papers

Prediction with expert advice is a foundational problem in online learning. In instances with $T$ rounds and $n$ experts, the classical Multiplicative Weights Update method suffers at most $\sqrt{(T/2)\ln n}$ regret when $T$ is known…

Machine Learning · Computer Science 2022-03-16 Laura Greenstreet , Nicholas J. A. Harvey , Victor Sanches Portella

We study the prediction with expert advice setting, where the aim is to produce a decision by combining the decisions generated by a set of experts, e.g., independently running algorithms. We achieve the min-max optimal dynamic regret under…

Machine Learning · Computer Science 2022-08-09 Hakan Gokcesu , Suleyman S. Kozat

Prediction with experts' advice is one of the most fundamental problems in online learning and captures many of its technical challenges. A recent line of work has looked at online learning through the lens of differential equations and…

Machine Learning · Computer Science 2022-10-04 Victor Sanches Portella , Christopher Liaw , Nicholas J. A. Harvey

We study the classical problem of prediction with expert advice in the adversarial setting with a geometric stopping time. In 1965, Cover gave the optimal algorithm for the case of 2 experts. In this paper, we design the optimal algorithm,…

Machine Learning · Computer Science 2016-07-12 Nick Gravin , Yuval Peres , Balasubramanian Sivan

We present a new anytime algorithm that achieves near-optimal regret for any instance of finite stochastic partial monitoring. In particular, the new algorithm achieves the minimax regret, within logarithmic factors, for both "easy" and…

Machine Learning · Computer Science 2012-07-03 Gabor Bartok , Navid Zolghadr , Csaba Szepesvari

Consider the classical problem of predicting the next bit in a sequence of bits. A standard performance measure is {\em regret} (loss in payoff) with respect to a set of experts. For example if we measure performance with respect to two…

Machine Learning · Computer Science 2013-04-30 Rina Panigrahy , Preyas Popat

We study the problem of expert advice under partial bandit feedback setting and create a sequential minimax optimal algorithm. Our algorithm works with a more general partial monitoring setting, where, in contrast to the classical bandit…

Machine Learning · Computer Science 2022-04-15 Kaan Gokcesu , Hakan Gokcesu

The need for fast and robust optimization algorithms are of critical importance in all areas of machine learning. This paper treats the task of designing optimization algorithms as an optimal control problem. Using regret as a metric for an…

Machine Learning · Computer Science 2021-01-21 Philippe Casgrain , Anastasis Kratsios

In this paper, we consider the problem of prediction with expert advice in dynamic environments. We choose tracking regret as the performance metric and develop two adaptive and efficient algorithms with data-dependent tracking regret…

Machine Learning · Computer Science 2020-02-11 Shiyin Lu , Lijun Zhang

We consider a stochastic bandit problem with infinitely many arms. In this setting, the learner has no chance of trying all the arms even once and has to dedicate its limited number of samples only to a certain number of arms. All previous…

Machine Learning · Computer Science 2015-05-19 Alexandra Carpentier , Michal Valko

We consider a variation on the problem of prediction with expert advice, where new forecasters that were unknown until then may appear at each round. As often in prediction with expert advice, designing an algorithm that achieves…

Machine Learning · Statistics 2017-09-01 Jaouad Mourtada , Odalric-Ambrym Maillard

We address the problem of sequential prediction with expert advice in a non-stationary environment with long-term memory guarantees in the sense of Bousquet and Warmuth [4]. We give a linear-time algorithm that improves on the best known…

Machine Learning · Computer Science 2021-06-25 James Robinson , Mark Herbster

We study a variant of decision-theoretic online learning in which the set of experts that are available to Learner can shrink over time. This is a restricted version of the well-studied sleeping experts problem, itself a generalization of…

Machine Learning · Computer Science 2019-10-31 Hamid Shayestehmanesh , Sajjad Azami , Nishant A. Mehta

We consider the classical question of predicting binary sequences and study the {\em optimal} algorithms for obtaining the best possible regret and payoff functions for this problem. The question turns out to be also equivalent to the…

Machine Learning · Computer Science 2013-05-08 Alexandr Andoni , Rina Panigrahy

Regret minimization is treated as the golden rule in the traditional study of online learning. However, regret minimization algorithms tend to converge to the static optimum, thus being suboptimal for changing environments. To address this…

Machine Learning · Computer Science 2020-02-07 Lijun Zhang , Shiyin Lu , Tianbao Yang

We investigate the problem of cumulative regret minimization for individual sequence prediction with respect to the best expert in a finite family of size K under limited access to information. We assume that in each round, the learner can…

Statistics Theory · Mathematics 2022-10-06 El Mehdi Saad , G. Blanchard

We study a regret minimization problem with the existence of multiple best/near-optimal arms in the multi-armed bandit setting. We consider the case when the number of arms/actions is comparable or much larger than the time horizon, and…

Machine Learning · Statistics 2020-10-23 Yinglun Zhu , Robert Nowak

In this work, we propose an efficient minimax optimal global optimization algorithm for multivariate Lipschitz continuous functions. To evaluate the performance of our approach, we utilize the average regret instead of the traditional…

Machine Learning · Computer Science 2022-06-07 Kaan Gokcesu , Hakan Gokcesu

We consider linear stochastic bandits where the set of actions is an ellipsoid. We provide the first known minimax optimal algorithm for this problem. We first derive a novel information-theoretic lower bound on the regret of any algorithm,…

Machine Learning · Statistics 2025-02-25 Raymond Zhang , Hedi Hadiji , Richard Combes

We consider the classic problem of online convex optimisation. Whereas the notion of static regret is relevant for stationary problems, the notion of switching regret is more appropriate for non-stationary problems. A switching regret is…

Machine Learning · Computer Science 2025-03-07 Stephen Pasteris , Chris Hicks , Vasilios Mavroudis , Mark Herbster
‹ Prev 1 2 3 10 Next ›