English
Related papers

Related papers: A new Hedging algorithm and its application to inf…

200 papers

The article is devoted to investigating the application of hedging strategies to online expert weight allocation under delayed feedback. As the main result, we develop the General Hedging algorithm $\mathcal{G}$ based on the exponential…

Machine Learning · Computer Science 2019-06-25 Alexander Korotin , Vladimir V'yugin , Evgeny Burnaev

We study how we can adapt a predictor to a non-stationary environment with advises from multiple experts. We study the problem under complete feedback when the best expert changes over time from a decision theoretic point of view. Proposed…

Machine Learning · Computer Science 2017-08-08 Vishnu Raj , Sheetal Kalyani

In this work, we aim to create a completely online algorithmic framework for prediction with expert advice that is translation-free and scale-free of the expert losses. Our goal is to create a generalized algorithm that is suitable for use…

Machine Learning · Computer Science 2020-09-10 Kaan Gokcesu , Hakan Gokcesu

We study prediction with expert advice in the setting where the losses are accumulated with some discounting---the impact of old losses may gradually vanish. We generalize the Aggregating Algorithm and the Aggregating Algorithm for…

Machine Learning · Computer Science 2010-06-07 Alexey Chernov , Fedor Zhdanov

We study the problem of incentive-compatible online learning with bandit feedback. In this class of problems, the experts are self-interested agents who might misrepresent their preferences with the goal of being selected most often. The…

Machine Learning · Computer Science 2024-05-13 Julian Zimmert , Teodor V. Marinov

We develop the setting of sequential prediction based on shifting experts and on a "smooth" version of the method of specialized experts. To aggregate experts predictions, we use the AdaHedge algorithm, which is a version of the Hedge…

Machine Learning · Computer Science 2020-01-24 Vladimir V'yugin , Vladimir Trunov

In online learning an algorithm plays against an environment with losses possibly picked by an adversary at each round. The generality of this framework includes problems that are not adversarial, for example offline optimization, or saddle…

Machine Learning · Computer Science 2021-02-04 Ryan D'Orazio , Ruitong Huang

For each of $T$ time steps, $m$ experts report probability distributions over $n$ outcomes; we wish to learn to aggregate these forecasts in a way that attains a no-regret guarantee. We focus on the fundamental and practical aggregation…

Machine Learning · Computer Science 2023-10-11 Eric Neyman , Tim Roughgarden

In this paper, we investigate the online non-convex optimization problem which generalizes the classic {online convex optimization problem by relaxing the convexity assumption on the cost function. For this type of problem, the classic…

Machine Learning · Computer Science 2017-09-14 Lin Yang , Cheng Tan , Wing Shing Wong

We consider a general framework of online learning with expert advice where regret is defined with respect to sequences of experts accepted by a weighted automaton. Our framework covers several problems previously studied, including…

Machine Learning · Computer Science 2017-10-24 Mehryar Mohri , Scott Yang

Online machine learning systems need to adapt to domain shifts. Meanwhile, acquiring label at every timestep is expensive. We propose a surprisingly simple algorithm that adaptively balances its regret and its number of label queries in…

Machine Learning · Computer Science 2021-03-01 Yining Chen , Haipeng Luo , Tengyu Ma , Chicheng Zhang

We study online learning settings in which experts act strategically to maximize their influence on the learning algorithm's predictions by potentially misreporting their beliefs about a sequence of binary events. Our goal is twofold.…

Machine Learning · Computer Science 2020-07-02 Rupert Freeman , David M. Pennock , Chara Podimata , Jennifer Wortman Vaughan

We study revenue optimization learning algorithms for posted-price auctions with strategic buyers. We analyze a very broad family of monotone regret minimization algorithms for this problem, which includes the previously best known…

Machine Learning · Computer Science 2014-11-25 Mehryar Mohri , Andres Muñoz Medina

We consider the problem of online stochastic optimization in a distributed setting with $M$ clients connected through a central server. We develop a distributed online learning algorithm that achieves order-optimal cumulative regret with…

Machine Learning · Computer Science 2023-06-07 Sudeep Salgia , Qing Zhao , Tamir Gabay , Kobi Cohen

We consider prediction with expert advice for strongly convex and bounded losses, and investigate trade-offs between regret and "variance" (i.e., squared difference of learner's predictions and best expert predictions). With $K$ experts,…

Machine Learning · Computer Science 2022-06-07 Dirk van der Hoeven , Nikita Zhivotovskiy , Nicolò Cesa-Bianchi

We consider a variation on the problem of prediction with expert advice, where new forecasters that were unknown until then may appear at each round. As often in prediction with expert advice, designing an algorithm that achieves…

Machine Learning · Statistics 2017-09-01 Jaouad Mourtada , Odalric-Ambrym Maillard

This paper describes a new parameter-free online learning algorithm for changing environments. In comparing against algorithms with the same time complexity as ours, we obtain a strongly adaptive regret bound that is a factor of at least…

Machine Learning · Statistics 2017-08-08 Kwang-Sung Jun , Francesco Orabona , Rebecca Willett , Stephen Wright

In this paper, we study a variant of the framework of online learning using expert advice with limited/bandit feedback. We consider each expert as a learning entity, seeking to more accurately reflecting certain real-world applications. In…

Machine Learning · Computer Science 2017-02-21 Adish Singla , Hamed Hassani , Andreas Krause

We address online learning in complex auction settings, such as sponsored search auctions, where the value of the bidder is unknown to her, evolving in an arbitrary manner and observed only if the bidder wins an allocation. We leverage the…

Computer Science and Game Theory · Computer Science 2018-06-04 Zhe Feng , Chara Podimata , Vasilis Syrgkanis

In this paper, we study the behavior of the Hedge algorithm in the online stochastic setting. We prove that anytime Hedge with decreasing learning rate, which is one of the simplest algorithm for the problem of prediction with expert…

Machine Learning · Statistics 2019-07-10 Jaouad Mourtada , Stéphane Gaïffas
‹ Prev 1 2 3 10 Next ›