English
Related papers

Related papers: Error-Correcting Tournaments

200 papers

This paper describes an efficient reduction of the learning problem of ranking to binary classification. The reduction guarantees an average pairwise misranking regret of at most that of the binary classifier regret, improving a recent…

Machine Learning · Computer Science 2007-12-07 Nir Ailon , Mehryar Mohri

The problem of bipartite ranking, where instances are labeled positive or negative and the goal is to learn a scoring function that minimizes the probability of mis-ranking a pair of positive and negative instances (or equivalently, that…

Machine Learning · Computer Science 2014-08-13 Shivani Agarwal

We investigate the problem of cumulative regret minimization for individual sequence prediction with respect to the best expert in a finite family of size K under limited access to information. We assume that in each round, the learner can…

Statistics Theory · Mathematics 2022-10-06 El Mehdi Saad , G. Blanchard

Motivated by alternating learning dynamics in two-player games, a recent work by Cevher et al.(2024) shows that $o(\sqrt{T})$ alternating regret is possible for any $T$-round adversarial Online Linear Optimization (OLO) problem, and left as…

Machine Learning · Computer Science 2025-06-19 Soumita Hait , Ping Li , Haipeng Luo , Mengxiao Zhang

We present an efficient second-order algorithm with $\tilde{O}(\frac{1}{\eta}\sqrt{T})$ regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of loss functions parameterized by…

Machine Learning · Computer Science 2018-01-19 Alina Beygelzimer , Francesco Orabona , Chicheng Zhang

We consider the setting of online logistic regression and consider the regret with respect to the 2-ball of radius B. It is known (see [Hazan et al., 2014]) that any proper algorithm which has logarithmic regret in the number of samples…

Machine Learning · Computer Science 2020-11-04 Rémi Jézéquel , Pierre Gaillard , Alessandro Rudi

We study online reinforcement learning in linear Markov decision processes with adversarial losses and bandit feedback, without prior knowledge on transitions or access to simulators. We introduce two algorithms that achieve improved regret…

Machine Learning · Computer Science 2023-10-19 Haolin Liu , Chen-Yu Wei , Julian Zimmert

The construction of multiclass classifiers from binary elements is studied in this paper, and performance is quantified by the regret, defined with respect to the Bayes optimal log-loss. We discuss two known methods. The first is one vs.…

Machine Learning · Computer Science 2021-08-13 Assaf Ben-Yishai , Or Ordentlich

In this paper, we investigate the existence of online learning algorithms with bandit feedback that simultaneously guarantee $O(1)$ regret compared to a given comparator strategy, and $\tilde{O}(\sqrt{T})$ regret compared to any fixed…

Machine Learning · Computer Science 2025-06-05 Adrian Müller , Jon Schneider , Stratis Skoulakis , Luca Viano , Volkan Cevher

We revisit online binary classification by shifting the focus from competing with the best-in-class binary loss to competing against relaxed benchmarks that capture smoothed notions of optimality. Instead of measuring regret relative to the…

Machine Learning · Statistics 2025-04-16 Omar Montasser , Abhishek Shetty , Nikita Zhivotovskiy

We study the problem of $K$-armed dueling bandit for both stochastic and adversarial environments, where the goal of the learner is to aggregate information through relative preferences of pair of decisions points queried in an online…

Machine Learning · Computer Science 2022-02-15 Aadirupa Saha , Pierre Gaillard

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

Recent breakthrough results by Dagan, Daskalakis, Fishelson and Golowich [2023] and Peng and Rubinstein [2023] established an efficient algorithm attaining at most $\epsilon$ swap regret over extensive-form strategy spaces of dimension $N$…

Computer Science and Game Theory · Computer Science 2025-02-14 Brian Hu Zhang , Ioannis Anagnostides , Gabriele Farina , Tuomas Sandholm

We study the $K$-armed contextual dueling bandit problem, a sequential decision making setting in which the learner uses contextual information to make two decisions, but only observes \emph{preference-based feedback} suggesting that one…

Machine Learning · Computer Science 2021-11-25 Aadirupa Saha , Akshay Krishnamurthy

Counterfactual regret minimization (CFR) is a family of iterative algorithms that are the most popular and, in practice, fastest approach to approximately solving large imperfect-information games. In this paper we introduce novel CFR…

Computer Science and Game Theory · Computer Science 2019-02-22 Noam Brown , Tuomas Sandholm

Tournaments are a widely used mechanism to rank alternatives in a noisy environment. This paper investigates a fundamental issue of economics in tournament design: what is the best usage of limited resources, that is, how should the…

Applications · Statistics 2022-05-24 Balázs R. Sziklai , Péter Biró , László Csató

A knockout tournament is one of the most simple and popular forms of competition. Here, we are given a binary tournament tree where all leaves are labeled with seed position names. The players participating in the tournament are assigned to…

Discrete Mathematics · Computer Science 2025-06-05 Klim Efremenko , Hendrik Molter , Meirav Zehavi

Large language models perform surprisingly well on many zero-shot classification tasks, but are difficult to fairly compare to supervised classifiers due to the lack of a modifiable decision boundary. In this work, we propose and evaluate a…

Computation and Language · Computer Science 2025-11-25 WonJin Yoon , Ian Bulovic , Timothy A. Miller

Our paper studies the setting of players using no-regret algorithms in various two-player games. We address whether having stronger regret guarantees or playing against an opponent with weaker regret guarantees yields higher utilities for…

Computer Science and Game Theory · Computer Science 2026-04-29 R. Xu , E. Yachbes , J. Zhang

The CFR framework has been a powerful tool for solving large-scale extensive-form games in practice. However, the theoretical rate at which past CFR-based algorithms converge to the Nash equilibrium is on the order of $O(T^{-1/2})$, where…

Computer Science and Game Theory · Computer Science 2019-02-14 Gabriele Farina , Christian Kroer , Noam Brown , Tuomas Sandholm
‹ Prev 1 2 3 10 Next ›