Related papers: Error-Correcting Tournaments

An efficient reduction of ranking to classification

This paper describes an efficient reduction of the learning problem of ranking to binary classification. The reduction guarantees an average pairwise misranking regret of at most that of the binary classifier regret, improving a recent…

Machine Learning · Computer Science 2007-12-07 Nir Ailon , Mehryar Mohri

Surrogate Regret Bounds for Bipartite Ranking via Strongly Proper Losses

The problem of bipartite ranking, where instances are labeled positive or negative and the goal is to learn a scoring function that minimizes the probability of mis-ranking a pair of positive and negative instances (or equivalently, that…

Machine Learning · Computer Science 2014-08-13 Shivani Agarwal

Constant regret for sequence prediction with limited advice

We investigate the problem of cumulative regret minimization for individual sequence prediction with respect to the best expert in a finite family of size K under limited access to information. We assume that in each round, the learner can…

Statistics Theory · Mathematics 2022-10-06 El Mehdi Saad , G. Blanchard

Alternating Regret for Online Convex Optimization

Motivated by alternating learning dynamics in two-player games, a recent work by Cevher et al.(2024) shows that $o(\sqrt{T})$ alternating regret is possible for any $T$-round adversarial Online Linear Optimization (OLO) problem, and left as…

Machine Learning · Computer Science 2025-06-19 Soumita Hait , Ping Li , Haipeng Luo , Mengxiao Zhang

Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret

We present an efficient second-order algorithm with $\tilde{O}(\frac{1}{\eta}\sqrt{T})$ regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of loss functions parameterized by…

Machine Learning · Computer Science 2018-01-19 Alina Beygelzimer , Francesco Orabona , Chicheng Zhang

Efficient improper learning for online logistic regression

We consider the setting of online logistic regression and consider the regret with respect to the 2-ball of radius B. It is known (see [Hazan et al., 2014]) that any proper algorithm which has logarithmic regret in the number of samples…

Machine Learning · Computer Science 2020-11-04 Rémi Jézéquel , Pierre Gaillard , Alessandro Rudi

Towards Optimal Regret in Adversarial Linear MDPs with Bandit Feedback

We study online reinforcement learning in linear Markov decision processes with adversarial losses and bandit feedback, without prior knowledge on transitions or access to simulators. We introduce two algorithms that achieve improved regret…

Machine Learning · Computer Science 2023-10-19 Haolin Liu , Chen-Yu Wei , Julian Zimmert

Constructing Multiclass Classifiers using Binary Classifiers Under Log-Loss

The construction of multiclass classifiers from binary elements is studied in this paper, and performance is quantified by the regret, defined with respect to the Bayes optimal log-loss. We discuss two known methods. The first is one vs.…

Machine Learning · Computer Science 2021-08-13 Assaf Ben-Yishai , Or Ordentlich

Best of Both Worlds: Regret Minimization versus Minimax Play

In this paper, we investigate the existence of online learning algorithms with bandit feedback that simultaneously guarantee $O(1)$ regret compared to a given comparator strategy, and $\tilde{O}(\sqrt{T})$ regret compared to any fixed…

Machine Learning · Computer Science 2025-06-05 Adrian Müller , Jon Schneider , Stratis Skoulakis , Luca Viano , Volkan Cevher

Beyond Worst-Case Online Classification: VC-Based Regret Bounds for Relaxed Benchmarks

We revisit online binary classification by shifting the focus from competing with the best-in-class binary loss to competing against relaxed benchmarks that capture smoothed notions of optimality. Instead of measuring regret relative to the…

Machine Learning · Statistics 2025-04-16 Omar Montasser , Abhishek Shetty , Nikita Zhivotovskiy

Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences

We study the problem of $K$-armed dueling bandit for both stochastic and adversarial environments, where the goal of the learner is to aggregate information through relative preferences of pair of decisions points queried in an online…

Machine Learning · Computer Science 2022-02-15 Aadirupa Saha , Pierre Gaillard

Online Optimization : Competing with Dynamic Comparators

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop…

Machine Learning · Computer Science 2015-01-27 Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

Efficient $\Phi$-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

Recent breakthrough results by Dagan, Daskalakis, Fishelson and Golowich [2023] and Peng and Rubinstein [2023] established an efficient algorithm attaining at most $\epsilon$ swap regret over extensive-form strategy spaces of dimension $N$…

Computer Science and Game Theory · Computer Science 2025-02-14 Brian Hu Zhang , Ioannis Anagnostides , Gabriele Farina , Tuomas Sandholm

Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability

We study the $K$-armed contextual dueling bandit problem, a sequential decision making setting in which the learner uses contextual information to make two decisions, but only observes \emph{preference-based feedback} suggesting that one…

Machine Learning · Computer Science 2021-11-25 Aadirupa Saha , Akshay Krishnamurthy

Solving Imperfect-Information Games via Discounted Regret Minimization

Counterfactual regret minimization (CFR) is a family of iterative algorithms that are the most popular and, in practice, fastest approach to approximately solving large imperfect-information games. In this paper we introduce novel CFR…

Computer Science and Game Theory · Computer Science 2019-02-22 Noam Brown , Tuomas Sandholm

The efficacy of tournament designs

Tournaments are a widely used mechanism to rank alternatives in a noisy environment. This paper investigates a fundamental issue of economics in tournament design: what is the best usage of limited resources, that is, how should the…

Applications · Statistics 2022-05-24 Balázs R. Sziklai , Péter Biró , László Csató

Tournament Robustness via Redundancy

A knockout tournament is one of the most simple and popular forms of competition. Here, we are given a binary tournament tree where all leaves are labeled with seed position names. The players participating in the tournament are assigned to…

Discrete Mathematics · Computer Science 2025-06-05 Klim Efremenko , Hendrik Molter , Meirav Zehavi

Using tournaments to calculate AUROC for zero-shot classification with LLMs

Large language models perform surprisingly well on many zero-shot classification tasks, but are difficult to fairly compare to supervised classifiers due to the lack of a modifiable decision boundary. In this work, we propose and evaluate a…

Computation and Language · Computer Science 2025-11-25 WonJin Yoon , Ian Bulovic , Timothy A. Miller

Hierarchies of No-regret Algorithms

Our paper studies the setting of players using no-regret algorithms in various two-player games. We address whether having stronger regret guarantees or playing against an opponent with weaker regret guarantees yields higher utilities for…

Computer Science and Game Theory · Computer Science 2026-04-29 R. Xu , E. Yachbes , J. Zhang

Stable-Predictive Optimistic Counterfactual Regret Minimization

The CFR framework has been a powerful tool for solving large-scale extensive-form games in practice. However, the theoretical rate at which past CFR-based algorithms converge to the Nash equilibrium is on the order of $O(T^{-1/2})$, where…

Computer Science and Game Theory · Computer Science 2019-02-14 Gabriele Farina , Christian Kroer , Noam Brown , Tuomas Sandholm