Related papers: Bandit Multiclass Linear Classification: Efficient…

Bandit Multiclass Linear Classification for the Group Linear Separable Case

We consider the online multiclass linear classification under the bandit feedback setting. Beygelzimer, P\'{a}l, Sz\"{o}r\'{e}nyi, Thiruvenkatachari, Wei, and Zhang [ICML'19] considered two notions of linear separability, weak and strong…

Machine Learning · Computer Science 2021-09-01 Jittat Fakcharoenphol , Chayutpong Prompak

A Perceptron-based Fine Approximation Technique for Linear Separation

This paper presents a novel online learning method that aims at finding a separator hyperplane between data points labelled as either positive or negative. Since weights and biases of artificial neurons can directly be related to…

Machine Learning · Computer Science 2023-09-13 Ákos Hajnal

Polynomial-Time Robust Multiclass Linear Classification under Gaussian Marginals

We study the task of agnostic learning of multiclass linear classifiers under the Gaussian distribution. Given labeled examples $(x, y)$ from a distribution over $\mathbb{R}^d \times [k]$, with Gaussian $x$-marginal, the goal is to output a…

Machine Learning · Computer Science 2026-05-21 Ilias Diakonikolas , Giannis Iakovidis , Mingchen Ma

Linear Separation via Optimism

Binary linear classification has been explored since the very early days of the machine learning literature. Perhaps the most classical algorithm is the Perceptron, where a weight vector used to classify examples is maintained, and additive…

Machine Learning · Computer Science 2020-11-18 Rafael Hanashiro , Jacob Abernethy

Fast Rates for Bandit PAC Multiclass Classification

We study multiclass PAC learning with bandit feedback, where inputs are classified into one of $K$ possible labels and feedback is limited to whether or not the predicted labels are correct. Our main contribution is in designing a novel…

Machine Learning · Computer Science 2024-06-19 Liad Erez , Alon Cohen , Tomer Koren , Yishay Mansour , Shay Moran

Efficient Online-Bandit Strategies for Minimax Learning Problems

Several learning problems involve solving min-max problems, e.g., empirical distributional robust learning or learning with non-standard aggregated losses. More specifically, these problems are convex-linear problems where the minimization…

Machine Learning · Computer Science 2021-06-07 Christophe Roux , Elias Wirth , Sebastian Pokutta , Thomas Kerdreux

Exact and efficient top-K inference for multi-target prediction by querying separable linear relational models

Many complex multi-target prediction problems that concern large target spaces are characterised by a need for efficient prediction strategies that avoid the computation of predictions for all targets explicitly. Examples of such problems…

Information Retrieval · Computer Science 2018-03-06 Michiel Stock , Krzysztof Dembczynski , Bernard De Baets , Willem Waegeman

Multiclass Classification using dilute bandit feedback

This paper introduces a new online learning framework for multiclass classification called learning with diluted bandit feedback. At every time step, the algorithm predicts a candidate label set instead of a single label for the observed…

Machine Learning · Computer Science 2021-05-19 Gaurav Batra , Naresh Manwani

Beyond Bandit Feedback in Online Multiclass Classification

We study the problem of online multiclass classification in a setting where the learner's feedback is determined by an arbitrary directed graph. While including bandit feedback as a special case, feedback graphs allow a much richer set of…

Machine Learning · Computer Science 2024-02-20 Dirk van der Hoeven , Federico Fusco , Nicolò Cesa-Bianchi

Bandit-Feedback Online Multiclass Classification: Variants and Tradeoffs

Consider the domain of multiclass classification within the adversarial online setting. What is the price of relying on bandit feedback as opposed to full information? To what extent can an adaptive adversary amplify the loss compared to an…

Machine Learning · Computer Science 2024-02-13 Yuval Filmus , Steve Hanneke , Idan Mehalel , Shay Moran

Divide and Learn: Multi-Objective Combinatorial Optimization at Scale

Multi-objective combinatorial optimization seeks Pareto-optimal solutions over exponentially large discrete spaces, yet existing methods sacrifice generality, scalability, or theoretical guarantees. We reformulate it as an online learning…

Machine Learning · Computer Science 2026-02-13 Esha Singh , Dongxia Wu , Chien-Yi Yang , Tajana Rosing , Rose Yu , Yi-An Ma

The Real Price of Bandit Information in Multiclass Classification

We revisit the classical problem of multiclass classification with bandit feedback (Kakade, Shalev-Shwartz and Tewari, 2008), where each input classifies to one of $K$ possible labels and feedback is restricted to whether the predicted…

Machine Learning · Computer Science 2024-06-21 Liad Erez , Alon Cohen , Tomer Koren , Yishay Mansour , Shay Moran

Decentralized Online Big Data Classification - a Bandit Framework

Distributed, online data mining systems have emerged as a result of applications requiring analysis of large amounts of correlated and high-dimensional data produced by multiple distributed data sources. We propose a distributed online data…

Machine Learning · Computer Science 2013-08-27 Cem Tekin , Mihaela van der Schaar

Does Sparsity Help in Learning Misspecified Linear Bandits?

Recently, the study of linear misspecified bandits has generated intriguing implications of the hardness of learning in bandits and reinforcement learning (RL). In particular, Du et al. (2020) show that even if a learner is given linear…

Machine Learning · Computer Science 2023-03-31 Jialin Dong , Lin F. Yang

Efficient and Robust Algorithms for Adversarial Linear Contextual Bandits

We consider an adversarial variant of the classic $K$-armed linear contextual bandit problem where the sequence of loss functions associated with each arm are allowed to change without restriction over time. Under the assumption that the…

Machine Learning · Computer Science 2022-05-25 Gergely Neu , Julia Olkhovskaya

Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits

A stochastic combinatorial semi-bandit is an online learning problem where at each step a learning agent chooses a subset of ground items subject to constraints, and then observes stochastic weights of these items and receives their sum as…

Machine Learning · Computer Science 2017-06-08 Branislav Kveton , Zheng Wen , Azin Ashkan , Csaba Szepesvari

The price of bandit information in multiclass online classification

We consider two scenarios of multiclass online learning of a hypothesis class $H\subseteq Y^X$. In the {\em full information} scenario, the learner is exposed to instances together with their labels. In the {\em bandit} scenario, the true…

Machine Learning · Computer Science 2013-07-10 Amit Daniely , Tom Helbertal

Efficient Online Large-Margin Classification via Dual Certificates

Online classification is a central problem in optimization, statistical learning and data science. Classical algorithms such as the perceptron offer efficient updates and finite mistake guarantees on linearly separable data, but they do not…

Optimization and Control · Mathematics 2025-09-25 Nam Ho-Nguyen , Fatma Kılınç-Karzan , Ellie Nguyen , Lingqing Shen

Learning Multiclass Classifier Under Noisy Bandit Feedback

This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero…

Machine Learning · Computer Science 2021-06-08 Mudit Agarwal , Naresh Manwani

Online Multiclass Boosting with Bandit Feedback

We present online boosting algorithms for multiclass classification with bandit feedback, where the learner only receives feedback about the correctness of its prediction. We propose an unbiased estimate of the loss using a randomized…

Machine Learning · Statistics 2019-02-26 Daniel T. Zhang , Young Hun Jung , Ambuj Tewari