English
Related papers

Related papers: Bandit Multiclass Linear Classification: Efficient…

200 papers

We consider the online multiclass linear classification under the bandit feedback setting. Beygelzimer, P\'{a}l, Sz\"{o}r\'{e}nyi, Thiruvenkatachari, Wei, and Zhang [ICML'19] considered two notions of linear separability, weak and strong…

Machine Learning · Computer Science 2021-09-01 Jittat Fakcharoenphol , Chayutpong Prompak

This paper presents a novel online learning method that aims at finding a separator hyperplane between data points labelled as either positive or negative. Since weights and biases of artificial neurons can directly be related to…

Machine Learning · Computer Science 2023-09-13 Ákos Hajnal

We study the task of agnostic learning of multiclass linear classifiers under the Gaussian distribution. Given labeled examples $(x, y)$ from a distribution over $\mathbb{R}^d \times [k]$, with Gaussian $x$-marginal, the goal is to output a…

Machine Learning · Computer Science 2026-05-21 Ilias Diakonikolas , Giannis Iakovidis , Mingchen Ma

Binary linear classification has been explored since the very early days of the machine learning literature. Perhaps the most classical algorithm is the Perceptron, where a weight vector used to classify examples is maintained, and additive…

Machine Learning · Computer Science 2020-11-18 Rafael Hanashiro , Jacob Abernethy

We study multiclass PAC learning with bandit feedback, where inputs are classified into one of $K$ possible labels and feedback is limited to whether or not the predicted labels are correct. Our main contribution is in designing a novel…

Machine Learning · Computer Science 2024-06-19 Liad Erez , Alon Cohen , Tomer Koren , Yishay Mansour , Shay Moran

Several learning problems involve solving min-max problems, e.g., empirical distributional robust learning or learning with non-standard aggregated losses. More specifically, these problems are convex-linear problems where the minimization…

Machine Learning · Computer Science 2021-06-07 Christophe Roux , Elias Wirth , Sebastian Pokutta , Thomas Kerdreux

Many complex multi-target prediction problems that concern large target spaces are characterised by a need for efficient prediction strategies that avoid the computation of predictions for all targets explicitly. Examples of such problems…

Information Retrieval · Computer Science 2018-03-06 Michiel Stock , Krzysztof Dembczynski , Bernard De Baets , Willem Waegeman

This paper introduces a new online learning framework for multiclass classification called learning with diluted bandit feedback. At every time step, the algorithm predicts a candidate label set instead of a single label for the observed…

Machine Learning · Computer Science 2021-05-19 Gaurav Batra , Naresh Manwani

We study the problem of online multiclass classification in a setting where the learner's feedback is determined by an arbitrary directed graph. While including bandit feedback as a special case, feedback graphs allow a much richer set of…

Machine Learning · Computer Science 2024-02-20 Dirk van der Hoeven , Federico Fusco , Nicolò Cesa-Bianchi

Consider the domain of multiclass classification within the adversarial online setting. What is the price of relying on bandit feedback as opposed to full information? To what extent can an adaptive adversary amplify the loss compared to an…

Machine Learning · Computer Science 2024-02-13 Yuval Filmus , Steve Hanneke , Idan Mehalel , Shay Moran

Multi-objective combinatorial optimization seeks Pareto-optimal solutions over exponentially large discrete spaces, yet existing methods sacrifice generality, scalability, or theoretical guarantees. We reformulate it as an online learning…

Machine Learning · Computer Science 2026-02-13 Esha Singh , Dongxia Wu , Chien-Yi Yang , Tajana Rosing , Rose Yu , Yi-An Ma

We revisit the classical problem of multiclass classification with bandit feedback (Kakade, Shalev-Shwartz and Tewari, 2008), where each input classifies to one of $K$ possible labels and feedback is restricted to whether the predicted…

Machine Learning · Computer Science 2024-06-21 Liad Erez , Alon Cohen , Tomer Koren , Yishay Mansour , Shay Moran

Distributed, online data mining systems have emerged as a result of applications requiring analysis of large amounts of correlated and high-dimensional data produced by multiple distributed data sources. We propose a distributed online data…

Machine Learning · Computer Science 2013-08-27 Cem Tekin , Mihaela van der Schaar

Recently, the study of linear misspecified bandits has generated intriguing implications of the hardness of learning in bandits and reinforcement learning (RL). In particular, Du et al. (2020) show that even if a learner is given linear…

Machine Learning · Computer Science 2023-03-31 Jialin Dong , Lin F. Yang

We consider an adversarial variant of the classic $K$-armed linear contextual bandit problem where the sequence of loss functions associated with each arm are allowed to change without restriction over time. Under the assumption that the…

Machine Learning · Computer Science 2022-05-25 Gergely Neu , Julia Olkhovskaya

A stochastic combinatorial semi-bandit is an online learning problem where at each step a learning agent chooses a subset of ground items subject to constraints, and then observes stochastic weights of these items and receives their sum as…

Machine Learning · Computer Science 2017-06-08 Branislav Kveton , Zheng Wen , Azin Ashkan , Csaba Szepesvari

We consider two scenarios of multiclass online learning of a hypothesis class $H\subseteq Y^X$. In the {\em full information} scenario, the learner is exposed to instances together with their labels. In the {\em bandit} scenario, the true…

Machine Learning · Computer Science 2013-07-10 Amit Daniely , Tom Helbertal

Online classification is a central problem in optimization, statistical learning and data science. Classical algorithms such as the perceptron offer efficient updates and finite mistake guarantees on linearly separable data, but they do not…

Optimization and Control · Mathematics 2025-09-25 Nam Ho-Nguyen , Fatma Kılınç-Karzan , Ellie Nguyen , Lingqing Shen

This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero…

Machine Learning · Computer Science 2021-06-08 Mudit Agarwal , Naresh Manwani

We present online boosting algorithms for multiclass classification with bandit feedback, where the learner only receives feedback about the correctness of its prediction. We propose an unbiased estimate of the loss using a randomized…

Machine Learning · Statistics 2019-02-26 Daniel T. Zhang , Young Hun Jung , Ambuj Tewari
‹ Prev 1 2 3 10 Next ›