English
Related papers

Related papers: Partial Feedback Online Learning

200 papers

We study a variant of online multiclass classification where the learner predicts a single label but receives a \textit{set of labels} as feedback. In this model, the learner is penalized for not outputting a label contained in the revealed…

Machine Learning · Computer Science 2024-06-21 Vinod Raman , Unique Subedi , Ambuj Tewari

In online binary classification under \emph{apple tasting} feedback, the learner only observes the true label if it predicts ``1". First studied by \cite{helmbold2000apple}, we revisit this classical partial-feedback setting and study…

Machine Learning · Computer Science 2024-06-21 Vinod Raman , Unique Subedi , Ananth Raman , Ambuj Tewari

We consider the problem of online classification under a privacy constraint. In this setting a learner observes sequentially a stream of labelled examples $(x_t, y_t)$, for $1 \leq t \leq T$, and returns at each iteration $t$ a hypothesis…

Machine Learning · Computer Science 2021-06-28 Noah Golowich , Roi Livni

We study the problem of online binary classification in settings where strategic agents can modify their observable features to receive a positive classification. We model the set of feasible manipulations by a directed graph over the…

Machine Learning · Computer Science 2024-07-17 Saba Ahmadi , Kunhe Yang , Hanrui Zhang

We study the problem of learning robust classifiers where the classifier will receive a perturbed input. Unlike robust PAC learning studied in prior work, here the clean data and its label are also adversarially chosen. We formulate this…

Machine Learning · Computer Science 2026-03-02 Sajad Ashkezari

We study fast rates of convergence in the setting of nonparametric online regression, namely where regret is defined with respect to an arbitrary function class which has bounded complexity. Our contributions are two-fold: - In the…

Machine Learning · Computer Science 2022-04-13 Constantinos Daskalakis , Noah Golowich

We consider combinatorial online learning with subset choices when only relative feedback information from subsets is available, instead of bandit or semi-bandit feedback which is absolute. Specifically, we study two regret minimisation…

Machine Learning · Computer Science 2020-02-28 Aadirupa Saha , Aditya Gopalan

This paper studies classification with an abstention option in the online setting. In this setting, examples arrive sequentially, the learner is given a hypothesis class $\mathcal H$, and the goal of the learner is to either predict a label…

Machine Learning · Computer Science 2016-09-29 Chicheng Zhang , Kamalika Chaudhuri

We study the online learnability of hypothesis classes with respect to arbitrary, but bounded loss functions. No characterization of online learnability is known at this level of generality. We give a new scale-sensitive combinatorial…

Machine Learning · Computer Science 2024-02-12 Vinod Raman , Unique Subedi , Ambuj Tewari

Motivated by the predictable nature of real-life in data streams, we study online regression when the learner has access to predictions about future examples. In the extreme case, called transductive online learning, the sequence of…

Machine Learning · Computer Science 2025-10-07 Vinod Raman , Shenghao Xie , Samson Zhou

We study online multiclass classification under bandit feedback. We extend the results of Daniely and Helbertal [2013] by showing that the finiteness of the Bandit Littlestone dimension is necessary and sufficient for bandit online…

Machine Learning · Computer Science 2024-01-23 Ananth Raman , Vinod Raman , Unique Subedi , Idan Mehalel , Ambuj Tewari

We consider the problem of learning an unknown subset $N_\text{target}$ of a domain in an online setting. In each round $t$, the learner predicts a set of items ${N}_t$ and receives one of two types of feedback, each with equal probability:…

Machine Learning · Computer Science 2026-05-12 Lee Cohen , Yishay Mansour , Shay Moran , Han Shao

We consider the problem of adversarial (non-stochastic) online learning with partial information feedback, where at each round, a decision maker selects an action from a finite set of alternatives. We develop a black-box approach for such…

Machine Learning · Computer Science 2021-07-28 Thodoris Lykouris , Karthik Sridharan , Eva Tardos

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit…

Machine Learning · Statistics 2017-07-03 Nicolò Cesa-Bianchi , Pierre Gaillard , Claudio Gentile , Sébastien Gerchinovitz

We consider the problem of online combinatorial optimization under semi-bandit feedback, where a learner has to repeatedly pick actions from a combinatorial decision set in order to minimize the total losses associated with its decisions.…

Machine Learning · Computer Science 2015-06-11 Gergely Neu

We investigate the hardness of online reinforcement learning in fixed horizon, sparse linear Markov decision process (MDP), with a special focus on the high-dimensional regime where the ambient dimension is larger than the number of…

Machine Learning · Computer Science 2021-02-11 Botao Hao , Tor Lattimore , Csaba Szepesvári , Mengdi Wang

We consider the problem of multiclass transductive online learning when the number of labels can be unbounded. Previous works by Ben-David et al. [1997] and Hanneke et al. [2023b] only consider the case of binary and finite label spaces,…

Machine Learning · Computer Science 2024-11-05 Steve Hanneke , Vinod Raman , Amirreza Shaeiri , Unique Subedi

We propose a new partial-observability model for online learning problems where the learner, besides its own loss, also observes some noisy feedback about the other actions, depending on the underlying structure of the problem. We represent…

Machine Learning · Computer Science 2026-04-16 Tomáš Kocák , Gergely Neu , Michal Valko

We revisit the problem of private online learning, in which a learner receives a sequence of $T$ data points and has to respond at each time-step a hypothesis. It is required that the entire stream of output hypotheses should satisfy…

Machine Learning · Computer Science 2025-11-11 Bo Li , Wei Wang , Peng Ye

In many quantum tasks, there is an unknown quantum object that one wishes to learn. An online strategy for this task involves adaptively refining a hypothesis to reproduce such an object or its measurement statistics. A common evaluation…

Quantum Physics · Physics 2025-11-25 Akshay Bansal , Ian George , Soumik Ghosh , Jamie Sikora , Alice Zheng
‹ Prev 1 2 3 10 Next ›