English

Deterministic Apple Tasting

Machine Learning 2025-06-05 v2 Machine Learning

Abstract

In binary (0/10/1) online classification with apple tasting feedback, the learner receives feedback only when predicting 11. Besides some degenerate learning tasks, all previously known learning algorithms for this model are randomized. Consequently, prior to this work it was unknown whether deterministic apple tasting is generally feasible. In this work, we provide the first widely-applicable deterministic apple tasting learner, and show that in the realizable case, a hypothesis class is learnable if and only if it is deterministically learnable, confirming a conjecture of [Raman, Subedi, Raman, Tewari-24]. Quantitatively, we show that every class H\mathcal{H} is learnable with mistake bound O(L(H)TlogT)O \left(\sqrt{\mathtt{L}(\mathcal{H}) T \log T} \right) (where L(H)\mathtt{L}(\mathcal{H}) is the Littlestone dimension of H\mathcal{H}), and that this is tight for some classes. We further study the agnostic case, in which the best hypothesis makes at most kk many mistakes, and prove a trichotomy stating that every class H\mathcal{H} must be either easy, hard, or unlearnable. Easy classes have (both randomized and deterministic) mistake bound ΘH(k)\Theta_{\mathcal{H}}(k). Hard classes have randomized mistake bound Θ~H(k+T)\tilde{\Theta}_{\mathcal{H}} \left(k + \sqrt{T} \right), and deterministic mistake bound Θ~H(kT)\tilde{\Theta}_{\mathcal{H}} \left(\sqrt{k \cdot T} \right), where TT is the time horizon. Unlearnable classes have (both randomized and deterministic) mistake bound Θ(T)\Theta(T). Our upper bound is based on a deterministic algorithm for learning from expert advice with apple tasting feedback, a problem interesting in its own right. For this problem, we show that the optimal deterministic mistake bound is Θ(T(k+logn))\Theta \left(\sqrt{T (k + \log n)} \right) for all kk and Tn2TT \leq n \leq 2^T, where nn is the number of experts.

Keywords

Cite

@article{arxiv.2410.10404,
  title  = {Deterministic Apple Tasting},
  author = {Zachary Chase and Idan Mehalel},
  journal= {arXiv preprint arXiv:2410.10404},
  year   = {2025}
}
R2 v1 2026-06-28T19:20:26.324Z