English

Threshold rules for online sample selection

Data Structures and Algorithms 2010-07-20 v2

Abstract

We consider the following sample selection problem. We observe in an online fashion a sequence of samples, each endowed by a quality. Our goal is to either select or reject each sample, so as to maximize the aggregate quality of the subsample selected so far. There is a natural trade-off here between the rate of selection and the aggregate quality of the subsample. We show that for a number of such problems extremely simple and oblivious "threshold rules" for selection achieve optimal tradeoffs between rate of selection and aggregate quality in a probabilistic sense. In some cases we show that the same threshold rule is optimal for a large class of quality distributions and is thus oblivious in a strong sense.

Keywords

Cite

@article{arxiv.1002.5034,
  title  = {Threshold rules for online sample selection},
  author = {Eric Bach and Shuchi Chawla and Seeun Umboh},
  journal= {arXiv preprint arXiv:1002.5034},
  year   = {2010}
}
R2 v1 2026-06-21T14:51:42.356Z