English

Efficient Ensemble Selection from Binary and Pairwise Feedback

Computer Science and Game Theory 2026-05-12 v1 Artificial Intelligence Machine Learning

Abstract

Organizations increasingly deploy multiple AI systems across task domains, but selecting a small, high-performing ensemble can require costly model calls, benchmark runs, and human evaluation. We study this selection problem as a distributional variant of multiwinner voting: tasks are drawn from an unknown domain distribution, each task induces feedback over candidate experts, and a committee's value on a task is determined by its best-performing member. We analyze both binary feedback, for tasks with correct/incorrect outcomes, and pairwise feedback, for tasks where candidate outputs are compared by preference. In the binary setting, the induced objective is coverage. We give exhaustive-elicitation baselines and matching worst-case query lower bounds, and we design a failure-conditioned greedy algorithm that preserves the standard (11/e)(1-1/e) guarantee while obtaining instance-dependent query savings. In the pairwise setting, we study θ\theta-winning committees. We show that full-information optimization admits a PTAS but no EPTAS under Gap-ETH, and that the objective is monotone but not submodular. This motivates a weighted ordinal coverage relaxation, which is submodular and supports a failure-conditioned greedy oracle under pairwise feedback. We then convert this oracle back into θ\theta-type guarantees through finite-family auditing or a minimax wrapper. We also provide small-scale LLM experiments illustrating the predicted query savings and the role of complementarity in committee selection.

Keywords

Cite

@article{arxiv.2605.09588,
  title  = {Efficient Ensemble Selection from Binary and Pairwise Feedback},
  author = {Tzeh Yuan Neoh and Nicholas Teh and Je Qin Chooi and Paul W. Goldberg and Milind Tambe},
  journal= {arXiv preprint arXiv:2605.09588},
  year   = {2026}
}