English

An Approximation Algorithm for Graph Label Selection

Data Structures and Algorithms 2026-05-21 v2 Machine Learning

Abstract

In the graph label selection problem, one is given an nn-vertex graph and a budget kk, and seeks to select kk vertices whose labels enable accurate prediction of the labels on the remaining vertices. This problem formalizes distilling a small representative set from the whole graph. We present the first O~(log1.5n)\tilde{O}(\log^{1.5} n)-approximation algorithm for graph label selection under the standard budget constraint. Prior work either relies on resource augmentation, allowing substantially more than kk labeled vertices, or consists primarily of heuristics without provable guarantees. Finally, we demonstrate that practical heuristic variants of our algorithm scale to significantly larger graphs than previous methods, while essentially retaining their quality.

Keywords

Cite

@article{arxiv.2605.18623,
  title  = {An Approximation Algorithm for Graph Label Selection},
  author = {Josia John and Simon Meierhans and Maximilian Probst Gutenberg},
  journal= {arXiv preprint arXiv:2605.18623},
  year   = {2026}
}

Comments

Accepted at ICML 2026. 9 pages, 7 figures