English

How to select the largest k elements from evolving data?

Data Structures and Algorithms 2014-12-30 v1

Abstract

In this paper we investigate the top-kk-selection problem, i.e. determine the largest, second largest, ..., and the kk-th largest elements, in the dynamic data model. In this model the order of elements evolves dynamically over time. In each time step the algorithm can only probe the changes of data by comparing a pair of elements. Previously only two special cases were studied[2]: finding the largest element and the median; and sorting all elements. This paper systematically deals with k[n]k\in [n] and solves the problem almost completely. Specifically, we identify a critical point kk^* such that the top-kk-selection problem can be solved error-free with probability 1o(1)1-o(1) if and only if k=o(k)k=o(k^*). A lower bound of the error when k=Ω(k)k=\Omega(k^*) is also determined, which actually is tight under some condition. On the other hand, it is shown that the top-kk-set problem, which means finding the largest kk elements without sorting them, can be solved error-free for all k[n]k\in [n]. Additionally, we extend the dynamic data model and show that most of these results still hold.

Keywords

Cite

@article{arxiv.1412.8164,
  title  = {How to select the largest k elements from evolving data?},
  author = {Qin Huang and Xingwu Liu and Xiaoming Sun and Jialin Zhang},
  journal= {arXiv preprint arXiv:1412.8164},
  year   = {2014}
}

Comments

23 pages, 2 figures

R2 v1 2026-06-22T07:45:09.119Z