How to select the largest k elements from evolving data?
Abstract
In this paper we investigate the top--selection problem, i.e. determine the largest, second largest, ..., and the -th largest elements, in the dynamic data model. In this model the order of elements evolves dynamically over time. In each time step the algorithm can only probe the changes of data by comparing a pair of elements. Previously only two special cases were studied[2]: finding the largest element and the median; and sorting all elements. This paper systematically deals with and solves the problem almost completely. Specifically, we identify a critical point such that the top--selection problem can be solved error-free with probability if and only if . A lower bound of the error when is also determined, which actually is tight under some condition. On the other hand, it is shown that the top--set problem, which means finding the largest elements without sorting them, can be solved error-free for all . Additionally, we extend the dynamic data model and show that most of these results still hold.
Keywords
Cite
@article{arxiv.1412.8164,
title = {How to select the largest k elements from evolving data?},
author = {Qin Huang and Xingwu Liu and Xiaoming Sun and Jialin Zhang},
journal= {arXiv preprint arXiv:1412.8164},
year = {2014}
}
Comments
23 pages, 2 figures