English

Noisy Sorting Without Resampling

Data Structures and Algorithms 2007-07-10 v1

Abstract

In this paper we study noisy sorting without re-sampling. In this problem there is an unknown order aπ(1)<...<aπ(n)a_{\pi(1)} < ... < a_{\pi(n)} where π\pi is a permutation on nn elements. The input is the status of (n2)n \choose 2 queries of the form q(ai,xj)q(a_i,x_j), where q(ai,aj)=+q(a_i,a_j) = + with probability at least 1/2+\ga1/2+\ga if π(i)>π(j)\pi(i) > \pi(j) for all pairs iji \neq j, where \ga>0\ga > 0 is a constant and q(ai,aj)=q(aj,ai)q(a_i,a_j) = -q(a_j,a_i) for all ii and jj. It is assumed that the errors are independent. Given the status of the queries the goal is to find the maximum likelihood order. In other words, the goal is find a permutation σ\sigma that minimizes the number of pairs σ(i)>σ(j)\sigma(i) > \sigma(j) where q(σ(i),σ(j))=q(\sigma(i),\sigma(j)) = -. The problem so defined is the feedback arc set problem on distributions of inputs, each of which is a tournament obtained as a noisy perturbations of a linear order. Note that when \ga<1/2\ga < 1/2 and nn is large, it is impossible to recover the original order π\pi. It is known that the weighted feedback are set problem on tournaments is NP-hard in general. Here we present an algorithm of running time nO(γ4)n^{O(\gamma^{-4})} and sampling complexity Oγ(nlogn)O_{\gamma}(n \log n) that with high probability solves the noisy sorting without re-sampling problem. We also show that if aσ(1),aσ(2),...,aσ(n)a_{\sigma(1)},a_{\sigma(2)},...,a_{\sigma(n)} is an optimal solution of the problem then it is ``close'' to the original order. More formally, with high probability it holds that iσ(i)π(i)=Θ(n)\sum_i |\sigma(i) - \pi(i)| = \Theta(n) and maxiσ(i)π(i)=Θ(logn)\max_i |\sigma(i) - \pi(i)| = \Theta(\log n). Our results are of interest in applications to ranking, such as ranking in sports, or ranking of search items based on comparisons by experts.

Keywords

Cite

@article{arxiv.0707.1051,
  title  = {Noisy Sorting Without Resampling},
  author = {Mark Braverman and Elchanan Mossel},
  journal= {arXiv preprint arXiv:0707.1051},
  year   = {2007}
}
R2 v1 2026-06-21T08:56:01.824Z