English

Precise Expression for the Algorithmic Information Distance

Information Theory 2020-09-02 v1 math.IT

Abstract

We consider the notion of information distance between two objects xx and yy introduced by Bennett, G\'acs, Li, Vit\'anyi, and Zurek in 1998 as the minimal length of a program that computes xx from yy as well as computing yy from xx. In this paper, it was proven that the distance is equal to max(K(xy),K(yx))\max (K(x|y),K(y|x)) up to additive logarithmic terms, and it was conjectured that this could not be improved to O(1)O(1) precision. We revisit subtle issues in the definition and prove this conjecture. We show that if the distance is at least logarithmic in the length, then this equality does hold with O(1)O(1) precision for strings of equal length. Thus for such strings, both the triangle inequality and the characterization hold with optimal precision. Finally, we extend the result to sets SS of bounded size. We show that for each constant~ss, the shortest program that prints an ss-element set S{0,1}nS \subseteq \{0,1\}^n given any of its elements, has length at most maxwSK(Sw)+O(1)\max_{w \in S} K(S|w) + O(1), provided this maximum is at least logarithmic in~nn.

Keywords

Cite

@article{arxiv.2009.00469,
  title  = {Precise Expression for the Algorithmic Information Distance},
  author = {Bruno Bauwens},
  journal= {arXiv preprint arXiv:2009.00469},
  year   = {2020}
}

Comments

arXiv admin note: substantial text overlap with arXiv:1807.11087