English

Testing (Conditional) Mutual Information

Data Structures and Algorithms 2025-06-05 v1 Information Theory math.IT

Abstract

We investigate the sample complexity of mutual information and conditional mutual information testing. For conditional mutual information testing, given access to independent samples of a triple of random variables (A,B,C)(A, B, C) with unknown distribution, we want to distinguish between two cases: (i) AA and CC are conditionally independent, i.e., I(A ⁣: ⁣CB)=0I(A\!:\!C|B) = 0, and (ii) AA and CC are conditionally dependent, i.e., I(A ⁣: ⁣CB)εI(A\!:\!C|B) \geq \varepsilon for some threshold ε\varepsilon. We establish an upper bound on the number of samples required to distinguish between the two cases with high confidence, as a function of ε\varepsilon and the three alphabet sizes. We conjecture that our bound is tight and show that this is indeed the case in several parameter regimes. For the special case of mutual information testing (when BB is trivial), we establish the necessary and sufficient number of samples required up to polylogarithmic terms. Our technical contributions include a novel method to efficiently simulate weakly correlated samples from the conditionally independent distribution PABPCBPBP_{A|B} P_{C|B} P_B given access to samples from an unknown distribution PABCP_{ABC}, and a new estimator for equivalence testing that can handle such correlated samples, which might be of independent interest.

Keywords

Cite

@article{arxiv.2506.03894,
  title  = {Testing (Conditional) Mutual Information},
  author = {Jan Seyfried and Sayantan Sen and Marco Tomamichel},
  journal= {arXiv preprint arXiv:2506.03894},
  year   = {2025}
}

Comments

79 pages, accepted for presentation at the Conference on Learning Theory (COLT) 2025