English

Parameter estimation for integer-valued Gibbs distributions

Probability 2023-08-21 v6 Computational Complexity Discrete Mathematics

Abstract

A central problem in computational statistics is to convert a procedure for sampling combinatorial from an objects into a procedure for counting those objects, and vice versa. Weconsider sampling problems coming from *Gibbs distributions*, which are probability distributions of the form μβΩ(ω)eβH(ω)\mu^\Omega_\beta(\omega) \propto e^{\beta H(\omega)} for β\beta in an interval [\beta_\min, \beta_\max] and H(ω){0}[1,n]H( \omega ) \in \{0 \} \cup [1, n]. The *partition function* is the normalization factor Z(β)=ωΩeβH(ω)Z(\beta)=\sum_{\omega \in\Omega}e^{\beta H(\omega)}. Two important parameters are the log partition ratio q = \log \tfrac{Z(\beta_\max)}{Z(\beta_\min)} and the vector of counts cx=H1(x)c_x = |H^{-1}(x)|. Our first result is an algorithm to estimate the counts cxc_x using roughly O~(qϵ2)\tilde O( \frac{q}{\epsilon^2}) samples for general Gibbs distributions and O~(n2ϵ2)\tilde O( \frac{n^2}{\epsilon^2} ) samples for integer-valued distributions (ignoring some second-order terms and parameters). We show this is optimal up to logarithmic factors. We illustrate with improved algorithms for counting connected subgraphs and perfect matchings in a graph. We develop a key subroutine for global estimation of the partition function. Specifically, we produce a data structure to estimate Z(β)Z(\beta) for \emph{all} values β\beta, without further samples. Constructing the data structure requires O(qlognϵ2)O(\frac{q \log n}{\epsilon^2}) samples for general Gibbs distributions and O(n2lognϵ2+nlogq)O(\frac{n^2 \log n}{\epsilon^2} + n \log q) samples for integer-valued distributions. This improves over a prior algorithm of Kolmogorov (2018) which computes the single point estimate Z(\beta_\max) using O~(qϵ2)\tilde O(\frac{q}{\epsilon^2}) samples. We also show that this complexity is optimal as a function of nn and qq up to logarithmic terms.

Keywords

Cite

@article{arxiv.1904.03139,
  title  = {Parameter estimation for integer-valued Gibbs distributions},
  author = {David G. Harris and Vladimir Kolmogorov},
  journal= {arXiv preprint arXiv:1904.03139},
  year   = {2023}
}

Comments

Superseded by arXiv:2007.10824 This version is obsolete