English

Estimating Graph Dimension with Cross-validated Eigenvalues

Methodology 2025-12-24 v2 Machine Learning Social and Information Networks Statistics Theory Statistics Theory

Abstract

In applied multivariate statistics, estimating the number of latent dimensions or the number of clusters, kk, is a fundamental and recurring problem. We study a sequence of statistics called "cross-validated eigenvalues." Under a large class of random graph models, including both Poisson and Bernoulli edges, without parametric assumptions, we provide a pp-value for each cross-validated eigenvalue. It tests the null hypothesis that the sample eigenvector is orthogonal to (i.e., uncorrelated with) the true latent dimensions. This approach naturally adapts to problems where some dimensions are not statistically detectable. In scenarios where all kk dimensions can be estimated, we show that our procedure consistently estimates kk. In simulations and data example, the proposed estimator compares favorably to alternative approaches in both computational and statistical performance.

Keywords

Cite

@article{arxiv.2108.03336,
  title  = {Estimating Graph Dimension with Cross-validated Eigenvalues},
  author = {Fan Chen and Sebastien Roch and Karl Rohe and Shuqi Yu},
  journal= {arXiv preprint arXiv:2108.03336},
  year   = {2025}
}

Comments

63 pages, 12 figures

R2 v1 2026-06-24T04:54:17.898Z