English

Sample completion, structured correlation, and Netflix problems

Machine Learning 2025-09-26 v1 Machine Learning Logic Statistics Theory Statistics Theory

Abstract

We develop a new high-dimensional statistical learning model which can take advantage of structured correlation in data even in the presence of randomness. We completely characterize learnability in this model in terms of VCNk,k{}_{k,k}-dimension (essentially kk-dependence from Shelah's classification theory). This model suggests a theoretical explanation for the success of certain algorithms in the 2006~Netflix Prize competition.

Keywords

Cite

@article{arxiv.2509.20404,
  title  = {Sample completion, structured correlation, and Netflix problems},
  author = {Leonardo N. Coregliano and Maryanthe Malliaris},
  journal= {arXiv preprint arXiv:2509.20404},
  year   = {2025}
}

Comments

97 pages, 1 figure

R2 v1 2026-07-01T05:54:39.632Z