Sample completion, structured correlation, and Netflix problems
Machine Learning
2025-09-26 v1 Machine Learning
Logic
Statistics Theory
Statistics Theory
Abstract
We develop a new high-dimensional statistical learning model which can take advantage of structured correlation in data even in the presence of randomness. We completely characterize learnability in this model in terms of VCN-dimension (essentially -dependence from Shelah's classification theory). This model suggests a theoretical explanation for the success of certain algorithms in the 2006~Netflix Prize competition.
Cite
@article{arxiv.2509.20404,
title = {Sample completion, structured correlation, and Netflix problems},
author = {Leonardo N. Coregliano and Maryanthe Malliaris},
journal= {arXiv preprint arXiv:2509.20404},
year = {2025}
}
Comments
97 pages, 1 figure