English

Long-Context Linear System Identification

Machine Learning 2025-07-03 v2 Machine Learning Systems and Control Systems and Control Statistics Theory Statistics Theory

Abstract

This paper addresses the problem of long-context linear system identification, where the state xtx_t of a dynamical system at time tt depends linearly on previous states xsx_s over a fixed context window of length pp. We establish a sample complexity bound that matches the i.i.d. parametric rate up to logarithmic factors for a broad class of systems, extending previous works that considered only first-order dependencies. Our findings reveal a learning-without-mixing phenomenon, indicating that learning long-context linear autoregressive models is not hindered by slow mixing properties potentially associated with extended context windows. Additionally, we extend these results to (i) shared low-rank representations, where rank-regularized estimators improve the dependence of the rates on the dimensionality, and (ii) misspecified context lengths in strictly stable systems, where shorter contexts offer statistical advantages.

Keywords

Cite

@article{arxiv.2410.05690,
  title  = {Long-Context Linear System Identification},
  author = {Oğuz Kaan Yüksel and Mathieu Even and Nicolas Flammarion},
  journal= {arXiv preprint arXiv:2410.05690},
  year   = {2025}
}

Comments

Published at ICLR 2025. This version includes minor corrections and improved grammar from the published version. 34 pages, 4 figures

R2 v1 2026-06-28T19:12:27.590Z