Long-Context Linear System Identification

Oğuz Kaan Yüksel; Mathieu Even; Nicolas Flammarion

Long-Context Linear System Identification

Machine Learning 2025-07-03 v2 Machine Learning Systems and Control Systems and Control Statistics Theory Statistics Theory

Authors: Oğuz Kaan Yüksel , Mathieu Even , Nicolas Flammarion

View on arXiv ↗ PDF ↗

Abstract

This paper addresses the problem of long-context linear system identification, where the state $x_t$ of a dynamical system at time $t$ depends linearly on previous states $x_s$ over a fixed context window of length $p$ . We establish a sample complexity bound that matches the i.i.d. parametric rate up to logarithmic factors for a broad class of systems, extending previous works that considered only first-order dependencies. Our findings reveal a learning-without-mixing phenomenon, indicating that learning long-context linear autoregressive models is not hindered by slow mixing properties potentially associated with extended context windows. Additionally, we extend these results to (i) shared low-rank representations, where rank-regularized estimators improve the dependence of the rates on the dimensionality, and (ii) misspecified context lengths in strictly stable systems, where shorter contexts offer statistical advantages.

Keywords

latent variable models deep learning multi-task learning

Cite

@article{arxiv.2410.05690,
  title  = {Long-Context Linear System Identification},
  author = {Oğuz Kaan Yüksel and Mathieu Even and Nicolas Flammarion},
  journal= {arXiv preprint arXiv:2410.05690},
  year   = {2025}
}

Comments

Published at ICLR 2025. This version includes minor corrections and improved grammar from the published version. 34 pages, 4 figures

Long-Context Linear System Identification

Abstract

Keywords

Cite

Comments

Related papers