English

Learning Decision-Sufficient Representations for Linear Optimization

Optimization and Control 2026-05-25 v2 Computational Complexity Machine Learning

Abstract

We study how to construct compressed datasets that suffice to recover optimal decisions in linear programs with an unknown cost vector cc lying in a prior set C\mathcal{C}. Recent work by Bennouna et al. provides an exact geometric characterization of sufficient decision datasets (SDDs) via an intrinsic decision-relevant dimension dd^\star. However, their algorithm for constructing minimum-size SDDs requires solving mixed-integer programs. In this paper, we establish hardness results showing that computing dd^\star is NP-hard and deciding whether a dataset is globally sufficient is coNP-hard, thereby resolving a recent open problem posed by Bennouna et al. To address this worst-case intractability, we introduce pointwise sufficiency, a relaxation that requires sufficiency for an individual cost vector. Under nondegeneracy, we provide a polynomial-time cutting-plane algorithm for constructing pointwise-sufficient decision datasets. In a data-driven regime with i.i.d.\ costs, we further propose a cumulative algorithm that aggregates decision-relevant directions across samples, yielding a stable compression scheme of size at most dd^\star. This leads to a distribution-free PAC guarantee: with high probability over the training sample, the pointwise sufficiency failure probability on a fresh draw is at most O~(d/n)\tilde{O}(d^\star/n), and this rate is tight up to logarithmic factors. Finally, we apply decision-sufficient representations to contextual linear optimization, obtaining compressed predictors with generalization bounds scaling as O~(d/n)\tilde{O}(\sqrt{d^\star/n}) rather than O~(d/n)\tilde{O}(\sqrt{d/n}), where dd is the ambient cost dimension.

Keywords

Cite

@article{arxiv.2603.18551,
  title  = {Learning Decision-Sufficient Representations for Linear Optimization},
  author = {Yuhan Ye and Saurabh Amin and Asuman Ozdaglar},
  journal= {arXiv preprint arXiv:2603.18551},
  year   = {2026}
}

Comments

45 pages plus appendix, 2 figures. Accepted at COLT 2026

R2 v1 2026-07-01T11:27:33.750Z