English

Efficient Data-Driven Leverage Score Sampling Algorithm for the Minimum Volume Covering Ellipsoid Problem in Big Data

Optimization and Control 2024-11-07 v1 Computation

Abstract

The Minimum Volume Covering Ellipsoid (MVCE) problem, characterised by nn observations in dd dimensions where ndn \gg d, can be computationally very expensive in the big data regime. We apply methods from randomised numerical linear algebra to develop a data-driven leverage score sampling algorithm for solving MVCE, and establish theoretical error bounds and a convergence guarantee. Assuming the leverage scores follow a power law decay, we show that the computational complexity of computing the approximation for MVCE is reduced from O(nd2)\mathcal{O}(nd^2) to O(nd+poly(d))\mathcal{O}(nd + \text{poly}(d)), which is a significant improvement in big data problems. Numerical experiments demonstrate the efficacy of our new algorithm, showing that it substantially reduces computation time and yields near-optimal solutions.

Keywords

Cite

@article{arxiv.2411.03617,
  title  = {Efficient Data-Driven Leverage Score Sampling Algorithm for the Minimum Volume Covering Ellipsoid Problem in Big Data},
  author = {Elizabeth Harris and Ali Eshragh and Bishnu Lamichhane and Jordan Shaw-Carmody and Elizabeth Stojanovski},
  journal= {arXiv preprint arXiv:2411.03617},
  year   = {2024}
}

Comments

20 pages, 3 figures

R2 v1 2026-06-28T19:49:42.578Z