English

Two-Sided Weak Submodularity for Matroid Constrained Optimization and Regression

Data Structures and Algorithms 2023-01-19 v2

Abstract

We study the following problem: Given a variable of interest, we would like to find a best linear predictor for it by choosing a subset of kk relevant variables obeying a matroid constraint. This problem is a natural generalization of subset selection problems where it is necessary to spread observations amongst multiple different classes. We derive new, strengthened guarantees for this problem by improving the analysis of the residual random greedy algorithm and by developing a novel distorted local-search algorithm. To quantify our approximation guarantees, we refine the definition of weak submodularity by Das and Kempe and introduce the notion of an upper submodularity ratio, which we connect to the minimum kk-sparse eigenvalue of the covariance matrix. More generally, we look at the problem of maximizing a set function ff with lower and upper submodularity ratio γ\gamma and β\beta under a matroid constraint. For this problem, our algorithms have asymptotic approximation guarantee 1/21/2 and 1e11-e^{-1} as the function is closer to being submodular. As a second application, we show that the Bayesian A-optimal design objective falls into our framework, leading to new guarantees for this problem as well.

Keywords

Cite

@article{arxiv.2102.09644,
  title  = {Two-Sided Weak Submodularity for Matroid Constrained Optimization and Regression},
  author = {Theophile Thiery and Justin Ward},
  journal= {arXiv preprint arXiv:2102.09644},
  year   = {2023}
}

Comments

Appeared in COLT'22. 29 pages, 1 figure. Comments welcome. The earlier version of this paper contains a local-search algorithm's analysis which we substitute with the analysis of the residual random greedy algorithm