Related papers: An Improved Approximation Algorithm for the Column…

Optimal Column-Based Low-Rank Matrix Reconstruction

We prove that for any real-valued matrix $X \in \R^{m \times n}$, and positive integers $r \ge k$, there is a subset of $r$ columns of $X$ such that projecting $X$ onto their span gives a $\sqrt{\frac{r+1}{r-k+1}}$-approximation to best…

Data Structures and Algorithms · Computer Science 2015-03-19 Venkatesan Guruswami , Ali Kemal Sinop

Many (most?) column subset selection criteria are NP hard for a few columns

We consider a variety of criteria for selecting k representative columns from a real mxn matrix A, when sufficiently few columns are required, i.e., 1<= k<= min{rank(A), m/3}. The criteria include the following optimization problems:…

Numerical Analysis · Mathematics 2026-04-13 Ilse C. F. Ipsen , Arvind K. Saibaba

New Subset Selection Algorithms for Low Rank Approximation: Offline and Online

Subset selection for the rank $k$ approximation of an $n\times d$ matrix $A$ offers improvements in the interpretability of matrices, as well as a variety of computational savings. This problem is well-understood when the error measure is…

Data Structures and Algorithms · Computer Science 2023-04-20 David P. Woodruff , Taisuke Yasuda

Optimal Analysis of Subset-Selection Based L_p Low Rank Approximation

We study the low rank approximation problem of any given matrix $A$ over $\mathbb{R}^{n\times m}$ and $\mathbb{C}^{n\times m}$ in entry-wise $\ell_p$ loss, that is, finding a rank-$k$ matrix $X$ such that $\|A-X\|_p$ is minimized. Unlike…

Machine Learning · Computer Science 2019-10-31 Chen Dan , Hong Wang , Hongyang Zhang , Yuchen Zhou , Pradeep Ravikumar

Low-rank approximation in the Frobenius norm by column and row subset selection

A CUR approximation of a matrix $A$ is a particular type of low-rank approximation $A \approx C U R$, where $C$ and $R$ consist of columns and rows of $A$, respectively. One way to obtain such an approximation is to apply column subset…

Numerical Analysis · Mathematics 2019-08-19 Alice Cortinovis , Daniel Kressner

Faster Subset Selection for Matrices and Applications

We study subset selection for matrices defined as follows: given a matrix $\matX \in \R^{n \times m}$ ($m > n$) and an oversampling parameter $k$ ($n \le k \le m$), select a subset of $k$ columns from $\matX$ such that the pseudo-inverse of…

Data Structures and Algorithms · Computer Science 2013-06-25 Haim Avron , Christos Boutsidis

Optimal $\ell_1$ Column Subset Selection and a Fast PTAS for Low Rank Approximation

We study the problem of entrywise $\ell_1$ low rank approximation. We give the first polynomial time column subset selection-based $\ell_1$ low rank approximation algorithm sampling $\tilde{O}(k)$ columns and achieving an…

Data Structures and Algorithms · Computer Science 2020-11-17 Arvind V. Mahankali , David P. Woodruff

Average Case Column Subset Selection for Entrywise $\ell_1$-Norm Loss

We study the column subset selection problem with respect to the entrywise $\ell_1$-norm loss. It is known that in the worst case, to obtain a good rank-$k$ approximation to a matrix, one needs an arbitrarily large $n^{\Omega(1)}$ number of…

Data Structures and Algorithms · Computer Science 2020-04-20 Zhao Song , David P. Woodruff , Peilin Zhong

Subset selection for matrices in spectral norm

We address the subset selection problem for matrices, where the goal is to select a subset of $k$ columns from a "short-and-fat" matrix $X \in \mathbb{R}^{m \times n}$, such that the pseudoinverse of the sampled submatrix has as small…

Numerical Analysis · Mathematics 2025-07-29 Ivan Kozyrev , Alexander Osinsky

Streaming and Distributed Algorithms for Robust Column Subset Selection

We give the first single-pass streaming algorithm for Column Subset Selection with respect to the entrywise $\ell_p$-norm with $1 \leq p < 2$. We study the $\ell_p$ norm loss since it is often considered more robust to noise than the…

Data Structures and Algorithms · Computer Science 2021-07-19 Shuli Jiang , Dongyu Li , Irene Mengze Li , Arvind V. Mahankali , David P. Woodruff

Provably Correct Algorithms for Matrix Column Subset Selection with Selectively Sampled Data

We consider the problem of matrix column subset selection, which selects a subset of columns from an input matrix such that the input can be well approximated by the span of the selected columns. Column subset selection has been applied to…

Machine Learning · Statistics 2018-01-26 Yining Wang , Aarti Singh

Fair Column Subset Selection

The problem of column subset selection asks for a subset of columns from an input matrix such that the matrix can be reconstructed as accurately as possible within the span of the selected columns. A natural extension is to consider a…

Machine Learning · Computer Science 2024-08-13 Antonis Matakos , Bruno Ordozgoiti , Suhas Thejaswi

Column Subset Selection and Nystr\"om Approximation via Continuous Optimization

We propose a continuous optimization algorithm for the Column Subset Selection Problem (CSSP) and Nystr\"om approximation. The CSSP and Nystr\"om method construct low-rank approximations of matrices based on a predetermined subset of…

Methodology · Statistics 2023-04-20 Anant Mathur , Sarat Moka , Zdravko Botev

Interlacing Polynomial Method for the Column Subset Selection Problem

This paper investigates the spectral norm version of the column subset selection problem. Given a matrix $\mathbf{A}\in\mathbb{R}^{n\times d}$ and a positive integer $k\leq\text{rank}(\mathbf{A})$, the objective is to select exactly $k$…

Data Structures and Algorithms · Computer Science 2024-01-09 Jian-Feng Cai , Zhiqiang Xu , Zili Xu

Column Subset Selection, Matrix Factorization, and Eigenvalue Optimization

Given a fixed matrix, the problem of column subset selection requests a column submatrix that has favorable spectral properties. Most research from the algorithms and numerical linear algebra communities focuses on a variant called…

Numerical Analysis · Mathematics 2014-04-29 Joel A. Tropp

A determinantal point process for column subset selection

Dimensionality reduction is a first step of many machine learning pipelines. Two popular approaches are principal component analysis, which projects onto a small number of well chosen but non-interpretable directions, and feature selection,…

Machine Learning · Statistics 2018-12-27 Ayoub Belhadji , Rémi Bardenet , Pierre Chainais

Column Selection via Adaptive Sampling

Selecting a good column (or row) subset of massive data matrices has found many applications in data analysis and machine learning. We propose a new adaptive sampling algorithm that can be used to improve any relative-error column selection…

Data Structures and Algorithms · Computer Science 2015-10-15 Saurabh Paul , Malik Magdon-Ismail , Petros Drineas

Subset Selection for Matrices with Fixed Blocks

Subset selection for matrices is the task of extracting a column sub-matrix from a given matrix $B\in\mathbb{R}^{n\times m}$ with $m>n$ such that the pseudoinverse of the sampled matrix has as small Frobenius or spectral norm as possible.…

Data Structures and Algorithms · Computer Science 2020-03-04 Jiaxin Xie , Zhiqiang Xu

Adaptive randomized pivoting for column subset selection, DEIM, and low-rank approximation

We derive a new adaptive leverage score sampling strategy for solving the Column Subset Selection Problem (CSSP). The resulting algorithm, called Adaptive Randomized Pivoting, can be viewed as a randomization of Osinsky's recently proposed…

Numerical Analysis · Mathematics 2025-06-23 Alice Cortinovis , Daniel Kressner

Column subset selection is NP-complete

Let $M$ be a real $r\times c$ matrix and let $k$ be a positive integer. In the column subset selection problem (CSSP), we need to minimize the quantity $\|M-SA\|$, where $A$ can be an arbitrary $k\times c$ matrix, and $S$ runs over all…

Combinatorics · Mathematics 2017-01-12 Yaroslav Shitov