English
Related papers

Related papers: Provably Correct Algorithms for Matrix Column Subs…

200 papers

Selecting a good column (or row) subset of massive data matrices has found many applications in data analysis and machine learning. We propose a new adaptive sampling algorithm that can be used to improve any relative-error column selection…

Data Structures and Algorithms · Computer Science 2015-10-15 Saurabh Paul , Malik Magdon-Ismail , Petros Drineas

The problem of column subset selection asks for a subset of columns from an input matrix such that the matrix can be reconstructed as accurately as possible within the span of the selected columns. A natural extension is to consider a…

Machine Learning · Computer Science 2024-08-13 Antonis Matakos , Bruno Ordozgoiti , Suhas Thejaswi

The problem of column subset selection has recently attracted a large body of research, with feature selection serving as one obvious and important application. Among the techniques that have been applied to solve this problem, the greedy…

Data Structures and Algorithms · Computer Science 2021-11-16 Jason Altschuler , Aditya Bhaskara , Gang Fu , Vahab Mirrokni , Afshin Rostamizadeh , Morteza Zadimoghaddam

Low-rank matrix approximations are often used to help scale standard machine learning algorithms to large-scale problems. Recently, matrix coherence has been used to characterize the ability to extract global information from a subset of…

Machine Learning · Statistics 2010-09-07 Mehryar Mohri , Ameet Talwalkar

This paper considers the problem of matrix completion when some number of the columns are completely and arbitrarily corrupted, potentially by a malicious adversary. It is well-known that standard algorithms for matrix completion can return…

Machine Learning · Statistics 2016-04-26 Yudong Chen , Huan Xu , Constantine Caramanis , Sujay Sanghavi

Dimensionality reduction is a first step of many machine learning pipelines. Two popular approaches are principal component analysis, which projects onto a small number of well chosen but non-interpretable directions, and feature selection,…

Machine Learning · Statistics 2018-12-27 Ayoub Belhadji , Rémi Bardenet , Pierre Chainais

A novel matrix approximation problem is considered herein: observations based on a few fully sampled columns and quasi-polynomial structural side information are exploited. The framework is motivated by quantum chemistry problems wherein…

Signal Processing · Electrical Eng. & Systems 2023-05-23 Jeongmin Chae , Praneeth Narayanamurthy , Selin Bac , Shaama Mallikarjun Sharada , Urbashi Mitra

Given a fixed matrix, the problem of column subset selection requests a column submatrix that has favorable spectral properties. Most research from the algorithms and numerical linear algebra communities focuses on a variant called…

Numerical Analysis · Mathematics 2014-04-29 Joel A. Tropp

We propose a randomized method for solving linear programs with a large number of columns but a relatively small number of constraints. Since enumerating all the columns is usually unrealistic, such linear programs are commonly solved by…

Optimization and Control · Mathematics 2023-11-29 Yi-Chun Akchen , Velibor V. Mišić

We address the subset selection problem for matrices, where the goal is to select a subset of $k$ columns from a "short-and-fat" matrix $X \in \mathbb{R}^{m \times n}$, such that the pseudoinverse of the sampled submatrix has as small…

Numerical Analysis · Mathematics 2025-07-29 Ivan Kozyrev , Alexander Osinsky

In today's information systems, the availability of massive amounts of data necessitates the development of fast and accurate algorithms to summarize these data and represent them in a succinct format. One crucial problem in big data…

Data Structures and Algorithms · Computer Science 2013-12-27 Ahmed K. Farahat , Ahmed Elgohary , Ali Ghodsi , Mohamed S. Kamel

We study the problem of exact completion for $m \times n$ sized matrix of rank $r$ with the adaptive sampling method. We introduce a relation of the exact completion problem with the sparsest vector of column and row spaces (which we call…

Machine Learning · Computer Science 2022-03-08 Ilqar Ramazanli , Barnabas Poczos

In this paper, we consider matrix completion from non-uniformly sampled entries including fully observed and partially observed columns. Specifically, we assume that a small number of columns are randomly selected and fully observed, and…

Machine Learning · Computer Science 2018-06-28 Yuanyu Wan , Jinfeng Yi , Lijun Zhang

Symmetric positive semidefinite (SPSD) matrix approximation is an important problem with applications in kernel methods. However, existing SPSD matrix approximation methods such as the Nystr\"om method only have weak error bounds. In this…

Machine Learning · Computer Science 2016-05-23 Shusen Wang , Luo Luo , Zhihua Zhang

The problem of extracting a well conditioned submatrix from any rectangular matrix (with normalized columns) has been studied for some time in functional and harmonic analysis; see…

Functional Analysis · Mathematics 2016-12-07 Stephane Chretien , Sebastien Darses

Column selection is an essential tool for structure-preserving low-rank approximation, with wide-ranging applications across many fields, such as data science, machine learning, and theoretical chemistry. In this work, we develop unified…

Numerical Analysis · Mathematics 2024-08-09 Mark Fornace , Michael Lindsey

Subset selection for matrices is the task of extracting a column sub-matrix from a given matrix $B\in\mathbb{R}^{n\times m}$ with $m>n$ such that the pseudoinverse of the sampled matrix has as small Frobenius or spectral norm as possible.…

Data Structures and Algorithms · Computer Science 2020-03-04 Jiaxin Xie , Zhiqiang Xu

We study the column subset selection problem with respect to the entrywise $\ell_1$-norm loss. It is known that in the worst case, to obtain a good rank-$k$ approximation to a matrix, one needs an arbitrarily large $n^{\Omega(1)}$ number of…

Data Structures and Algorithms · Computer Science 2020-04-20 Zhao Song , David P. Woodruff , Peilin Zhong

We propose a continuous optimization algorithm for the Column Subset Selection Problem (CSSP) and Nystr\"om approximation. The CSSP and Nystr\"om method construct low-rank approximations of matrices based on a predetermined subset of…

Methodology · Statistics 2023-04-20 Anant Mathur , Sarat Moka , Zdravko Botev

We study subset selection for matrices defined as follows: given a matrix $\matX \in \R^{n \times m}$ ($m > n$) and an oversampling parameter $k$ ($n \le k \le m$), select a subset of $k$ columns from $\matX$ such that the pseudo-inverse of…

Data Structures and Algorithms · Computer Science 2013-06-25 Haim Avron , Christos Boutsidis
‹ Prev 1 2 3 10 Next ›