English
Related papers

Related papers: Iterative Row Sampling

200 papers

Random sampling has become a critical tool in solving massive matrix problems. For linear regression, a small, manageable set of data rows can be randomly selected to approximate a tall, skinny data matrix, improving processing time…

Data Structures and Algorithms · Computer Science 2014-08-22 Michael B. Cohen , Yin Tat Lee , Cameron Musco , Christopher Musco , Richard Peng , Aaron Sidford

We present a new algorithm for finding a near optimal low-rank approximation of a matrix $A$ in $O(nnz(A))$ time. Our method is based on a recursive sampling scheme for computing a representative subset of $A$'s columns, which is then used…

Data Structures and Algorithms · Computer Science 2016-10-10 Michael B. Cohen , Cameron Musco , Christopher Musco

The statistical leverage scores of a matrix $A$ are the squared row-norms of the matrix containing its (top) left singular vectors and the coherence is the largest leverage score. These quantities are of interest in recently-popular…

Data Structures and Algorithms · Computer Science 2012-12-06 Petros Drineas , Malik Magdon-Ismail , Michael W. Mahoney , David P. Woodruff

Iterative refinement is particularly popular for numerical solution of linear systems of equations. We extend it to Low Rank Approximation of a matrix (LRA) and observe close link of the resulting algorithm to oversampling techniques,…

Numerical Analysis · Mathematics 2024-11-28 Victor Y. Pan , Qi Luan , Soo Go

Given a matrix $\mathbf{A}\in\mathbb{R}^{n\times d}$ and a vector $b \in\mathbb{R}^{d}$, we show how to compute an $\epsilon$-approximate solution to the regression problem $ \min_{x\in\mathbb{R}^{d}}\frac{1}{2} \|\mathbf{A} x - b\|_{2}^{2}…

Machine Learning · Statistics 2017-11-23 Naman Agarwal , Sham Kakade , Rahul Kidambi , Yin Tat Lee , Praneeth Netrapalli , Aaron Sidford

We focus the use of \emph{row sampling} for approximating matrix algorithms. We give applications to matrix multipication; sparse matrix reconstruction; and, \math{\ell_2} regression. For a matrix \math{\matA\in\R^{m\times d}} which…

Data Structures and Algorithms · Computer Science 2010-08-04 Malik Magdon-Ismail

Recently, a class of algorithms combining classical fixed point iterations with repeated random sparsification of approximate solution vectors has been successfully applied to eigenproblems with matrices as large as $10^{108} \times…

Numerical Analysis · Mathematics 2025-04-28 Jonathan Weare , Robert J. Webber

We provide new high-accuracy randomized algorithms for solving linear systems and regression problems that are well-conditioned except for $k$ large singular values. For solving such $d \times d$ positive definite system our algorithms…

Data Structures and Algorithms · Computer Science 2025-07-17 Michał Dereziński , Aaron Sidford

Randomized algorithms for very large matrix problems have received a great deal of attention in recent years. Much of this work was motivated by problems in large-scale data analysis, and this work was performed by individuals from many…

Data Structures and Algorithms · Computer Science 2011-11-16 Michael W. Mahoney

One popular method for dealing with large-scale data sets is sampling. For example, by using the empirical statistical leverage scores as an importance sampling distribution, the method of algorithmic leveraging samples and rescales…

Methodology · Statistics 2013-06-25 Ping Ma , Michael W. Mahoney , Bin Yu

Efficiently solving sparse linear algebraic equations is an important research topic of numerical simulation. Commonly used approaches include direct methods and iterative methods. Compared with the direct methods, the iterative methods…

Numerical Analysis · Mathematics 2023-10-11 Haifeng Zou , Xiaowen Xu , Chen-Song Zhang

In this letter, we propose an algorithm for recovery of sparse and low rank components of matrices using an iterative method with adaptive thresholding. In each iteration, the low rank and sparse components are obtained using a thresholding…

Numerical Analysis · Computer Science 2017-04-13 Nematollah Zarmehi , Farokh Marvasti

Leverage scores have become essential in statistics and machine learning, aiding regression analysis, randomized matrix computations, and various other tasks. This paper delves into the inverse problem, aiming to recover the intrinsic model…

Machine Learning · Computer Science 2024-08-22 Chenyang Li , Zhao Song , Zhaoxing Xu , Junze Yin

Finding a small spectral approximation for a tall $n \times d$ matrix $A$ is a fundamental numerical primitive. For a number of reasons, one often seeks an approximation whose rows are sampled from those of $A$. Row sampling improves…

Data Structures and Algorithms · Computer Science 2016-04-20 Michael B. Cohen , Cameron Musco , Jakub Pachocki

In this paper, we obtain improved running times for regression and top eigenvector computation for numerically sparse matrices. Given a data matrix $A \in \mathbb{R}^{n \times d}$ where every row $a \in \mathbb{R}^d$ has $\|a\|_2^2 \leq L$…

Data Structures and Algorithms · Computer Science 2018-11-28 Neha Gupta , Aaron Sidford

While leverage score sampling provides powerful tools for approximating solutions to large least squares problems, the cost of computing exact scores and sampling often prohibits practical application. This paper addresses this challenge by…

Numerical Analysis · Mathematics 2025-04-29 Osman Asif Malik , Yiming Xu , Nuojin Cheng , Stephen Becker , Alireza Doostan , Akil Narayan

Matrix recovery from sparse observations is an extensively studied topic emerging in various applications, such as recommendation system and signal processing, which includes the matrix completion and compressed sensing models as special…

Methodology · Statistics 2026-04-13 Ziyuan Chen , Ying Yang , Fang Yao

Leverage score sampling provides an appealing way to perform approximate computations for large matrices. Indeed, it allows to derive faithful approximations with a complexity adapted to the problem at hand. Yet, performing leverage scores…

Machine Learning · Statistics 2019-01-25 Alessandro Rudi , Daniele Calandriello , Luigi Carratino , Lorenzo Rosasco

Given a data matrix $X \in R^{n\times d}$ and a response vector $y \in R^{n}$, suppose $n>d$, it costs $O(n d^2)$ time and $O(n d)$ space to solve the least squares regression (LSR) problem. When $n$ and $d$ are both large, exactly solving…

Machine Learning · Computer Science 2014-04-08 Shusen Wang

The low-rank canonical polyadic tensor decomposition is useful in data analysis and can be computed by solving a sequence of overdetermined least squares subproblems. Motivated by consideration of sparse tensors, we propose sketching each…

Numerical Analysis · Mathematics 2022-01-05 Brett W. Larsen , Tamara G. Kolda
‹ Prev 1 2 3 10 Next ›