Related papers: Exploiting Numerical Sparsity for Efficient Learni…

Robust Shift-and-Invert Preconditioning: Faster and More Sample Efficient Algorithms for Eigenvector Computation

We provide faster algorithms and improved sample complexities for approximating the top eigenvector of a matrix. Offline Setting: Given an $n \times d$ matrix $A$, we show how to compute an $\epsilon$ approximate top eigenvector in time…

Data Structures and Algorithms · Computer Science 2016-05-31 Chi Jin , Sham M. Kakade , Cameron Musco , Praneeth Netrapalli , Aaron Sidford

Faster $p$-Norm Regression Using Sparsity

For a matrix $A\in \mathbb{R}^{n\times d}$ with $n\geq d$, we consider the dual problems of $\min \|Ax-b\|_p^p, \, b\in \mathbb{R}^n$ and $\min_{A^\top x=b} \|x\|_p^p,\, b\in \mathbb{R}^d$. We improve the runtimes for solving these problems…

Data Structures and Algorithms · Computer Science 2021-11-22 Mehrdad Ghadiri , Richard Peng , Santosh S. Vempala

Total Least Squares Regression in Input Sparsity Time

In the total least squares problem, one is given an $m \times n$ matrix $A$, and an $m \times d$ matrix $B$, and one seeks to "correct" both $A$ and $B$, obtaining matrices $\hat{A}$ and $\hat{B}$, so that there exists an $X$ satisfying the…

Data Structures and Algorithms · Computer Science 2019-09-30 Huaian Diao , Zhao Song , David P. Woodruff , Xin Yang

Iterative Row Sampling

There has been significant interest and progress recently in algorithms that solve regression problems involving tall and thin matrices in input sparsity time. These algorithms find shorter equivalent of a n*d matrix where n >> d, which…

Data Structures and Algorithms · Computer Science 2013-04-05 Mu Li , Gary L. Miller , Richard Peng

Approaching Optimality for Solving Dense Linear Systems with Low-Rank Structure

We provide new high-accuracy randomized algorithms for solving linear systems and regression problems that are well-conditioned except for $k$ large singular values. For solving such $d \times d$ positive definite system our algorithms…

Data Structures and Algorithms · Computer Science 2025-07-17 Michał Dereziński , Aaron Sidford

Faster Eigenvector Computation via Shift-and-Invert Preconditioning

We give faster algorithms and improved sample complexities for estimating the top eigenvector of a matrix $\Sigma$ -- i.e. computing a unit vector $x$ such that $x^T \Sigma x \ge (1-\epsilon)\lambda_1(\Sigma)$: Offline Eigenvector…

Data Structures and Algorithms · Computer Science 2016-05-30 Dan Garber , Elad Hazan , Chi Jin , Sham M. Kakade , Cameron Musco , Praneeth Netrapalli , Aaron Sidford

Sublinear Time Numerical Linear Algebra for Structured Matrices

We show how to solve a number of problems in numerical linear algebra, such as least squares regression, $\ell_p$-regression for any $p \geq 1$, low rank approximation, and kernel regression, in time $T(A) \poly(\log(nd))$, where for a…

Machine Learning · Computer Science 2019-12-13 Xiaofei Shi , David P. Woodruff

Input Sparsity Time Low-Rank Approximation via Ridge Leverage Score Sampling

We present a new algorithm for finding a near optimal low-rank approximation of a matrix $A$ in $O(nnz(A))$ time. Our method is based on a recursive sampling scheme for computing a representative subset of $A$'s columns, which is then used…

Data Structures and Algorithms · Computer Science 2016-10-10 Michael B. Cohen , Cameron Musco , Christopher Musco

Sparse Regression via Range Counting

The sparse regression problem, also known as best subset selection problem, can be cast as follows: Given a set $S$ of $n$ points in $\mathbb{R}^d$, a point $y\in \mathbb{R}^d$, and an integer $2 \leq k \leq d$, find an affine combination…

Data Structures and Algorithms · Computer Science 2020-01-01 Jean Cardinal , Aurélien Ooms

Low Rank Approximation and Regression in Input Sparsity Time

We design a new distribution over $\poly(r \eps^{-1}) \times n$ matrices $S$ so that for any fixed $n \times d$ matrix $A$ of rank $r$, with probability at least 9/10, $\norm{SAx}_2 = (1 \pm \eps)\norm{Ax}_2$ simultaneously for all $x \in…

Data Structures and Algorithms · Computer Science 2013-04-08 Kenneth L. Clarkson , David P. Woodruff

Low-Bandwidth Matrix Multiplication: Faster Algorithms and More General Forms of Sparsity

In prior work, Gupta et al. (SPAA 2022) presented a distributed algorithm for multiplying sparse $n \times n$ matrices, using $n$ computers. They assumed that the input matrices are uniformly sparse--there are at most $d$ non-zeros in each…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-24 Chetan Gupta , Janne H. Korhonen , Jan Studený , Jukka Suomela , Hossein Vahidi

Robust Sparse Regression with Non-Isotropic Designs

We develop a technique to design efficiently computable estimators for sparse linear regression in the simultaneous presence of two adversaries: oblivious and adaptive. We design several robust algorithms that outperform the state of the…

Machine Learning · Computer Science 2024-11-01 Chih-Hung Liu , Gleb Novikov

A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates

We study the algorithmic problem of estimating the mean of heavy-tailed random vector in $\mathbb{R}^d$, given $n$ i.i.d. samples. The goal is to design an efficient estimator that attains the optimal sub-gaussian error bound, only assuming…

Statistics Theory · Mathematics 2020-02-19 Zhixian Lei , Kyle Luh , Prayaag Venkat , Fred Zhang

Sublinear Time Eigenvalue Approximation via Random Sampling

We study the problem of approximating the eigenspectrum of a symmetric matrix $\mathbf A \in \mathbb{R}^{n \times n}$ with bounded entries (i.e., $\|\mathbf A\|_{\infty} \leq 1$). We present a simple sublinear time algorithm that…

Data Structures and Algorithms · Computer Science 2022-07-25 Rajarshi Bhattacharjee , Gregory Dexter , Petros Drineas , Cameron Musco , Archan Ray

Efficiently Constructing Sparse Navigable Graphs

Graph-based nearest neighbor search methods have seen a surge of popularity in recent years, offering state-of-the-art performance across a wide variety of applications. Central to these methods is the task of constructing a sparse…

Data Structures and Algorithms · Computer Science 2025-10-24 Alex Conway , Laxman Dhulipala , Martin Farach-Colton , Rob Johnson , Ben Landrum , Christopher Musco , Yarin Shechter , Torsten Suel , Richard Wen

Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations

To the best of our knowledge, there are no methods today for training differentially private regression models on sparse input data. To remedy this, we adapt the Frank-Wolfe algorithm for $L_1$ penalized linear regression to be aware of…

Machine Learning · Computer Science 2023-11-01 Edward Raff , Amol Khanna , Fred Lu

From Hop Reduction to Sparsification for Negative Length Shortest Paths

The textbook algorithm for real-weighted single-source shortest paths takes $O(mn)$ time on a graph with $m$ edges and $n$ vertices. A recent breakthrough algorithm by [Fin24] takes $\tilde{O}(mn^{8/9})$ randomized time. The running time…

Data Structures and Algorithms · Computer Science 2025-12-01 Kent Quanrud , Navid Tajkhorshid

Semi-Random Sparse Recovery in Nearly-Linear Time

Sparse recovery is one of the most fundamental and well-studied inverse problems. Standard statistical formulations of the problem are provably solved by general convex programming techniques and more practical, fast (nearly-linear time)…

Data Structures and Algorithms · Computer Science 2022-03-09 Jonathan A. Kelner , Jerry Li , Allen Liu , Aaron Sidford , Kevin Tian

Solving Large Top-K Graph Eigenproblems with a Memory and Compute-optimized FPGA Design

Large-scale eigenvalue computations on sparse matrices are a key component of graph analytics techniques based on spectral methods. In such applications, an exhaustive computation of all eigenvalues and eigenvectors is impractical and…

Hardware Architecture · Computer Science 2021-03-19 Francesco Sgherzi , Alberto Parravicini , Marco Siracusa , Marco Domenico Santambrogio

Discrepancy Minimization in Input-Sparsity Time

A recent work by [Larsen, SODA 2023] introduced a faster combinatorial alternative to Bansal's SDP algorithm for finding a coloring $x \in \{-1, 1\}^n$ that approximately minimizes the discrepancy $\mathrm{disc}(A, x) := | A x |_{\infty}$…

Data Structures and Algorithms · Computer Science 2025-05-27 Yichuan Deng , Xiaoyu Li , Zhao Song , Omri Weinstein