English
Related papers

Related papers: Exploiting Numerical Sparsity for Efficient Learni…

200 papers

We provide faster algorithms and improved sample complexities for approximating the top eigenvector of a matrix. Offline Setting: Given an $n \times d$ matrix $A$, we show how to compute an $\epsilon$ approximate top eigenvector in time…

Data Structures and Algorithms · Computer Science 2016-05-31 Chi Jin , Sham M. Kakade , Cameron Musco , Praneeth Netrapalli , Aaron Sidford

For a matrix $A\in \mathbb{R}^{n\times d}$ with $n\geq d$, we consider the dual problems of $\min \|Ax-b\|_p^p, \, b\in \mathbb{R}^n$ and $\min_{A^\top x=b} \|x\|_p^p,\, b\in \mathbb{R}^d$. We improve the runtimes for solving these problems…

Data Structures and Algorithms · Computer Science 2021-11-22 Mehrdad Ghadiri , Richard Peng , Santosh S. Vempala

In the total least squares problem, one is given an $m \times n$ matrix $A$, and an $m \times d$ matrix $B$, and one seeks to "correct" both $A$ and $B$, obtaining matrices $\hat{A}$ and $\hat{B}$, so that there exists an $X$ satisfying the…

Data Structures and Algorithms · Computer Science 2019-09-30 Huaian Diao , Zhao Song , David P. Woodruff , Xin Yang

There has been significant interest and progress recently in algorithms that solve regression problems involving tall and thin matrices in input sparsity time. These algorithms find shorter equivalent of a n*d matrix where n >> d, which…

Data Structures and Algorithms · Computer Science 2013-04-05 Mu Li , Gary L. Miller , Richard Peng

We provide new high-accuracy randomized algorithms for solving linear systems and regression problems that are well-conditioned except for $k$ large singular values. For solving such $d \times d$ positive definite system our algorithms…

Data Structures and Algorithms · Computer Science 2025-07-17 Michał Dereziński , Aaron Sidford

We give faster algorithms and improved sample complexities for estimating the top eigenvector of a matrix $\Sigma$ -- i.e. computing a unit vector $x$ such that $x^T \Sigma x \ge (1-\epsilon)\lambda_1(\Sigma)$: Offline Eigenvector…

Data Structures and Algorithms · Computer Science 2016-05-30 Dan Garber , Elad Hazan , Chi Jin , Sham M. Kakade , Cameron Musco , Praneeth Netrapalli , Aaron Sidford

We show how to solve a number of problems in numerical linear algebra, such as least squares regression, $\ell_p$-regression for any $p \geq 1$, low rank approximation, and kernel regression, in time $T(A) \poly(\log(nd))$, where for a…

Machine Learning · Computer Science 2019-12-13 Xiaofei Shi , David P. Woodruff

We present a new algorithm for finding a near optimal low-rank approximation of a matrix $A$ in $O(nnz(A))$ time. Our method is based on a recursive sampling scheme for computing a representative subset of $A$'s columns, which is then used…

Data Structures and Algorithms · Computer Science 2016-10-10 Michael B. Cohen , Cameron Musco , Christopher Musco

The sparse regression problem, also known as best subset selection problem, can be cast as follows: Given a set $S$ of $n$ points in $\mathbb{R}^d$, a point $y\in \mathbb{R}^d$, and an integer $2 \leq k \leq d$, find an affine combination…

Data Structures and Algorithms · Computer Science 2020-01-01 Jean Cardinal , Aurélien Ooms

We design a new distribution over $\poly(r \eps^{-1}) \times n$ matrices $S$ so that for any fixed $n \times d$ matrix $A$ of rank $r$, with probability at least 9/10, $\norm{SAx}_2 = (1 \pm \eps)\norm{Ax}_2$ simultaneously for all $x \in…

Data Structures and Algorithms · Computer Science 2013-04-08 Kenneth L. Clarkson , David P. Woodruff

In prior work, Gupta et al. (SPAA 2022) presented a distributed algorithm for multiplying sparse $n \times n$ matrices, using $n$ computers. They assumed that the input matrices are uniformly sparse--there are at most $d$ non-zeros in each…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-24 Chetan Gupta , Janne H. Korhonen , Jan Studený , Jukka Suomela , Hossein Vahidi

We develop a technique to design efficiently computable estimators for sparse linear regression in the simultaneous presence of two adversaries: oblivious and adaptive. We design several robust algorithms that outperform the state of the…

Machine Learning · Computer Science 2024-11-01 Chih-Hung Liu , Gleb Novikov

We study the algorithmic problem of estimating the mean of heavy-tailed random vector in $\mathbb{R}^d$, given $n$ i.i.d. samples. The goal is to design an efficient estimator that attains the optimal sub-gaussian error bound, only assuming…

Statistics Theory · Mathematics 2020-02-19 Zhixian Lei , Kyle Luh , Prayaag Venkat , Fred Zhang

We study the problem of approximating the eigenspectrum of a symmetric matrix $\mathbf A \in \mathbb{R}^{n \times n}$ with bounded entries (i.e., $\|\mathbf A\|_{\infty} \leq 1$). We present a simple sublinear time algorithm that…

Data Structures and Algorithms · Computer Science 2022-07-25 Rajarshi Bhattacharjee , Gregory Dexter , Petros Drineas , Cameron Musco , Archan Ray

Graph-based nearest neighbor search methods have seen a surge of popularity in recent years, offering state-of-the-art performance across a wide variety of applications. Central to these methods is the task of constructing a sparse…

Data Structures and Algorithms · Computer Science 2025-10-24 Alex Conway , Laxman Dhulipala , Martin Farach-Colton , Rob Johnson , Ben Landrum , Christopher Musco , Yarin Shechter , Torsten Suel , Richard Wen

To the best of our knowledge, there are no methods today for training differentially private regression models on sparse input data. To remedy this, we adapt the Frank-Wolfe algorithm for $L_1$ penalized linear regression to be aware of…

Machine Learning · Computer Science 2023-11-01 Edward Raff , Amol Khanna , Fred Lu

The textbook algorithm for real-weighted single-source shortest paths takes $O(mn)$ time on a graph with $m$ edges and $n$ vertices. A recent breakthrough algorithm by [Fin24] takes $\tilde{O}(mn^{8/9})$ randomized time. The running time…

Data Structures and Algorithms · Computer Science 2025-12-01 Kent Quanrud , Navid Tajkhorshid

Sparse recovery is one of the most fundamental and well-studied inverse problems. Standard statistical formulations of the problem are provably solved by general convex programming techniques and more practical, fast (nearly-linear time)…

Data Structures and Algorithms · Computer Science 2022-03-09 Jonathan A. Kelner , Jerry Li , Allen Liu , Aaron Sidford , Kevin Tian

Large-scale eigenvalue computations on sparse matrices are a key component of graph analytics techniques based on spectral methods. In such applications, an exhaustive computation of all eigenvalues and eigenvectors is impractical and…

Hardware Architecture · Computer Science 2021-03-19 Francesco Sgherzi , Alberto Parravicini , Marco Siracusa , Marco Domenico Santambrogio

A recent work by [Larsen, SODA 2023] introduced a faster combinatorial alternative to Bansal's SDP algorithm for finding a coloring $x \in \{-1, 1\}^n$ that approximately minimizes the discrepancy $\mathrm{disc}(A, x) := | A x |_{\infty}$…

Data Structures and Algorithms · Computer Science 2025-05-27 Yichuan Deng , Xiaoyu Li , Zhao Song , Omri Weinstein
‹ Prev 1 2 3 10 Next ›