English
Related papers

Related papers: Deterministic Coreset for Lp Subspace

200 papers

A coreset of a dataset with $n$ examples and $d$ features is a weighted subset of examples that is sufficient for solving downstream data analytic tasks. Nearly optimal constructions of coresets for least squares and $\ell_p$ linear…

Data Structures and Algorithms · Computer Science 2024-06-05 David P. Woodruff , Taisuke Yasuda

Accurate coresets are a weighted subset of the original dataset, ensuring a model trained on the accurate coreset maintains the same level of accuracy as a model trained on the full dataset. Primarily, these coresets have been studied for a…

Machine Learning · Computer Science 2024-12-31 Sanskar Ranjan , Supratim Shit

The $\ell_p$ subspace approximation problem is an NP-hard low rank approximation problem that generalizes the median hyperplane ($p = 1$), principal component analysis ($p = 2$), and center hyperplane problems ($p = \infty$). A popular…

Data Structures and Algorithms · Computer Science 2025-04-04 David P. Woodruff , Taisuke Yasuda

Oblivious low-distortion subspace embeddings are a crucial building block for numerical linear algebra problems. We show for any real $p, 1 \leq p < \infty$, given a matrix $M \in \mathbb{R}^{n \times d}$ with $n \gg d$, with constant…

Data Structures and Algorithms · Computer Science 2014-03-19 David P. Woodruff , Qin Zhang

We show how to construct in linear time coresets of constant size for farthest point problems in fixed-dimensional hyperbolic space. Our coresets provide both an arbitrarily small relative error and additive error $\varepsilon$. More…

Computational Geometry · Computer Science 2025-11-03 Eunku Park , Antoine Vigneron

We consider the problem of subset selection for $\ell_{p}$ subspace approximation, that is, to efficiently find a \emph{small} subset of data points such that solving the problem optimally for this subset gives a good approximation to…

Machine Learning · Computer Science 2022-04-27 Amit Deshpande , Rameshwar Pratap

With the dramatic growth in the number of application domains that generate probabilistic, noisy and uncertain data, there has been an increasing interest in designing algorithms for geometric or combinatorial optimization problems over…

Data Structures and Algorithms · Computer Science 2016-05-24 Lingxiao Huang , Jian Li , Jeff M. Phillips , Haitao Wang

Given a point set $P\subset \mathbb{R}^d$, the kernel density estimate of $P$ is defined as \[ \overline{\mathcal{G}}_P(x) = \frac{1}{\left|P\right|}\sum_{p\in P}e^{-\left\lVert x-p \right\rVert^2} \] for any $x\in\mathbb{R}^d$. We study…

Data Structures and Algorithms · Computer Science 2022-02-22 Wai Ming Tai

Coreset (or core-set) is a small weighted \emph{subset} $Q$ of an input set $P$ with respect to a given \emph{monotonic} function $f:\mathbb{R}\to\mathbb{R}$ that \emph{provably} approximates its fitting loss $\sum_{p\in P}f(p\cdot x)$ to…

Machine Learning · Computer Science 2021-12-24 Elad Tolochinsky , Ibrahim Jubran , Dan Feldman

We provide fast algorithms for overconstrained $\ell_p$ regression and related problems: for an $n\times d$ input matrix $A$ and vector $b\in\mathbb{R}^n$, in $O(nd\log n)$ time we reduce the problem $\min_{x\in\mathbb{R}^d} \|Ax-b\|_p$ to…

Data Structures and Algorithms · Computer Science 2014-04-08 Kenneth L. Clarkson , Petros Drineas , Malik Magdon-Ismail , Michael W. Mahoney , Xiangrui Meng , David P. Woodruff

The Lp regression problem takes as input a matrix $A \in \Real^{n \times d}$, a vector $b \in \Real^n$, and a number $p \in [1,\infty)$, and it returns as output a number ${\cal Z}$ and a vector $x_{opt} \in \Real^d$ such that ${\cal Z} =…

Data Structures and Algorithms · Computer Science 2007-07-13 Anirban Dasgupta , Petros Drineas , Boulos Harb , Ravi Kumar , Michael W. Mahoney

With input sizes becoming massive, coresets -- small yet representative summary of the input -- are relevant more than ever. A weighted set $C_w$ that is a subset of the input is an $\varepsilon$-coreset if the cost of any feasible solution…

Data Structures and Algorithms · Computer Science 2020-09-29 Monika Henzinger , Sagar Kale

An $\ell_p$ oblivious subspace embedding is a distribution over $r \times n$ matrices $\Pi$ such that for any fixed $n \times d$ matrix $A$, $$\Pr_{\Pi}[\textrm{for all }x, \ \|Ax\|_p \leq \|\Pi Ax\|_p \leq \kappa \|Ax\|_p] \geq 9/10,$$…

Data Structures and Algorithms · Computer Science 2018-04-10 Ruosong Wang , David P. Woodruff

We give the first non-trivial decremental dynamic embedding of a weighted, undirected graph $G$ into $\ell_p$ space. Given a weighted graph $G$ undergoing a sequence of edge weight increases, the goal of this problem is to maintain a…

Data Structures and Algorithms · Computer Science 2024-08-15 Kiarash Banihashem , MohammadTaghi Hajiaghayi , Dariusz R. Kowalski , Jan Olkowski , Max Springer

We show that for every large enough integer $N$, there exists an $N$-point subset of $L_1$ such that for every $D>1$, embedding it into $\ell_1^d$ with distortion $D$ requires dimension $d$ at least $N^{\Omega(1/D^2)}$, and that for every…

Metric Geometry · Mathematics 2011-12-22 Oded Regev

Many existing algorithms for streaming geometric data analysis have been plagued by exponential dependencies in the space complexity, which are undesirable for processing high-dimensional data sets. In particular, once $d\geq\log n$, there…

Data Structures and Algorithms · Computer Science 2022-09-28 David P. Woodruff , Taisuke Yasuda

The Johnson-Lindenstrauss (JL) lemma is a fundamental result in dimensionality reduction, ensuring that any finite set $X \subseteq \mathbb{R}^d$ can be embedded into a lower-dimensional space $\mathbb{R}^k$ while approximately preserving…

Probability · Mathematics 2025-10-30 Rafael Chiclana , Mark Iwen

We give improved algorithms for the $\ell_{p}$-regression problem, $\min_{x} \|x\|_{p}$ such that $A x=b,$ for all $p \in (1,2) \cup (2,\infty).$ Our algorithms obtain a high accuracy solution in $\tilde{O}_{p}(m^{\frac{|p-2|}{2p + |p-2|}})…

Data Structures and Algorithms · Computer Science 2024-12-20 Deeksha Adil , Rasmus Kyng , Richard Peng , Sushant Sachdeva

Coreset of a given dataset and loss function is usually a small weighed set that approximates this loss for every query from a given set of queries. Coresets have shown to be very useful in many applications. However, coresets construction…

Machine Learning · Computer Science 2021-11-05 Alaa Maalouf , Gilad Eini , Ben Mussay , Dan Feldman , Margarita Osadchy

The coresets approach, also called subsampling or subset selection, aims to select a subsample as a surrogate for the observed sample and has found extensive applications in large-scale data analysis. Existing coresets methods construct the…

Computation · Statistics 2024-09-17 Mengyu Li , Jun Yu , Tao Li , Cheng Meng
‹ Prev 1 2 3 10 Next ›