English
Related papers

Related papers: Randomized Dimension Reduction on Massive Data

200 papers

The scalability of statistical estimators is of increasing importance in modern applications. One approach to implementing scalable algorithms is to compress data into a low dimensional latent space using dimension reduction methods. In…

Machine Learning · Statistics 2015-04-14 Gregory Darnell , Stoyan Georgiev , Sayan Mukherjee , Barbara E Engelhardt

Large models and enormous data are essential driving forces of the unprecedented successes achieved by modern algorithms, especially in scientific computing and machine learning. Nevertheless, the growing dimensionality and model…

Machine Learning · Computer Science 2023-10-04 Yijun Dong

Dimensionality reduction techniques play important roles in the analysis of big data. Traditional dimensionality reduction approaches, such as principal component analysis (PCA) and linear discriminant analysis (LDA), have been studied…

Machine Learning · Computer Science 2018-05-31 Haozhe Xie , Jie Li , Hanqing Xue

Dimension reduction is often the first step in statistical modeling or prediction of multivariate spatial data. However, most existing dimension reduction techniques do not account for the spatial correlation between observations and do not…

Methodology · Statistics 2025-05-27 Si Cheng , Magali N. Blanco , Timothy V. Larson , Lianne Sheppard , Adam Szpiro , Ali Shojaie

In this paper, we study randomized reduction methods, which reduce high-dimensional features into low-dimensional space by randomized methods (e.g., random projection, random hashing), for large-scale high-dimensional classification.…

Machine Learning · Computer Science 2015-07-21 Tianbao Yang , Lijun Zhang , Rong Jin , Shenghuo Zhu

These notes are an overview of some classical linear methods in Multivariate Data Analysis. This is a good old domain, well established since the 60's, and refreshed timely as a key step in statistical learning. It can be presented as part…

Numerical Analysis · Mathematics 2023-05-25 Alain Franc

This paper, broadly speaking, covers the use of randomness in two main areas: low-rank approximation and kernel methods. Low-rank approximation is very important in numerical linear algebra. Many applications depend on matrix decomposition…

Numerical Analysis · Mathematics 2020-08-12 Rishi Advani , Madison Crim , Sean O'Hagan

This paper presents a randomized algorithm for computing the near-optimal low-rank dynamic mode decomposition (DMD). Randomized algorithms are emerging techniques to compute low-rank matrix approximations at a fraction of the cost of…

Numerical Analysis · Mathematics 2019-11-28 N. Benjamin Erichson , Lionel Mathelin , Steven L. Brunton , J. Nathan Kutz

We consider multi-class classification problems for high dimensional data. Following the idea of reduced-rank linear discriminant analysis (LDA), we introduce a new dimension reduction tool with a flavor of supervised principal component…

Methodology · Statistics 2017-03-28 Yue Selena Niu , Ning Hao , Bin Dong

In this work we show that the classification performance of high-dimensional structural MRI data with only a small set of training examples is improved by the usage of dimension reduction methods. We assessed two different dimension…

Machine Learning · Computer Science 2015-05-27 Andreas Grünauer , Markus Vincze

The development and use of dimension reduction methods is prevalent in modern statistical literature. This paper reviews a class of dimension reduction techniques which aim to simultaneously select relevant predictors and find clusters…

Methodology · Statistics 2022-02-18 Suchit Mehrotra

Data are not only ubiquitous in society, but are increasingly complex both in size and dimensionality. Dimension reduction offers researchers and scholars the ability to make such complex, high dimensional data spaces simpler and more…

Machine Learning · Computer Science 2021-03-15 Philip D. Waggoner

In our "big data" age, the size and complexity of data is steadily increasing. Methods for dimension reduction are ever more popular and useful. Two distinct types of dimension reduction are "data-oblivious" methods such as random…

Statistics Theory · Mathematics 2021-03-30 Fan Yang , Sifan Liu , Edgar Dobriban , David P. Woodruff

Dimension reduction is often an important step in the analysis of high-dimensional data. PCA is a popular technique to find the best low-dimensional approximation of high-dimensional data. However, classical PCA is very sensitive to…

Computation · Statistics 2019-01-14 Holger Cevallos-Valdiviezo , Stefan Van Aelst

This paper explores and analyzes two randomized designs for robust Principal Component Analysis (PCA) employing low-dimensional data sketching. In one design, a data sketch is constructed using random column sampling followed by low…

Machine Learning · Statistics 2017-03-21 Mostafa Rahmani , George Atia

Principal component analysis (PCA) is a well-known linear dimension-reduction method that has been widely used in data analysis and modeling. It is an unsupervised learning technique that identifies a suitable linear subspace for the input…

Machine Learning · Statistics 2021-09-10 Shaojie Xu , Joel Vaughan , Jie Chen , Agus Sudjianto , Vijayan Nair

Linear dimensionality reduction methods are a cornerstone of analyzing high dimensional data, due to their simple geometric interpretations and typically attractive computational properties. These methods capture many data features of…

Machine Learning · Statistics 2016-03-22 John P. Cunningham , Zoubin Ghahramani

This paper is concerned with the problem of low rank plus sparse matrix decomposition for big data. Conventional algorithms for matrix decomposition use the entire data to extract the low-rank and sparse components, and are based on…

Numerical Analysis · Computer Science 2017-03-17 Mostafa Rahmani , George Atia

Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which…

Numerical Analysis · Mathematics 2014-04-29 Nathan Halko , Per-Gunnar Martinsson , Joel A. Tropp

In high-dimensional multivariate regression problems, enforcing low rank in the coefficient matrix offers effective dimension reduction, which greatly facilitates parameter estimation and model interpretation. However, commonly-used…

Statistics Theory · Mathematics 2017-07-18 Yiyuan She , Kun Chen
‹ Prev 1 2 3 10 Next ›