English
Related papers

Related papers: Representing complex data using localized principa…

200 papers

Big data is transforming our world, revolutionizing operations and analytics everywhere, from financial engineering to biomedical sciences. The complexity of big data often makes dimension reduction techniques necessary before conducting…

Methodology · Statistics 2018-01-08 Jianqing Fan , Qiang Sun , Wen-Xin Zhou , Ziwei Zhu

Principal component analysis (PCA) is a well-known linear dimension-reduction method that has been widely used in data analysis and modeling. It is an unsupervised learning technique that identifies a suitable linear subspace for the input…

Machine Learning · Statistics 2021-09-10 Shaojie Xu , Joel Vaughan , Jie Chen , Agus Sudjianto , Vijayan Nair

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance-correlation matrix of the analyzed data. However to properly work with high-dimensional data, PCA poses severe mathematical…

Quantitative Methods · Quantitative Biology 2018-10-18 Luigi Leonardo Palese

Principal Component Analysis (PCA) is a very successful dimensionality reduction technique, widely used in predictive modeling. A key factor in its widespread use in this domain is the fact that the projection of a dataset onto its first…

Machine Learning · Statistics 2017-05-19 Xianghui Luo , Robert J. Durrant

Principal component analysis (PCA) is often used for analyzing data in the most diverse areas. In this work, we report an integrated approach to several theoretical and practical aspects of PCA. We start by providing, in an intuitive and…

Computational Engineering, Finance, and Science · Computer Science 2021-06-09 Felipe L. Gewers , Gustavo R. Ferreira , Henrique F. de Arruda , Filipi N. Silva , Cesar H. Comin , Diego R. Amancio , Luciano da F. Costa

Principal components analysis (PCA) is a classical method for the reduction of dimensionality of data in the form of n observations (or cases) of a vector with p variables. For a simple model of factor analysis type, it is proved that…

Statistics Theory · Mathematics 2009-01-29 Iain M Johnstone , Arthur Yu Lu

Principal Component Analysis (PCA) is one of the most important methods to handle high dimensional data. However, most of the studies on PCA aim to minimize the loss after projection, which usually measures the Euclidean distance, though in…

Machine Learning · Computer Science 2019-03-19 Kai Liu , Qiuwei Li , Hua Wang , Gongguo Tang

Principal Component Analysis (PCA) is a well known procedure to reduce intrinsic complexity of a dataset, essentially through simplifying the covariance structure or the correlation structure. We introduce a novel algebraic, model-based…

Methodology · Statistics 2021-12-09 Martin Schlather , Felix Reinbott

Dimensionality reduction is a crucial step for pattern recognition and data mining tasks to overcome the curse of dimensionality. Principal component analysis (PCA) is a traditional technique for unsupervised dimensionality reduction, which…

Machine Learning · Computer Science 2017-05-04 Zan Gao , Guotai Zhang , Feiping Nie , Hua Zhang

Principal component analysis (PCA) is one of the most popular dimension reduction techniques in statistics and is especially powerful when a multivariate distribution is concentrated near a lower-dimensional subspace. Multivariate extreme…

Methodology · Statistics 2025-07-15 Felix Reinbott , Anja Janßen

Principal component analysis (PCA) is a widely used dimension reduction tool in the analysis of many kind of high-dimensional data. It is used in signal processing, mechanical engineering, psychometrics, and other fields under different…

Methodology · Statistics 2014-01-15 Ngoc Mai Tran , Maria Osipenko , Wolfgang Karl Haerdle

This paper introduces a Projected Principal Component Analysis (Projected-PCA), which employs principal component analysis to the projected (smoothed) data matrix onto a given linear space spanned by covariates. When it applies to…

Methodology · Statistics 2016-01-18 Jianqing Fan , Yuan Liao , Weichen Wang

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal,…

Machine Learning · Statistics 2015-05-06 Madeleine Udell , Corinne Horn , Reza Zadeh , Stephen Boyd

Principal component analysis (PCA) for binary data, known as logistic PCA, has become a popular alternative to dimensionality reduction of binary data. It is motivated as an extension of ordinary PCA by means of a matrix factorization, akin…

Machine Learning · Statistics 2020-09-08 Andrew J. Landgraf , Yoonkyung Lee

Principal Component Analysis (PCA) is the workhorse tool for dimensionality reduction in this era of big data. While often overlooked, the purpose of PCA is not only to reduce data dimensionality, but also to yield features that are…

Machine Learning · Computer Science 2021-11-30 Arpita Gang , Waheed U. Bajwa

Methodologies for multidimensionality reduction aim at discovering low-dimensional manifolds where data ranges. Principal Component Analysis (PCA) is very effective if data have linear structure. But fails in identifying a possible…

Numerical Analysis · Mathematics 2021-01-14 Alberto García-González , Antonio Huerta , Sergio Zlotnik , Pedro Díez

Principal component analysis (PCA) is a widely used unsupervised dimensionality reduction technique in machine learning, applied across various fields such as bioinformatics, computer vision and finance. However, when the response variables…

Applications · Statistics 2025-06-25 Theodosios Papazoglou , Guosheng Yin

Accurate predictions of pollutant concentrations at new locations are often of interest in air pollution studies on fine particulate matters (PM$_{2.5}$), in which data is usually not measured at all study locations. PM$_{2.5}$ is also a…

Applications · Statistics 2020-05-19 Phuong T. Vu , Timothy V. Larson , Adam A. Szpiro

Principal Component Analysis (PCA) is a dimension reduction technique. It produces inconsistent estimators when the dimensionality is moderate to high, which is often the problem in modern large-scale applications where algorithm…

Computation · Statistics 2016-01-29 Qiaoya Zhang , Yiyuan She

Principal component analysis (PCA) is a popular dimension reduction technique often used to visualize high-dimensional data structures. In genomics, this can involve millions of variables, but only tens to hundreds of observations.…

Statistics Theory · Mathematics 2020-06-11 Kristoffer Hellton , Magne Thoresen
‹ Prev 1 2 3 10 Next ›