Related papers: Principal Component Analysis based frameworks for …

Blockwise Principal Component Analysis for monotone missing data imputation and dimensionality reduction

Monotone missing data is a common problem in data analysis. However, imputation combined with dimensionality reduction can be computationally expensive, especially with the increasing size of datasets. To address this issue, we propose a…

Machine Learning · Computer Science 2024-01-11 Tu T. Do , Mai Anh Vu , Tuan L. Vo , Hoang Thien Ly , Thu Nguyen , Steven A. Hicks , Michael A. Riegler , Pål Halvorsen , Binh T. Nguyen

Multiple imputation for continuous variables using a Bayesian principal component analysis

We propose a multiple imputation method based on principal component analysis (PCA) to deal with incomplete continuous data. To reflect the uncertainty of the parameters from one imputation to the next, we use a Bayesian treatment of the…

Methodology · Statistics 2015-08-20 Vincent Audigier , François Husson , Julie Josse

Maximum Margin Principal Components

Principal Component Analysis (PCA) is a very successful dimensionality reduction technique, widely used in predictive modeling. A key factor in its widespread use in this domain is the fact that the projection of a dataset onto its first…

Machine Learning · Statistics 2017-05-19 Xianghui Luo , Robert J. Durrant

FAST-PCA: A Fast and Exact Algorithm for Distributed Principal Component Analysis

Principal Component Analysis (PCA) is a fundamental data preprocessing tool in the world of machine learning. While PCA is often thought of as a dimensionality reduction method, the purpose of PCA is actually two-fold: dimension reduction…

Machine Learning · Computer Science 2023-01-25 Arpita Gang , Waheed U. Bajwa

Principal component analysis for big data

Big data is transforming our world, revolutionizing operations and analytics everywhere, from financial engineering to biomedical sciences. The complexity of big data often makes dimension reduction techniques necessary before conducting…

Methodology · Statistics 2018-01-08 Jianqing Fan , Qiang Sun , Wen-Xin Zhou , Ziwei Zhu

Principal Component Analysis: A Natural Approach to Data Exploration

Principal component analysis (PCA) is often used for analyzing data in the most diverse areas. In this work, we report an integrated approach to several theoretical and practical aspects of PCA. We start by providing, in an intuitive and…

Computational Engineering, Finance, and Science · Computer Science 2021-06-09 Felipe L. Gewers , Gustavo R. Ferreira , Henrique F. de Arruda , Filipi N. Silva , Cesar H. Comin , Diego R. Amancio , Luciano da F. Costa

Integrated Principal Components Analysis

Data integration, or the strategic analysis of multiple sources of data simultaneously, can often lead to discoveries that may be hidden in individualistic analyses of a single data source. We develop a new unsupervised data integration…

Methodology · Statistics 2021-04-06 Tiffany M. Tang , Genevera I. Allen

A memory-based method to select the number of relevant components in Principal Component Analysis

We propose a new data-driven method to select the optimal number of relevant components in Principal Component Analysis (PCA). This new method applies to correlation matrices whose time autocorrelation function decays more slowly than an…

Statistical Finance · Quantitative Finance 2019-10-07 Anshul Verma , Pierpaolo Vivo , Tiziana Di Matteo

Autoencoder, Principal Component Analysis and Support Vector Regression for Data Imputation

Data collection often results in records that have missing values or variables. This investigation compares 3 different data imputation models and identifies their merits by using accuracy measures. Autoencoder Neural Networks, Principal…

Artificial Intelligence · Computer Science 2007-09-18 Vukosi N. Marivate , Fulufhelo V. Nelwamodo , Tshilidzi Marwala

Principal Component Analysis with Noisy and/or Missing Data

We present a method for performing Principal Component Analysis (PCA) on noisy datasets with missing values. Estimates of the measurement error are used to weight the input data such that compared to classic PCA, the resulting eigenvectors…

Instrumentation and Methods for Astrophysics · Physics 2015-06-11 Stephen Bailey

Generalized Low Rank Models

Principal components analysis (PCA) is a well-known technique for approximating a tabular data set by a low rank matrix. Here, we extend the idea of PCA to handle arbitrary data sets consisting of numerical, Boolean, categorical, ordinal,…

Machine Learning · Statistics 2015-05-06 Madeleine Udell , Corinne Horn , Reza Zadeh , Stephen Boyd

Sparse Principal Component Analysis with missing observations

In this paper, we study the problem of sparse Principal Component Analysis (PCA) in the high-dimensional setting with missing observations. Our goal is to estimate the first principal component when we only have access to partial…

Statistics Theory · Mathematics 2012-06-04 Karim Lounici

Sparse Generalized Principal Component Analysis for Large-scale Applications beyond Gaussianity

Principal Component Analysis (PCA) is a dimension reduction technique. It produces inconsistent estimators when the dimensionality is moderate to high, which is often the problem in modern large-scale applications where algorithm…

Computation · Statistics 2016-01-29 Qiaoya Zhang , Yiyuan She

Robust Principal Component Analysis Based On Maximum Correntropy Power Iterations

Principal component analysis (PCA) is recognised as a quintessential data analysis technique when it comes to describing linear relationships between the features of a dataset. However, the well-known sensitivity of PCA to non-Gaussian…

Machine Learning · Statistics 2019-10-28 Jean P. Chereau , Bruno Scalzo Dees , Danilo P. Mandic

Supervised dimensionality reduction for multiple imputation by chained equations

Multivariate imputation by chained equations (MICE) is one of the most popular approaches to address missing values in a data set. This approach requires specifying a univariate imputation model for every variable under imputation. The…

Methodology · Statistics 2023-11-01 Edoardo Costantini , Kyle M. Lang , Klaas Sijtsma

A random version of principal component analysis in data clustering

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance-correlation matrix of the analyzed data. However to properly work with high-dimensional data, PCA poses severe mathematical…

Quantitative Methods · Quantitative Biology 2018-10-18 Luigi Leonardo Palese

Functional Classwise Principal Component Analysis: A Novel Classification Framework

In recent times, functional data analysis (FDA) has been successfully applied in the field of high dimensional data classification. In this paper, we present a novel classification framework using functional data and classwise Principal…

Machine Learning · Statistics 2021-06-29 Avishek Chatterjee , Satyaki Mazumder , Koel Das

Multiple imputation using dimension reduction techniques for high-dimensional data

Missing data present challenges in data analysis. Naive analyses such as complete-case and available-case analysis may introduce bias and loss of efficiency, and produce unreliable results. Multiple imputation (MI) is one of the most widely…

Methodology · Statistics 2019-05-15 Domonique W. Hodge , Sandra E. Safo , Qi Long

PCA-Based Missing Information Imputation for Real-Time Crash Likelihood Prediction Under Imbalanced Data

The real-time crash likelihood prediction has been an important research topic. Various classifiers, such as support vector machine (SVM) and tree-based boosting algorithms, have been proposed in traffic safety studies. However, few…

Machine Learning · Computer Science 2018-02-13 Jintao Ke , Shuaichao Zhang , Hai Yang , Xiqun Chen

A novel approach for Fair Principal Component Analysis based on eigendecomposition

Principal component analysis (PCA), a ubiquitous dimensionality reduction technique in signal processing, searches for a projection matrix that minimizes the mean squared error between the reduced dataset and the original one. Since…

Machine Learning · Computer Science 2022-08-25 Guilherme Dean Pelegrina , Leonardo Tomazeli Duarte