Related papers: Visualizing probabilistic models: Intensive Princi…

Visualizing theory space: Isometric embedding of probabilistic predictions, from the Ising model to the cosmic microwave background

We develop an intensive embedding for visualizing the space of all predictions for probabalistic models, using replica theory. Our embedding is isometric (preserves the distinguishability between models) and faithful (yields low-dimensional…

Statistical Mechanics · Physics 2017-09-08 Katherine N. Quinn , Francesco De Bernardis , Michael D. Niemack , James P. Sethna

Disentangling Interpretable Factors with Supervised Independent Subspace Principal Component Analysis

The success of machine learning models relies heavily on effectively representing high-dimensional data. However, ensuring data representations capture human-understandable concepts remains difficult, often requiring the incorporation of…

Machine Learning · Statistics 2024-11-01 Jiayu Su , David A. Knowles , Raul Rabadan

Intrinsic dimension estimation of data by principal component analysis

Estimating intrinsic dimensionality of data is a classic problem in pattern recognition and statistics. Principal Component Analysis (PCA) is a powerful tool in discovering dimensionality of data sets with a linear structure; it, however,…

Computer Vision and Pattern Recognition · Computer Science 2010-02-11 Mingyu Fan , Nannan Gu , Hong Qiao , Bo Zhang

Combining Structured and Unstructured Randomness in Large Scale PCA

Principal Component Analysis (PCA) is a ubiquitous tool with many applications in machine learning including feature construction, subspace embedding, and outlier detection. In this paper, we present an algorithm for computing the top…

Machine Learning · Computer Science 2013-10-25 Nikos Karampatziakis , Paul Mineiro

Principal Component Analysis Using Structural Similarity Index for Images

Despite the advances of deep learning in specific tasks using images, the principled assessment of image fidelity and similarity is still a critical ability to develop. As it has been shown that Mean Squared Error (MSE) is insufficient for…

Image and Video Processing · Electrical Eng. & Systems 2019-08-27 Benyamin Ghojogh , Fakhri Karray , Mark Crowley

Dimensionality reduction to maximize prediction generalization capability

Generalization of time series prediction remains an important open issue in machine learning, wherein earlier methods have either large generalization error or local minima. We develop an analytically solvable, unsupervised learning scheme…

Machine Learning · Statistics 2022-01-21 Takuya Isomura , Taro Toyoizumi

Integrated Principal Components Analysis

Data integration, or the strategic analysis of multiple sources of data simultaneously, can often lead to discoveries that may be hidden in individualistic analyses of a single data source. We develop a new unsupervised data integration…

Methodology · Statistics 2021-04-06 Tiffany M. Tang , Genevera I. Allen

On the Convergence of the Dynamic Inner PCA Algorithm

Dynamic inner principal component analysis (DiPCA) is a powerful method for the analysis of time-dependent multivariate data. DiPCA extracts dynamic latent variables that capture the most dominant temporal trends by solving a large-scale,…

Systems and Control · Electrical Eng. & Systems 2020-03-16 Sungho Shin , Alex D. Smith , S. Joe Qin , Victor M. Zavala

When and why are principal component scores a good tool for visualizing high-dimensional data?

Principal component analysis (PCA) is a popular dimension reduction technique often used to visualize high-dimensional data structures. In genomics, this can involve millions of variables, but only tens to hundreds of observations.…

Statistics Theory · Mathematics 2020-06-11 Kristoffer Hellton , Magne Thoresen

Uncertainty-Aware Principal Component Analysis

We present a technique to perform dimensionality reduction on data that is subject to uncertainty. Our method is a generalization of traditional principal component analysis (PCA) to multivariate probability distributions. In comparison to…

Machine Learning · Computer Science 2019-10-14 Jochen Görtler , Thilo Spinner , Dirk Streeb , Daniel Weiskopf , Oliver Deussen

Supervised Linear Dimension-Reduction Methods: Review, Extensions, and Comparisons

Principal component analysis (PCA) is a well-known linear dimension-reduction method that has been widely used in data analysis and modeling. It is an unsupervised learning technique that identifies a suitable linear subspace for the input…

Machine Learning · Statistics 2021-09-10 Shaojie Xu , Joel Vaughan , Jie Chen , Agus Sudjianto , Vijayan Nair

Automatic dimensionality selection for principal component analysis models with the ignorance score

Principal component analysis (PCA) is by far the most widespread tool for unsupervised learning with high-dimensional data sets. Its application is popularly studied for the purpose of exploratory data analysis and online process…

Applications · Statistics 2019-02-12 Stefania Russo , Guangyu Li , Kris Villez

Robust Principal Component Analysis on Graphs

Principal Component Analysis (PCA) is the most widely used tool for linear dimensionality reduction and clustering. Still it is highly sensitive to outliers and does not scale well with respect to the number of data samples. Robust PCA…

Computer Vision and Pattern Recognition · Computer Science 2015-04-24 Nauman Shahid , Vassilis Kalofolias , Xavier Bresson , Michael Bronstein , Pierre Vandergheynst

Nonlinear Independent Component Analysis for Principled Disentanglement in Unsupervised Deep Learning

A central problem in unsupervised deep learning is how to find useful representations of high-dimensional data, sometimes called "disentanglement". Most approaches are heuristic and lack a proper theoretical foundation. In linear…

Machine Learning · Computer Science 2023-09-06 Aapo Hyvarinen , Ilyes Khemakhem , Hiroshi Morioka

Introduction to Principal Components Analysis

Understanding the inverse equivalent width - luminosity relationship (Baldwin Effect), the topic of this meeting, requires extracting information on continuum and emission line parameters from samples of AGN. We wish to discover whether,…

Astrophysics · Physics 2007-05-23 Paul J. Francis , Beverley J. Wills

Deep Kernel Principal Component Analysis for Multi-level Feature Learning

Principal Component Analysis (PCA) and its nonlinear extension Kernel PCA (KPCA) are widely used across science and industry for data analysis and dimensionality reduction. Modern deep learning tools have achieved great empirical success,…

Machine Learning · Computer Science 2023-02-23 Francesco Tonin , Qinghua Tao , Panagiotis Patrinos , Johan A. K. Suykens

Extrinsic Principal Component Analysis

One develops a fast computational methodology for principal component analysis on manifolds. Instead of estimating intrinsic principal components on an object space with a Riemannian structure, one embeds the object space in a numerical…

Methodology · Statistics 2024-10-04 Ka Chun Wong , Vic Patrangenaru , Robert L. Paige , Mihaela Pricop Jeckstadt

Robust Principal Component Analysis using Density Power Divergence

Principal component analysis (PCA) is a widely employed statistical tool used primarily for dimensionality reduction. However, it is known to be adversely affected by the presence of outlying observations in the sample, which is quite…

Methodology · Statistics 2023-09-26 Subhrajyoty Roy , Ayanendranath Basu , Abhik Ghosh

Robust Principal Component Analysis: A Median of Means Approach

Principal Component Analysis (PCA) is a fundamental tool for data visualization, denoising, and dimensionality reduction. It is widely popular in Statistics, Machine Learning, Computer Vision, and related fields. However, PCA is well-known…

Machine Learning · Statistics 2023-07-21 Debolina Paul , Saptarshi Chakraborty , Swagatam Das

Iterative Supervised Principal Components

In high-dimensional prediction problems, where the number of features may greatly exceed the number of training instances, fully Bayesian approach with a sparsifying prior is known to produce good results but is computationally challenging.…

Methodology · Statistics 2018-10-15 Juho Piironen , Aki Vehtari