English
Related papers

Related papers: Dimension estimation in PCA model using high-dimen…

200 papers

In this paper, we develop new statistical theory for probabilistic principal component analysis models in high dimensions. The focus is the estimation of the noise variance, which is an important and unresolved issue when the number of…

Statistics Theory · Mathematics 2014-06-23 Damien Passemier , Zhaoyuan Li , Jian-Feng Yao

Estimating intrinsic dimensionality of data is a classic problem in pattern recognition and statistics. Principal Component Analysis (PCA) is a powerful tool in discovering dimensionality of data sets with a linear structure; it, however,…

Computer Vision and Pattern Recognition · Computer Science 2010-02-11 Mingyu Fan , Nannan Gu , Hong Qiao , Bo Zhang

Principal Component Analysis (PCA) is a classical method for reducing the dimensionality of data by projecting them onto a subspace that captures most of their variation. Effective use of PCA in modern applications requires understanding…

Statistics Theory · Mathematics 2019-06-14 David Hong , Laura Balzano , Jeffrey A. Fessler

High dimensional data and systems with many degrees of freedom are often characterized by covariance matrices. In this paper, we consider the problem of simultaneously estimating the dimension of the principal (dominant) subspace of these…

Numerical Analysis · Computer Science 2018-10-10 Shashanka Ubaru , Abd-Krim Seghouane , Yousef Saad

The success of algorithms in the analysis of high-dimensional data is often attributed to the manifold hypothesis, which supposes that this data lie on or near a manifold of much lower dimension. It is often useful to determine or estimate…

Machine Learning · Statistics 2024-09-10 Anna C. Gilbert , Kevin O'Neill

Modern data are increasingly both high-dimensional and heteroscedastic. This paper considers the challenge of estimating underlying principal components from high-dimensional data with noise that is heteroscedastic across samples, i.e.,…

Statistics Theory · Mathematics 2022-09-14 David Hong , Fan Yang , Jeffrey A. Fessler , Laura Balzano

Principal component analysis (PCA) is one of the most widely used dimension reduction and multivariate statistical techniques. From a probabilistic perspective, PCA seeks a low-dimensional representation of data in the presence of…

Machine Learning · Computer Science 2021-01-06 Chihao Zhang , Kuo Gai , Shihua Zhang

Principal component analysis (PCA) is arguably the most widely used approach for large-dimensional factor analysis. While it is effective when the factors are sufficiently strong, it can be inconsistent when the factors are weak and/or the…

Methodology · Statistics 2025-08-22 Zhongyuan Lyu , Ming Yuan

Dimension reduction is often an important step in the analysis of high-dimensional data. PCA is a popular technique to find the best low-dimensional approximation of high-dimensional data. However, classical PCA is very sensitive to…

Computation · Statistics 2019-01-14 Holger Cevallos-Valdiviezo , Stefan Van Aelst

Learning augmented is a machine learning concept built to improve the performance of a method or model, such as enhancing its ability to predict and generalize data or features, or testing the reliability of the method by introducing noise…

Machine Learning · Computer Science 2024-01-09 Issam K. O Jabari , Shofiyah , Pradiptya Kahvi S , Novi Nur Putriwijaya , Novanto Yudistira

This paper considers the estimation and inference of the low-rank components in high-dimensional matrix-variate factor models, where each dimension of the matrix-variates ($p \times q$) is comparable to or greater than the number of…

Statistics Theory · Mathematics 2022-10-20 Elynn Y. Chen , Jianqing Fan

High dimensional data has introduced challenges that are difficult to address when attempting to implement classical approaches of statistical process control. This has made it a topic of interest for research due in recent years. However,…

Applications · Statistics 2019-04-23 Mohammad Nabhan , Yajun Mei , Jianjun Shi

Dimensionality reduction methods are very common in the field of high dimensional data analysis. Typically, algorithms for dimensionality reduction are computationally expensive. Therefore, their applications for the analysis of massive…

Machine Learning · Statistics 2015-11-04 Yariv Aizenbud , Amit Bermanis , Amir Averbuch

In this work we show that the classification performance of high-dimensional structural MRI data with only a small set of training examples is improved by the usage of dimension reduction methods. We assessed two different dimension…

Machine Learning · Computer Science 2015-05-27 Andreas Grünauer , Markus Vincze

We investigate whether the standard dimensionality reduction technique of PCA inadvertently produces data representations with different fidelity for two different populations. We show on several real-world data sets, PCA has higher…

Machine Learning · Computer Science 2018-11-02 Samira Samadi , Uthaipon Tantipongpipat , Jamie Morgenstern , Mohit Singh , Santosh Vempala

Tensor-valued data benefits greatly from dimension reduction as the reduction in size is exponential in the number of modes. To achieve maximal reduction without loss in information, our objective in this work is to give an automated…

Methodology · Statistics 2022-07-22 Una Radojicic , Niko Lietzen , Klaus Nordhausen , Joni Virta

Principal component analysis (PCA) is a well-known linear dimension-reduction method that has been widely used in data analysis and modeling. It is an unsupervised learning technique that identifies a suitable linear subspace for the input…

Machine Learning · Statistics 2021-09-10 Shaojie Xu , Joel Vaughan , Jie Chen , Agus Sudjianto , Vijayan Nair

It is a standard assumption that datasets in high dimension have an internal structure which means that they in fact lie on, or near, subsets of a lower dimension. In many instances it is important to understand the real dimension of the…

Machine Learning · Statistics 2025-07-21 James A. D. Binnie , Paweł Dłotko , John Harvey , Jakub Malinowski , Ka Man Yim

Dimensionality reduction (DR) methods are commonly used for analyzing and visualizing multidimensional data. However, when data is a live streaming feed, conventional DR methods cannot be directly used because of their computational…

Graphics · Computer Science 2019-10-16 Takanori Fujiwara , Jia-Kai Chou , Shilpika , Panpan Xu , Liu Ren , Kwan-Liu Ma

We present a Bayesian model selection approach to estimate the intrinsic dimensionality of a high-dimensional dataset. To this end, we introduce a novel formulation of the probabilisitic principal component analysis model based on a…

Methodology · Statistics 2019-05-22 Charles Bouveyron , Pierre Latouche , Pierre-Alexandre Mattei
‹ Prev 1 2 3 10 Next ›