English
Related papers

Related papers: PC Adjusted Testing for Low Dimensional Parameters

200 papers

We study principal components regression (PCR) in an asymptotic high-dimensional regression setting, where the number of data points is proportional to the dimension. We derive exact limiting formulas for the estimation and prediction…

Statistics Theory · Mathematics 2025-09-18 Alden Green , Elad Romanov

Principal component regression (PCR) is a popular technique for fixed-design error-in-variables regression, a generalization of the linear regression setting in which the observed covariates are corrupted with random noise. We provide the…

Machine Learning · Computer Science 2024-08-06 Anish Agarwal , Keegan Harris , Justin Whitehouse , Zhiwei Steven Wu

A number of settings arise in which it is of interest to predict Principal Component (PC) scores for new observations using data from an initial sample. In this paper, we demonstrate that naive approaches to PC score prediction can be…

Statistics Theory · Mathematics 2012-11-14 Seunggeun Lee , Fei Zou , Fred A. Wright

Principal component analysis continues to be a powerful tool in dimension reduction of high dimensional data. We assume a variance-diverging model and use the high-dimension, low-sample-size asymptotics to show that even though the…

Statistics Theory · Mathematics 2020-09-28 Sungkyu Jung

We analyze the prediction error of principal component regression (PCR) and prove high probability bounds for the corresponding squared risk conditional on the design. Our first main result shows that PCR performs comparably to the oracle…

Statistics Theory · Mathematics 2024-01-03 Laura Hucker , Martin Wahl

Principal Component Analysis (PCA) is an important tool of dimension reduction especially when the dimension (or the number of variables) is very high. Asymptotic studies where the sample size is fixed, and the dimension grows [i.e., High…

Statistics Theory · Mathematics 2009-11-20 Sungkyu Jung , J. S. Marron

With the development of high-throughput technologies, principal component analysis (PCA) in the high-dimensional regime is of great interest. Most of the existing theoretical and methodological results for high-dimensional PCA are based on…

Statistics Theory · Mathematics 2019-03-11 Rounak Dey , Seunggeun Lee

We consider high-dimensional generalized linear models when the covariates are contaminated by measurement error. Estimates from errors-in-variables regression models are well-known to be biased in traditional low-dimensional settings if…

Computation · Statistics 2020-01-06 Michael Byrd , Monnie McGee

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance-correlation matrix of the analyzed data. However to properly work with high-dimensional data, PCA poses severe mathematical…

Quantitative Methods · Quantitative Biology 2018-10-18 Luigi Leonardo Palese

Principal component analysis is a versatile tool to reduce dimensionality which has wide applications in statistics and machine learning. It is particularly useful for modeling data in high-dimensional scenarios where the number of…

Methodology · Statistics 2022-08-18 Xiaoyu Hu , Fang Yao

Sparse Principal Component Analysis (PCA) methods are efficient tools to reduce the dimension (or the number of variables) of complex data. Sparse principal components (PCs) are easier to interpret than conventional PCs, because most…

Statistics Theory · Mathematics 2011-04-22 Dan Shen , Haipeng Shen , J. S. Marron

Big data is transforming our world, revolutionizing operations and analytics everywhere, from financial engineering to biomedical sciences. The complexity of big data often makes dimension reduction techniques necessary before conducting…

Methodology · Statistics 2018-01-08 Jianqing Fan , Qiang Sun , Wen-Xin Zhou , Ziwei Zhu

Principal component regression results in lack of fit when important dimensions are omitted, which cannot be assessed from the eigenvalues. I show that the PC-regression estimator can also suffer from increased variance relative to ordinary…

Methodology · Statistics 2023-06-30 Bert van der Veen

The first order behavior of multivariate heavy-tailed random vectors above large radial thresholds is ruled by a limit measure in a regular variation framework. For a high dimensional vector, a reasonable assumption is that the support of…

Statistics Theory · Mathematics 2019-06-27 Holger Drees , Anne Sabourin

We provide a remedy for two concerns that have dogged the use of principal components in regression: (i) principal components are computed from the predictors alone and do not make apparent use of the response, and (ii) principal components…

Methodology · Statistics 2009-06-23 R. Dennis Cook , Liliana Forzani

Principal component analysis (PCA) is arguably the most widely used approach for large-dimensional factor analysis. While it is effective when the factors are sufficiently strong, it can be inconsistent when the factors are weak and/or the…

Methodology · Statistics 2025-08-22 Zhongyuan Lyu , Ming Yuan

Principal component analysis (PCA) aims at estimating the direction of maximal variability of a high-dimensional dataset. A natural question is: does this task become easier, and estimation more accurate, when we exploit additional…

Information Theory · Computer Science 2014-06-19 Andrea Montanari , Emile Richard

We propose a new method for statistical inference in generalized linear models. In the overparameterized regime, Principal Component Regression (PCR) reduces variance by projecting high-dimensional data to a low-dimensional principal…

Machine Learning · Statistics 2026-04-27 Yixuan Florence Wu , Yilun Zhu , Lei Cao , Naichen Shi

In this paper, we develop new statistical theory for probabilistic principal component analysis models in high dimensions. The focus is the estimation of the noise variance, which is an important and unresolved issue when the number of…

Statistics Theory · Mathematics 2014-06-23 Damien Passemier , Zhaoyuan Li , Jian-Feng Yao

Factor models are a class of powerful statistical models that have been widely used to deal with dependent measurements that arise frequently from various applications from genomics and neuroscience to economics and finance. As data are…

Methodology · Statistics 2018-08-14 Jianqing Fan , Kaizheng Wang , Yiqiao Zhong , Ziwei Zhu
‹ Prev 1 2 3 10 Next ›