English
Related papers

Related papers: Robust approach for variable selection with high d…

200 papers

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz

Determining the number of factors in high-dimensional factor modeling is essential but challenging, especially when the data are heavy-tailed. In this paper, we introduce a new estimator based on the spectral properties of Spearman sample…

Methodology · Statistics 2024-08-29 Jiaxin Qiu , Zeng Li , Jianfeng Yao

It is of importance to develop statistical techniques to analyze high-dimensional data in the presence of both complex dependence and possible outliers in real-world applications such as imaging data analyses. We propose a new robust…

Methodology · Statistics 2021-10-01 Bingyuan Liu , Qi Zhang , Lingzhou Xue , Peter X. K. Song , Jian Kang

A ubiquitous feature of data of our era is their extra-large sizes and dimensions. Analyzing such high-dimensional data poses significant challenges, since the feature dimension is often much larger than the sample size. This thesis…

Statistics Theory · Mathematics 2025-09-11 Kai Yang

We herein propose a new robust estimation method based on random projections that is adaptive and, automatically produces a robust estimate, while enabling easy computations for high or infinite dimensional data. Under some restricted…

Methodology · Statistics 2023-12-29 Ricardo Fraiman , Marcela Svarc

Estimating covariance matrices with high-dimensional complex data presents significant challenges, particularly concerning positive definiteness, sparsity, and numerical stability. Existing robust sparse estimators often fail to guarantee…

Methodology · Statistics 2025-12-30 Shaoxin Wang , Ziyun Ma

This paper provides the relevant literature with a complete toolkit for conducting robust estimation and inference about the parameters of interest involved in a high-dimensional panel data framework. Specifically, (1) we allow for…

Econometrics · Economics 2025-02-13 Jiti Gao , Fei Liu , Bin Peng , Yayi Yan

We provide a novel -- and to the best of our knowledge, the first -- algorithm for high dimensional sparse regression with constant fraction of corruptions in explanatory and/or response variables. Our algorithm recovers the true sparse…

Machine Learning · Computer Science 2019-05-31 Liu Liu , Yanyao Shen , Tianyang Li , Constantine Caramanis

Statistical learning evolves quickly with more and more sophisticated models proposed to incorporate the complicated data structure from modern scientific and business problems. Varying index coefficient models extend varying coefficient…

Statistics Theory · Mathematics 2019-03-05 Li Jialiang , Lv Jing

Due to the highly non-convex nature of large-scale robust parameter estimation, avoiding poor local minima is challenging in real-world applications where input data is contaminated by a large or unknown fraction of outliers. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Huu Le , Christopher Zach

Cellwise outliers are widespread in data and traditional robust methods may fail when applied to datasets under such contamination. We propose a variable selection procedure, that uses a pairwise robust estimator to obtain an initial…

Methodology · Statistics 2023-09-06 Peng Su , Garth Tarr , Samuel Muller

We propose a robust variable selection procedure using a divergence based M-estimator combined with a penalty function. It produces robust estimates of the regression parameters and simultaneously selects the important explanatory…

Methodology · Statistics 2020-01-01 Abhijit Mandal , Samiran Ghosh

This paper studies the case of possibly high-dimensional covariates in the regression discontinuity design (RDD) analysis. In particular, we propose estimation and inference methods for the RDD models with covariate selection which perform…

Econometrics · Economics 2026-01-21 Yoichi Arai , Taisuke Otsu , Myung Hwan Seo

Logistic regression involving high-dimensional covariates is a practically important problem. Often the goal is variable selection, i.e., determining which few of the many covariates are associated with the binary response. Unfortunately,…

Computation · Statistics 2025-02-18 Yiqi Tang , Ryan Martin

This paper tackles the problem of robust covariance matrix estimation when the data is incomplete. Classical statistical estimation methodologies are usually built upon the Gaussian assumption, whereas existing robust estimation ones assume…

We address high dimensional covariance estimation for elliptical distributed samples, which are also known as spherically invariant random vectors (SIRV) or compound-Gaussian processes. Specifically we consider shrinkage methods that are…

Methodology · Statistics 2015-05-20 Yilun Chen , Ami Wiesel , Alfred O. Hero

We consider the problem of online active learning to collect data for regression modeling. Specifically, we consider a decision maker with a limited experimentation budget who must efficiently learn an underlying linear population model.…

Machine Learning · Statistics 2016-12-22 Carlos Riquelme , Ramesh Johari , Baosen Zhang

We propose a nonconvex estimator for joint multivariate regression and precision matrix estimation in the high dimensional regime, under sparsity constraints. A gradient descent algorithm with hard thresholding is developed to solve the…

Machine Learning · Statistics 2016-06-03 Jinghui Chen , Quanquan Gu

High-dimensional multivariate longitudinal data, which arise when many outcome variables are measured repeatedly over time, are becoming increasingly common in social, behavioral and health sciences. We propose a latent variable model for…

Methodology · Statistics 2025-12-09 Sze Ming Lee , Yunxiao Chen , Tony Sit

We propose generalized additive partial linear models for complex data which allow one to capture nonlinear patterns of some covariates, in the presence of linear components. The proposed method improves estimation efficiency and increases…

Statistics Theory · Mathematics 2014-05-26 Li Wang , Lan Xue , Annie Qu , Hua Liang
‹ Prev 1 2 3 10 Next ›