Related papers: Robust Variable Selection for High-dimensional Reg…

Robust Variable Selection in High-dimensional Nonparametric Additive Model

Additive models belong to the class of structured nonparametric regression models that do not suffer from the curse of dimensionality. Finding the additive components that are nonzero when the true model is assumed to be sparse is an…

Methodology · Statistics 2025-05-08 Suneel Babu Chatla , Abhijit Mandal

RIGID: Robust Linear Regression with Missing Data

We present a robust framework to perform linear regression with missing entries in the features. By considering an elliptical data distribution, and specifically a multivariate normal model, we are able to conditionally formulate a…

Machine Learning · Computer Science 2022-11-10 Alireza Aghasi , MohammadJavad Feizollahi , Saeed Ghadimi

Robust Variable Selection Criteria for the Penalized Regression

We propose a robust variable selection procedure using a divergence based M-estimator combined with a penalty function. It produces robust estimates of the regression parameters and simultaneously selects the important explanatory…

Methodology · Statistics 2020-01-01 Abhijit Mandal , Samiran Ghosh

High-dimensional outlier detection and variable selection via adaptive weighted mean regression

This paper proposes an adaptive penalized weighted mean regression for outlier detection of high-dimensional data. In comparison to existing approaches based on the mean shift model, the proposed estimators demonstrate robustness against…

Statistics Theory · Mathematics 2023-06-27 Jiaqi Li , Linglong Kong , Bei Jiang , Wei Tu

An ensemble learning method for variable selection: application to high dimensional data and missing values

Standard approaches for variable selection in linear models are not tailored to deal properly with high-dimensional and incomplete data. Currently, methods dedicated to high-dimensional data handle missing values by ad-hoc strategies, like…

Methodology · Statistics 2021-06-09 Avner Bar-Hen , Vincent Audigier

Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data

When outcomes are missing for reasons beyond an investigator's control, there are two different ways to adjust a parameter estimate for covariates that may be related both to the outcome and to missingness. One approach is to model the…

Methodology · Statistics 2008-12-18 Joseph D. Y. Kang , Joseph L. Schafer

Using Instruments for Selection to Adjust for Selection Bias in Mendelian Randomization

Selection bias is a common concern in epidemiologic studies. In the literature, selection bias is often viewed as a missing data problem. Popular approaches to adjust for bias due to missing data, such as inverse probability weighting, rely…

Methodology · Statistics 2024-04-16 Apostolos Gkatzionis , Eric J. Tchetgen Tchetgen , Jon Heron , Kate Northstone , Kate Tilling

Robust location estimators in regression models with covariates and responses missing at random

This paper deals with robust marginal estimation under a general regression model when missing data occur in the response and also in some of covariates. The target is a marginal location parameter which is given through an $M-$functional.…

Methodology · Statistics 2020-05-08 Ana M. Bianco , Graciela Boente , Wenceslao González-Manteiga , Ana Pérez-González

Robust variable selection for model-based learning in presence of adulteration

The problem of identifying the most discriminating features when performing supervised learning has been extensively investigated. In particular, several methods for variable selection in model-based classification have been proposed.…

Applications · Statistics 2020-12-16 Andrea Cappozzo , Francesca Greselin , Thomas Brendan Murphy

Variable selection in doubly truncated regression

Doubly truncated data arise in many areas such as astronomy, econometrics, and medical studies. For the regression analysis with doubly truncated response variables, the existence of double truncation may bring bias for estimation as well…

Methodology · Statistics 2021-10-22 Ming Zheng , Chanjuan Lin , Wen Yu

Variable Selection for Additive Partial Linear Quantile Regression with Missing Covariates

The standard quantile regression model assumes a linear relationship at the quantile of interest and that all variables are observed. We relax these assumptions by considering a partial linear model while allowing for missing linear…

Methodology · Statistics 2016-06-07 Ben Sherwood

Doubly Robust Inference for Targeted Minimum Loss Based Estimation in Randomized Trials with Missing Outcome Data

Missing outcome data is one of the principal threats to the validity of treatment effect estimates from randomized trials. The outcome distributions of participants with missing and observed data are often different, which increases the…

Methodology · Statistics 2017-04-06 Iván Díaz , Mark J. van der Laan

Robust Estimation and Shrinkage in Ultrahigh Dimensional Expectile Regression with Heavy Tails and Variance Heterogeneity

High-dimensional data subject to heavy-tailed phenomena and heterogeneity are commonly encountered in various scientific fields and bring new challenges to the classical statistical methods. In this paper, we combine the asymmetric square…

Statistics Theory · Mathematics 2019-10-02 Jun Zhao , Guan'ao Yan , Yi Zhang

Robust location estimation with missing data

In a missing-data setting, we have a sample in which a vector of explanatory variables x_i is observed for every subject i, while scalar outcomes y_i are missing by happenstance on some individuals. In this work we propose robust estimates…

Statistics Theory · Mathematics 2010-09-20 Mariela Sued , Victor J. Yohai

Robust propensity score weighting estimation under missing at random

Missing data is frequently encountered in many areas of statistics. Propensity score weighting is a popular method for handling missing data. The propensity score method employs a response propensity model, but correct specification of the…

Methodology · Statistics 2024-03-28 Hengfang Wang , Jae Kwang Kim , Jeongseop Han , Youngjo Lee

Efficient Test-based Variable Selection for High-dimensional Linear Models

Variable selection plays a fundamental role in high-dimensional data analysis. Various methods have been developed for variable selection in recent years. Well-known examples are forward stepwise regression (FSR) and least angle regression…

Methodology · Statistics 2018-02-01 Siliang Gong , Kai Zhang , Yufeng Liu

Robust Estimation and Variable Selection for the Accelerated Failure Time Model

This paper considers robust modeling of the survival time for cancer patients. Accurate prediction can be helpful for developing therapeutic and care strategies. We propose a unified Expectation-Maximization approach combined with the…

Methodology · Statistics 2019-12-23 Yi Li , Muxuan Liang , Lu Mao , Sijian Wang

Robust adaptive Lasso in high-dimensional logistic regression

Penalized logistic regression is extremely useful for binary classification with large number of covariates (higher than the sample size), having several real life applications, including genomic disease classification. However, the…

Methodology · Statistics 2023-04-10 Ayanendranath Basu , Abhik Ghosh , María Jaenada , Leandro Pardo

Variational Bayesian Multiple Imputation in High-Dimensional Regression Models With Missing Responses

Multiple imputation has become one of the standard methods in drawing inferences in many incomplete data applications. Applications of multiple imputation in relatively more complex settings, such as high-dimensional clustered data, require…

Methodology · Statistics 2025-04-08 Qiushuang Li , Recai Yucel

Variable selection with missing data in both covariates and outcomes: Imputation and machine learning

The missing data issue is ubiquitous in health studies. Variable selection in the presence of both missing covariates and outcomes is an important statistical research topic but has been less studied. Existing literature focuses on…

Methodology · Statistics 2021-07-09 Liangyuan Hu , Jung-Yi Joyce Lin , Jiayi Ji