English

Robust Variable Selection for High-dimensional Regression with Missing Data and Measurement Errors

Methodology 2025-07-01 v3 Machine Learning

Abstract

In our paper, we focus on robust variable selection for missing data and measurement error. Missing data and measurement errors can lead to confusing data distribution. We propose an exponential loss function with a tuning parameter to apply to Missing and measurement errors data. By adjusting the parameter, the loss function can be better and more robust under various data distributions. We use inverse probability weighting and additive error models to address missing data and measurement errors. Also, we find that the Atan punishment method works better. We used Monte Carlo simulations to assess the validity of robust variable selection and validated our findings with the breast cancer dataset.

Keywords

Cite

@article{arxiv.2410.16722,
  title  = {Robust Variable Selection for High-dimensional Regression with Missing Data and Measurement Errors},
  author = {Zhenhao Zhang and Yunquan Song},
  journal= {arXiv preprint arXiv:2410.16722},
  year   = {2025}
}

Comments

I finished this work in 2023 when I was an undergraduate Student intern in the Department of Data Science and Statistics