Related papers: RIGID: Robust Linear Regression with Missing Data

Sparse Linear Regression With Missing Data

This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the…

Machine Learning · Statistics 2015-03-31 Ravi Ganti , Rebecca M. Willett

Robust reduced rank regression under heavy-tailed noise and missing data via non-convex penalization

Reduced rank regression (RRR) is a fundamental tool for modeling multiple responses through low-dimensional latent structures, offering both interpretability and strong predictive performance in high-dimensional settings. Classical RRR…

Methodology · Statistics 2026-01-01 The Tien Mai

A Randomized Missing Data Approach to Robust Filtering and Forecasting

We put forward a simple new randomized missing data (RMD) approach to robust filtering of state-space models, motivated by the idea that the inclusion of only a small fraction of available highly precise measurements can still extract most…

Methodology · Statistics 2022-10-21 Dobrislav Dobrev , Derek Hansen , Pawel Szerszen

Robust Nonparametric Regression for Compositional Data: the Simplicial--Real case

Statistical analysis on compositional data has gained a lot of attention due to their great potential of applications. A feature of these data is that they are multivariate vectors that lie in the simplex, that is, the components of each…

Methodology · Statistics 2025-05-22 Ana M. Bianco , Graciela Boente , Wenceslao González--Manteiga , Francisco Gude Sampedro , Ana Pérez--González

Robust linear regression with broad distributions of errors

We consider the problem of linear fitting of noisy data in the case of broad (say $\alpha$-stable) distributions of random impacts ("noise"), which can lack even the first moment. This situation, common in statistical physics of small…

Data Analysis, Statistics and Probability · Physics 2015-05-27 Eugene B. Postnikov , Igor M. Sokolov

A User-Friendly Computational Framework for Robust Structured Regression with the L$_2$ Criterion

We introduce a user-friendly computational framework for implementing robust versions of a wide variety of structured regression methods with the L$_{2}$ criterion. In addition to introducing an algorithm for performing L$_{2}$E regression,…

Computation · Statistics 2021-09-15 Jocelyn T. Chi , Eric C. Chi

Conformal Inference For Missing Data under Multiple Robust Learning

We develop a novel approach to tackle the common but challenging problem of conformal inference for missing data in machine learning, focusing on Missing at Random (MAR) data. We propose a new procedure Conformal prediction for Missing data…

Methodology · Statistics 2025-10-22 Wenlu Tang , Hongni Wang , Xingcai Zhou , Bei Jiang , Linglong Kong

Robust Optimal Designs when Missing Data Happen at Random

In this article, we investigate the robust optimal design problem for the prediction of response when the fitted regression models are only approximately specified, and observations might be missing completely at random. The intuitive idea…

Methodology · Statistics 2022-10-19 Rui Hu , Ion Bica , Zhichun Zhai

Robust High-Dimensional Linear Regression

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the…

Machine Learning · Computer Science 2016-08-11 Chang Liu , Bo Li , Yevgeniy Vorobeychik , Alina Oprea

Robust Variable Selection for High-dimensional Regression with Missing Data and Measurement Errors

In our paper, we focus on robust variable selection for missing data and measurement error. Missing data and measurement errors can lead to confusing data distribution. We propose an exponential loss function with a tuning parameter to…

Methodology · Statistics 2025-07-01 Zhenhao Zhang , Yunquan Song

A Hypergradient Approach to Robust Regression without Correspondence

We consider a variant of regression problem, where the correspondence between input and output data is not available. Such shuffled data is commonly observed in many real world problems. Taking flow cytometry as an example, the measuring…

Machine Learning · Computer Science 2021-02-12 Yujia Xie , Yixiu Mao , Simiao Zuo , Hongteng Xu , Xiaojing Ye , Tuo Zhao , Hongyuan Zha

Robust location estimation with missing data

In a missing-data setting, we have a sample in which a vector of explanatory variables x_i is observed for every subject i, while scalar outcomes y_i are missing by happenstance on some individuals. In this work we propose robust estimates…

Statistics Theory · Mathematics 2010-09-20 Mariela Sued , Victor J. Yohai

Robust Matrix Regression

Modern technologies are producing datasets with complex intrinsic structures, and they can be naturally represented as matrices instead of vectors. To preserve the latent data structures during processing, modern regression approaches…

Machine Learning · Computer Science 2016-11-16 Hang Zhang , Fengyuan Zhu , Shixin Li

Adaptive Optimization for Prediction with Missing Data

When training predictive models on data with missing entries, the most widely used and versatile approach is a pipeline technique where we first impute missing entries and then compute predictions. In this paper, we view prediction with…

Machine Learning · Computer Science 2025-02-25 Dimitris Bertsimas , Arthur Delarue , Jean Pauphilet

Robust Regression over Averaged Uncertainty

We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least squares regression problem. We show that this…

Machine Learning · Computer Science 2024-10-10 Dimitris Bertsimas , Yu Ma

Robust functional regression based on principal components

Functional data analysis is a fast evolving branch of modern statistics and the functional linear model has become popular in recent years. However, most estimation methods for this model rely on generalized least squares procedures and…

Methodology · Statistics 2020-06-24 Ioannis Kalogridis , Stefan Van Aelst

RIFLE: Imputation and Robust Inference from Low Order Marginals

The ubiquity of missing values in real-world datasets poses a challenge for statistical inference and can prevent similar datasets from being analyzed in the same study, precluding many existing datasets from being used for new analyses.…

Machine Learning · Computer Science 2023-09-14 Sina Baharlouei , Kelechi Ogudu , Sze-chuan Suen , Meisam Razaviyayn

On identification in ill-posed linear regression

A novel framework is introduced to formalize identifiability in well-specified but ill-posed linear regression models. The framework is distribution-free and accommodates highly correlated features that may or may not relate to the…

Statistics Theory · Mathematics 2026-03-05 Gianluca Finocchio , Tatyana Krivobokova

What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features

While discriminative classifiers often yield strong predictive performance, missing feature values at prediction time can still be a challenge. Classifiers may not behave as expected under certain ways of substituting the missing values,…

Machine Learning · Computer Science 2019-06-04 Pasha Khosravi , Yitao Liang , YooJung Choi , Guy Van den Broeck

Robust Linear Regression for General Feature Distribution

We investigate robust linear regression where data may be contaminated by an oblivious adversary, i.e., an adversary than may know the data distribution but is otherwise oblivious to the realizations of the data samples. This model has been…

Machine Learning · Computer Science 2022-02-07 Tom Norman , Nir Weinberger , Kfir Y. Levy