English
Related papers

Related papers: Computing Robust Leverage Diagnostics when the Des…

200 papers

With rapid advances in information technology, massive datasets are collected in all fields of science, such as biology, chemistry, and social science. Useful or meaningful information is extracted from these data often through statistical…

Methodology · Statistics 2021-09-22 Wenxuan Zhong , Yiwen Liu , Peng Zeng

The dependency structure of multivariate data can be analyzed using the covariance matrix $\Sigma$. In many fields the precision matrix $\Sigma^{-1}$ is even more informative. As the sample covariance estimator is singular in…

Methodology · Statistics 2015-06-04 Viktoria Öllerer , Christophe Croux

Penalized logistic regression is extremely useful for binary classification with large number of covariates (higher than the sample size), having several real life applications, including genomic disease classification. However, the…

Methodology · Statistics 2023-04-10 Ayanendranath Basu , Abhik Ghosh , María Jaenada , Leandro Pardo

Cellwise outliers are widespread in data and traditional robust methods may fail when applied to datasets under such contamination. We propose a variable selection procedure, that uses a pairwise robust estimator to obtain an initial…

Methodology · Statistics 2023-09-06 Peng Su , Garth Tarr , Samuel Muller

A robust estimator is proposed for the parameters that characterize the linear regression problem. It is based on the notion of shrinkages, often used in Finance and previously studied for outlier detection in multivariate data. A thorough…

Methodology · Statistics 2020-02-07 Elisa Cabana , Rosa E. Lillo , Henry Laniado

Datasets containing both categorical and continuous variables are frequently encountered in many areas, and with the rapid development of modern measurement technologies, the dimensions of these variables can be very high. Despite the…

Methodology · Statistics 2024-01-03 Binyan Jiang , Chenlei Leng , Cheng Wang , Zhongqing Yang , Xinyang Yu

The robustness of classifiers has become a question of paramount importance in the past few years. Indeed, it has been shown that state-of-the-art deep learning architectures can easily be fooled with imperceptible changes to their inputs.…

Computer Vision and Pattern Recognition · Computer Science 2020-06-12 Théo Giraudon , Vincent Gripon , Matthias Löwe , Franck Vermet

High-dimensional data subject to heavy-tailed phenomena and heterogeneity are commonly encountered in various scientific fields and bring new challenges to the classical statistical methods. In this paper, we combine the asymmetric square…

Statistics Theory · Mathematics 2019-10-02 Jun Zhao , Guan'ao Yan , Yi Zhang

This paper tackles the problem of robust covariance matrix estimation when the data is incomplete. Classical statistical estimation methodologies are usually built upon the Gaussian assumption, whereas existing robust estimation ones assume…

Among semiparametric regression models, partially linear additive models provide a useful tool to include additive nonparametric components as well as a parametric component, when explaining the relationship between the response and a set…

Methodology · Statistics 2024-02-01 Graciela Boente , Alejandra Martínez

Errors-in-variables is a long-standing, difficult issue in linear regression; and progress depends in part on new identifying assumptions. I characterize measurement error as bad-leverage points and assume that fewer than half the sample…

Econometrics · Economics 2020-03-17 Eric Blankmeyer

Estimating covariance matrices with high-dimensional complex data presents significant challenges, particularly concerning positive definiteness, sparsity, and numerical stability. Existing robust sparse estimators often fail to guarantee…

Methodology · Statistics 2025-12-30 Shaoxin Wang , Ziyun Ma

A robust estimator for a wide family of mixtures of linear regression is presented. Robustness is based on the joint adoption of the Cluster Weighted Model and of an estimator based on trimming and restrictions. The selected model provides…

Methodology · Statistics 2015-02-05 L. A. Garcia-Escudero , A. Gordaliza , F. Greselin , S. Ingrassia , A. Mayo-Iscar

The parameters of the log-logistic distribution are generally estimated based on classical methods such as maximum likelihood estimation, whereas these methods usually result in severe biased estimates when the data contain outliers. In…

Methodology · Statistics 2022-09-16 Zhuanzhuan Ma , Min Wang , Chanseok Park

This paper addresses the problem of providing robust estimators under a functional logistic regression model. Logistic regression is a popular tool in classification problems with two populations. As in functional linear regression,…

Methodology · Statistics 2023-08-16 Graciela Boente , Marina Valdora

The effectiveness of supervised learning techniques has made them ubiquitous in research and practice. In high-dimensional settings, supervised learning commonly relies on dimensionality reduction to improve performance and identify the…

Machine Learning · Computer Science 2016-08-11 Chang Liu , Bo Li , Yevgeniy Vorobeychik , Alina Oprea

We propose a robust variable selection procedure using a divergence based M-estimator combined with a penalty function. It produces robust estimates of the regression parameters and simultaneously selects the important explanatory…

Methodology · Statistics 2020-01-01 Abhijit Mandal , Samiran Ghosh

We propose a residual randomization procedure designed for robust Lasso-based inference in the high-dimensional setting. Compared to earlier work that focuses on sub-Gaussian errors, the proposed procedure is designed to work robustly in…

Methodology · Statistics 2021-08-20 Y. Samuel Wang , Si Kai Lee , Panos Toulis , Mladen Kolar

Robust design is one of the main tools employed by engineers for the facilitation of the design of high-quality processes. However, most real-world processes invariably contend with external uncontrollable factors, often denoted as outliers…

Methodology · Statistics 2023-09-12 Xuehong Gao , Zhijin Chen , Bosung Kim , Chanseok Park

Recent advances in deep learning have achieved impressive gains in classification accuracy on a variety of types of data, including images and text. Despite these gains, however, concerns have been raised about the calibration, robustness,…

Machine Learning · Computer Science 2018-11-20 Dallas Card , Michael Zhang , Noah A. Smith
‹ Prev 1 2 3 10 Next ›