English
Related papers

Related papers: Cellwise Robust Discriminant Analysis

200 papers

Quadratic and Linear Discriminant Analysis (QDA/LDA) are the most often applied classification rules under normality. In QDA, a separate covariance matrix is estimated for each group. If there are more variables than observations in the…

Methodology · Statistics 2016-12-26 Stéphanie Aerts , Ines Wilms

Discriminant analysis (DA) is one of the most popular methods for classification due to its conceptual simplicity, low computational cost, and often solid performance. In its standard form, DA uses the arithmetic mean and sample covariance…

Methodology · Statistics 2026-05-12 Mia Hubert , Jakob Raymaekers , Peter J. Rousseeuw

In statistics and machine learning, the traditional meaning of the terms `outlier' and `anomaly' is a case in the dataset that behaves differently from the bulk of the data. This raises suspicion that it may belong to a different…

Methodology · Statistics 2026-04-17 Mia Hubert , Jakob Raymaekers , Peter J. Rousseeuw

Principal component analysis (PCA) is a fundamental tool for analyzing multivariate data. Here the focus is on dimension reduction to the principal subspace, characterized by its projection matrix. The classical principal subspace can be…

Methodology · Statistics 2026-05-29 Fabio Centofanti , Mia Hubert , Peter J. Rousseeuw

We propose a data-analytic method for detecting cellwise outliers. Given a robust covariance matrix, outlying cells (entries) in a row are found by the cellHandler technique which combines lasso regression with a stepwise application of…

Methodology · Statistics 2024-07-08 Jakob Raymaekers , Peter J. Rousseeuw

The usual Minimum Covariance Determinant (MCD) estimator of a covariance matrix is robust against casewise outliers. These are cases (that is, rows of the data matrix) that behave differently from the majority of cases, raising suspicion…

Methodology · Statistics 2024-07-08 Jakob Raymaekers , Peter J. Rousseeuw

Multivariate linear regression is a fundamental statistical task, but classical estimators such as ordinary least squares are highly sensitive to outliers. These may occur as casewise outliers that affect entire observations, or as outlying…

Methodology · Statistics 2026-05-11 Fabio Centofanti , Mia Hubert , Peter J. Rousseeuw

It is well-known that real data often contain outliers. The term outlier typically refers to a case, that is, a row of the $n \times d$ data matrix. In recent times a different type has come into focus, the cellwise outliers. These are…

Methodology · Statistics 2024-07-08 Jakob Raymaekers , Peter J. Rousseeuw

Cellwise outliers are likely to occur together with casewise outliers in modern data sets with relatively large dimension. Recent work has shown that traditional robust regression methods may fail for data sets in this paradigm. The…

Statistics Theory · Mathematics 2016-12-28 Andy Leung , Hongyang Zhang , Ruben H. Zamar

When applying a statistical method in practice it often occurs that some observations deviate from the usual assumptions. However, many classical methods are sensitive to outliers. The goal of robust statistics is to develop methods that…

Methodology · Statistics 2008-08-06 Mia Hubert , Peter J. Rousseeuw , Stefan Van Aelst

The sample covariance matrix is a cornerstone of multivariate statistics, but it is highly sensitive to outliers. These can be casewise outliers, such as cases belonging to a different population, or cellwise outliers, which are deviating…

Methodology · Statistics 2025-05-27 Fabio Centofanti , Mia Hubert , Peter J. Rousseeuw

Quadratic discriminant analysis (QDA) is a widely used classification technique. Based on a training dataset, each class in the data is characterized by an estimate of its center and shape, which can then be used to assign unseen…

Methodology · Statistics 2021-01-13 Iwein Vranckx , Jakob Raymaekers , Bart De Ketelaere , Peter J. Rousseeuw , Mia Hubert

Multivariate location and scatter matrix estimation is a cornerstone in multivariate data analysis. We consider this problem when the data may contain independent cellwise and casewise outliers. Flat data sets with a large number of…

Statistics Theory · Mathematics 2014-06-24 Claudio Agostinelli , Andy Leung , Victor J. Yohai , Ruben H. Zamar

In this paper, we propose a new variant of Linear Discriminant Analysis to overcome underlying drawbacks of traditional LDA and other LDA variants targeting problems involving imbalanced classes. Traditional LDA sets assumptions related to…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Lei Xu , Alexandros Iosifidis , Moncef Gabbouj

Cellwise outliers are widespread in data and traditional robust methods may fail when applied to datasets under such contamination. We propose a variable selection procedure, that uses a pairwise robust estimator to obtain an initial…

Methodology · Statistics 2023-09-06 Peng Su , Garth Tarr , Samuel Muller

Real data often contain anomalous cases, also known as outliers. These may spoil the resulting analysis but they may also contain valuable information. In either case, the ability to detect such anomalies is essential. A useful tool for…

Machine Learning · Statistics 2021-01-13 Peter J. Rousseeuw , Mia Hubert

A popular approach for comparing gene expression levels between (replicated) conditions of RNA sequencing data relies on counting reads that map to features of interest. Within such count-based methods, many flexible and advanced…

Quantitative Methods · Quantitative Biology 2014-03-17 Xiaobei Zhou , Helen Lindsay , Mark D. Robinson

Linear Discriminant Analysis (LDA) is a fundamental method for classification. Its simple linear structure facilitates interpretation, and it is naturally suited to multi-class settings. LDA is also closely connected to several classical…

Methodology · Statistics 2026-04-09 Xin Bing , Bingqing Li , Marten Wegkamp

In diagnostic test accuracy meta-analysis (DTA-MA), standard inference methods using bivariate random-effects models for jointly synthesizing sensitivity and specificity can be sensitive to outlying studies and may yield misleading…

Methodology · Statistics 2026-05-01 Kotaro Sasaki , Hisashi Noma , Theodoros Evrenoglou

Multilinear Principal Component Analysis (MPCA) is an important tool for analyzing tensor data. It performs dimension reduction similar to PCA for multivariate data. However, standard MPCA is sensitive to outliers. It is highly influenced…

Methodology · Statistics 2026-03-18 Mehdi Hirari , Fabio Centofanti , Mia Hubert , Stefan Van Aelst
‹ Prev 1 2 3 10 Next ›