Related papers: Robust mixtures in the presence of measurement err…

Cellwise outlier detection in heterogeneous populations

Real-world applications may be affected by outlying values. In the model-based clustering literature, several methodologies have been proposed to detect units that deviate from the majority of the data (rowwise outliers) and trim them from…

Methodology · Statistics 2025-05-14 Giorgia Zaccaria , Luis A. García-Escudero , Francesca Greselin , Agustín Mayo-Íscar

Weighted likelihood mixture modeling and model based clustering

A weighted likelihood approach for robust fitting of a mixture of multivariate Gaussian components is developed in this work. Two approaches have been proposed that are driven by a suitable modification of the standard EM and CEM…

Methodology · Statistics 2018-11-19 Luca Greco , Claudio Agostinelli

Robust estimation of mixtures of regressions with random covariates, via trimming and constraints

A robust estimator for a wide family of mixtures of linear regression is presented. Robustness is based on the joint adoption of the Cluster Weighted Model and of an estimator based on trimming and restrictions. The selected model provides…

Methodology · Statistics 2015-02-05 L. A. Garcia-Escudero , A. Gordaliza , F. Greselin , S. Ingrassia , A. Mayo-Iscar

Outlier Detection on Mixed-Type Data: An Energy-based Approach

Outlier detection amounts to finding data points that differ significantly from the norm. Classic outlier detection methods are largely designed for single data type such as continuous or discrete. However, real world data is increasingly…

Machine Learning · Statistics 2016-08-18 Kien Do , Truyen Tran , Dinh Phung , Svetha Venkatesh

Robust Clustering with Normal Mixture Models: A Pseudo $\beta$-Likelihood Approach

As in other estimation scenarios, likelihood based estimation in the normal mixture set-up is highly non-robust against model misspecification and presence of outliers (apart from being an ill-posed optimization problem). A robust…

Methodology · Statistics 2023-12-20 Soumya Chakraborty , Ayanendranath Basu , Abhik Ghosh

Robust Fitting of Mixture Models using Weighted Complete Estimating Equations

Mixture modeling, which considers the potential heterogeneity in data, is widely adopted for classification and clustering problems. Mixture models can be estimated using the Expectation-Maximization algorithm, which works with the complete…

Methodology · Statistics 2022-03-18 Shonosuke Sugasawa , Genya Kobayashi

Random Similarity Isolation Forests

With predictive models becoming prevalent, companies are expanding the types of data they gather. As a result, the collected datasets consist not only of simple numerical features but also more complex objects such as time series, images,…

Machine Learning · Computer Science 2025-07-01 Sebastian Chwilczyński , Dariusz Brzezinski

Detecting Outliers in High-dimensional Data with Mixed Variable Types using Conditional Gaussian Regression Models

Outlier detection has gained increasing interest in recent years, due to newly emerging technologies and the huge amount of high-dimensional data that are now available. Outlier detection can help practitioners to identify unwanted noise…

Statistics Theory · Mathematics 2021-05-20 Mads Lindskou , Torben Tvedebrink , Poul Svante Eriksen , Niels Morling

Robust Estimation in Finite Mixture Models

We observe a $n$-sample, the distribution of which is assumed to belong, or at least to be close enough, to a given mixture model. We propose an estimator of this distribution that belongs to our model and possesses some robustness…

Statistics Theory · Mathematics 2025-02-06 Alexandre Lecestre

Simultaneous Feature Selection and Outlier Detection with Optimality Guarantees

Sparse estimation methods capable of tolerating outliers have been broadly investigated in the last decade. We contribute to this research considering high-dimensional regression problems contaminated by multiple mean-shift outliers which…

Methodology · Statistics 2025-10-21 Luca Insolia , Ana Kenney , Francesca Chiaromonte , Giovanni Felici

Robust Estimation for Multivariate Wrapped Models

A weighted likelihood technique for robust estimation of a multivariate Wrapped Normal distribution for data points scattered on a p-dimensional torus is proposed. The occurrence of outliers in the sample at hand can badly compromise…

Methodology · Statistics 2021-07-01 Giovanni Saraceno , Claudio Agostinelli , Luca Greco

A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components

Estimating parameters of mixture model has wide applications ranging from classification problems to estimating of complex distributions. Most of the current literature on estimating the parameters of the mixture densities are based on…

Machine Learning · Statistics 2020-06-23 Yuantong Li , Qi Ma , Sujit K. Ghosh

Robust semi-parametric mixtures of linear experts using the contaminated Gaussian distribution

Semi- and non-parametric mixture of regressions are a very useful flexible class of mixture of regressions in which some or all of the parameters are non-parametric functions of the covariates. These models are, however, based on the…

Methodology · Statistics 2026-01-21 Peterson Mambondimumwe , Sphiwe B. Skhosana , Najmeh Nakhaei Rad

Robust Linear Mixed Models using Hierarchical Gamma-Divergence

Linear mixed models (LMMs) are a popular class of methods for analyzing longitudinal and clustered data. However, such models can be sensitive to outliers, and this can lead to biased inference on model parameters and inaccurate prediction…

Methodology · Statistics 2025-03-28 Shonosuke Sugasawa , Francis K. C. Hui , Alan H. Welsh

Outlier detection for mixed-type data: A novel approach

Outlier detection can serve as an extremely important tool for researchers from a wide range of fields. From the sectors of banking and marketing to the social sciences and healthcare sectors, outlier detection techniques are very useful…

Methodology · Statistics 2023-12-12 Efthymios Costa , Ioanna Papatsouma

A Robust Regression Approach for Robot Model Learning

Machine learning and data analysis have been used in many robotics fields, especially for modelling. Data are usually the result of sensor measurements and, as such, they might be subjected to noise and outliers. The presence of outliers…

Robotics · Computer Science 2019-08-26 Francesco Cursi , Guang-Zhong Yang

High-dimensional outlier detection and variable selection via adaptive weighted mean regression

This paper proposes an adaptive penalized weighted mean regression for outlier detection of high-dimensional data. In comparison to existing approaches based on the mean shift model, the proposed estimators demonstrate robustness against…

Statistics Theory · Mathematics 2023-06-27 Jiaqi Li , Linglong Kong , Bei Jiang , Wei Tu

Outlier-Robust Multi-Group Gaussian Mixture Modeling with Flexible Group Reassignment

Do expert-defined or diagnostically-labeled data groups align with clusters inferred through statistical modeling? If not, where do discrepancies between predefined labels and model-based groupings occur and why? In this work, we introduce…

Methodology · Statistics 2026-03-18 Patricia Puchhammer , Ines Wilms , Peter Filzmoser

Robust estimation of mixing measures in finite mixture models

In finite mixture models, apart from underlying mixing measure, true kernel density function of each subpopulation in the data is, in many scenarios, unknown. Perhaps the most popular approach is to choose some kernel functions that we…

Statistics Theory · Mathematics 2017-09-26 Nhat Ho , XuanLong Nguyen , Ya'acov Ritov

Nonparametric contaminated Gaussian mixture of regressions

Semi- and non-parametric mixture of regressions are a very useful flexible class of mixture of regressions in which some or all of the parameters are non-parametric functions of the covariates. These models are, however, based on the…

Methodology · Statistics 2026-01-13 Sphiwe B. Skhosana , Weixin Yao