English
Related papers

Related papers: Robust score matching for compositional data

200 papers

Missing data is frequently encountered in many areas of statistics. Propensity score weighting is a popular method for handling missing data. The propensity score method employs a response propensity model, but correct specification of the…

Methodology · Statistics 2024-03-28 Hengfang Wang , Jae Kwang Kim , Jeongseop Han , Youngjo Lee

Compositional data and multivariate count data with known totals are challenging to analyse due to the non-negativity and sum-to-one constraints on the sample space. It is often the case that many of the compositional components are highly…

Methodology · Statistics 2020-12-24 Janice L. Scealy , Andrew T. A. Wood

Applications such as the analysis of microbiome data have led to renewed interest in statistical methods for compositional data, i.e., multivariate data in the form of probability vectors that contain relative proportions. In particular,…

Methodology · Statistics 2021-09-13 Shiqing Yu , Mathias Drton , Ali Shojaie

Statistical analysis on compositional data has gained a lot of attention due to their great potential of applications. A feature of these data is that they are multivariate vectors that lie in the simplex, that is, the components of each…

Many biological high-throughput data sets, such as targeted amplicon-based and metagenomic sequencing data, are compositional in nature. A common exploratory data analysis task is to infer statistical associations between the…

Methodology · Statistics 2020-07-28 Aditya Mishra , Christian L. Muller

We develop a method to generate prediction sets with a guaranteed coverage rate that is robust to corruptions in the training data, such as missing or noisy variables. Our approach builds on conformal prediction, a powerful framework to…

Machine Learning · Computer Science 2025-01-10 Shai Feldman , Yaniv Romano

A class of robust estimators of scatter applied to information-plus-impulsive noise samples is studied, where the sample information matrix is assumed of low rank; this generalizes the study of (Couillet et al., 2013b) to spiked random…

Probability · Mathematics 2014-05-01 Romain Couillet

Conformal prediction provides finite-sample, distribution-free coverage under exchangeability, but standard constructions may lack robustness in the presence of outliers or heavy tails. We propose a robust conformal method based on a…

Statistics Theory · Mathematics 2026-04-21 Alejandro Cholaquidis , Emilien Joly , Leonardo Moreno

Proposed in Hyv\"arinen (2005), score matching is a parameter estimation procedure that does not require computation of distributional normalizing constants. In this work we utilize the geometric median of means to develop a robust score…

Machine Learning · Statistics 2025-06-23 Richard Schwank , Andrew McCormack , Mathias Drton

In data analysis, contamination caused by outliers is inevitable, and robust statistical methods are strongly demanded. In this paper, our concern is to develop a new approach for robust data analysis based on scoring rules. The scoring…

Statistics Theory · Mathematics 2013-11-22 Takafumi Kanamori , Hironori Fujisawa

Using offline observational data for policy evaluation and learning allows decision-makers to evaluate and learn a policy that connects characteristics and interventions. Most existing literature has focused on either discrete treatment…

Artificial Intelligence · Computer Science 2025-01-22 Cheuk Hang Leung , Yiyan Huang , Yijun Li , Qi Wu

Distributionally robust policy learning aims to find a policy that performs well under the worst-case distributional shift, and yet most existing methods for robust policy learning consider the worst-case joint distribution of the covariate…

Machine Learning · Computer Science 2025-06-03 Jingyuan Wang , Zhimei Ren , Ruohan Zhan , Zhengyuan Zhou

Robust estimation provides essential tools for analyzing data that contain outliers, ensuring that statistical models remain reliable even in the presence of some anomalous data. While robust methods have long been available in R, users of…

Computation · Statistics 2024-11-05 Sarah Leyder , Jakob Raymaekers , Peter J. Rousseeuw , Thomas Servotte , Tim Verdonck

Additive regression models have a long history in multivariate nonparametric regression. They provide a model in which each regression function depends only on a single explanatory variable allowing to obtain estimators at the optimal…

Methodology · Statistics 2015-09-16 Graciela Boente , Alejandra Martinez

We propose a robust variable selection procedure using a divergence based M-estimator combined with a penalty function. It produces robust estimates of the regression parameters and simultaneously selects the important explanatory…

Methodology · Statistics 2020-01-01 Abhijit Mandal , Samiran Ghosh

The best subset selection (or "best subsets") estimator is a classic tool for sparse regression, and developments in mathematical optimization over the past decade have made it more computationally tractable than ever. Notwithstanding its…

Methodology · Statistics 2022-01-11 Ryan Thompson

We formalize notions of robustness for composite estimators via the notion of a breakdown point. A composite estimator successively applies two (or more) estimators: on data decomposed into disjoint parts, it applies the first estimator on…

Machine Learning · Computer Science 2016-09-06 Pingfan Tang , Jeff M. Phillips

In many supervised learning applications, the response consists of both continuous and binary outcomes. Studies have shown that jointly modeling such mixed-type responses can substantially improve predictive performance compared to separate…

Methodology · Statistics 2026-03-13 Yu Wang , Ran Jin , Lulu Kang

Learning compact and interpretable representations is a very natural task, which has not been solved satisfactorily even for simple binary datasets. In this paper, we review various ways of composing experts for binary data and argue that…

Computer Vision and Pattern Recognition · Computer Science 2016-11-01 Marc Goessling , Yali Amit

We observe a $n$-sample, the distribution of which is assumed to belong, or at least to be close enough, to a given mixture model. We propose an estimator of this distribution that belongs to our model and possesses some robustness…

Statistics Theory · Mathematics 2025-02-06 Alexandre Lecestre
‹ Prev 1 2 3 10 Next ›