Related papers: Trimmed Density Ratio Estimation

Interpreting Outliers: Localized Logistic Regression for Density Ratio Estimation

We propose an inlier-based outlier detection method capable of both identifying the outliers and explaining why they are outliers, by identifying the outlier-specific features. Specifically, we employ an inlier-based outlier detection…

Machine Learning · Statistics 2017-02-22 Makoto Yamada , Song Liu , Samuel Kaski

Gaining Outlier Resistance with Progressive Quantiles: Fast Algorithms and Theoretical Studies

Outliers widely occur in big-data applications and may severely affect statistical estimation and inference. In this paper, a framework of outlier-resistant estimation is introduced to robustify an arbitrarily given loss function. It has a…

Methodology · Statistics 2023-04-20 Yiyuan She , Zhifeng Wang , Jiahui Shen

A General Family of Trimmed Estimators for Robust High-dimensional Data Analysis

We consider the problem of robustifying high-dimensional structured estimation. Robust techniques are key in real-world applications which often involve outliers and data corruption. We focus on trimmed versions of structurally regularized…

Machine Learning · Statistics 2017-08-22 Eunho Yang , Aurelie Lozano , Aleksandr Aravkin

High Dimensional Multivariate Regression and Precision Matrix Estimation via Nonconvex Optimization

We propose a nonconvex estimator for joint multivariate regression and precision matrix estimation in the high dimensional regime, under sparsity constraints. A gradient descent algorithm with hard thresholding is developed to solve the…

Machine Learning · Statistics 2016-06-03 Jinghui Chen , Quanquan Gu

High-Dimensional Robust Mean Estimation via Gradient Descent

We study the problem of high-dimensional robust mean estimation in the presence of a constant fraction of adversarial outliers. A recent line of work has provided sophisticated polynomial-time algorithms for this problem with…

Machine Learning · Computer Science 2020-05-05 Yu Cheng , Ilias Diakonikolas , Rong Ge , Mahdi Soltanolkotabi

Robust subset selection

The best subset selection (or "best subsets") estimator is a classic tool for sparse regression, and developments in mathematical optimization over the past decade have made it more computationally tractable than ever. Notwithstanding its…

Methodology · Statistics 2022-01-11 Ryan Thompson

Robust Mean Estimation in High Dimensions: An Outlier Fraction Agnostic and Efficient Algorithm

The problem of robust mean estimation in high dimensions is studied, in which a certain fraction (less than half) of the datapoints can be arbitrarily corrupted. Motivated by compressive sensing, the robust mean estimation problem is…

Applications · Statistics 2022-12-08 Aditya Deshmukh , Jing Liu , Venugopal V. Veeravalli

Data-adaptive trimming of the Hill estimator and detection of outliers in the extremes of heavy-tailed data

We introduce a trimmed version of the Hill estimator for the index of a heavy-tailed distribution, which is robust to perturbations in the extreme order statistics. In the ideal Pareto setting, the estimator is essentially finite-sample…

Methodology · Statistics 2018-08-24 Shrijita Bhattacharya , Michael Kallitsis , Stilian Stoev

Robust Learning of Trimmed Estimators via Manifold Sampling

We adapt a manifold sampling algorithm for the nonsmooth, nonconvex formulations of learning that arise when imposing robustness to outliers present in the training data. We demonstrate the approach on objectives based on trimmed loss.…

Optimization and Control · Mathematics 2018-07-10 Matt Menickelly , Stefan M. Wild

Minimum Local Distance Density Estimation

We present a local density estimator based on first order statistics. To estimate the density at a point, $x$, the original sample is divided into subsets and the average minimum sample distance to $x$ over all such subsets is used to…

Methodology · Statistics 2014-12-10 Vikram V. Garg , Luis Tenorio , Karen Willcox

Nonparametric density estimation by histogram trend filtering

We propose a novel approach for density estimation called histogram trend filtering. Our estimator arises from looking at surrogate Poisson model for counts of observations in a partition of the support of the data. We begin by showing…

Methodology · Statistics 2016-02-09 Oscar Hernan Madrid Padilla , James G. Scott

Deep density ratio estimation for change point detection

In this work, we propose new objective functions to train deep neural network based density ratio estimators and apply it to a change point detection problem. Existing methods use linear combinations of kernels to approximate the density…

Machine Learning · Computer Science 2019-05-27 Haidar Khan , Lara Marcuse , Bülent Yener

Meta-Learning for Relative Density-Ratio Estimation

The ratio of two probability densities, called a density-ratio, is a vital quantity in machine learning. In particular, a relative density-ratio, which is a bounded extension of the density-ratio, has received much attention due to its…

Machine Learning · Statistics 2021-07-05 Atsutoshi Kumagai , Tomoharu Iwata , Yasuhiro Fujiwara

Estimating Unbounded Density Ratios: Applications in Error Control under Covariate Shift

The density ratio is an important metric for evaluating the relative likelihood of two probability distributions, with extensive applications in statistics and machine learning. However, existing estimation theories for density ratios often…

Machine Learning · Statistics 2025-04-03 Shuntuo Xu , Zhou Yu , Jian Huang

Nonconvex Low-Rank Matrix Recovery with Arbitrary Outliers via Median-Truncated Gradient Descent

Recent work has demonstrated the effectiveness of gradient descent for directly recovering the factors of low-rank matrices from random linear measurements in a globally convergent manner when initialized properly. However, the performance…

Information Theory · Computer Science 2017-09-26 Yuanxin Li , Yuejie Chi , Huishuai Zhang , Yingbin Liang

Outlier-Robust Convex Segmentation

We derive a convex optimization problem for the task of segmenting sequential data, which explicitly treats presence of outliers. We describe two algorithms for solving this problem, one exact and one a top-down novel approach, and we…

Machine Learning · Computer Science 2014-11-19 Itamar Katz , Koby Crammer

A new method for estimation and model selection: $\rho$-estimation

The aim of this paper is to present a new estimation procedure that can be applied in many statistical frameworks including density and regression and which leads to both robust and optimal (or nearly optimal) estimators. In density…

Statistics Theory · Mathematics 2017-01-23 Yannick Baraud , Lucien Birgé , Mathieu Sart

Density estimation with quadratic loss: a confidence intervals method

In a previous article, a least square regression estimation procedure was proposed: first, we condiser a family of functions and study the properties of an estimator in every unidimensionnal model defined by one of these functions; we then…

Statistics Theory · Mathematics 2007-06-13 Pierre Alquier

Robust density estimation with the $\mathbb{L}_{1}$-loss. Applications to the estimation of a density on the line satisfying a shape constraint

We solve the problem of estimating the distribution of presumed i.i.d. observations for the total variation loss. Our approach is based on density models and is versatile enough to cope with many different ones, including some density…

Statistics Theory · Mathematics 2024-01-05 Y. Baraud , H. Halconruy , G. Maillard

High-Dimensional Density Ratio Estimation with Extensions to Approximate Likelihood Computation

The ratio between two probability density functions is an important component of various tasks, including selection bias correction, novelty detection and classification. Recently, several estimators of this ratio have been proposed. Most…

Methodology · Statistics 2014-04-30 Rafael Izbicki , Ann B. Lee , Chad M. Schafer