Related papers: Interpretable Distribution Features with Maximum T…

Fast Two-Sample Testing with Analytic Representations of Probability Measures

We propose a class of nonparametric two-sample tests with a cost linear in the sample size. Two tests are given, both based on an ensemble of distances between analytic functions representing each of the distributions. The first test uses…

Machine Learning · Statistics 2015-06-16 Kacper Chwialkowski , Aaditya Ramdas , Dino Sejdinovic , Arthur Gretton

Semiparametric inference on general functionals of two semicontinuous populations

In this paper, we propose new semiparametric procedures for making inference on linear functionals and their functions of two semicontinuous populations. The distribution of each population is usually characterized by a mixture of a…

Methodology · Statistics 2020-12-21 Meng Yuan , Chunlin Wang , Boxi Lin , Pengfei Li

On the High-dimensional Power of Linear-time Kernel Two-Sample Testing under Mean-difference Alternatives

Nonparametric two sample testing deals with the question of consistently deciding if two distributions are different, given samples from both, without making any parametric assumptions about the form of the distributions. The current…

Statistics Theory · Mathematics 2014-11-25 Aaditya Ramdas , Sashank J. Reddi , Barnabas Poczos , Aarti Singh , Larry Wasserman

Easy Maximum Empirical Likelihood Estimation of Linear Functionals Of A Probability Measure With Infinitely Many Constraints

In this article, we construct semiparametrically efficient estimators of linear functionals of a probability measure in the presence of side information using an easy empirical likelihood approach. We use estimated constraint functions and…

Methodology · Statistics 2023-03-01 Shan Wang , Hanxiang Peng

On quantitative aspects of model interpretability

Despite the growing body of work in interpretable machine learning, it remains unclear how to evaluate different explainability methods without resorting to qualitative assessment and user-studies. While interpretability is an inherently…

Machine Learning · Computer Science 2020-07-16 An-phi Nguyen , María Rodríguez Martínez

The Perturbed Variation

We introduce a new discrepancy score between two distributions that gives an indication on their similarity. While much research has been done to determine if two samples come from exactly the same distribution, much less research…

Machine Learning · Computer Science 2012-10-16 Maayan Harel , Shie Mannor

Testing semiparametric model-equivalence hypotheses based on the characteristic function

We propose three test criteria each of which is appropriate for testing, respectively, the equivalence hypotheses of symmetry, of homogeneity, and of independence, with multivariate data. All quantities have the common feature of involving…

Methodology · Statistics 2023-11-09 Feifei Chen , Simos G. Meintanis , Lixing Zhu

A Characterization of Most(More) Powerful Test Statistics with Simple Nonparametric Applications

Data-driven most powerful tests are statistical hypothesis decision-making tools that deliver the greatest power against a fixed null hypothesis among all corresponding data-based tests of a given size. When the underlying data…

Statistics Theory · Mathematics 2023-03-15 Albert Vexler , Alan D. Hutson

A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation

The advent of data science has spurred interest in estimating properties of distributions over large alphabets. Fundamental symmetric properties such as support size, support coverage, entropy, and proximity to uniformity, received most…

Information Theory · Computer Science 2016-11-29 Jayadev Acharya , Hirakendu Das , Alon Orlitsky , Ananda Theertha Suresh

Power Maxwell distribution: Statistical Properties, Estimation and Application

In this article, we proposed a new probability distribution named as power Maxwell distribution (PMaD). It is another extension of Maxwell distribution (MaD) which would lead more flexibility to analyze the data with non-monotone failure…

Applications · Statistics 2018-07-04 Abhimanyu Singh Yadav , Hassan S. Bakouch , Sanjay Kumar Singh , Umesh Singh

Inequalities for m-Divisible Distributions and Testing of Infinite Divisibility

We state some inequalities for m-divisible and infinite divisible characteristic functions. Basing on them we propose a statistical test for a distribution to be infinitely divisible. Keywords: infinite divisible distributions; statistical…

Probability · Mathematics 2019-04-17 Lev B. Klebanov , Ashot V. Kakosyan , Irina V. Volchenkova

A Semiparametric Approach to Interpretable Machine Learning

Black box models in machine learning have demonstrated excellent predictive performance in complex problems and high-dimensional settings. However, their lack of transparency and interpretability restrict the applicability of such models in…

Machine Learning · Computer Science 2020-06-09 Numair Sani , Jaron Lee , Razieh Nabi , Ilya Shpitser

Robust estimations from distribution structures: III. Invariant Moments

Descriptive statistics for parametric models are currently highly sensative to departures, gross errors, and/or random errors. Here, leveraging the structures of parametric distributions and their central moment kernel distributions, a…

Statistics Theory · Mathematics 2024-09-11 Li Tuobang

Inspecting discrepancy between multivariate distributions using half-space depth based information criteria

This article inspects whether a multivariate distribution is different from a specified distribution or not, and it also tests the equality of two multivariate distributions. In the course of this study, a graphical tool-kit using…

Methodology · Statistics 2024-08-19 Pratim Guha Niyogi , Subhra Sankar Dhar

Permutation Tests at Nonparametric Rates

Classical two-sample permutation tests for equality of distributions have exact size in finite samples, but they fail to control size for testing equality of parameters that summarize each distribution. This paper proposes permutation tests…

Econometrics · Economics 2022-04-22 Marinho Bertanha , EunYi Chung

Improved Density and Distribution Function Estimation

Given additional distributional information in the form of moment restrictions, kernel density and distribution function estimators with implied generalised empirical likelihood probabilities as weights achieve a reduction in variance due…

Methodology · Statistics 2019-10-08 Vitaliy Oryshchenko , Richard J. Smith

Semiparametric Efficient Test for Interpretable Distributional Treatment Effects

Distributional treatment effects can be invisible to means: a treatment may preserve average outcomes while changing tails, modes, dispersion, or rare-event probabilities. Kernel tests can detect discrepancies between interventional outcome…

Machine Learning · Statistics 2026-05-11 Houssam Zenati , Arthur Gretton

A Scalable Nystrom-Based Kernel Two-Sample Test with Permutations

Two-sample hypothesis testing-determining whether two sets of data are drawn from the same distribution-is a fundamental problem in statistics and machine learning with broad scientific applications. In the context of nonparametric testing,…

Machine Learning · Statistics 2026-04-21 Antoine Chatalic , Marco Letizia , Nicolas Schreuder , Lorenzo Rosasco

Explainability as statistical inference

A wide variety of model explanation approaches have been proposed in recent years, all guided by very different rationales and heuristics. In this paper, we take a new route and cast interpretability as a statistical inference problem. We…

Machine Learning · Computer Science 2024-01-01 Hugo Henri Joseph Senetaire , Damien Garreau , Jes Frellsen , Pierre-Alexandre Mattei

Semiparametric Testing with Highly Persistent Predictors

We address the issue of semiparametric efficiency in the bivariate regression problem with a highly persistent predictor, where the joint distribution of the innovations is regarded an infinite-dimensional nuisance parameter. Using a…

Econometrics · Economics 2020-09-18 Bas Werker , Bo Zhou