Related papers: Power Studies For Two-sample Methods For Multivari…

Power Studies For Two-Sample and Goodness-of-Fit Methods For Multivariate Data

We present the results of a large number of simulation studies regarding the power of various goodness-of-fit as well as non-parametric two-sample tests for multivariate data. In two dimensions this includes both continuous and discrete…

Methodology · Statistics 2026-05-13 Wolfgang Rolke

Simulation Studies For Goodness-of-Fit and Two-Sample Methods For Univariate Data

We present the results of a large number of simulation studies regarding the power of various goodness-of-fit as well as nonparametric two-sample tests for univariate data. This includes both continuous and discrete data. In general no…

Methodology · Statistics 2024-11-13 Wolfgang Rolke

On the High-dimensional Power of Linear-time Kernel Two-Sample Testing under Mean-difference Alternatives

Nonparametric two sample testing deals with the question of consistently deciding if two distributions are different, given samples from both, without making any parametric assumptions about the form of the distributions. The current…

Statistics Theory · Mathematics 2014-11-25 Aaditya Ramdas , Sashank J. Reddi , Barnabas Poczos , Aarti Singh , Larry Wasserman

A new graph-based two-sample test for multivariate and object data

Two-sample tests for multivariate data and especially for non-Euclidean data are not well explored. This paper presents a novel test statistic based on a similarity graph constructed on the pooled observations from the two samples. It can…

Methodology · Statistics 2024-08-12 Hao Chen , Jerome H. Friedman

A weighted edge-count two-sample test for multivariate and object data

Two-sample tests for multivariate data and non-Euclidean data are widely used in many fields. Parametric tests are mostly restrained to certain types of data that meets the assumptions of the parametric models. In this paper, we study a…

Methodology · Statistics 2018-05-01 Hao Chen , Xu Chen , Yi Su

Power-Enhanced Two-Sample Mean Tests for High-Dimensional Compositional Data with Application to Microbiome Data Analysis

Testing differences in mean vectors is a fundamental task in the analysis of high-dimensional compositional data. Existing methods may suffer from low power if the underlying signal pattern is in a situation that does not favor the deployed…

Methodology · Statistics 2025-03-11 Danning Li , Lingzhou Xue , Haoyi Yang , Xiufan Yu

A fast and effective kernel two-sample test for large-scale data

Kernel two-sample tests have been widely used, and the development of efficient methods for high-dimensional, large-scale data is receiving increasing attention in the big data era. However, existing methods, such as the maximum mean…

Methodology · Statistics 2025-10-03 Hoseung Song , Hao Chen

Two-Sample Test Based on Classification Probability

Robust classification algorithms have been developed in recent years with great success. We take advantage of this development and recast the classical two-sample test problem in the framework of classification. Based on the estimates of…

Statistics Theory · Mathematics 2019-09-18 Haiyan Cai , Bryan Goggin , Qingtang Jiang

A Normality Test for Multivariate Dependent Samples

Most normality tests in the literature are performed for scalar and independent samples. Thus, they become unreliable when applied to colored processes, hampering their use in realistic scenarios.We focus on Mardia's multivariate kurtosis,…

Methodology · Statistics 2022-03-02 Sara Elbouch , Olivier Michel , Pierre Comon

Two-Sample Testing with Missing Data via Energy Distance: Weighting and Imputation Approaches

In this paper, we address the problem of two-sample testing in the presence of missing data under a variety of missingness mechanisms. Our focus is on the well-known energy distance-based two-sample test. In addition to the standard…

Methodology · Statistics 2025-08-18 Danijel G. Aleksić , Bojana Milošević

Power analysis for a linear regression model when regressors are matrix sampled

Multiple matrix sampling is a survey methodology technique that randomly chooses a relatively small subset of items to be presented to survey respondents for the purpose of reducing respondent burden. The data produced are missing…

Methodology · Statistics 2017-10-03 Stanislav Kolenikov , Heather Hammer

Two-Sample Tests for High Dimensional Means with Thresholding and Data Transformation

We consider testing for two-sample means of high dimensional populations by thresholding. Two tests are investigated, which are designed for better power performance when the two population mean vectors differ only in sparsely populated…

Methodology · Statistics 2014-10-13 Song Xi Chen , Jun Li , Ping-Shou Zhong

Robust Tests for the Equality of Two Normal Means based on the Density Power Divergence

Statistical techniques are used in all branches of science to determine the feasibility of quantitative hypotheses. One of the most basic applications of statistical techniques in comparative analysis is the test of equality of two…

Methodology · Statistics 2018-05-01 Ayanendranath Basu , Abhijit Mandal , Nirian Martin , Leandro Pardo

A new test for the multivariate two-sample problem based on the concept of minimum energy

We introduce a new statistical quantity the energy to test whether two samples originate from the same distributions. The energy is a simple logarithmic function of the distances of the observations in the variate space. The distribution of…

Probability · Mathematics 2007-05-23 Guenter Zech , Berkan Aslan

Distribution and correlation free two-sample test of high-dimensional means

We propose a two-sample test for high-dimensional means that requires neither distributional nor correlational assumptions, besides some weak conditions on the moments and tail properties of the elements in the random vectors. This…

Methodology · Statistics 2019-04-17 Kaijie Xue , Fang Yao

A new class of robust two-sample Wald-type tests

Parametric hypothesis testing associated with two independent samples arises frequently in several applications in biology, medical sciences, epidemiology, reliability and many more. In this paper, we propose robust Wald-type tests for…

Methodology · Statistics 2019-05-09 Abhik Ghosh , Nirian Martin , Ayanendranath Basu , Leandro Pardo

Nonparametric Regression using the Concept of Minimum Energy

It has recently been shown that an unbinned distance-based statistic, the energy, can be used to construct an extremely powerful nonparametric multivariate two sample goodness-of-fit test. An extension to this method that makes it possible…

Data Analysis, Statistics and Probability · Physics 2011-10-11 Mike Williams

A Characterization of Most(More) Powerful Test Statistics with Simple Nonparametric Applications

Data-driven most powerful tests are statistical hypothesis decision-making tools that deliver the greatest power against a fixed null hypothesis among all corresponding data-based tests of a given size. When the underlying data…

Statistics Theory · Mathematics 2023-03-15 Albert Vexler , Alan D. Hutson

Simulating the Power of Statistical Tests: A Collection of R Examples

This paper illustrates how to calculate the power of a statistical test by computer simulation. It provides R code for power simulations of several classical inference procedures including one- and two-sample t tests, chi-squared tests,…

Applications · Statistics 2026-02-16 Florian Wickelmaier

MMD Two-sample Testing in the Presence of Arbitrarily Missing Data

In many real-world applications, it is common that a proportion of the data may be missing or only partially observed. We develop a novel two-sample testing method based on the Maximum Mean Discrepancy (MMD) which accounts for missing data…

Methodology · Statistics 2024-05-27 Yijin Zeng , Niall M. Adams , Dean A. Bodenham