English
Related papers

Related papers: Testing Consistency of Two Histograms

200 papers

We propose an approach for testing the hypothesis that two realizations of the random variables in the form of histograms are taken from the same statistical population (i.e. that two histograms are drawn from the same distribution). The…

Data Analysis, Statistics and Probability · Physics 2013-05-22 Sergey Bityukov , Nikolai Krasnikov , Alexander Nikitenko , Vera Smirnova

Determining if two histograms are consistent, whether they have been drawn from the same underlying distribution or not, is a common problem in physics. Existing approaches are not only limited in power but also inapplicable to histograms…

Data Analysis, Statistics and Probability · Physics 2010-09-29 M. J. Betancourt

We consider an approach for testing the hypothesis that two realizations of the random variables in the form of histograms are taken from the same statistical population (i.e. two histograms are drawn from the same distribution). The…

Data Analysis, Statistics and Probability · Physics 2013-11-26 S. Bityukov , N. Krasnikov , A. Nikitenko , V. Smirnova

Hypothesis testing for graphs has been an important tool in applied research fields for more than two decades, and still remains a challenging problem as one often needs to draw inference from few replicates of large graphs. Recent studies…

Machine Learning · Statistics 2018-12-03 Debarghya Ghoshdastidar , Ulrike von Luxburg

We present a method which tests whether or not two datasets (one of which could be Monte Carlo generated) might come from the same distribution. Our method works in arbitrarily high dimensions.

Data Analysis, Statistics and Probability · Physics 2010-08-18 Wolfgang Rolke , Angel Lopez

Two-sample hypothesis testing for random graphs arises naturally in neuroscience, social networks, and machine learning. In this paper, we consider a semiparametric problem of two-sample hypothesis testing for a class of latent position…

Methodology · Statistics 2015-06-19 Minh Tang , Avanti Athreya , Daniel L. Sussman , Vince Lyzinski , Carey E. Priebe

Data depth has been applied as a nonparametric measurement for ranking multivariate samples. In this paper, we focus on homogeneity tests to assess whether two multivariate samples are from the same distribution. There are many data…

Statistics Theory · Mathematics 2023-06-09 Yiting Chen , Wei Lin , Xiaoping Shi

We consider a two-sample hypothesis testing problem, where the distributions are defined on the space of undirected graphs, and one has access to only one observation from each model. A motivating example for this problem is comparing the…

Consider two random variables contaminated by two unknown transformations. The aim of this paper is to test the equality of those transformations. Two cases are distinguished: first, the two random variables have known distributions.…

Methodology · Statistics 2011-11-01 Mohamed Boutahar , Denys Pommeret

In this work, we revisit the one- and two-sample testing problems: binary hypothesis testing in which one or both distributions are unknown. For the one-sample test, we provide a more streamlined proof of the asymptotic optimality of…

Information Theory · Computer Science 2026-04-21 Arick Grootveld , Biao Chen , Venkata Gandikota

Network (graph) data analysis is a popular research topic in statistics and machine learning. In application, one is frequently confronted with graph two-sample hypothesis testing where the goal is to test the difference between two graph…

Methodology · Statistics 2021-02-01 Mingao Yuan , Qian Wen

Two-sample tests evaluate whether two samples are realizations of the same distribution (the null hypothesis) or two different distributions (the alternative hypothesis). We consider a new setting for this problem where sample features are…

Machine Learning · Computer Science 2022-07-20 Weizhi Li , Gautam Dasarathy , Karthikeyan Natesan Ramamurthy , Visar Berisha

In this article, we propose a one-sample test to check whether the support of the unknown distribution generating the data is homologically equivalent to the support of some specified distribution or not OR using the corresponding…

Methodology · Statistics 2023-12-01 Satish Kumar , Subhra Sankar Dhar

Hypothesis testing in high dimensional data is a notoriously difficult problem without direct access to competing models' likelihood functions. This paper argues that statistical divergences can be used to quantify the difference between…

Data Analysis, Statistics and Probability · Physics 2024-08-02 Jeremy J. H. Wilkinson , Christopher G. Lester

Rejecting the null hypothesis in two-sample testing is a fundamental tool for scientific discovery. Yet, aside from concluding that two samples do not come from the same probability distribution, it is often of interest to characterize how…

Statistics Theory · Mathematics 2021-09-08 Boris Landa , Rihao Qu , Joseph Chang , Yuval Kluger

Testing to see whether a given data set comes from some specified distribution is among the oldest types of problems in Statistics. Many such tests have been developed and their performance studied. The general result has been that while a…

Applications · Statistics 2020-12-07 Wolfgang Rolke

We consider the problem of testing whether two finite-dimensional random dot product graphs have generating latent positions that are independently drawn from the same distribution, or distributions that are related via scaling or…

Statistics Theory · Mathematics 2015-11-13 Minh Tang , Avanti Athreya , Daniel L. Sussman , Vince Lyzinski , Carey E. Priebe

Two-sample tests utilizing a similarity graph on observations are useful for high-dimensional and non-Euclidean data due to their flexibility and good performance under a wide range of alternatives. Existing works mainly focused on sparse…

Statistics Theory · Mathematics 2023-11-14 Yejiong Zhu , Hao Chen

The theory of random graphs is being applied in recent years to model neural interactions in the brain. While the probabilistic properties of random graphs has been extensively studied in the literature, the development of statistical…

Applications · Statistics 2015-04-27 Andressa Cerqueira , Daniel Fraiman , Claudia D. Vargas , Florencia Leonardi

Testing for the equality of two high-dimensional distributions is a challenging problem, and this becomes even more challenging when the sample size is small. Over the last few decades, several graph-based two-sample tests have been…

Methodology · Statistics 2019-11-22 Soham Sarkar , Rahul Biswas , Anil K. Ghosh
‹ Prev 1 2 3 10 Next ›