English
Related papers

Related papers: A distance based test on random trees

200 papers

We propose a statistical method to test whether two phylogenetic trees with given alignments are significantly incongruent. Our method compares the two distributions of phylogenetic trees given by the input alignments, instead of comparing…

Populations and Evolution · Quantitative Biology 2010-04-14 Elissaveta Arnaoudova , David Haws , Peter Huggins , Jerzy W. Jaromczyk , Neil Moore , Chris Schardl , Ruriko Yoshida

In order to conduct a statistical analysis on a given set of phylogenetic gene trees, we often use a distance measure between two trees. In a statistical distance-based method to analyze discordance between gene trees, it is a key to decide…

Populations and Evolution · Quantitative Biology 2016-02-05 Jing Xi , Jin Xie , Ruriko Yoshida

We develop a general statistical framework for the analysis and inference of large tree-structured data, with a focus on developing asymptotic goodness-of-fit tests. We first propose a consistent statistical model for binary trees, from…

We consider a random tree and introduce a metric in the space of trees to define the ``mean tree'' as the tree minimizing the average distance to the random tree. When the resulting metric space is compact we have laws of large numbers and…

Probability · Mathematics 2007-05-23 David Balding , Pablo A. Ferrari , Ricardo Fraiman , Mariela Sued

The distribution function of a random distance in three dimensions is given and some new three-dimensional d2-tests of randomness are suggested. We show that our test statistics are not correlated with the usual test statistics and are…

Applications · Statistics 2014-02-24 Sergii Koliada

We study a rank based univariate two-sample distribution-free test. The test statistic is the difference between the average of between-group rank distances and the average of within-group rank distances. This test statistic is closely…

Methodology · Statistics 2018-02-28 Jamye Curry , Xin Dang , Hailin Sang

Categorical variables are of uttermost importance in biomedical research. When two of them are considered, it is often the case that one wants to test whether or not they are statistically dependent. We show weaknesses of classical methods…

Generative models are invaluable in many fields of science because of their ability to capture high-dimensional and complicated distributions, such as photo-realistic images, protein structures, and connectomes. How do we evaluate the…

Efficient automatic protein classification is of central importance in genomic annotation. As an independent way to check the reliability of the classification, we propose a statistical approach to test if two sets of protein domain…

A rigorous methodology is proposed to study cell division data consisting in several observed genealogical trees of possibly different shapes. The procedure takes into account missing observations, data from different trees, as well as the…

Applications · Statistics 2013-04-15 Benoîte de Saporta , Anne Gégout Petit , Laurence Marsalle

We study the fundamental question of how likely it is that two randomly chosen trees are isomorphic to each other for different models of random trees. We show that the probability decays exponentially for rooted labeled trees as well as…

Probability · Mathematics 2023-04-11 Christoffer Olsson

It is well-known that the height profile of a critical conditioned Galton-Watson tree with finite offspring variance converges, after a suitable normalization, to the local time of a standard Brownian excursion. In this work, we study the…

Probability · Mathematics 2021-06-22 Gabriel Berzunza Ojeda , Svante Janson

In this paper we consider random walks on Galton-Watson trees with random conductances. On these trees, the distance of the walker to the root satisfies a law of large numbers with limit the effective velocity, or speed of the walk. We…

Probability · Mathematics 2020-11-23 Tabea Glatzel , Jan Nagel

Following the line of classification-based two-sample testing, tests based on the Random Forest classifier are proposed. The developed tests are easy to use, require almost no tuning, and are applicable for any distribution on…

Methodology · Statistics 2021-05-07 Simon Hediger , Loris Michel , Jeffrey Näf

Hypothesis testing in high dimensional data is a notoriously difficult problem without direct access to competing models' likelihood functions. This paper argues that statistical divergences can be used to quantify the difference between…

Data Analysis, Statistics and Probability · Physics 2024-08-02 Jeremy J. H. Wilkinson , Christopher G. Lester

The delimitation of biological species, i.e., deciding which individuals belong to the same species and whether and how many different species are represented in a data set, is key to the conservation of biodiversity. Much existing work…

Populations and Evolution · Quantitative Biology 2025-12-15 Gabriele d'Angella , Christian Hennig

We investigate the statistics of trees grown from some initial tree by attaching links to preexisting vertices, with attachment probabilities depending only on the valence of these vertices. We consider the asymptotic mass distribution that…

Statistical Mechanics · Physics 2007-05-23 François David , Philippe Di Francesco , Emmanuel Guitter , Thordur Jonsson

We propose an approach for testing the hypothesis that two realizations of the random variables in the form of histograms are taken from the same statistical population (i.e. that two histograms are drawn from the same distribution). The…

Data Analysis, Statistics and Probability · Physics 2013-05-22 Sergey Bityukov , Nikolai Krasnikov , Alexander Nikitenko , Vera Smirnova

In this paper, we propose a new spectral-based approach to hypothesis testing for populations of networks. The primary goal is to develop a test to determine whether two given samples of networks come from the same random model or…

Methodology · Statistics 2020-11-26 Li Chen , Nathaniel Josephs , Lizhen Lin , Jie Zhou , Eric D. Kolaczyk

In this paper, we study a parallel version of Galton-Watson processes for the random generation of tree-shaped structures. Random trees are useful in many situations (testing, binary search, simulation of physics phenomena,...) as attests…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-06-22 Olivier Bodini , Camille Coti , Julien David
‹ Prev 1 2 3 10 Next ›