English
Related papers

Related papers: Computing the Distribution of a Tree Metric

200 papers

The Robinson-Foulds (RF) metric is arguably the most widely used measure of phylogenetic tree similarity, despite its well-known shortcomings: For example, moving a single taxon in a tree can result in a tree that has maximum distance to…

Data Structures and Algorithms · Computer Science 2013-08-02 Sebastian Böcker , Stefan Canzar , Gunnar W. Klau

Motivated by applications in medical bioinformatics, Khayatian et al. (2024) introduced a family of metrics on Cayley trees (the $k$-RF distance, for $k=0, \ldots, n-2$) and explored their distribution on pairs of random Cayley trees via…

Probability · Mathematics 2024-12-31 Michael Fuchs , Mike Steel

There are several tools available to infer phylogenetic trees, which depict the evolutionary relationships among biological entities such as viral and bacterial strains in infectious outbreaks, or cancerous cells in tumor progression trees.…

Data Structures and Algorithms · Computer Science 2023-12-22 António Pedro Branco , Cátia Vaz , Alexandre P. Francisco

Understanding the mutational history of tumor cells is a critical endeavor in unraveling the mechanisms underlying cancer. Since the modeling of tumor cell evolution employs labeled trees, researchers are motivated to develop different…

Populations and Evolution · Quantitative Biology 2023-11-17 Elahe Khayatian , Gabriel Valiente , Louxin Zhang

The Hausdorff distance is a relatively new measure of similarity of graphs. The notion of the Hausdorff distance considers a special kind of a common subgraph of the compared graphs and depends on the structural properties outside of the…

Combinatorics · Mathematics 2023-06-22 Aleksander Kelenc

We present a new method for inferring species trees from multi-copy gene trees. Our method is based on a generalization of the Robinson-Foulds (RF) distance to multi-labeled trees (mul-trees), i.e., gene trees in which multiple leaves can…

Data Structures and Algorithms · Computer Science 2012-10-10 Ruchi Chaudhary , J. Gordon Burleigh , David Fernández-Baca

We consider the numerical taxonomy problem of fitting a positive distance function ${D:{S\choose 2}\rightarrow \mathbb R_{>0}}$ by a tree metric. We want a tree $T$ with positive edge weights and including $S$ among the vertices so that…

Data Structures and Algorithms · Computer Science 2022-03-14 Vincent Cohen-Addad , Debarati Das , Evangelos Kipouridis , Nikos Parotsidis , Mikkel Thorup

Random Forests (RF) is a popular machine learning method for classification and regression problems. It involves a bagging application to decision tree models. One of the primary advantages of the Random Forests model is the reduction in…

Machine Learning · Statistics 2022-07-06 Sai K Popuri

Rotation distance between rooted binary trees measures the number of simple operations it takes to transform one tree into another. There are no known polynomial-time algorithms for computing rotation distance. We give an efficient,…

Data Structures and Algorithms · Computer Science 2018-03-19 Sean Cleary , Katherine St. John

This paper extends the study of fringe trees in random plane trees with a given degree statistic. While previous work established the asymptotic normality of the count of fringe trees isomorphic to a fixed tree, we investigate the case…

Probability · Mathematics 2026-04-08 Gabriel Berzunza Ojeda , Cecilia Holmgren , Svante Janson

The Distributional Random Forest (DRF) is a recently introduced Random Forest algorithm to estimate multivariate conditional distributions. Due to its general estimation procedure, it can be employed to estimate a wide range of targets such…

Statistics Theory · Mathematics 2023-12-20 Jeffrey Näf , Corinne Emmenegger , Peter Bühlmann , Nicolai Meinshausen

The Poisson-binomial distribution is useful in many applied problems in engineering, actuarial science, and data mining. The Poisson-binomial distribution models the distribution of the sum of independent but not identically distributed…

Computation · Statistics 2017-02-07 Man Zhang , Yili Hong , Narayanaswamy Balakrishnan

We consider a probability measure on cycle-rooted spanning forests (CRSFs) introduced by Kenyon. CRSFs are spanning subgraphs, each connected component of which has a unique cycle; they generalize spanning trees. A generalization of…

Data Structures and Algorithms · Computer Science 2025-07-10 Michaël Fanuel , Rémi Bardenet

We investigate the distribution of the depth of a node containing a specific key or, equivalently, the number of steps needed to retrieve an item stored in a randomly grown binary search tree. Using a representation in terms of mixed and…

Probability · Mathematics 2007-05-23 Rudolf Grubel , Nikolce Stefanoski

We propose a tree-based semi-varying coefficient model for the Conway-Maxwell- Poisson (CMP or COM-Poisson) distribution which is a two-parameter generalization of the Poisson distribution and is flexible enough to capture both…

Methodology · Statistics 2020-04-27 Suneel Babu Chatla , Galit Shmueli

Random forest (RF) stands out as a highly favored machine learning approach for classification problems. The effectiveness of RF hinges on two key factors: the accuracy of individual trees and the diversity among them. In this study, we…

Machine Learning · Computer Science 2024-10-28 Ye-eun Kim , Seoung Yun Kim , Hyunjoong Kim

Maximum parsimony distance is a measure used to quantify the dissimilarity of two unrooted phylogenetic trees. It is NP-hard to compute, and very few positive algorithmic results are known due to its complex combinatorial structure. Here we…

Data Structures and Algorithms · Computer Science 2020-04-07 Mark Jones , Steven Kelk , Leen Stougie

In order to conduct a statistical analysis on a given set of phylogenetic gene trees, we often use a distance measure between two trees. In a statistical distance-based method to analyze discordance between gene trees, it is a key to decide…

Populations and Evolution · Quantitative Biology 2016-02-05 Jing Xi , Jin Xie , Ruriko Yoshida

We investigate the statistics of trees grown from some initial tree by attaching links to preexisting vertices, with attachment probabilities depending only on the valence of these vertices. We consider the asymptotic mass distribution that…

Statistical Mechanics · Physics 2007-05-23 François David , Philippe Di Francesco , Emmanuel Guitter , Thordur Jonsson

The recursive and hierarchical structure of full rooted trees is applicable to represent statistical models in various areas, such as data compression, image processing, and machine learning. In most of these cases, the full rooted tree is…

Machine Learning · Statistics 2022-03-24 Yuta Nakahara , Shota Saito , Akira Kamatsuka , Toshiyasu Matsushima
‹ Prev 1 2 3 10 Next ›