Related papers: Learning about Spatial and Temporal Proximity usin…

Simple Distances for Trajectories via Landmarks

We develop a new class of distances for objects including lines, hyperplanes, and trajectories, based on the distance to a set of landmarks. These distances easily and interpretably map objects to a Euclidean space, are simple to compute,…

Computational Geometry · Computer Science 2019-06-13 Jeff M. Phillips , Pingfan Tang

Comparative Analysis of Merge Trees using Local Tree Edit Distance

Comparative analysis of scalar fields is an important problem with various applications including feature-directed visualization and feature tracking in time-varying data. Comparing topological structures that are abstract and succinct…

Graphics · Computer Science 2024-06-06 Raghavendra Sridharamurthy , Vijay Natarajan

Geometry-Aware Merge Tree Comparisons for Time-Varying Data with Interleaving Distances

Merge trees, a type of topological descriptor, serve to identify and summarize the topological characteristics associated with scalar fields. They present a great potential for the analysis and visualization of time-varying data. First,…

Human-Computer Interaction · Computer Science 2021-08-02 Lin Yan , Talha Bin Masood , Farhan Rasheed , Ingrid Hotz , Bei Wang

Matrices of Forests and the Analysis of Digraphs

The matrices of spanning rooted forests are studied as a tool for analysing the structure of digraphs and measuring their characteristics. The problems of revealing the basis bicomponents, measuring vertex proximity, and ranking from…

Combinatorics · Mathematics 2007-05-23 Pavel Chebotarev , Rafig Agaev

Matrices of forests, analysis of networks, and ranking problems

The matrices of spanning rooted forests are studied as a tool for analysing the structure of networks and measuring their properties. The problems of revealing the basic bicomponents, measuring vertex proximity, and ranking from preference…

Combinatorics · Mathematics 2013-05-29 Pavel Chebotarev , Rafig Agaev

Dive into Decision Trees and Forests: A Theoretical Demonstration

Based on decision trees, many fields have arguably made tremendous progress in recent years. In simple words, decision trees use the strategy of "divide-and-conquer" to divide the complex problem on the dependency between input features and…

Machine Learning · Computer Science 2021-01-22 Jinxiong Zhang

Canonical distances such as Euclidean distance often fail to capture the appropriate relationships between items, subsequently leading to subpar inference and prediction. Many algorithms have been proposed for automated learning of suitable…

Machine Learning · Statistics 2020-08-24 Tyler M. Tomita , Joshua T. Vogelstein

Parameters Tied to Treewidth

Treewidth is a graph parameter of fundamental importance to algorithmic and structural graph theory. This paper surveys several graph parameters tied to treewidth, including separation number, tangle number, well-linked number and Cartesian…

Combinatorics · Mathematics 2016-01-29 Daniel J. Harvey , David R. Wood

Tree Edit Distance with Variables. Measuring the Similarity between Mathematical Formulas

In this article, we propose tree edit distance with variables, which is an extension of the tree edit distance to handle trees with variables and has a potential application to measuring the similarity between mathematical formulas,…

Data Structures and Algorithms · Computer Science 2021-05-12 Tatsuya Akutsu , Tomoya Mori , Naotoshi Nakamura , Satoshi Kozawa , Yuhei Ueno , Thomas N. Sato

FREEtree: A Tree-based Approach for High Dimensional Longitudinal Data With Correlated Features

This paper proposes FREEtree, a tree-based method for high dimensional longitudinal data with correlated features. Popular machine learning approaches, like Random Forests, commonly used for variable selection do not perform well when there…

Machine Learning · Statistics 2020-06-18 Yuancheng Xu , Athanasse Zafirov , R. Michael Alvarez , Dan Kojis , Min Tan , Christina M. Ramirez

Attack Tree Distance: a practical examination of tree difference measurement within cyber security

CONTEXT. Attack treesare a recommended threat modeling tool, but there is no established method to compare them. OBJECTIVE. We aim to establish a method to compare "real" attack trees, based on both the structure of the tree itself and the…

Cryptography and Security · Computer Science 2025-03-05 Nathan D. Schiele , Olga Gadyatskaya

Distance approximation using Isolation Forests

This work briefly explores the possibility of approximating spatial distance (alternatively, similarity) between data points using the Isolation Forest method envisioned for outlier detection. The logic is similar to that of isolation: the…

Machine Learning · Statistics 2019-11-26 David Cortes

Metric Learning for Ordered Labeled Trees with pq-grams

Computing the similarity between two data points plays a vital role in many machine learning algorithms. Metric learning has the aim of learning a good metric automatically from data. Most existing studies on metric learning for…

Machine Learning · Computer Science 2020-03-10 Hikaru Shindo , Masaaki Nishino , Yasuaki Kobayashi , Akihiro Yamamoto

Comparison-Based Random Forests

Assume we are given a set of items from a general metric space, but we neither have access to the representation of the data nor to the distances between data points. Instead, suppose that we can actively choose a triplet of items (A,B,C)…

Machine Learning · Statistics 2018-06-19 Siavash Haghiri , Damien Garreau , Ulrike von Luxburg

How far is my network from being edge-based? Proximity measures for edge-basedness of unrooted phylogenetic networks

Phylogenetic networks which are, as opposed to trees, suitable to describe processes like hybridization and horizontal gene transfer, play a substantial role in evolutionary research. However, while non-treelike events need to be taken into…

Populations and Evolution · Quantitative Biology 2022-07-06 Mareike Fischer , Tom Niklas Hamann , Kristina Wicke

A Random Forest Approach for Modeling Bounded Outcomes

Random forests have become an established tool for classification and regression, in particular in high-dimensional settings and in the presence of complex predictor-response relationships. For bounded outcome variables restricted to the…

Methodology · Statistics 2019-01-21 Leonie Weinhold , Matthias Schmid , Marvin N. Wright , Moritz Berger

Ranking Perspective for Tree-based Methods with Applications to Symbolic Feature Selection

Tree-based methods are powerful nonparametric techniques in statistics and machine learning. However, their effectiveness, particularly in finite-sample settings, is not fully understood. Recent applications have revealed their surprising…

Statistics Theory · Mathematics 2024-10-04 Hengrui Luo , Meng Li

Non-Parametric Learning of Gaifman Models

We consider the problem of structure learning for Gaifman models and learn relational features that can be used to derive feature representations from a knowledge base. These relational features are first-order rules that are then partially…

Machine Learning · Computer Science 2020-01-17 Devendra Singh Dhami , Siwen Yan , Gautam Kunapuli , Sriraam Natarajan

Regression Trees Know Calculus

Regression trees have emerged as a preeminent tool for solving real-world regression problems due to their ability to deal with nonlinearities, interaction effects and sharp discontinuities. In this article, we rather study regression trees…

Machine Learning · Statistics 2025-11-14 Nathan Wycoff

A basic limitation on inferring phylogenies by pairwise sequence comparisons

Distance-based approaches in phylogenetics such as Neighbor-Joining are a fast and popular approach for building trees. These methods take pairs of sequences from them construct a value that, in expectation, is additive under a stochastic…

Populations and Evolution · Quantitative Biology 2008-08-21 Mike Steel