Related papers: The Weight Function in the Subtree Kernel is Decis…

Fast Computation of Subpath Kernel for Trees

The kernel method is a potential approach to analyzing structured data such as sequences, trees, and graphs; however, unordered trees have not been investigated extensively. Kimura et al. (2011) proposed a kernel function for unordered…

Data Structures and Algorithms · Computer Science 2012-06-22 Daisuke Kimura , Hisashi Kashima

Forest Kernel Balancing Weights: Outcome-Guided Features for Causal Inference

While balancing covariates between groups is central for observational causal inference, selecting which features to balance remains a challenging problem. Kernel balancing is a promising approach that first estimates a kernel that captures…

Methodology · Statistics 2025-12-15 Andy A. Shen , Eli Ben-Michael , Avi Feller , Luke Keele , Jared Murray

Measure Inducing Classification and Regression Trees for Functional Data

We propose a tree-based algorithm for classification and regression problems in the context of functional data analysis, which allows to leverage representation learning and multiple splitting rules at the node level, reducing…

Machine Learning · Statistics 2020-11-03 Edoardo Belli , Simone Vantini

Learning subtree pattern importance for Weisfeiler-Lehmanbased graph kernels

Graph is an usual representation of relational data, which are ubiquitous in manydomains such as molecules, biological and social networks. A popular approach to learningwith graph structured data is to make use of graph kernels, which…

Machine Learning · Computer Science 2022-08-02 Dai Hai Nguyen , Canh Hao Nguyen , Hiroshi Mamitsuka

New Linear-time Algorithm for SubTree Kernel Computation based on Root-Weighted Tree Automata

Tree kernels have been proposed to be used in many areas as the automatic learning of natural language applications. In this paper, we propose a new linear time algorithm based on the concept of weighted tree automata for SubTree kernel…

Computation and Language · Computer Science 2023-02-03 Ludovic Mignot , Faissal Ouardi , Djelloul Ziadi

Parallel Tree Kernel Computation

Tree kernels are fundamental tools that have been leveraged in many applications, particularly those based on machine learning for Natural Language Processing tasks. In this paper, we devise a parallel implementation of the sequential…

Computation and Language · Computer Science 2023-05-16 Souad Taouti , Hadda Cherroun , Djelloul Ziadi

Bayesian Kernel Regression for Functional Data

In supervised learning, the output variable to be predicted is often represented as a function, such as a spectrum or probability distribution. Despite its importance, functional output regression remains relatively unexplored. In this…

Machine Learning · Statistics 2025-03-19 Minoru Kusaba , Megumi Iwayama , Ryo Yoshida

Distributional Random Forests: Heterogeneity Adjustment and Multivariate Distributional Regression

Random Forest (Breiman, 2001) is a successful and widely used regression and classification algorithm. Part of its appeal and reason for its versatility is its (implicit) construction of a kernel-type weighting function on training data,…

Machine Learning · Statistics 2022-10-13 Domagoj Ćevid , Loris Michel , Jeffrey Näf , Nicolai Meinshausen , Peter Bühlmann

A Subpath Kernel for Learning Hierarchical Image Representations

Tree kernels have demonstrated their ability to deal with hierarchical data, as the intrinsic tree structure often plays a discriminative role. While such kernels have been successfully applied to various domains such as nature language…

Computer Vision and Pattern Recognition · Computer Science 2016-04-08 Yanwei Cui , Laetitia Chapel , Sébastien Lefèvre

Weighted Sum-of-Trees Model for Clustered Data

Clustered data, which arise when observations are nested within groups, are incredibly common in clinical, education, and social science research. Traditionally, a linear mixed model, which includes random effects to account for…

Methodology · Statistics 2026-02-04 Kevin McCoy , Zachary Wooten , Katarzyna Tomczak , Christine B. Peterson

Energy Trees: Regression and Classification With Structured and Mixed-Type Covariates

The increasing complexity of data requires methods and models that can effectively handle intricate structures, as simplifying them would result in loss of information. While several analytical tools have been developed to work with complex…

Methodology · Statistics 2023-06-16 Riccardo Giubilei , Tullia Padellini , Pierpaolo Brutti

Better Model Selection with a new Definition of Feature Importance

Feature importance aims at measuring how crucial each input feature is for model prediction. It is widely used in feature engineering, model selection and explainable artificial intelligence (XAI). In this paper, we propose a new tree-model…

Machine Learning · Statistics 2020-09-17 Fan Fang , Carmine Ventre , Lingbo Li , Leslie Kanthan , Fan Wu , Michail Basios

Random Machines: A bagged-weighted support vector model with free kernel choice

Improvement of statistical learning models in order to increase efficiency in solving classification or regression problems is still a goal pursued by the scientific community. In this way, the support vector machine model is one of the…

Machine Learning · Statistics 2019-11-22 Anderson Ara , Mateus Maia , Samuel Macêdo , Francisco Louzada

On the utility of feature selection in building two-tier decision trees

Nowadays, feature selection is frequently used in machine learning when there is a risk of performance degradation due to overfitting or when computational resources are limited. During the feature selection process, the subset of features…

Machine Learning · Computer Science 2023-01-02 Sergey A. Saltykov

Analyzing Tree Architectures in Ensembles via Neural Tangent Kernel

A soft tree is an actively studied variant of a decision tree that updates splitting rules using the gradient method. Although soft trees can take various architectures, their impact is not theoretically well known. In this paper, we…

Machine Learning · Computer Science 2023-02-10 Ryuichi Kanoh , Mahito Sugiyama

Decision trees as partitioning machines to characterize their generalization properties

Decision trees are popular machine learning models that are simple to build and easy to interpret. Even though algorithms to learn decision trees date back to almost 50 years, key properties affecting their generalization error are still…

Machine Learning · Computer Science 2020-10-16 Jean-Samuel Leboeuf , Frédéric LeBlanc , Mario Marchand

Example-Based Explanations of Random Forest Predictions

A random forest prediction can be computed by the scalar product of the labels of the training examples and a set of weights that are determined by the leafs of the forest into which the test object falls; each prediction can hence be…

Machine Learning · Computer Science 2023-11-27 Henrik Boström

Structural Optimization Makes Graph Classification Simpler and Better

In deep neural networks, better results can often be obtained by increasing the complexity of previously developed basic models. However, it is unclear whether there is a way to boost performance by decreasing the complexity of such models.…

Machine Learning · Computer Science 2021-09-07 Junran Wu , Jianhao Li , Yicheng Pan , Ke Xu

A Simple yet Effective Method for Graph Classification

In deep neural networks, better results can often be obtained by increasing the complexity of previously developed basic models. However, it is unclear whether there is a way to boost performance by decreasing the complexity of such models.…

Machine Learning · Computer Science 2022-06-07 Junran Wu , Shangzhe Li , Jianhao Li , Yicheng Pan , Ke Xu

On Random Tree Structures, Their Entropy, and Compression

Measuring the complexity of tree structures can be beneficial in areas that use tree data structures for storage, communication, and processing purposes. This complexity can then be used to compress tree data structures to their…

Information Theory · Computer Science 2023-09-19 Amirmohammad Farzaneh , Mihai-Alin Badiu , Justin P. Coon