English

Path-Based Gradient Boosting for Graph-Level Prediction

Machine Learning 2026-05-12 v1 Machine Learning

Abstract

We propose PathBoost, a gradient tree boosting method for graph-level classification and regression that learns discriminative path-based features directly from the input graph structure. Building on a previous work, which was tailored to a specific chemistry application, PathBoost introduces three key extensions: (i) adaptation to binary classification through gradient boosting with a logistic loss, (ii) incorporation of multiple node and edge attributes into the path feature space via a prefix-based decomposition, and (iii) automatic anchor node selection based on categorical attribute diversity, eliminating the need for the user to specify the starting point of the considered path features. We compared PathBoost to graph neural networks and graph kernel approaches on several benchmark datasets, obtaining better results in half of them, and comparable results in the rest. PathBoost shows better performances on graphs with larger average node counts. Overall, the results demonstrate that path-based boosting methods can be competitive with more complex black-box approaches.

Keywords

Cite

@article{arxiv.2605.08102,
  title  = {Path-Based Gradient Boosting for Graph-Level Prediction},
  author = {Claudio Meggio and Johan Pensar and Riccardo De Bin},
  journal= {arXiv preprint arXiv:2605.08102},
  year   = {2026}
}

Comments

20 Pages, 1 figure