English

What makes a good feedforward computational graph?

Machine Learning 2025-06-03 v2 Artificial Intelligence Social and Information Networks Machine Learning

Abstract

As implied by the plethora of literature on graph rewiring, the choice of computational graph employed by a neural network can make a significant impact on its downstream performance. Certain effects related to the computational graph, such as under-reaching and over-squashing, may even render the model incapable of learning certain functions. Most of these effects have only been thoroughly studied in the domain of undirected graphs; however, recent years have seen a significant rise in interest in feedforward computational graphs: directed graphs without any back edges. In this paper, we study the desirable properties of a feedforward computational graph, discovering two important complementary measures: fidelity and mixing time, and evaluating a few popular choices of graphs through the lens of these measures. Our study is backed by both theoretical analyses of the metrics' asymptotic behaviour for various graphs, as well as correlating these metrics to the performance of trained neural network models using the corresponding graphs.

Keywords

Cite

@article{arxiv.2502.06751,
  title  = {What makes a good feedforward computational graph?},
  author = {Alex Vitvitskyi and João G. M. Araújo and Marc Lackenby and Petar Veličković},
  journal= {arXiv preprint arXiv:2502.06751},
  year   = {2025}
}

Comments

To appear at ICML 2025. 17 pages, 7 figures