Related papers: Graded Transformers

Artificial Neural Networks on Graded Vector Spaces

This paper presents a transformative framework for artificial neural networks over graded vector spaces, tailored to model hierarchical and structured data in fields like algebraic geometry and physics. By exploiting the algebraic…

Artificial Intelligence · Computer Science 2026-01-07 Tony Shaska

Graded Neural Networks

This paper presents a novel framework for graded neural networks (GNNs) built over graded vector spaces $\V_\w^n$, extending classical neural architectures by incorporating algebraic grading. Leveraging a coordinate-wise grading structure…

Machine Learning · Computer Science 2026-04-24 Tony Shaska

Learning Graph Quantized Tokenizers

Transformers serve as the backbone architectures of Foundational Models, where domain-specific tokenizers allow them to adapt to various domains. Graph Transformers (GTs) have recently emerged as leading models in geometric deep learning,…

Neural and Evolutionary Computing · Computer Science 2025-04-03 Limei Wang , Kaveh Hassani , Si Zhang , Dongqi Fu , Baichuan Yuan , Weilin Cong , Zhigang Hua , Hao Wu , Ning Yao , Bo Long

Hypergraph Transformer for Semi-Supervised Classification

Hypergraphs play a pivotal role in the modelling of data featuring higher-order relations involving more than two entities. Hypergraph neural networks emerge as a powerful tool for processing hypergraph-structured data, delivering…

Machine Learning · Computer Science 2024-06-04 Zexi Liu , Bohan Tang , Ziyuan Ye , Xiaowen Dong , Siheng Chen , Yanfeng Wang

GPT-FT: An Efficient Automated Feature Transformation Using GPT for Sequence Reconstruction and Performance Enhancement

Feature transformation plays a critical role in enhancing machine learning model performance by optimizing data representations. Recent state-of-the-art approaches address this task as a continuous embedding optimization problem, converting…

Machine Learning · Computer Science 2025-08-29 Yang Gao , Dongjie Wang , Scott Piersall , Ye Zhang , Liqiang Wang

Gated Graph Recurrent Neural Networks

Graph processes exhibit a temporal structure determined by the sequence index and and a spatial structure determined by the graph support. To learn from graph processes, an information processing architecture must then be able to exploit…

Signal Processing · Electrical Eng. & Systems 2020-12-02 Luana Ruiz , Fernando Gama , Alejandro Ribeiro

Graph Transformers for Large Graphs

Transformers have recently emerged as powerful neural networks for graph learning, showcasing state-of-the-art performance on several graph property prediction tasks. However, these results have been limited to small-scale graphs, where the…

Machine Learning · Computer Science 2023-12-19 Vijay Prakash Dwivedi , Yozen Liu , Anh Tuan Luu , Xavier Bresson , Neil Shah , Tong Zhao

GradMetaNet: An Equivariant Architecture for Learning on Gradients

Gradients of neural networks encode valuable information for optimization, editing, and analysis of models. Therefore, practitioners often treat gradients as inputs to task-specific algorithms, e.g. for pruning or optimization. Recent works…

Machine Learning · Computer Science 2025-10-14 Yoav Gelberg , Yam Eitan , Aviv Navon , Aviv Shamsian , Theo , Putterman , Michael Bronstein , Haggai Maron

A Pure Transformer Pretraining Framework on Text-attributed Graphs

Pretraining plays a pivotal role in acquiring generalized knowledge from large-scale data, achieving remarkable successes as evidenced by large models in CV and NLP. However, progress in the graph domain remains limited due to fundamental…

Artificial Intelligence · Computer Science 2024-06-21 Yu Song , Haitao Mao , Jiachen Xiao , Jingzhe Liu , Zhikai Chen , Wei Jin , Carl Yang , Jiliang Tang , Hui Liu

Geometry-Informed Neural Operator Transformer

Machine-learning-based surrogate models offer significant computational efficiency and faster simulations compared to traditional numerical methods, especially for problems requiring repeated evaluations of partial differential equations.…

Machine Learning · Computer Science 2025-12-15 Qibang Liu , Weiheng Zhong , Hadi Meidani , Diab Abueidda , Seid Koric , Philippe Geubelle

Language Embedding Meets Dynamic Graph: A New Exploration for Neural Architecture Representation Learning

Neural Architecture Representation Learning aims to transform network models into feature representations for predicting network attributes, playing a crucial role in deploying and designing networks for real-world applications. Recently,…

Machine Learning · Computer Science 2025-06-10 Haizhao Jing , Haokui Zhang , Zhenhao Shang , Rong Xiao , Peng Wang , Yanning Zhang

Topology-Informed Graph Transformer

Transformers have revolutionized performance in Natural Language Processing and Vision, paving the way for their integration with Graph Neural Networks (GNNs). One key challenge in enhancing graph transformers is strengthening the…

Machine Learning · Computer Science 2026-01-09 Yun Young Choi , Sun Woo Park , Minho Lee , Youngho Woo

Large Sequence Models for Sequential Decision-Making: A Survey

Transformer architectures have facilitated the development of large-scale and general-purpose sequence models for prediction tasks in natural language processing and computer vision, e.g., GPT-3 and Swin Transformer. Although originally…

Machine Learning · Computer Science 2023-06-27 Muning Wen , Runji Lin , Hanjing Wang , Yaodong Yang , Ying Wen , Luo Mai , Jun Wang , Haifeng Zhang , Weinan Zhang

TransGAT: Transformer-Based Graph Neural Networks for Multi-Dimensional Automated Essay Scoring

Essay writing is a critical component of student assessment, yet manual scoring is labor-intensive and inconsistent. Automated Essay Scoring (AES) offers a promising alternative, but current approaches face limitations. Recent studies have…

Computation and Language · Computer Science 2025-09-03 Hind Aljuaid , Areej Alhothali , Ohoud Al-Zamzami , Hussein Assalahi

Transformers as Graph-to-Graph Models

We argue that Transformers are essentially graph-to-graph models, with sequences just being a special case. Attention weights are functionally equivalent to graph edges. Our Graph-to-Graph Transformer architecture makes this ability…

Computation and Language · Computer Science 2023-10-30 James Henderson , Alireza Mohammadshahi , Andrei C. Coman , Lesly Miculicich

Transformers are Graph Neural Networks

We establish connections between the Transformer architecture, originally introduced for natural language processing, and Graph Neural Networks (GNNs) for representation learning on graphs. We show how Transformers can be viewed as message…

Machine Learning · Computer Science 2025-06-30 Chaitanya K. Joshi

Nonparametric Teaching for Graph Property Learners

Inferring properties of graph-structured data, e.g., the solubility of molecules, essentially involves learning the implicit mapping from graphs to their properties. This learning process is often costly for graph property learners like…

Machine Learning · Computer Science 2025-05-22 Chen Zhang , Weixin Bu , Zeyi Ren , Zhengwu Liu , Yik-Chung Wu , Ngai Wong

Convolutional Sequence to Sequence Learning

The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to…

Computation and Language · Computer Science 2017-07-26 Jonas Gehring , Michael Auli , David Grangier , Denis Yarats , Yann N. Dauphin

Best of Both Worlds: Advantages of Hybrid Graph Sequence Models

Modern sequence models (e.g., Transformers, linear RNNs, etc.) emerged as dominant backbones of recent deep learning frameworks, mainly due to their efficiency, representational power, and/or ability to capture long-range dependencies.…

Machine Learning · Computer Science 2024-11-26 Ali Behrouz , Ali Parviz , Mahdi Karami , Clayton Sanford , Bryan Perozzi , Vahab Mirrokni

Generalized Transferable Neural Networks for Steady-State Partial Differential Equations

Deep learning has emerged as a compelling framework for scientific and engineering computing, motivating growing interest in neural network-based solvers for partial differential equations (PDEs). Within this landscape, network…

Numerical Analysis · Mathematics 2026-04-06 Tao Cheng , Lili Ju , Zhonghua Qiao , Xiaoping Zhang