English
Related papers

Related papers: Graded Transformers

200 papers

This paper presents a transformative framework for artificial neural networks over graded vector spaces, tailored to model hierarchical and structured data in fields like algebraic geometry and physics. By exploiting the algebraic…

Artificial Intelligence · Computer Science 2026-01-07 Tony Shaska

This paper presents a novel framework for graded neural networks (GNNs) built over graded vector spaces $\V_\w^n$, extending classical neural architectures by incorporating algebraic grading. Leveraging a coordinate-wise grading structure…

Machine Learning · Computer Science 2026-04-24 Tony Shaska

Transformers serve as the backbone architectures of Foundational Models, where domain-specific tokenizers allow them to adapt to various domains. Graph Transformers (GTs) have recently emerged as leading models in geometric deep learning,…

Neural and Evolutionary Computing · Computer Science 2025-04-03 Limei Wang , Kaveh Hassani , Si Zhang , Dongqi Fu , Baichuan Yuan , Weilin Cong , Zhigang Hua , Hao Wu , Ning Yao , Bo Long

Hypergraphs play a pivotal role in the modelling of data featuring higher-order relations involving more than two entities. Hypergraph neural networks emerge as a powerful tool for processing hypergraph-structured data, delivering…

Machine Learning · Computer Science 2024-06-04 Zexi Liu , Bohan Tang , Ziyuan Ye , Xiaowen Dong , Siheng Chen , Yanfeng Wang

Feature transformation plays a critical role in enhancing machine learning model performance by optimizing data representations. Recent state-of-the-art approaches address this task as a continuous embedding optimization problem, converting…

Machine Learning · Computer Science 2025-08-29 Yang Gao , Dongjie Wang , Scott Piersall , Ye Zhang , Liqiang Wang

Graph processes exhibit a temporal structure determined by the sequence index and and a spatial structure determined by the graph support. To learn from graph processes, an information processing architecture must then be able to exploit…

Signal Processing · Electrical Eng. & Systems 2020-12-02 Luana Ruiz , Fernando Gama , Alejandro Ribeiro

Transformers have recently emerged as powerful neural networks for graph learning, showcasing state-of-the-art performance on several graph property prediction tasks. However, these results have been limited to small-scale graphs, where the…

Machine Learning · Computer Science 2023-12-19 Vijay Prakash Dwivedi , Yozen Liu , Anh Tuan Luu , Xavier Bresson , Neil Shah , Tong Zhao

Gradients of neural networks encode valuable information for optimization, editing, and analysis of models. Therefore, practitioners often treat gradients as inputs to task-specific algorithms, e.g. for pruning or optimization. Recent works…

Machine Learning · Computer Science 2025-10-14 Yoav Gelberg , Yam Eitan , Aviv Navon , Aviv Shamsian , Theo , Putterman , Michael Bronstein , Haggai Maron

Pretraining plays a pivotal role in acquiring generalized knowledge from large-scale data, achieving remarkable successes as evidenced by large models in CV and NLP. However, progress in the graph domain remains limited due to fundamental…

Artificial Intelligence · Computer Science 2024-06-21 Yu Song , Haitao Mao , Jiachen Xiao , Jingzhe Liu , Zhikai Chen , Wei Jin , Carl Yang , Jiliang Tang , Hui Liu

Machine-learning-based surrogate models offer significant computational efficiency and faster simulations compared to traditional numerical methods, especially for problems requiring repeated evaluations of partial differential equations.…

Machine Learning · Computer Science 2025-12-15 Qibang Liu , Weiheng Zhong , Hadi Meidani , Diab Abueidda , Seid Koric , Philippe Geubelle

Neural Architecture Representation Learning aims to transform network models into feature representations for predicting network attributes, playing a crucial role in deploying and designing networks for real-world applications. Recently,…

Machine Learning · Computer Science 2025-06-10 Haizhao Jing , Haokui Zhang , Zhenhao Shang , Rong Xiao , Peng Wang , Yanning Zhang

Transformers have revolutionized performance in Natural Language Processing and Vision, paving the way for their integration with Graph Neural Networks (GNNs). One key challenge in enhancing graph transformers is strengthening the…

Machine Learning · Computer Science 2026-01-09 Yun Young Choi , Sun Woo Park , Minho Lee , Youngho Woo

Transformer architectures have facilitated the development of large-scale and general-purpose sequence models for prediction tasks in natural language processing and computer vision, e.g., GPT-3 and Swin Transformer. Although originally…

Machine Learning · Computer Science 2023-06-27 Muning Wen , Runji Lin , Hanjing Wang , Yaodong Yang , Ying Wen , Luo Mai , Jun Wang , Haifeng Zhang , Weinan Zhang

Essay writing is a critical component of student assessment, yet manual scoring is labor-intensive and inconsistent. Automated Essay Scoring (AES) offers a promising alternative, but current approaches face limitations. Recent studies have…

Computation and Language · Computer Science 2025-09-03 Hind Aljuaid , Areej Alhothali , Ohoud Al-Zamzami , Hussein Assalahi

We argue that Transformers are essentially graph-to-graph models, with sequences just being a special case. Attention weights are functionally equivalent to graph edges. Our Graph-to-Graph Transformer architecture makes this ability…

Computation and Language · Computer Science 2023-10-30 James Henderson , Alireza Mohammadshahi , Andrei C. Coman , Lesly Miculicich

We establish connections between the Transformer architecture, originally introduced for natural language processing, and Graph Neural Networks (GNNs) for representation learning on graphs. We show how Transformers can be viewed as message…

Machine Learning · Computer Science 2025-06-30 Chaitanya K. Joshi

Inferring properties of graph-structured data, e.g., the solubility of molecules, essentially involves learning the implicit mapping from graphs to their properties. This learning process is often costly for graph property learners like…

Machine Learning · Computer Science 2025-05-22 Chen Zhang , Weixin Bu , Zeyi Ren , Zhengwu Liu , Yik-Chung Wu , Ngai Wong

The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to…

Computation and Language · Computer Science 2017-07-26 Jonas Gehring , Michael Auli , David Grangier , Denis Yarats , Yann N. Dauphin

Modern sequence models (e.g., Transformers, linear RNNs, etc.) emerged as dominant backbones of recent deep learning frameworks, mainly due to their efficiency, representational power, and/or ability to capture long-range dependencies.…

Machine Learning · Computer Science 2024-11-26 Ali Behrouz , Ali Parviz , Mahdi Karami , Clayton Sanford , Bryan Perozzi , Vahab Mirrokni

Deep learning has emerged as a compelling framework for scientific and engineering computing, motivating growing interest in neural network-based solvers for partial differential equations (PDEs). Within this landscape, network…

Numerical Analysis · Mathematics 2026-04-06 Tao Cheng , Lili Ju , Zhonghua Qiao , Xiaoping Zhang
‹ Prev 1 2 3 10 Next ›