Related papers: Can Transformers Do Enumerative Geometry?

Transformer Meets Boundary Value Inverse Problems

A Transformer-based deep direct sampling method is proposed for electrical impedance tomography, a well-known severely ill-posed nonlinear boundary value inverse problem. A real-time reconstruction is achieved by evaluating the learned…

Machine Learning · Computer Science 2023-03-07 Ruchi Guo , Shuhao Cao , Long Chen

Persformer: A Transformer Architecture for Topological Machine Learning

One of the main challenges of Topological Data Analysis (TDA) is to extract features from persistent diagrams directly usable by machine learning algorithms. Indeed, persistence diagrams are intrinsically (multi-)sets of points in…

Machine Learning · Computer Science 2022-09-27 Raphael Reinauer , Matteo Caorsi , Nicolas Berkouk

Bringing Attention to CAD: Boundary Representation Learning via Transformer

The recent rise of generative artificial intelligence (AI), powered by Transformer networks, has achieved remarkable success in natural language processing, computer vision, and graphics. However, the application of Transformers in…

Graphics · Computer Science 2025-09-01 Qiang Zou , Lizhen Zhu

Life Sequence Transformer: Generative Modelling of Socio-Economic Trajectories from Administrative Data

Generative modelling with Transformer architectures can simulate complex sequential structures across various applications. We extend this line of work to the social sciences by introducing a Transformer-based generative model tailored to…

Econometrics · Economics 2026-01-23 Alberto Cabezas , Carlotta Montorsi

inTformer: A Time-Embedded Attention-Based Transformer for Crash Likelihood Prediction at Intersections Using Connected Vehicle Data

The real-time crash likelihood prediction model is an essential component of the proactive traffic safety management system. Over the years, numerous studies have attempted to construct a crash likelihood prediction model in order to…

Machine Learning · Computer Science 2023-08-30 B M Tazbiul Hassan Anik , Zubayer Islam , Mohamed Abdel-Aty

(How) Can Transformers Predict Pseudo-Random Numbers?

Transformers excel at discovering patterns in sequential data, yet their fundamental limitations and learning mechanisms remain crucial topics of investigation. In this paper, we study the ability of Transformers to learn pseudo-random…

Machine Learning · Computer Science 2025-07-10 Tao Tao , Darshil Doshi , Dayal Singh Kalra , Tianyu He , Maissam Barkeshli

Measure-to-measure interpolation using Transformers

Transformers are deep neural network architectures that underpin the recent successes of large language models. Unlike more classical architectures that can be viewed as point-to-point maps, a Transformer acts as a measure-to-measure map…

Optimization and Control · Mathematics 2026-02-16 Borjan Geshkovski , Philippe Rigollet , Domènec Ruiz-Balet

Learning Modular Exponentiation with Transformers

Modular exponentiation is crucial to number theory and cryptography, yet remains largely unexplored from a mechanistic interpretability standpoint. We train a 4-layer encoder-decoder Transformer model to perform this operation and…

Machine Learning · Computer Science 2025-10-24 David Demitri Africa , Sara M. Kapoor , Theo Simon Sorg , Challenger Mishra

NLAFormer: Transformers Learn Numerical Linear Algebra Operations

Transformers are effective and efficient at modeling complex relationships and learning patterns from structured data in many applications. The main aim of this paper is to propose and design NLAFormer, which is a transformer-based…

Numerical Analysis · Mathematics 2025-08-28 Zhantao Ma , Yihang Gao , Michael K. Ng

Cross-Representation Knowledge Transfer for Improved Sequential Recommendations

Transformer architectures, capable of capturing sequential dependencies in the history of user interactions, have become the dominant approach in sequential recommender systems. Despite their success, such models consider sequence elements…

Information Retrieval · Computer Science 2026-03-02 Artur Gimranov , Viacheslav Yusupov , Elfat Sabitov , Tatyana Matveeva , Anton Lysenko , Ruslan Israfilov , Evgeny Frolov

Transformers as Meta-Learners for Implicit Neural Representations

Implicit Neural Representations (INRs) have emerged and shown their benefits over discrete representations in recent years. However, fitting an INR to the given observations usually requires optimization with gradient descent from scratch,…

Machine Learning · Computer Science 2022-08-08 Yinbo Chen , Xiaolong Wang

Transformers Learn Shortcuts to Automata

Algorithmic reasoning requires capabilities which are most naturally understood through recurrent models of computation, like the Turing machine. However, Transformer models, while lacking recurrence, are able to perform such reasoning…

Machine Learning · Computer Science 2023-05-03 Bingbin Liu , Jordan T. Ash , Surbhi Goel , Akshay Krishnamurthy , Cyril Zhang

Geometric Transformer for End-to-End Molecule Properties Prediction

Transformers have become methods of choice in many applications thanks to their ability to represent complex interactions between elements. However, extending the Transformer architecture to non-sequential data such as molecules and…

Machine Learning · Computer Science 2022-04-27 Yoni Choukroun , Lior Wolf

A Practical Survey on Faster and Lighter Transformers

Recurrent neural networks are effective models to process sequences. However, they are unable to learn long-term dependencies because of their inherent sequential nature. As a solution, Vaswani et al. introduced the Transformer, a model…

Machine Learning · Computer Science 2023-03-28 Quentin Fournier , Gaétan Marceau Caron , Daniel Aloise

IntTrajSim: Trajectory Prediction for Simulating Multi-Vehicle driving at Signalized Intersections

Traffic simulators are widely used to study the operational efficiency of road infrastructure, but their rule-based approach limits their ability to mimic real-world driving behavior. Traffic intersections are critical components of the…

Artificial Intelligence · Computer Science 2025-06-11 Yash Ranjan , Rahul Sengupta , Anand Rangarajan , Sanjay Ranka

Learning to Recall with Transformers Beyond Orthogonal Embeddings

Modern large language models (LLMs) excel at tasks that require storing and retrieving knowledge, such as factual recall and question answering. Transformers are central to this capability because they can encode information during training…

Machine Learning · Statistics 2026-03-18 Nuri Mert Vural , Alberto Bietti , Mahdi Soltanolkotabi , Denny Wu

Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation

Transformer-based detection and segmentation methods use a list of learned detection queries to retrieve information from the transformer network and learn to predict the location and category of one specific object from each query. We…

Computer Vision and Pattern Recognition · Computer Science 2023-07-31 Yiming Cui , Linjie Yang , Haichao Yu

LMFormer: Lane based Motion Prediction Transformer

Motion prediction plays an important role in autonomous driving. This study presents LMFormer, a lane-aware transformer network for trajectory prediction tasks. In contrast to previous studies, our work provides a simple mechanism to…

Computer Vision and Pattern Recognition · Computer Science 2025-04-15 Harsh Yadav , Maximilian Schaefer , Kun Zhao , Tobias Meisen

Continual Learning with Transformers for Image Classification

In many real-world scenarios, data to train machine learning models become available over time. However, neural network models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon…

Machine Learning · Computer Science 2022-06-29 Beyza Ermis , Giovanni Zappella , Martin Wistuba , Aditya Rawal , Cedric Archambeau

Transformers can optimally learn regression mixture models

Mixture models arise in many regression problems, but most methods have seen limited adoption partly due to these algorithms' highly-tailored and model-specific nature. On the other hand, transformers are flexible, neural sequence models…

Machine Learning · Computer Science 2023-11-15 Reese Pathak , Rajat Sen , Weihao Kong , Abhimanyu Das