Related papers: Spline-based Transformers

Learning to Encode Position for Transformer with Continuous Dynamical Model

We introduce a new way of learning to encode position information for non-recurrent models, such as Transformer models. Unlike RNN and LSTM, which contain inductive bias by loading the input tokens sequentially, non-recurrent models are…

Machine Learning · Computer Science 2020-03-23 Xuanqing Liu , Hsiang-Fu Yu , Inderjit Dhillon , Cho-Jui Hsieh

Improving Transformers using Faithful Positional Encoding

We propose a new positional encoding method for a neural network architecture called the Transformer. Unlike the standard sinusoidal positional encoding, our approach is based on solid mathematical grounds and has a guarantee of not losing…

Machine Learning · Computer Science 2024-05-17 Tsuyoshi Idé , Jokin Labaien , Pin-Yu Chen

Spline Positional Encoding for Learning 3D Implicit Signed Distance Fields

Multilayer perceptrons (MLPs) have been successfully used to represent 3D shapes implicitly and compactly, by mapping 3D coordinates to the corresponding signed distance values or occupancy values. In this paper, we propose a novel…

Computer Vision and Pattern Recognition · Computer Science 2021-10-29 Peng-Shuai Wang , Yang Liu , Yu-Qi Yang , Xin Tong

Spline Deformation Field

Trajectory modeling of dense points usually employs implicit deformation fields, represented as neural networks that map coordinates to relate canonical spatial positions to temporal offsets. However, the inductive biases inherent in neural…

Computer Vision and Pattern Recognition · Computer Science 2025-07-14 Mingyang Song , Yang Zhang , Marko Mihajlovic , Siyu Tang , Markus Gross , Tunç Ozan Aydın

Convolutional Transformer-Based Image Compression

In this paper, we present a novel transformer-based architecture for end-to-end image compression. Our architecture incorporates blocks that effectively capture local dependencies between tokens, eliminating the need for positional encoding…

Image and Video Processing · Electrical Eng. & Systems 2024-09-09 Bouzid Arezki , Fangchen Feng , Anissa Mokraoui

Multi-space Variational Encoder-Decoders for Semi-supervised Labeled Sequence Transduction

Labeled sequence transduction is a task of transforming one sequence into another sequence that satisfies desiderata specified by a set of labels. In this paper we propose multi-space variational encoder-decoders, a new model for labeled…

Computation and Language · Computer Science 2019-10-08 Chunting Zhou , Graham Neubig

Position Information Emerges in Causal Transformers Without Positional Encodings via Similarity of Nearby Embeddings

Transformers with causal attention can solve tasks that require positional information without using positional encodings. In this work, we propose and investigate a new hypothesis about how positional information can be stored without…

Computation and Language · Computer Science 2025-01-03 Chunsheng Zuo , Pavel Guerzhoy , Michael Guerzhoy

Algebraic Positional Encodings

We introduce a novel positional encoding strategy for Transformer-style models, addressing the shortcomings of existing, often ad hoc, approaches. Our framework provides a flexible mapping from the algebraic specification of a domain to an…

Machine Learning · Computer Science 2024-11-01 Konstantinos Kogkalidis , Jean-Philippe Bernardy , Vikas Garg

CoPE: A Lightweight Complex Positional Encoding

Recent studies have demonstrated the effectiveness of position encoding in transformer architectures. By incorporating positional information, this approach provides essential guidance for modeling dependencies between elements across…

Machine Learning · Computer Science 2025-08-27 Avinash Amballa

DeformSyncNet: Deformation Transfer via Synchronized Shape Deformation Spaces

Shape deformation is an important component in any geometry processing toolbox. The goal is to enable intuitive deformations of single or multiple shapes or to transfer example deformations to new shapes while preserving the plausibility of…

Graphics · Computer Science 2020-09-04 Minhyuk Sung , Zhenyu Jiang , Panos Achlioptas , Niloy J. Mitra , Leonidas J. Guibas

PermuteFormer: Efficient Relative Position Encoding for Long Sequences

A recent variation of Transformer, Performer, scales Transformer to longer sequences with a linear attention mechanism. However, it is not compatible with relative position encoding, which has advantages over absolute position encoding. In…

Computation and Language · Computer Science 2021-09-09 Peng Chen

SeqPE: Transformer with Sequential Position Encoding

Since self-attention layers in Transformers are permutation invariant by design, positional encodings must be explicitly incorporated to enable spatial understanding. However, fixed-size lookup tables used in traditional learnable position…

Machine Learning · Computer Science 2025-06-18 Huayang Li , Yahui Liu , Hongyu Sun , Deng Cai , Leyang Cui , Wei Bi , Peilin Zhao , Taro Watanabe

Positional Encoding in Transformer-Based Time Series Models: A Survey

Recent advancements in transformer-based models have greatly improved time series analysis, providing robust solutions for tasks such as forecasting, anomaly detection, and classification. A crucial element of these models is positional…

Machine Learning · Computer Science 2026-05-07 Habib Irani , Vangelis Metsis

Thin-Plate Spline Motion Model for Image Animation

Image animation brings life to the static object in the source image according to the driving video. Recent works attempt to perform motion transfer on arbitrary objects through unsupervised methods without using a priori knowledge.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Jian Zhao , Hui Zhang

Sketchformer: Transformer-based Representation for Sketched Structure

Sketchformer is a novel transformer-based representation for encoding free-hand sketches input in a vector form, i.e. as a sequence of strokes. Sketchformer effectively addresses multiple tasks: sketch classification, sketch based image…

Computer Vision and Pattern Recognition · Computer Science 2020-02-25 Leo Sampaio Ferraz Ribeiro , Tu Bui , John Collomosse , Moacir Ponti

Transflow Learning: Repurposing Flow Models Without Retraining

It is well known that deep generative models have a rich latent space, and that it is possible to smoothly manipulate their outputs by traversing this latent space. Recently, architectures have emerged that allow for more complex…

Machine Learning · Computer Science 2019-12-06 Andrew Gambardella , Atılım Güneş Baydin , Philip H. S. Torr

Nonlinear Computation with Linear Optics via Source-Position Encoding

Optical computing systems provide an alternate hardware model which appears to be aligned with the demands of neural network workloads. However, the challenge of implementing energy efficient nonlinearities in optics -- a key requirement…

Optics · Physics 2025-08-04 N. Richardson , C. Bosch , R. P. Adams

Relative Positional Encoding for Transformers with Linear Complexity

Recent advances in Transformer models allow for unprecedented sequence lengths, due to linear space and time complexity. In the meantime, relative positional encoding (RPE) was proposed as beneficial for classical Transformers and consists…

Machine Learning · Computer Science 2021-06-11 Antoine Liutkus , Ondřej Cífka , Shih-Lun Wu , Umut Şimşekli , Yi-Hsuan Yang , Gaël Richard

No Representation without Transformation

We extend the framework of variational autoencoders to represent transformations explicitly in the latent space. In the family of hierarchical graphical models that emerges, the latent space is populated by higher order objects that are…

Machine Learning · Computer Science 2020-04-24 Giorgio Giannone , Saeed Saremi , Jonathan Masci , Christian Osendorfer

RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

We present RenderFormer, a neural rendering pipeline that directly renders an image from a triangle-based representation of a scene with full global illumination effects and that does not require per-scene training or fine-tuning. Instead…

Graphics · Computer Science 2025-05-29 Chong Zeng , Yue Dong , Pieter Peers , Hongzhi Wu , Xin Tong