English
Related papers

Related papers: Spline-based Transformers

200 papers

We introduce a new way of learning to encode position information for non-recurrent models, such as Transformer models. Unlike RNN and LSTM, which contain inductive bias by loading the input tokens sequentially, non-recurrent models are…

Machine Learning · Computer Science 2020-03-23 Xuanqing Liu , Hsiang-Fu Yu , Inderjit Dhillon , Cho-Jui Hsieh

We propose a new positional encoding method for a neural network architecture called the Transformer. Unlike the standard sinusoidal positional encoding, our approach is based on solid mathematical grounds and has a guarantee of not losing…

Machine Learning · Computer Science 2024-05-17 Tsuyoshi Idé , Jokin Labaien , Pin-Yu Chen

Multilayer perceptrons (MLPs) have been successfully used to represent 3D shapes implicitly and compactly, by mapping 3D coordinates to the corresponding signed distance values or occupancy values. In this paper, we propose a novel…

Computer Vision and Pattern Recognition · Computer Science 2021-10-29 Peng-Shuai Wang , Yang Liu , Yu-Qi Yang , Xin Tong

Trajectory modeling of dense points usually employs implicit deformation fields, represented as neural networks that map coordinates to relate canonical spatial positions to temporal offsets. However, the inductive biases inherent in neural…

Computer Vision and Pattern Recognition · Computer Science 2025-07-14 Mingyang Song , Yang Zhang , Marko Mihajlovic , Siyu Tang , Markus Gross , Tunç Ozan Aydın

In this paper, we present a novel transformer-based architecture for end-to-end image compression. Our architecture incorporates blocks that effectively capture local dependencies between tokens, eliminating the need for positional encoding…

Image and Video Processing · Electrical Eng. & Systems 2024-09-09 Bouzid Arezki , Fangchen Feng , Anissa Mokraoui

Labeled sequence transduction is a task of transforming one sequence into another sequence that satisfies desiderata specified by a set of labels. In this paper we propose multi-space variational encoder-decoders, a new model for labeled…

Computation and Language · Computer Science 2019-10-08 Chunting Zhou , Graham Neubig

Transformers with causal attention can solve tasks that require positional information without using positional encodings. In this work, we propose and investigate a new hypothesis about how positional information can be stored without…

Computation and Language · Computer Science 2025-01-03 Chunsheng Zuo , Pavel Guerzhoy , Michael Guerzhoy

We introduce a novel positional encoding strategy for Transformer-style models, addressing the shortcomings of existing, often ad hoc, approaches. Our framework provides a flexible mapping from the algebraic specification of a domain to an…

Machine Learning · Computer Science 2024-11-01 Konstantinos Kogkalidis , Jean-Philippe Bernardy , Vikas Garg

Recent studies have demonstrated the effectiveness of position encoding in transformer architectures. By incorporating positional information, this approach provides essential guidance for modeling dependencies between elements across…

Machine Learning · Computer Science 2025-08-27 Avinash Amballa

Shape deformation is an important component in any geometry processing toolbox. The goal is to enable intuitive deformations of single or multiple shapes or to transfer example deformations to new shapes while preserving the plausibility of…

Graphics · Computer Science 2020-09-04 Minhyuk Sung , Zhenyu Jiang , Panos Achlioptas , Niloy J. Mitra , Leonidas J. Guibas

A recent variation of Transformer, Performer, scales Transformer to longer sequences with a linear attention mechanism. However, it is not compatible with relative position encoding, which has advantages over absolute position encoding. In…

Computation and Language · Computer Science 2021-09-09 Peng Chen

Since self-attention layers in Transformers are permutation invariant by design, positional encodings must be explicitly incorporated to enable spatial understanding. However, fixed-size lookup tables used in traditional learnable position…

Machine Learning · Computer Science 2025-06-18 Huayang Li , Yahui Liu , Hongyu Sun , Deng Cai , Leyang Cui , Wei Bi , Peilin Zhao , Taro Watanabe

Recent advancements in transformer-based models have greatly improved time series analysis, providing robust solutions for tasks such as forecasting, anomaly detection, and classification. A crucial element of these models is positional…

Machine Learning · Computer Science 2026-05-07 Habib Irani , Vangelis Metsis

Image animation brings life to the static object in the source image according to the driving video. Recent works attempt to perform motion transfer on arbitrary objects through unsupervised methods without using a priori knowledge.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Jian Zhao , Hui Zhang

Sketchformer is a novel transformer-based representation for encoding free-hand sketches input in a vector form, i.e. as a sequence of strokes. Sketchformer effectively addresses multiple tasks: sketch classification, sketch based image…

Computer Vision and Pattern Recognition · Computer Science 2020-02-25 Leo Sampaio Ferraz Ribeiro , Tu Bui , John Collomosse , Moacir Ponti

It is well known that deep generative models have a rich latent space, and that it is possible to smoothly manipulate their outputs by traversing this latent space. Recently, architectures have emerged that allow for more complex…

Machine Learning · Computer Science 2019-12-06 Andrew Gambardella , Atılım Güneş Baydin , Philip H. S. Torr

Optical computing systems provide an alternate hardware model which appears to be aligned with the demands of neural network workloads. However, the challenge of implementing energy efficient nonlinearities in optics -- a key requirement…

Optics · Physics 2025-08-04 N. Richardson , C. Bosch , R. P. Adams

Recent advances in Transformer models allow for unprecedented sequence lengths, due to linear space and time complexity. In the meantime, relative positional encoding (RPE) was proposed as beneficial for classical Transformers and consists…

Machine Learning · Computer Science 2021-06-11 Antoine Liutkus , Ondřej Cífka , Shih-Lun Wu , Umut Şimşekli , Yi-Hsuan Yang , Gaël Richard

We extend the framework of variational autoencoders to represent transformations explicitly in the latent space. In the family of hierarchical graphical models that emerges, the latent space is populated by higher order objects that are…

Machine Learning · Computer Science 2020-04-24 Giorgio Giannone , Saeed Saremi , Jonathan Masci , Christian Osendorfer

We present RenderFormer, a neural rendering pipeline that directly renders an image from a triangle-based representation of a scene with full global illumination effects and that does not require per-scene training or fine-tuning. Instead…

Graphics · Computer Science 2025-05-29 Chong Zeng , Yue Dong , Pieter Peers , Hongzhi Wu , Xin Tong
‹ Prev 1 2 3 10 Next ›