English

Improving Transformers using Faithful Positional Encoding

Machine Learning 2024-05-17 v2

Abstract

We propose a new positional encoding method for a neural network architecture called the Transformer. Unlike the standard sinusoidal positional encoding, our approach is based on solid mathematical grounds and has a guarantee of not losing information about the positional order of the input sequence. We show that the new encoding approach systematically improves the prediction performance in the time-series classification task.

Keywords

Cite

@article{arxiv.2405.09061,
  title  = {Improving Transformers using Faithful Positional Encoding},
  author = {Tsuyoshi Idé and Jokin Labaien and Pin-Yu Chen},
  journal= {arXiv preprint arXiv:2405.09061},
  year   = {2024}
}

Comments

arXiv admin note: text overlap with arXiv:2305.17149