English

Bidirectional Long-Range Parser for Sequential Data Understanding

Computer Vision and Pattern Recognition 2024-04-09 v1 Computation and Language Machine Learning

Abstract

The transformer is a powerful data modelling framework responsible for remarkable performance on a wide range of tasks. However, they are limited in terms of scalability as it is suboptimal and inefficient to process long-sequence data. To this purpose we introduce BLRP (Bidirectional Long-Range Parser), a novel and versatile attention mechanism designed to increase performance and efficiency on long-sequence tasks. It leverages short and long range heuristics in the form of a local sliding window approach combined with a global bidirectional latent space synthesis technique. We show the benefits and versatility of our approach on vision and language domains by demonstrating competitive results against state-of-the-art methods on the Long-Range-Arena and CIFAR benchmarks together with ablations demonstrating the computational efficiency.

Keywords

Cite

@article{arxiv.2404.05210,
  title  = {Bidirectional Long-Range Parser for Sequential Data Understanding},
  author = {George Leotescu and Daniel Voinea and Alin-Ionut Popa},
  journal= {arXiv preprint arXiv:2404.05210},
  year   = {2024}
}
R2 v1 2026-06-28T15:47:01.567Z