Machine Learning · Computer Science
NoiseFormer -- Noise Diffused Symmetric Attention Transformer
Phani Kumar, Nyshadham, Jyothendra Varma, Polisetty V R K +1
2026-01-21
Machine Learning · Computer Science
SparseBERT: Rethinking the Importance Analysis in Self-attention
Han Shi, Jiahui Gao, Xiaozhe Ren, Hang Xu +3
2021-07-02
Machine Learning · Computer Science
A Length Adaptive Algorithm-Hardware Co-design of Transformer on FPGA Through Sparse Attention and Dynamic Pipelining
Hongwu Peng, Shaoyi Huang, Shiyang Chen, Bingbing Li +7
2022-08-23
Computation and Language · Computer Science
Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition
Chendong Zhao, Jianzong Wang, Wen qi Wei, Xiaoyang Qu +2
2022-10-03
Machine Learning · Computer Science
Transformer Acceleration with Dynamic Sparse Attention
Liu Liu, Zheng Qu, Zhaodong Chen, Yufei Ding +1
2021-10-22
Machine Learning · Statistics
The Kanerva Machine: A Generative Distributed Memory
Yan Wu, Greg Wayne, Alex Graves, Timothy Lillicrap
2018-06-19
Neural and Evolutionary Computing · Computer Science
Sparse Distributed Memory is a Continual Learner
Trenton Bricken, Xander Davies, Deepak Singh, Dmitry Krotov +1
2023-03-28
Computation and Language · Computer Science
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
Chao Lou, Zixia Jia, Zilong Zheng, Kewei Tu
2024-06-25
Machine Learning · Computer Science
Transformers meet Stochastic Block Models: Attention with Data-Adaptive Sparsity and Cost
Sungjun Cho, Seonwoo Min, Jinwoo Kim, Moontae Lee +2
2022-10-28
Machine Learning · Computer Science
Understanding Transformer from the Perspective of Associative Memory
Shu Zhong, Mingyu Xu, Tenglong Ao, Guang Shi
2025-05-27
Machine Learning · Computer Science
Efficient Content-Based Sparse Attention with Routing Transformers
Aurko Roy, Mohammad Saffar, Ashish Vaswani, David Grangier
2020-10-27
Computation and Language · Computer Science
Smart Bird: Learnable Sparse Attention for Efficient and Effective Transformer
Chuhan Wu, Fangzhao Wu, Tao Qi, Binxing Jiao +3
2021-09-03
Machine Learning · Computer Science
Trading with the Momentum Transformer: An Intelligent and Interpretable Architecture
Kieran Wood, Sven Giegerich, Stephen Roberts, Stefan Zohren
2022-11-24
Computer Vision and Pattern Recognition · Computer Science
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
Xizhou Zhu, Dazhi Cheng, Zheng Zhang, Stephen Lin +1
2019-04-15
Machine Learning · Computer Science
Dimensional Collapse in Transformer Attention Outputs: A Challenge for Sparse Dictionary Learning
Junxuan Wang, Xuyang Ge, Wentao Shu, Zhengfu He +1
2026-02-12