Related papers: Convolutional Self-Attention Network

Convolutional Self-Attention Networks

Self-attention networks (SANs) have drawn increasing interest due to their high parallelization in computation and flexibility in modeling dependencies. SANs can be further enhanced with multi-head attention by allowing the model to attend…

Computation and Language · Computer Science 2019-04-08 Baosong Yang , Longyue Wang , Derek Wong , Lidia S. Chao , Zhaopeng Tu

Self-Attention Network for Skeleton-based Human Action Recognition

Skeleton-based action recognition has recently attracted a lot of attention. Researchers are coming up with new approaches for extracting spatio-temporal relations and making considerable progress on large-scale skeleton-based datasets.…

Computer Vision and Pattern Recognition · Computer Science 2019-12-19 Sangwoo Cho , Muhammad Hasan Maqbool , Fei Liu , Hassan Foroosh

Self-Attention: A Better Building Block for Sentiment Analysis Neural Network Classifiers

Sentiment Analysis has seen much progress in the past two decades. For the past few years, neural network approaches, primarily RNNs and CNNs, have been the most successful for this task. Recently, a new category of neural networks,…

Computation and Language · Computer Science 2018-12-20 Artaches Ambartsoumian , Fred Popowich

Deps-SAN: Neural Machine Translation with Dependency-Scaled Self-Attention Network

Syntax knowledge contributes its powerful strength in Neural machine translation (NMT) tasks. Early NMT works supposed that syntax details can be automatically learned from numerous texts via attention networks. However, succeeding…

Computation and Language · Computer Science 2022-10-05 Ru Peng , Nankai Lin , Yi Fang , Shengyi Jiang , Tianyong Hao , Boyu Chen , Junbo Zhao

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

Recurrent neural nets (RNN) and convolutional neural nets (CNN) are widely used on NLP tasks to capture the long-term and local dependencies, respectively. Attention mechanisms have recently attracted enormous interest due to their highly…

Computation and Language · Computer Science 2017-11-22 Tao Shen , Tianyi Zhou , Guodong Long , Jing Jiang , Shirui Pan , Chengqi Zhang

Context-Aware Self-Attention Networks

Self-attention model have shown its flexibility in parallel computation and the effectiveness on modeling both long- and short-term dependencies. However, it calculates the dependencies between representations without considering the…

Computation and Language · Computer Science 2019-02-18 Baosong Yang , Jian Li , Derek Wong , Lidia S. Chao , Xing Wang , Zhaopeng Tu

Spatial-Temporal Adaptive Graph Convolution with Attention Network for Traffic Forecasting

Traffic forecasting is one canonical example of spatial-temporal learning task in Intelligent Traffic System. Existing approaches capture spatial dependency with a pre-determined matrix in graph convolution neural operators. However, the…

Machine Learning · Computer Science 2022-06-08 Chen Weikang , Li Yawen , Xue Zhe , Li Ang , Wu Guobin

SSAN: Separable Self-Attention Network for Video Representation Learning

Self-attention has been successfully applied to video representation learning due to the effectiveness of modeling long range dependencies. Existing approaches build the dependencies merely by computing the pairwise correlations along…

Computer Vision and Pattern Recognition · Computer Science 2021-05-28 Xudong Guo , Xun Guo , Yan Lu

SAN: Learning Relationship between Convolutional Features for Multi-Scale Object Detection

Most of the recent successful methods in accurate object detection build on the convolutional neural networks (CNN). However, due to the lack of scale normalization in CNN-based detection methods, the activated channels in the feature space…

Computer Vision and Pattern Recognition · Computer Science 2018-08-16 Yonghyun Kim , Bong-Nam Kang , Daijin Kim

Simplicial Attention Neural Networks

The aim of this work is to introduce simplicial attention networks (SANs), i.e., novel neural architectures that operate on data defined on simplicial complexes leveraging masked self-attentional layers. Hinging on formal arguments from…

Machine Learning · Computer Science 2022-03-29 L. Giusti , C. Battiloro , P. Di Lorenzo , S. Sardellitti , S. Barbarossa

Hybrid Self-Attention Network for Machine Translation

The encoder-decoder is the typical framework for Neural Machine Translation (NMT), and different structures have been developed for improving the translation performance. Transformer is one of the most promising structures, which can…

Computation and Language · Computer Science 2018-12-11 Kaitao Song , Xu Tan , Furong Peng , Jianfeng Lu

Assessing the Ability of Self-Attention Networks to Learn Word Order

Self-attention networks (SAN) have attracted a lot of interests due to their high parallelization and strong performance on a variety of NLP tasks, e.g. machine translation. Due to the lack of recurrence structure such as recurrent neural…

Computation and Language · Computer Science 2019-06-04 Baosong Yang , Longyue Wang , Derek F. Wong , Lidia S. Chao , Zhaopeng Tu

Self-Attention Generative Adversarial Network for Speech Enhancement

Existing generative adversarial networks (GANs) for speech enhancement solely rely on the convolution operation, which may obscure temporal dependencies across the sequence input. To remedy this issue, we propose a self-attention layer…

Sound · Computer Science 2021-02-09 Huy Phan , Huy Le Nguyen , Oliver Y. Chén , Philipp Koch , Ngoc Q. K. Duong , Ian McLoughlin , Alfred Mertins

Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling

Recurrent neural networks (RNN), convolutional neural networks (CNN) and self-attention networks (SAN) are commonly used to produce context-aware representations. RNN can capture long-range dependency but is hard to parallelize and not…

Computation and Language · Computer Science 2018-04-04 Tao Shen , Tianyi Zhou , Guodong Long , Jing Jiang , Chengqi Zhang

Modeling Localness for Self-Attention Networks

Self-attention networks have proven to be of profound value for its strength of capturing global dependencies. In this work, we propose to model localness for self-attention networks, which enhances the ability of capturing useful local…

Computation and Language · Computer Science 2018-10-25 Baosong Yang , Zhaopeng Tu , Derek F. Wong , Fandong Meng , Lidia S. Chao , Tong Zhang

Session-based Recommendation with Self-Attention Networks

Session-based recommendation aims to predict user's next behavior from current session and previous anonymous sessions. Capturing long-range dependencies between items is a vital challenge in session-based recommendation. A novel approach…

Information Retrieval · Computer Science 2021-02-04 Jun Fang

EAANet: Efficient Attention Augmented Convolutional Networks

Humans can effectively find salient regions in complex scenes. Self-attention mechanisms were introduced into Computer Vision (CV) to achieve this. Attention Augmented Convolutional Network (AANet) is a mixture of convolution and…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Runqing Zhang , Tianshu Zhu

GAttANet: Global attention agreement for convolutional neural networks

Transformer attention architectures, similar to those developed for natural language processing, have recently proved efficient also in vision, either in conjunction with or as a replacement for convolutional layers. Typically, visual…

Computer Vision and Pattern Recognition · Computer Science 2021-07-01 Rufin VanRullen , Andrea Alamia

Temporal Convolutional Attention-based Network For Sequence Modeling

With the development of feed-forward models, the default model for sequence modeling has gradually evolved to replace recurrent networks. Many powerful feed-forward models based on convolutional networks and attention mechanism were…

Computation and Language · Computer Science 2023-10-17 Hongyan Hao , Yan Wang , Siqiao Xue , Yudi Xia , Jian Zhao , Furao Shen

Attention Augmented Convolutional Networks

Convolutional networks have been the paradigm of choice in many computer vision applications. The convolution operation however has a significant weakness in that it only operates on a local neighborhood, thus missing global information.…

Computer Vision and Pattern Recognition · Computer Science 2020-09-11 Irwan Bello , Barret Zoph , Ashish Vaswani , Jonathon Shlens , Quoc V. Le