Related papers: Self-Attention with Structural Position Representa…

Integrating Dependency Tree Into Self-attention for Sentence Representation

Recent progress on parse tree encoder for sentence representation learning is notable. However, these works mainly encode tree structures recursively, which is not conducive to parallelization. On the other hand, these works rarely take…

Computation and Language · Computer Science 2022-05-10 Junhua Ma , Jiajun Li , Yuxuan Liu , Shangbo Zhou , Xue Li

Self-Attention with Cross-Lingual Position Representation

Position encoding (PE), an essential part of self-attention networks (SANs), is used to preserve the word order information for natural language processing tasks, generating fixed position indices for input sequences. However, in…

Computation and Language · Computer Science 2020-11-24 Liang Ding , Longyue Wang , Dacheng Tao

Multiple Structural Priors Guided Self Attention Network for Language Understanding

Self attention networks (SANs) have been widely utilized in recent NLP studies. Unlike CNNs or RNNs, standard SANs are usually position-independent, and thus are incapable of capturing the structural priors between sequences of words.…

Computation and Language · Computer Science 2021-01-01 Le Qi , Yu Zhang , Qingyu Yin , Ting Liu

Assessing the Ability of Self-Attention Networks to Learn Word Order

Self-attention networks (SAN) have attracted a lot of interests due to their high parallelization and strong performance on a variety of NLP tasks, e.g. machine translation. Due to the lack of recurrence structure such as recurrent neural…

Computation and Language · Computer Science 2019-06-04 Baosong Yang , Longyue Wang , Derek F. Wong , Lidia S. Chao , Zhaopeng Tu

Learning to Embed Sentences Using Attentive Recursive Trees

Sentence embedding is an effective feature representation for most deep learning-based NLP tasks. One prevailing line of methods is using recursive latent tree-structured networks to embed sentences with task-specific structures. However,…

Computation and Language · Computer Science 2018-11-16 Jiaxin Shi , Lei Hou , Juanzi Li , Zhiyuan Liu , Hanwang Zhang

Evaluating Discourse in Structured Text Representations

Discourse structure is integral to understanding a text and is helpful in many NLP tasks. Learning latent representations of discourse is an attractive alternative to acquiring expensive labeled discourse data. Liu and Lapata (2018) propose…

Computation and Language · Computer Science 2019-06-11 Elisa Ferracane , Greg Durrett , Junyi Jessy Li , Katrin Erk

A Self-supervised Representation Learning of Sentence Structure for Authorship Attribution

Syntactic structure of sentences in a document substantially informs about its authorial writing style. Sentence representation learning has been widely explored in recent years and it has been shown that it improves the generalization of…

Computation and Language · Computer Science 2022-02-25 Fereshteh Jafariakinabad , Kien A. Hua

Convolutional Self-Attention Networks

Self-attention networks (SANs) have drawn increasing interest due to their high parallelization in computation and flexibility in modeling dependencies. SANs can be further enhanced with multi-head attention by allowing the model to attend…

Computation and Language · Computer Science 2019-04-08 Baosong Yang , Longyue Wang , Derek Wong , Lidia S. Chao , Zhaopeng Tu

StructLens: A Structural Lens for Language Models via Maximum Spanning Trees

Language exhibits inherent structures, a property that explains both language acquisition and language change. Given this characteristic, we expect language models to manifest their own internal structures as well. While interpretability…

Computation and Language · Computer Science 2026-05-19 Haruki Sakajo , Frederikus Hudi , Yusuke Sakai , Hidetaka Kamigaito , Taro Watanabe

Structural Attention Neural Networks for improved sentiment analysis

We introduce a tree-structured attention neural network for sentences and small phrases and apply it to the problem of sentiment classification. Our model expands the current recursive models by incorporating structural information around a…

Computation and Language · Computer Science 2017-01-10 Filippos Kokkinos , Alexandros Potamianos

Structured Self-Attention Weights Encode Semantics in Sentiment Analysis

Neural attention, especially the self-attention made popular by the Transformer, has become the workhorse of state-of-the-art natural language processing (NLP) models. Very recent work suggests that the self-attention in the Transformer…

Computation and Language · Computer Science 2020-10-16 Zhengxuan Wu , Thanh-Son Nguyen , Desmond C. Ong

Syntax-Enhanced Self-Attention-Based Semantic Role Labeling

As a fundamental NLP task, semantic role labeling (SRL) aims to discover the semantic roles for each predicate within one sentence. This paper investigates how to incorporate syntactic knowledge into the SRL task effectively. We present…

Computation and Language · Computer Science 2019-10-25 Yue Zhang , Rui Wang , Luo Si

Dependency-Aware Named Entity Recognition with Relative and Global Attentions

Named entity recognition is one of the core tasks in NLP. Although many improvements have been made on this task during the last years, the state-of-the-art systems do not explicitly take into account the recursive nature of language.…

Computation and Language · Computer Science 2019-09-12 Gustavo Aguilar , Thamar Solorio

Self-Attention: A Better Building Block for Sentiment Analysis Neural Network Classifiers

Sentiment Analysis has seen much progress in the past two decades. For the past few years, neural network approaches, primarily RNNs and CNNs, have been the most successful for this task. Recently, a new category of neural networks,…

Computation and Language · Computer Science 2018-12-20 Artaches Ambartsoumian , Fred Popowich

Context-Aware Self-Attention Networks

Self-attention model have shown its flexibility in parallel computation and the effectiveness on modeling both long- and short-term dependencies. However, it calculates the dependencies between representations without considering the…

Computation and Language · Computer Science 2019-02-18 Baosong Yang , Jian Li , Derek Wong , Lidia S. Chao , Xing Wang , Zhaopeng Tu

Combining Improvements for Exploiting Dependency Trees in Neural Semantic Parsing

The dependency tree of a natural language sentence can capture the interactions between semantics and words. However, it is unclear whether those methods which exploit such dependency information for semantic parsing can be combined to…

Computation and Language · Computer Science 2021-12-28 Defeng Xie , Jianmin Ji , Jiafei Xu , Ran Ji

How Does Selective Mechanism Improve Self-Attention Networks?

Self-attention networks (SANs) with selective mechanism has produced substantial improvements in various NLP tasks by concentrating on a subset of input words. However, the underlying reasons for their strong performance have not been well…

Computation and Language · Computer Science 2020-05-05 Xinwei Geng , Longyue Wang , Xing Wang , Bing Qin , Ting Liu , Zhaopeng Tu

Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

Entities, as the essential elements in relation extraction tasks, exhibit certain structure. In this work, we formulate such structure as distinctive dependencies between mention pairs. We then propose SSAN, which incorporates these…

Computation and Language · Computer Science 2021-02-23 Benfeng Xu , Quan Wang , Yajuan Lyu , Yong Zhu , Zhendong Mao

Tree-structured Attention with Hierarchical Accumulation

Incorporating hierarchical structures like constituency trees has been shown to be effective for various natural language processing (NLP) tasks. However, it is evident that state-of-the-art (SOTA) sequence-based models like the Transformer…

Machine Learning · Computer Science 2020-02-20 Xuan-Phi Nguyen , Shafiq Joty , Steven C. H. Hoi , Richard Socher

Contextually Structured Token Dependency Encoding for Large Language Models

Token representation strategies within large-scale neural architectures often rely on contextually refined embeddings, yet conventional approaches seldom encode structured relationships explicitly within token interactions. Self-attention…

Computation and Language · Computer Science 2025-03-27 James Blades , Frederick Somerfield , William Langley , Susan Everingham , Maurice Witherington