Related papers: Influence Patterns for Explaining Information Flow…

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer

The great success of Transformer-based models benefits from the powerful multi-head self-attention mechanism, which learns token dependencies and encodes contextual information from the input. Prior work strives to attribute model decisions…

Computation and Language · Computer Science 2021-02-26 Yaru Hao , Li Dong , Furu Wei , Ke Xu

Telling BERT's full story: from Local Attention to Global Aggregation

We take a deep look into the behavior of self-attention heads in the transformer architecture. In light of recent work discouraging the use of attention distributions for explaining a model's behavior, we show that attention distributions…

Machine Learning · Computer Science 2021-01-15 Damian Pascual , Gino Brunner , Roger Wattenhofer

Human Guided Exploitation of Interpretable Attention Patterns in Summarization and Topic Segmentation

The multi-head self-attention mechanism of the transformer model has been thoroughly investigated recently. In one vein of study, researchers are interested in understanding why and how transformers work. In another vein, researchers…

Computation and Language · Computer Science 2022-10-28 Raymond Li , Wen Xiao , Linzi Xing , Lanjun Wang , Gabriel Murray , Giuseppe Carenini

A Primer in BERTology: What we know about how BERT works

Transformer-based models have pushed state of the art in many areas of NLP, but our understanding of what is behind their success is still limited. This paper is the first survey of over 150 studies of the popular BERT model. We review the…

Computation and Language · Computer Science 2020-11-10 Anna Rogers , Olga Kovaleva , Anna Rumshisky

Of Non-Linearity and Commutativity in BERT

In this work we provide new insights into the transformer architecture, and in particular, its best-known variant, BERT. First, we propose a method to measure the degree of non-linearity of different elements of transformers. Next, we focus…

Computation and Language · Computer Science 2021-05-10 Sumu Zhao , Damian Pascual , Gino Brunner , Roger Wattenhofer

A Multiscale Visualization of Attention in the Transformer Model

The Transformer is a sequence model that forgoes traditional recurrent architectures in favor of a fully attention-based approach. Besides improving performance, an advantage of using attention is that it can also help to interpret a model…

Human-Computer Interaction · Computer Science 2019-06-14 Jesse Vig

Revealing the Dark Secrets of BERT

BERT-based architectures currently give state-of-the-art performance on many NLP tasks, but little is known about the exact mechanisms that contribute to its success. In the current work, we focus on the interpretation of self-attention,…

Computation and Language · Computer Science 2019-09-12 Olga Kovaleva , Alexey Romanov , Anna Rogers , Anna Rumshisky

From independent patches to coordinated attention: Controlling information flow in vision transformers

We make the information transmitted by attention an explicit, measurable quantity in vision transformers. By inserting variational information bottlenecks on all attention-mediated writes to the residual stream -- without other…

Machine Learning · Computer Science 2026-02-05 Kieran A. Murphy

What Does BERT Look At? An Analysis of BERT's Attention

Large pre-trained neural networks such as BERT have had great recent success in NLP, motivating a growing body of research investigating what aspects of language they are able to learn from unlabeled data. Most recent analysis has focused…

Computation and Language · Computer Science 2019-06-12 Kevin Clark , Urvashi Khandelwal , Omer Levy , Christopher D. Manning

Quantifying Attention Flow in Transformers

In the Transformer model, "self-attention" combines information from attended embeddings into the representation of the focal embedding in the next layer. Thus, across layers of the Transformer, information originating from different tokens…

Machine Learning · Computer Science 2020-06-02 Samira Abnar , Willem Zuidema

Are Sixteen Heads Really Better than One?

Attention is a powerful and ubiquitous mechanism for allowing neural models to focus on particular salient pieces of information by taking their weighted average when making predictions. In particular, multi-headed attention is a driving…

Computation and Language · Computer Science 2019-11-05 Paul Michel , Omer Levy , Graham Neubig

Input-length-shortening and text generation via attention values

Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention…

Computation and Language · Computer Science 2023-03-15 Neşet Özkan Tan , Alex Yuxuan Peng , Joshua Bensemann , Qiming Bao , Tim Hartill , Mark Gahegan , Michael Witbrock

On the Surprising Effectiveness of Attention Transfer for Vision Transformers

Conventional wisdom suggests that pre-training Vision Transformers (ViT) improves downstream performance by learning useful representations. Is this actually true? We investigate this question and find that the features and representations…

Machine Learning · Computer Science 2024-11-15 Alexander C. Li , Yuandong Tian , Beidi Chen , Deepak Pathak , Xinlei Chen

Information Flow Routes: Automatically Interpreting Language Models at Scale

Information flows by routes inside the network via mechanisms implemented in the model. These routes can be represented as graphs where nodes correspond to token representations and edges to operations inside the network. We automatically…

Computation and Language · Computer Science 2024-04-18 Javier Ferrando , Elena Voita

Inserting Information Bottlenecks for Attribution in Transformers

Pretrained transformers achieve the state of the art across tasks in natural language processing, motivating researchers to investigate their inner mechanisms. One common direction is to understand what features are important for…

Computation and Language · Computer Science 2021-08-06 Zhiying Jiang , Raphael Tang , Ji Xin , Jimmy Lin

Breaking BERT: Evaluating and Optimizing Sparsified Attention

Transformers allow attention between all pairs of tokens, but there is reason to believe that most of these connections - and their quadratic time and memory - may not be necessary. But which ones? We evaluate the impact of sparsification…

Computation and Language · Computer Science 2022-10-11 Siddhartha Brahma , Polina Zablotskaia , David Mimno

Attention is Not Only a Weight: Analyzing Transformers with Vector Norms

Attention is a key component of Transformers, which have recently achieved considerable success in natural language processing. Hence, attention is being extensively studied to investigate various linguistic capabilities of Transformers,…

Computation and Language · Computer Science 2020-10-07 Goro Kobayashi , Tatsuki Kuribayashi , Sho Yokoi , Kentaro Inui

Does Self-Attention Need Separate Weights in Transformers?

The success of self-attention lies in its ability to capture long-range dependencies and enhance context understanding, but it is limited by its computational complexity and challenges in handling sequential data with inherent…

Computation and Language · Computer Science 2025-05-05 Md Kowsher , Nusrat Jahan Prottasha , Chun-Nam Yu , Ozlem Ozmen Garibay , Niloofar Yousefi

Modeling Attention Flow on Graphs

Real-world scenarios demand reasoning about process, more than final outcome prediction, to discover latent causal chains and better understand complex systems. It requires the learning algorithms to offer both accurate predictions and…

Artificial Intelligence · Computer Science 2019-01-09 Xiaoran Xu , Songpeng Zu , Chengliang Gao , Yuan Zhang , Wei Feng

Detecting Gender Bias in Transformer-based Models: A Case Study on BERT

In this paper, we propose a novel gender bias detection method by utilizing attention map for transformer-based models. We 1) give an intuitive gender bias judgement method by comparing the different relation degree between the genders and…

Computation and Language · Computer Science 2021-11-01 Bingbing Li , Hongwu Peng , Rajat Sainju , Junhuan Yang , Lei Yang , Yueying Liang , Weiwen Jiang , Binghui Wang , Hang Liu , Caiwen Ding