Related papers: Dodrio: Exploring Transformer Models with Interact…

Do Transformer Attention Heads Provide Transparency in Abstractive Summarization?

Learning algorithms become more powerful, often at the cost of increased complexity. In response, the demand for algorithms to be transparent is growing. In NLP tasks, attention distributions learned by attention-based deep learning models…

Computation and Language · Computer Science 2019-07-09 Joris Baan , Maartje ter Hoeve , Marlies van der Wees , Anne Schuth , Maarten de Rijke

Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach

Recent advancements in Multimodal Large Language Models (MLLMs) have demonstrated remarkable progress in visual understanding. This impressive leap raises a compelling question: how can language models, initially trained solely on…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Jing Bi , Junjia Guo , Yunlong Tang , Lianggong Bruce Wen , Zhang Liu , Chenliang Xu

VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers

Breakthroughs in transformer-based models have revolutionized not only the NLP field, but also vision and multimodal systems. However, although visualization and interpretability tools have become available for NLP models, internal…

Computer Vision and Pattern Recognition · Computer Science 2022-08-24 Estelle Aflalo , Meng Du , Shao-Yen Tseng , Yongfei Liu , Chenfei Wu , Nan Duan , Vasudev Lal

Attention Flows: Analyzing and Comparing Attention Mechanisms in Language Models

Advances in language modeling have led to the development of deep attention-based models that are performant across a wide variety of natural language processing (NLP) problems. These language models are typified by a pre-training process…

Human-Computer Interaction · Computer Science 2020-09-16 Joseph F DeRose , Jiayao Wang , Matthew Berger

VISIT: Visualizing and Interpreting the Semantic Information Flow of Transformers

Recent advances in interpretability suggest we can project weights and hidden states of transformer-based language models (LMs) to their vocabulary, a transformation that makes them more human interpretable. In this paper, we investigate LM…

Computation and Language · Computer Science 2023-11-27 Shahar Katz , Yonatan Belinkov

Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models

Transformer models typically calculate attention matrices using dot products, which have limitations when capturing nonlinear relationships between embedding vectors. We propose Neural Attention, a technique that replaces dot products with…

Machine Learning · Computer Science 2025-11-10 Andrew DiGiugno , Ausif Mahmood

A Multiscale Visualization of Attention in the Transformer Model

The Transformer is a sequence model that forgoes traditional recurrent architectures in favor of a fully attention-based approach. Besides improving performance, an advantage of using attention is that it can also help to interpret a model…

Human-Computer Interaction · Computer Science 2019-06-14 Jesse Vig

Is Random Attention Sufficient for Sequence Modeling? Disentangling Trainable Components in the Transformer

The transformer architecture is central to the success of modern Large Language Models (LLMs), in part due to its surprising ability to perform a wide range of tasks - including mathematical reasoning, memorization, and retrieval - using…

Machine Learning · Computer Science 2025-09-05 Yihe Dong , Lorenzo Noci , Mikhail Khodak , Mufan Li

exBERT: A Visual Analysis Tool to Explore Learned Representations in Transformers Models

Large language models can produce powerful contextual representations that lead to improvements across many NLP tasks. Since these models are typically guided by a sequence of learned self attention mechanisms and may comprise undesired…

Computation and Language · Computer Science 2019-10-14 Benjamin Hoover , Hendrik Strobelt , Sebastian Gehrmann

Mechanistic Interpretability of GPT-like Models on Summarization Tasks

Mechanistic interpretability research seeks to reveal the inner workings of large language models, yet most work focuses on classification or generative tasks rather than summarization. This paper presents an interpretability framework for…

Computation and Language · Computer Science 2025-05-26 Anurag Mishra

Multi-Head Self-Attention with Role-Guided Masks

The state of the art in learning meaningful semantic representations of words is the Transformer model and its attention mechanisms. Simply put, the attention mechanisms learn to attend to specific parts of the input dispensing recurrence…

Computation and Language · Computer Science 2020-12-24 Dongsheng Wang , Casper Hansen , Lucas Chaves Lima , Christian Hansen , Maria Maistro , Jakob Grue Simonsen , Christina Lioma

Do We Really Need That Many Parameters In Transformer For Extractive Summarization? Discourse Can Help !

The multi-head self-attention of popular transformer models is widely used within Natural Language Processing (NLP), including for the task of extractive summarization. With the goal of analyzing and pruning the parameter-heavy…

Computation and Language · Computer Science 2020-12-04 Wen Xiao , Patrick Huber , Giuseppe Carenini

Head Pursuit: Probing Attention Specialization in Multimodal Transformers

Language and vision-language models have shown impressive performance across a wide range of tasks, but their internal mechanisms remain only partly understood. In this work, we study how individual attention heads in text-generative models…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Lorenzo Basile , Valentino Maiorca , Diego Doimo , Francesco Locatello , Alberto Cazzaniga

Paying More Attention to Self-attention: Improving Pre-trained Language Models via Attention Guiding

Pre-trained language models (PLM) have demonstrated their effectiveness for a broad range of information retrieval and natural language processing tasks. As the core part of PLM, multi-head self-attention is appealing for its ability to…

Computation and Language · Computer Science 2022-04-07 Shanshan Wang , Zhumin Chen , Zhaochun Ren , Huasheng Liang , Qiang Yan , Pengjie Ren

Survey on Abstractive Text Summarization: Dataset, Models, and Metrics

The advancements in deep learning, particularly the introduction of transformers, have been pivotal in enhancing various natural language processing (NLP) tasks. These include text-to-text applications such as machine translation, text…

Artificial Intelligence · Computer Science 2024-12-24 Gospel Ozioma Nnadi , Flavio Bertini

Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models

Achieving human-level performance on some of the Machine Reading Comprehension (MRC) datasets is no longer challenging with the help of powerful Pre-trained Language Models (PLMs). However, the internal mechanism of these artifacts remains…

Computation and Language · Computer Science 2024-10-29 Yiming Cui , Wei-Nan Zhang , Wanxiang Che , Ting Liu , Zhigang Chen , Shijin Wang

Wide Attention Is The Way Forward For Transformers?

The Transformer is an extremely powerful and prominent deep learning architecture. In this work, we challenge the commonly held belief in deep learning that going deeper is better, and show an alternative design approach that is building…

Machine Learning · Computer Science 2022-11-10 Jason Ross Brown , Yiren Zhao , Ilia Shumailov , Robert D Mullins

Temporal Attention for Language Models

Pretrained language models based on the transformer architecture have shown great success in NLP. Textual training data often comes from the web and is thus tagged with time-specific information, but most language models ignore this…

Computation and Language · Computer Science 2022-05-05 Guy D. Rosin , Kira Radinsky

Zorro: the masked multimodal transformer

Attention-based models are appealing for multimodal processing because inputs from multiple modalities can be concatenated and fed to a single backbone network - thus requiring very little fusion engineering. The resulting representations…

Computer Vision and Pattern Recognition · Computer Science 2023-02-23 Adrià Recasens , Jason Lin , Joāo Carreira , Drew Jaegle , Luyu Wang , Jean-baptiste Alayrac , Pauline Luc , Antoine Miech , Lucas Smaira , Ross Hemsley , Andrew Zisserman

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

The dominating NLP paradigm of training a strong neural predictor to perform one task on a specific dataset has led to state-of-the-art performance in a variety of applications (eg. sentiment classification, span-prediction based question…

Computation and Language · Computer Science 2021-09-06 Paul Michel