Related papers: Visual Analytics for Generative Transformer Models

T3-Vis: a visual analytic framework for Training and fine-Tuning Transformers in NLP

Transformers are the dominant architecture in NLP, but their training and fine-tuning is still very challenging. In this paper, we present the design and implementation of a visual analytic framework for assisting researchers in such…

Computation and Language · Computer Science 2021-09-01 Raymond Li , Wen Xiao , Lanjun Wang , Hyeju Jang , Giuseppe Carenini

Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors

Transformer networks have revolutionized NLP representation learning since they were introduced. Though a great effort has been made to explain the representation in transformers, it is widely recognized that our understanding is not…

Computation and Language · Computer Science 2023-04-05 Zeyu Yun , Yubei Chen , Bruno A Olshausen , Yann LeCun

Analyzing Vision Transformers for Image Classification in Class Embedding Space

Despite the growing use of transformer models in computer vision, a mechanistic understanding of these networks is still needed. This work introduces a method to reverse-engineer Vision Transformers trained to solve image classification…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Martina G. Vilas , Timothy Schaumlöffel , Gemma Roig

Generative Adversarial Transformers

We introduce the GANformer, a novel and efficient type of transformer, and explore it for the task of visual generative modeling. The network employs a bipartite structure that enables long-range interactions across the image, while…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Drew A. Hudson , C. Lawrence Zitnick

A Survey of Visual Transformers

Transformer, an attention-based encoder-decoder model, has already revolutionized the field of natural language processing (NLP). Inspired by such significant achievements, some pioneering works have recently been done on employing…

Computer Vision and Pattern Recognition · Computer Science 2022-12-07 Yang Liu , Yao Zhang , Yixin Wang , Feng Hou , Jin Yuan , Jiang Tian , Yang Zhang , Zhongchao Shi , Jianping Fan , Zhiqiang He

A Visual Analytics Framework for Explaining and Diagnosing Transfer Learning Processes

Many statistical learning models hold an assumption that the training data and the future unlabeled data are drawn from the same distribution. However, this assumption is difficult to fulfill in real-world scenarios and creates barriers in…

Human-Computer Interaction · Computer Science 2020-09-16 Yuxin Ma , Arlen Fan , Jingrui He , Arun Reddy Nelakurthi , Ross Maciejewski

Graph Transformers: A Survey

Graph transformers are a recent advancement in machine learning, offering a new class of neural network models for graph-structured data. The synergy between transformers and graph learning demonstrates strong performance and versatility…

Machine Learning · Computer Science 2025-12-23 Ahsan Shehzad , Feng Xia , Shagufta Abid , Ciyuan Peng , Shuo Yu , Dongyu Zhang , Karin Verspoor

Transformer-based Image Generation from Scene Graphs

Graph-structured scene descriptions can be efficiently used in generative models to control the composition of the generated image. Previous approaches are based on the combination of graph convolutional networks and adversarial methods for…

Computer Vision and Pattern Recognition · Computer Science 2023-03-09 Renato Sortino , Simone Palazzo , Concetto Spampinato

A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships

Transformer-based models have transformed the landscape of natural language processing (NLP) and are increasingly applied to computer vision tasks with remarkable success. These models, renowned for their ability to capture long-range…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Gracile Astlin Pereira , Muhammad Hussain

Using Visual Analytics to Interpret Predictive Machine Learning Models

It is commonly believed that increasing the interpretability of a machine learning model may decrease its predictive power. However, inspecting input-output relationships of those models using visual analytics, while treating them as…

Machine Learning · Statistics 2016-06-22 Josua Krause , Adam Perer , Enrico Bertini

Efficient Visualization of Neural Networks with Generative Models and Adversarial Perturbations

This paper presents a novel approach for deep visualization via a generative network, offering an improvement over existing methods. Our model simplifies the architecture by reducing the number of networks used, requiring only a generator…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Athanasios Karagounis

A Graph VAE and Graph Transformer Approach to Generating Molecular Graphs

We propose a combination of a variational autoencoder and a transformer based model which fully utilises graph convolutional and graph pooling layers to operate directly on graphs. The transformer model implements a novel node encoding…

Machine Learning · Computer Science 2021-04-12 Joshua Mitton , Hans M. Senn , Klaas Wynne , Roderick Murray-Smith

InteractiveGNNExplainer: A Visual Analytics Framework for Multi-Faceted Understanding and Probing of Graph Neural Network Predictions

Graph Neural Networks (GNNs) excel in graph-based learning tasks, but their complex, non-linear operations often render them as opaque "black boxes". This opacity hinders user trust, complicates debugging, bias detection, and adoption in…

Artificial Intelligence · Computer Science 2025-11-18 TC Singh , Sougata Mukherjea

Generative Modeling for Multi-task Visual Learning

Generative modeling has recently shown great promise in computer vision, but it has mostly focused on synthesizing visually realistic images. In this paper, motivated by multi-task learning of shareable feature representations, we consider…

Computer Vision and Pattern Recognition · Computer Science 2021-06-28 Zhipeng Bao , Martial Hebert , Yu-Xiong Wang

Boosting vision transformers for image retrieval

Vision transformers have achieved remarkable progress in vision tasks such as image classification and detection. However, in instance-level image retrieval, transformers have not yet shown good performance compared to convolutional…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Chull Hwan Song , Jooyoung Yoon , Shunghyun Choi , Yannis Avrithis

Transformers in Vision: A Survey

Astounding results from Transformer models on natural language tasks have intrigued the vision community to study their application to computer vision problems. Among their salient benefits, Transformers enable modeling long dependencies…

Computer Vision and Pattern Recognition · Computer Science 2022-01-20 Salman Khan , Muzammal Naseer , Munawar Hayat , Syed Waqas Zamir , Fahad Shahbaz Khan , Mubarak Shah

On the Units of GANs (Extended Abstract)

Generative Adversarial Networks (GANs) have achieved impressive results for many real-world applications. As an active research topic, many GAN variants have emerged with improvements in sample quality and training stability. However,…

Machine Learning · Computer Science 2020-08-07 David Bau , Jun-Yan Zhu , Hendrik Strobelt , Bolei Zhou , Joshua B. Tenenbaum , William T. Freeman , Antonio Torralba

GIN-Graph: A Generative Interpretation Network for Model-Level Explanation of Graph Neural Networks

One significant challenge of exploiting Graph neural networks (GNNs) in real-life scenarios is that they are always treated as black boxes, therefore leading to the requirement of interpretability. To address this, model-level…

Machine Learning · Computer Science 2025-09-22 Xiao Yue , Guangzhi Qu , Lige Gan

A Generative Model for Volume Rendering

We present a technique to synthesize and analyze volume-rendered images using generative models. We use the Generative Adversarial Network (GAN) framework to compute a model from a large collection of volume renderings, conditioned on (1)…

Graphics · Computer Science 2019-07-18 Matthew Berger , Jixian Li , Joshua A. Levine

Explainability of Vision Transformers: A Comprehensive Review and New Perspectives

Transformers have had a significant impact on natural language processing and have recently demonstrated their potential in computer vision. They have shown promising results over convolution neural networks in fundamental computer vision…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Rojina Kashefi , Leili Barekatain , Mohammad Sabokrou , Fatemeh Aghaeipoor