Related papers: Using Captum to Explain Generative Language Models

Captum: A unified and generic model interpretability library for PyTorch

In this paper we introduce a novel, unified, open-source model interpretability library for PyTorch [12]. The library contains generic implementations of a number of gradient and perturbation-based attribution algorithms, also known as…

Machine Learning · Computer Science 2020-09-18 Narine Kokhlikyan , Vivek Miglani , Miguel Martin , Edward Wang , Bilal Alsallakh , Jonathan Reynolds , Alexander Melnikov , Natalia Kliushkina , Carlos Araya , Siqi Yan , Orion Reblitz-Richardson

Time Interpret: a Unified Model Interpretability Library for Time Series

We introduce $\texttt{time_interpret}$, a library designed as an extension of Captum, with a specific focus on temporal data. As such, this library implements several feature attribution methods that can be used to explain predictions made…

Machine Learning · Computer Science 2023-06-07 Joseph Enguehard

Conceptual Model Interpreter for Large Language Models

Large Language Models (LLMs) recently demonstrated capabilities for generating source code in common programming languages. Additionally, commercial products such as ChatGPT 4 started to provide code interpreters, allowing for the automatic…

Software Engineering · Computer Science 2023-11-15 Felix Härer

PyTorch, Explain! A Python library for Logic Explained Networks

"PyTorch, Explain!" is a Python module integrating a variety of state-of-the-art approaches to provide logic explanations from neural networks. This package focuses on bringing these methods to non-specialists. It has minimal dependencies…

Machine Learning · Computer Science 2021-07-26 Pietro Barbiero , Gabriele Ciravegna , Dobrik Georgiev , Franscesco Giannini

Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models

Understanding the internal representations of large language models (LLMs) can help explain models' behavior and verify their alignment with human values. Given the capabilities of LLMs in generating human-understandable text, we propose…

Computation and Language · Computer Science 2024-06-10 Asma Ghandeharioun , Avi Caciularu , Adam Pearce , Lucas Dixon , Mor Geva

Interactively Providing Explanations for Transformer Language Models

Transformer language models are state of the art in a multitude of NLP tasks. Despite these successes, their opaqueness remains problematic. Recent methods aiming to provide interpretability and explainability to black-box models primarily…

Computation and Language · Computer Science 2022-03-14 Felix Friedrich , Patrick Schramowski , Christopher Tauchmann , Kristian Kersting

GrASP: A Library for Extracting and Exploring Human-Interpretable Textual Patterns

Data exploration is an important step of every data science and machine learning project, including those involving textual data. We provide a novel language tool, in the form of a publicly available Python library for extracting patterns…

Computation and Language · Computer Science 2022-06-20 Piyawat Lertvittayakumjorn , Leshem Choshen , Eyal Shnarch , Francesca Toni

PyTorchVideo: A Deep Learning Library for Video Understanding

We introduce PyTorchVideo, an open-source deep-learning library that provides a rich set of modular, efficient, and reproducible components for a variety of video understanding tasks, including classification, detection, self-supervised…

Computer Vision and Pattern Recognition · Computer Science 2021-11-19 Haoqi Fan , Tullie Murrell , Heng Wang , Kalyan Vasudev Alwala , Yanghao Li , Yilei Li , Bo Xiong , Nikhila Ravi , Meng Li , Haichuan Yang , Jitendra Malik , Ross Girshick , Matt Feiszli , Aaron Adcock , Wan-Yen Lo , Christoph Feichtenhofer

PyTorchPipe: a framework for rapid prototyping of pipelines combining language and vision

Access to vast amounts of data along with affordable computational power stimulated the reincarnation of neural networks. The progress could not be achieved without adequate software tools, lowering the entry bar for the next generations of…

Machine Learning · Computer Science 2019-10-22 Tomasz Kornuta

Application of GPT Language Models for Innovation in Activities in University Teaching

The GPT (Generative Pre-trained Transformer) language models are an artificial intelligence and natural language processing technology that enables automatic text generation. There is a growing interest in applying GPT language models to…

Computers and Society · Computer Science 2024-03-25 Manuel de Buenaga , Francisco Javier Bueno

iCapsNets: Towards Interpretable Capsule Networks for Text Classification

Many text classification applications require models with satisfying performance as well as good interpretability. Traditional machine learning methods are easy to interpret but have low accuracies. The development of deep learning models…

Computation and Language · Computer Science 2020-06-02 Zhengyang Wang , Xia Hu , Shuiwang Ji

On the Use of Large Language Models to Generate Capability Ontologies

Capability ontologies are increasingly used to model functionalities of systems or machines. The creation of such ontological models with all properties and constraints of capabilities is very complex and can only be done by ontology…

Artificial Intelligence · Computer Science 2024-10-21 Luis Miguel Vieira da Silva , Aljosha Köcher , Felix Gehlhoff , Alexander Fay

Understanding the Properties of Generated Corpora

Models for text generation have become focal for many research tasks and especially for the generation of sentence corpora. However, understanding the properties of an automatically generated text corpus remains challenging. We propose a…

Computation and Language · Computer Science 2022-10-28 Naama Zwerdling , Segev Shlomov , Esther Goldbraich , George Kour , Boaz Carmeli , Naama Tepper , Inbal Ronen , Vitaly Zabershinsky , Ateret Anaby-Tavor

Interpreting Language Models Through Concept Descriptions: A Survey

Understanding the decision-making processes of neural networks is a central goal of mechanistic interpretability. In the context of Large Language Models (LLMs), this involves uncovering the underlying mechanisms and identifying the roles…

Computation and Language · Computer Science 2026-04-21 Nils Feldhus , Laura Kopf

Enigme: Generative Text Puzzles for Evaluating Reasoning in Language Models

Transformer-decoder language models are a core innovation in text based generative artificial intelligence. These models are being deployed as general-purpose intelligence systems in many applications. Central to their utility is the…

Artificial Intelligence · Computer Science 2025-05-09 John Hawkins

Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding

Narrative understanding involves capturing the author's cognitive processes, providing insights into their knowledge, intentions, beliefs, and desires. Although large language models (LLMs) excel in generating grammatically coherent text,…

Computation and Language · Computer Science 2026-01-19 Lixing Zhu , Runcong Zhao , Lin Gui , Yulan He

Language Models: A Guide for the Perplexed

Given the growing importance of AI literacy, we decided to write this tutorial to help narrow the gap between the discourse among those who study language models -- the core technology underlying ChatGPT and similar products -- and those…

Computation and Language · Computer Science 2023-11-30 Sofia Serrano , Zander Brumbaugh , Noah A. Smith

pyvene: A Library for Understanding and Improving PyTorch Models via Interventions

Interventions on model-internal states are fundamental operations in many areas of AI, including model editing, steering, robustness, and interpretability. To facilitate such research, we introduce $\textbf{pyvene}$, an open-source Python…

Machine Learning · Computer Science 2024-03-13 Zhengxuan Wu , Atticus Geiger , Aryaman Arora , Jing Huang , Zheng Wang , Noah D. Goodman , Christopher D. Manning , Christopher Potts

Generative Example-Based Explanations: Bridging the Gap between Generative Modeling and Explainability

Recently, several methods have leveraged deep generative modeling to produce example-based explanations of image classifiers. Despite producing visually stunning results, these methods are largely disconnected from classical explainability…

Machine Learning · Computer Science 2025-09-11 Philipp Vaeth , Alexander M. Fruehwald , Benjamin Paassen , Magda Gregorova

FlexModel: A Framework for Interpretability of Distributed Large Language Models

With the growth of large language models, now incorporating billions of parameters, the hardware prerequisites for their training and deployment have seen a corresponding increase. Although existing tools facilitate model parallelization…

Machine Learning · Computer Science 2023-12-07 Matthew Choi , Muhammad Adil Asif , John Willes , David Emerson