English
Related papers

Related papers: Using Captum to Explain Generative Language Models

200 papers

In this paper we introduce a novel, unified, open-source model interpretability library for PyTorch [12]. The library contains generic implementations of a number of gradient and perturbation-based attribution algorithms, also known as…

We introduce $\texttt{time_interpret}$, a library designed as an extension of Captum, with a specific focus on temporal data. As such, this library implements several feature attribution methods that can be used to explain predictions made…

Machine Learning · Computer Science 2023-06-07 Joseph Enguehard

Large Language Models (LLMs) recently demonstrated capabilities for generating source code in common programming languages. Additionally, commercial products such as ChatGPT 4 started to provide code interpreters, allowing for the automatic…

Software Engineering · Computer Science 2023-11-15 Felix Härer

"PyTorch, Explain!" is a Python module integrating a variety of state-of-the-art approaches to provide logic explanations from neural networks. This package focuses on bringing these methods to non-specialists. It has minimal dependencies…

Machine Learning · Computer Science 2021-07-26 Pietro Barbiero , Gabriele Ciravegna , Dobrik Georgiev , Franscesco Giannini

Understanding the internal representations of large language models (LLMs) can help explain models' behavior and verify their alignment with human values. Given the capabilities of LLMs in generating human-understandable text, we propose…

Computation and Language · Computer Science 2024-06-10 Asma Ghandeharioun , Avi Caciularu , Adam Pearce , Lucas Dixon , Mor Geva

Transformer language models are state of the art in a multitude of NLP tasks. Despite these successes, their opaqueness remains problematic. Recent methods aiming to provide interpretability and explainability to black-box models primarily…

Computation and Language · Computer Science 2022-03-14 Felix Friedrich , Patrick Schramowski , Christopher Tauchmann , Kristian Kersting

Data exploration is an important step of every data science and machine learning project, including those involving textual data. We provide a novel language tool, in the form of a publicly available Python library for extracting patterns…

Computation and Language · Computer Science 2022-06-20 Piyawat Lertvittayakumjorn , Leshem Choshen , Eyal Shnarch , Francesca Toni

We introduce PyTorchVideo, an open-source deep-learning library that provides a rich set of modular, efficient, and reproducible components for a variety of video understanding tasks, including classification, detection, self-supervised…

Access to vast amounts of data along with affordable computational power stimulated the reincarnation of neural networks. The progress could not be achieved without adequate software tools, lowering the entry bar for the next generations of…

Machine Learning · Computer Science 2019-10-22 Tomasz Kornuta

The GPT (Generative Pre-trained Transformer) language models are an artificial intelligence and natural language processing technology that enables automatic text generation. There is a growing interest in applying GPT language models to…

Computers and Society · Computer Science 2024-03-25 Manuel de Buenaga , Francisco Javier Bueno

Many text classification applications require models with satisfying performance as well as good interpretability. Traditional machine learning methods are easy to interpret but have low accuracies. The development of deep learning models…

Computation and Language · Computer Science 2020-06-02 Zhengyang Wang , Xia Hu , Shuiwang Ji

Capability ontologies are increasingly used to model functionalities of systems or machines. The creation of such ontological models with all properties and constraints of capabilities is very complex and can only be done by ontology…

Artificial Intelligence · Computer Science 2024-10-21 Luis Miguel Vieira da Silva , Aljosha Köcher , Felix Gehlhoff , Alexander Fay

Models for text generation have become focal for many research tasks and especially for the generation of sentence corpora. However, understanding the properties of an automatically generated text corpus remains challenging. We propose a…

Understanding the decision-making processes of neural networks is a central goal of mechanistic interpretability. In the context of Large Language Models (LLMs), this involves uncovering the underlying mechanisms and identifying the roles…

Computation and Language · Computer Science 2026-04-21 Nils Feldhus , Laura Kopf

Transformer-decoder language models are a core innovation in text based generative artificial intelligence. These models are being deployed as general-purpose intelligence systems in many applications. Central to their utility is the…

Artificial Intelligence · Computer Science 2025-05-09 John Hawkins

Narrative understanding involves capturing the author's cognitive processes, providing insights into their knowledge, intentions, beliefs, and desires. Although large language models (LLMs) excel in generating grammatically coherent text,…

Computation and Language · Computer Science 2026-01-19 Lixing Zhu , Runcong Zhao , Lin Gui , Yulan He

Given the growing importance of AI literacy, we decided to write this tutorial to help narrow the gap between the discourse among those who study language models -- the core technology underlying ChatGPT and similar products -- and those…

Computation and Language · Computer Science 2023-11-30 Sofia Serrano , Zander Brumbaugh , Noah A. Smith

Interventions on model-internal states are fundamental operations in many areas of AI, including model editing, steering, robustness, and interpretability. To facilitate such research, we introduce $\textbf{pyvene}$, an open-source Python…

Recently, several methods have leveraged deep generative modeling to produce example-based explanations of image classifiers. Despite producing visually stunning results, these methods are largely disconnected from classical explainability…

Machine Learning · Computer Science 2025-09-11 Philipp Vaeth , Alexander M. Fruehwald , Benjamin Paassen , Magda Gregorova

With the growth of large language models, now incorporating billions of parameters, the hardware prerequisites for their training and deployment have seen a corresponding increase. Although existing tools facilitate model parallelization…

Machine Learning · Computer Science 2023-12-07 Matthew Choi , Muhammad Adil Asif , John Willes , David Emerson
‹ Prev 1 2 3 10 Next ›