Related papers: Revisiting Methods for Finding Influential Example…

Integrated Influence: Data Attribution with Baseline

As an effective approach to quantify how training samples influence test sample, data attribution is crucial for understanding data and model and further enhance the transparency of machine learning models. We find that prevailing data…

Machine Learning · Computer Science 2025-08-08 Linxiao Yang , Xinyu Gu , Liang Sun

Supervising Feature Influence

Causal influence measures for machine learnt classifiers shed light on the reasons behind classification, and aid in identifying influential input features and revealing their biases. However, such analyses involve evaluating the classifier…

Machine Learning · Computer Science 2018-04-10 Shayak Sen , Piotr Mardziel , Anupam Datta , Matthew Fredrikson

Capturing the Temporal Dependence of Training Data Influence

Traditional data influence estimation methods, like influence function, assume that learning algorithms are permutation-invariant with respect to training data. However, modern training paradigms, especially for foundation models using…

Machine Learning · Computer Science 2024-12-13 Jiachen T. Wang , Dawn Song , James Zou , Prateek Mittal , Ruoxi Jia

An Empirical Comparison of Instance Attribution Methods for NLP

Widespread adoption of deep models has motivated a pressing need for approaches to interpret network outputs and to facilitate model debugging. Instance attribution methods constitute one means of accomplishing these goals by retrieving…

Computation and Language · Computer Science 2021-04-12 Pouya Pezeshkpour , Sarthak Jain , Byron C. Wallace , Sameer Singh

RelatIF: Identifying Explanatory Training Examples via Relative Influence

In this work, we focus on the use of influence functions to identify relevant training examples that one might hope "explain" the predictions of a machine learning model. One shortcoming of influence functions is that the training examples…

Machine Learning · Computer Science 2020-03-27 Elnaz Barshan , Marc-Etienne Brunet , Gintare Karolina Dziugaite

Influence Functions for Efficient Data Selection in Reasoning

Fine-tuning large language models (LLMs) on chain-of-thought (CoT) data shows that a small amount of high-quality data can outperform massive datasets. Yet, what constitutes "quality" remains ill-defined. Existing reasoning methods rely on…

Machine Learning · Computer Science 2025-12-02 Prateek Humane , Paolo Cudrano , Daniel Z. Kaplan , Matteo Matteucci , Supriyo Chakraborty , Irina Rish

f-INE: A Hypothesis Testing Framework for Estimating Influence under Training Randomness

Influence estimation methods promise to explain and debug machine learning by estimating the impact of individual samples on the final model. Yet, existing methods collapse under training randomness: the same example may appear critical in…

Machine Learning · Computer Science 2026-04-06 Subhodip Panda , Dhruv Tarsadiya , Shashwat Sourav , Prathosh A. P , Sai Praneeth Karimireddy

Efficient Estimation of Influence of a Training Instance

Understanding the influence of a training instance on a neural network model leads to improving interpretability. However, it is difficult and inefficient to evaluate the influence, which shows how a model's prediction would be changed if a…

Machine Learning · Computer Science 2021-11-22 Sosuke Kobayashi , Sho Yokoi , Jun Suzuki , Kentaro Inui

Z0-Inf: Zeroth Order Approximation for Data Influence

A critical aspect of analyzing and improving modern machine learning systems lies in understanding how individual training examples influence a model's predictive behavior. Estimating this influence enables critical applications, including…

Machine Learning · Computer Science 2025-10-15 Narine Kokhlikyan , Kamalika Chaudhuri , Saeed Mahloujifar

Compact Example-Based Explanations for Language Models

Training data influence estimation methods quantify the contribution of training documents to a model's output, making them a promising source of information for example-based explanations. As humans cannot interpret thousands of documents,…

Computation and Language · Computer Science 2026-04-10 Loris Schoenegger , Benjamin Roth

Training Data Influence Analysis and Estimation: A Survey

Good models require good training data. For overparameterized deep models, the causal relationship between training data and model predictions is increasingly opaque and poorly understood. Influence analysis partially demystifies training's…

Machine Learning · Computer Science 2024-04-02 Zayd Hammoudeh , Daniel Lowd

Faithful and Fast Influence Function via Advanced Sampling

How can we explain the influence of training data on black-box models? Influence functions (IFs) offer a post-hoc solution by utilizing gradients and Hessians. However, computing the Hessian for an entire dataset is resource-intensive,…

Machine Learning · Computer Science 2025-11-03 Jungyeon Koh , Hyeonsu Lyu , Jonggyu Jang , Hyun Jong Yang

Revisiting the Fragility of Influence Functions

In the last few years, many works have tried to explain the predictions of deep learning models. Few methods, however, have been proposed to verify the accuracy or faithfulness of these explanations. Recently, influence functions, which is…

Machine Learning · Computer Science 2023-04-10 Jacob R. Epifano , Ravi P. Ramachandran , Aaron J. Masino , Ghulam Rasool

Class based Influence Functions for Error Detection

Influence functions (IFs) are a powerful tool for detecting anomalous examples in large scale datasets. However, they are unstable when applied to deep networks. In this paper, we provide an explanation for the instability of IFs and…

Computation and Language · Computer Science 2023-05-03 Thang Nguyen-Duc , Hoang Thanh-Tung , Quan Hung Tran , Dang Huu-Tien , Hieu Ngoc Nguyen , Anh T. V. Dau , Nghi D. Q. Bui

DIVINE: Diverse Influential Training Points for Data Visualization and Model Refinement

As the complexity of machine learning (ML) models increases, resulting in a lack of prediction explainability, several methods have been developed to explain a model's behavior in terms of the training data points that most influence the…

Machine Learning · Computer Science 2021-07-14 Umang Bhatt , Isabel Chien , Muhammad Bilal Zafar , Adrian Weller

Interaction-Aware Influence Functions for Group Attribution

Influence functions approximate how removing a training example changes a quantity of interest, called the target function, such as a held-out loss. To estimate the influence of a group of examples, the standard practice is to sum the…

Machine Learning · Computer Science 2026-05-18 Jaeseung Heo , Kyeongheung Yun , Youngbin Choi , Sehyun Hwang , Jungseul Ok , Dongwoo Kim

Detecting Instruction Fine-tuning Attacks using Influence Function

Instruction fine-tuning attacks pose a serious threat to large language models (LLMs) by subtly embedding poisoned examples in fine-tuning datasets, leading to harmful or unintended behaviors in downstream applications. Detecting such…

Machine Learning · Computer Science 2026-02-02 Jiawei Li

Causal Influence Detection for Improving Efficiency in Reinforcement Learning

Many reinforcement learning (RL) environments consist of independent entities that interact sparsely. In such environments, RL agents have only limited influence over other entities in any particular situation. Our idea in this work is that…

Machine Learning · Computer Science 2021-12-03 Maximilian Seitzer , Bernhard Schölkopf , Georg Martius

Influence Functions in Deep Learning Are Fragile

Influence functions approximate the effect of training samples in test-time predictions and have a wide variety of applications in machine learning interpretability and uncertainty estimation. A commonly-used (first-order) influence…

Machine Learning · Computer Science 2021-02-12 Samyadeep Basu , Philip Pope , Soheil Feizi

Understanding Influence Functions and Datamodels via Harmonic Analysis

Influence functions estimate effect of individual data points on predictions of the model on test data and were adapted to deep learning in Koh and Liang [2017]. They have been used for detecting data poisoning, detecting helpful and…

Machine Learning · Computer Science 2022-10-04 Nikunj Saunshi , Arushi Gupta , Mark Braverman , Sanjeev Arora