Related papers: FastIF: Scalable Influence Functions for Efficient…

Influence Functions in Deep Learning Are Fragile

Influence functions approximate the effect of training samples in test-time predictions and have a wide variety of applications in machine learning interpretability and uncertainty estimation. A commonly-used (first-order) influence…

Machine Learning · Computer Science 2021-02-12 Samyadeep Basu , Philip Pope , Soheil Feizi

Faithful and Fast Influence Function via Advanced Sampling

How can we explain the influence of training data on black-box models? Influence functions (IFs) offer a post-hoc solution by utilizing gradients and Hessians. However, computing the Hessian for an entire dataset is resource-intensive,…

Machine Learning · Computer Science 2025-11-03 Jungyeon Koh , Hyeonsu Lyu , Jonggyu Jang , Hyun Jong Yang

Neural Networks for Learnable and Scalable Influence Estimation of Instruction Fine-Tuning Data

Influence functions provide crucial insights into model training, but existing methods suffer from large computational costs and limited generalization. Particularly, recent works have proposed various metrics and algorithms to calculate…

Machine Learning · Computer Science 2025-10-31 Ishika Agarwal , Dilek Hakkani-Tür

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution…

Machine Learning · Computer Science 2024-03-14 Yongchan Kwon , Eric Wu , Kevin Wu , James Zou

HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation

Influence functions provide a principled method to assess the contribution of individual training samples to a specific target. Yet, their high computational costs limit their applications on large-scale models and datasets. Existing…

Machine Learning · Computer Science 2025-06-27 Xinyu Zhou , Simin Fan , Martin Jaggi

Towards Robust Influence Functions with Flat Validation Minima

The Influence Function (IF) is a widely used technique for assessing the impact of individual training samples on model predictions. However, existing IF methods often fail to provide reliable influence estimates in deep neural networks,…

Machine Learning · Computer Science 2025-12-02 Xichen Ye , Yifan Wu , Weizhong Zhang , Cheng Jin , Yifan Chen

Influence Functions for Scalable Data Attribution in Diffusion Models

Diffusion models have led to significant advancements in generative modelling. Yet their widespread adoption poses challenges regarding data attribution and interpretability. In this paper, we aim to help address such challenges in…

Machine Learning · Computer Science 2025-05-27 Bruno Mlodozeniec , Runa Eschenhagen , Juhan Bae , Alexander Immer , David Krueger , Richard Turner

The Approximate Fisher Influence Function: Faster Estimation of Data Influence in Statistical Models

Quantifying the influence of infinitesimal changes in training data on model performance is crucial for understanding and improving machine learning models. In this work, we reformulate this problem as a weighted empirical risk minimization…

Machine Learning · Computer Science 2025-04-11 Omri Lev , Ashia C. Wilson

Z0-Inf: Zeroth Order Approximation for Data Influence

A critical aspect of analyzing and improving modern machine learning systems lies in understanding how individual training examples influence a model's predictive behavior. Estimating this influence enables critical applications, including…

Machine Learning · Computer Science 2025-10-15 Narine Kokhlikyan , Kamalika Chaudhuri , Saeed Mahloujifar

Rescaled Influence Functions: Accurate Data Attribution in High Dimension

How does the training data affect a model's behavior? This is the question we seek to answer with data attribution. The leading practical approaches to data attribution are based on influence functions (IF). IFs utilize a first-order Taylor…

Machine Learning · Computer Science 2025-09-11 Ittai Rubinstein , Samuel B. Hopkins

Revisiting Data Attribution for Influence Functions

The goal of data attribution is to trace the model's predictions through the learning algorithm and back to its training data. thereby identifying the most influential training samples and understanding how the model's behavior leads to…

Machine Learning · Computer Science 2025-08-12 Hongbo Zhu , Angelo Cangelosi

Deeper Understanding of Black-box Predictions via Generalized Influence Functions

Influence functions (IFs) elucidate how training data changes model behavior. However, the increasing size and non-convexity in large-scale models make IFs inaccurate. We suspect that the fragility comes from the first-order approximation…

Machine Learning · Computer Science 2024-05-07 Hyeonsu Lyu , Jonggyu Jang , Sehyun Ryu , Hyun Jong Yang

RelatIF: Identifying Explanatory Training Examples via Relative Influence

In this work, we focus on the use of influence functions to identify relevant training examples that one might hope "explain" the predictions of a machine learning model. One shortcoming of influence functions is that the training examples…

Machine Learning · Computer Science 2020-03-27 Elnaz Barshan , Marc-Etienne Brunet , Gintare Karolina Dziugaite

Revisit, Extend, and Enhance Hessian-Free Influence Functions

Influence functions serve as crucial tools for assessing sample influence in model interpretation, subset training set selection, noisy label detection, and more. By employing the first-order Taylor extension, influence functions can…

Machine Learning · Computer Science 2026-03-27 Ziao Yang , Han Yue , Jian Chen , Hongfu Liu

FairIF: Boosting Fairness in Deep Learning via Influence Functions with Validation Set Sensitive Attributes

Most fair machine learning methods either highly rely on the sensitive information of the training samples or require a large modification on the target models, which hinders their practical application. To address this issue, we propose a…

Machine Learning · Computer Science 2023-12-27 Haonan Wang , Ziwei Wu , Jingrui He

Theoretical and Practical Perspectives on what Influence Functions Do

Influence functions (IF) have been seen as a technique for explaining model predictions through the lens of the training data. Their utility is assumed to be in identifying training examples "responsible" for a prediction so that, for…

Machine Learning · Computer Science 2023-05-29 Andrea Schioppa , Katja Filippova , Ivan Titov , Polina Zablotskaia

A Free Lunch with Influence Functions? Improving Neural Network Estimates with Concepts from Semiparametric Statistics

Parameter estimation in empirical fields is usually undertaken using parametric models, and such models readily facilitate statistical inference. Unfortunately, they are unlikely to be sufficiently flexible to be able to adequately model…

Machine Learning · Computer Science 2022-06-13 Matthew J. Vowels , Sina Akbari , Necati Cihan Camgoz , Richard Bowden

Scaling Up Influence Functions

We address efficient calculation of influence functions for tracking predictions back to the training data. We propose and analyze a new approach to speeding up the inverse Hessian calculation based on Arnoldi iteration. With this…

Machine Learning · Computer Science 2021-12-07 Andrea Schioppa , Polina Zablotskaia , David Vilar , Artem Sokolov

Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation

Customer feedback can be an important signal for improving commercial machine translation systems. One solution for fixing specific translation errors is to remove the related erroneous training instances followed by re-training of the…

Computation and Language · Computer Science 2022-10-25 Tsz Kin Lam , Eva Hasler , Felix Hieber

InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are…

Computation and Language · Computer Science 2024-03-12 Somnath Banerjee , Maulindu Sarkar , Punyajoy Saha , Binny Mathew , Animesh Mukherjee