Related papers: Statistical and Computational Guarantees for Influ…

Revisiting Data Attribution for Influence Functions

The goal of data attribution is to trace the model's predictions through the learning algorithm and back to its training data. thereby identifying the most influential training samples and understanding how the model's behavior leads to…

Machine Learning · Computer Science 2025-08-12 Hongbo Zhu , Angelo Cangelosi

Understanding Influence Functions and Datamodels via Harmonic Analysis

Influence functions estimate effect of individual data points on predictions of the model on test data and were adapted to deep learning in Koh and Liang [2017]. They have been used for detecting data poisoning, detecting helpful and…

Machine Learning · Computer Science 2022-10-04 Nikunj Saunshi , Arushi Gupta , Mark Braverman , Sanjeev Arora

Influence Functions for Scalable Data Attribution in Diffusion Models

Diffusion models have led to significant advancements in generative modelling. Yet their widespread adoption poses challenges regarding data attribution and interpretability. In this paper, we aim to help address such challenges in…

Machine Learning · Computer Science 2025-05-27 Bruno Mlodozeniec , Runa Eschenhagen , Juhan Bae , Alexander Immer , David Krueger , Richard Turner

High-dimensional influence measure

Influence diagnosis is important since presence of influential observations could lead to distorted analysis and misleading interpretations. For high-dimensional data, it is particularly so, as the increased dimensionality and complexity…

Statistics Theory · Mathematics 2013-11-27 Junlong Zhao , Chenlei Leng , Lexin Li , Hansheng Wang

Newfluence: Boosting Model interpretability and Understanding in High Dimensions

The increasing complexity of machine learning (ML) and artificial intelligence (AI) models has created a pressing need for tools that help scientists, engineers, and policymakers interpret and refine model decisions and predictions.…

Machine Learning · Statistics 2025-07-17 Haolin Zou , Arnab Auddy , Yongchan Kwon , Kamiar Rahnama Rad , Arian Maleki

Understanding Black-box Predictions via Influence Functions

How can we explain the predictions of a black-box model? In this paper, we use influence functions -- a classic technique from robust statistics -- to trace a model's prediction through the learning algorithm and back to its training data,…

Machine Learning · Statistics 2021-01-01 Pang Wei Koh , Percy Liang

Graphical Influence Diagnostics for Changepoint Models

Changepoint models enjoy a wide appeal in a variety of disciplines to model the heterogeneity of ordered data. Graphical influence diagnostics to characterize the influence of single observations on changepoint models are, however, lacking.…

Methodology · Statistics 2021-07-23 Ines Wilms , Rebecca Killick , David S. Matteson

On the Accuracy of Influence Functions for Measuring Group Effects

Influence functions estimate the effect of removing a training point on a model without the need to retrain. They are based on a first-order Taylor approximation that is guaranteed to be accurate for sufficiently small changes to the model,…

Machine Learning · Computer Science 2019-11-22 Pang Wei Koh , Kai-Siang Ang , Hubert H. K. Teo , Percy Liang

Influence-based Attributions can be Manipulated

Influence Functions are a standard tool for attributing predictions to training data in a principled manner and are widely used in applications such as data valuation and fairness. In this work, we present realistic incentives to manipulate…

Machine Learning · Computer Science 2024-10-08 Chhavi Yadav , Ruihan Wu , Kamalika Chaudhuri

Intelligent Probabilistic Inference

The analysis of practical probabilistic models on the computer demands a convenient representation for the available knowledge and an efficient algorithm to perform inference. An appealing representation is the influence diagram, a network…

Artificial Intelligence · Computer Science 2013-04-15 Ross D. Shachter

Better Hessians Matter: Studying the Impact of Curvature Approximations in Influence Functions

Influence functions offer a principled way to trace model predictions back to training data, but their use in deep learning is hampered by the need to invert a large, ill-conditioned Hessian matrix. Approximations such as Generalised…

Machine Learning · Computer Science 2026-02-17 Steve Hong , Runa Eschenhagen , Bruno Mlodozeniec , Richard Turner

The Approximate Fisher Influence Function: Faster Estimation of Data Influence in Statistical Models

Quantifying the influence of infinitesimal changes in training data on model performance is crucial for understanding and improving machine learning models. In this work, we reformulate this problem as a weighted empirical risk minimization…

Machine Learning · Computer Science 2025-04-11 Omri Lev , Ashia C. Wilson

Cross-Loss Influence Functions to Explain Deep Network Representations

As machine learning is increasingly deployed in the real world, it is paramount that we develop the tools necessary to analyze the decision-making of the models we train and deploy to end-users. Recently, researchers have shown that…

Machine Learning · Computer Science 2022-05-05 Andrew Silva , Rohit Chopra , Matthew Gombolay

Multiple Influential Point Detection in High-Dimensional Spaces

Influence diagnosis is an integrated component of data analysis, but is severely under-investigated in a high-dimensional setting. One of the key challenges, even in a fixed-dimensional setting, is how to deal with multiple influential…

Methodology · Statistics 2017-02-07 Junlong Zhao , Chao Liu , Lu Niu , Chenlei Leng

Influence diagnostics in Birnbaum-Saunders nonlinear regression models

We consider the issue of assessing influence of observations in the class of Birnbaum-Saunders nonlinear regression models, which is useful in lifetime data analysis. Our results generalize those in Galea et al. [2004, Influence diagnostics…

Methodology · Statistics 2011-11-22 Artur J. Lemonte

FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging

Influence functions approximate the "influences" of training data-points for test predictions and have a wide variety of applications. Despite the popularity, their computational cost does not scale well with model and training data size.…

Machine Learning · Computer Science 2021-09-13 Han Guo , Nazneen Fatema Rajani , Peter Hase , Mohit Bansal , Caiming Xiong

Scaling Up Influence Functions

We address efficient calculation of influence functions for tracking predictions back to the training data. We propose and analyze a new approach to speeding up the inverse Hessian calculation based on Arnoldi iteration. With this…

Machine Learning · Computer Science 2021-12-07 Andrea Schioppa , Polina Zablotskaia , David Vilar , Artem Sokolov

InfluenceNet: AI Models for Banzhaf and Shapley Value Prediction

Power indices are essential in assessing the contribution and influence of individual agents in multi-agent systems, providing crucial insights into collaborative dynamics and decision-making processes. While invaluable, traditional…

Multiagent Systems · Computer Science 2025-03-12 Benjamin Kempinski , Tal Kachman

Toward Efficient Influence Function: Dropout as a Compression Tool

Assessing the impact the training data on machine learning models is crucial for understanding the behavior of the model, enhancing the transparency, and selecting training data. Influence function provides a theoretical framework for…

Machine Learning · Computer Science 2026-04-21 Yuchen Zhang , Mohammad Mohammadi Amiri

Toward computerized efficient estimation in infinite-dimensional models

Despite the risk of misspecification they are tied to, parametric models continue to be used in statistical practice because they are accessible to all. In particular, efficient estimation procedures in parametric models are simple to…

Statistics Theory · Mathematics 2016-09-01 Marco Carone , Alexander R. Luedtke , Mark J. van der Laan