Related papers: Estimating Structural Target Functions using Machi…

Deeper Understanding of Black-box Predictions via Generalized Influence Functions

Influence functions (IFs) elucidate how training data changes model behavior. However, the increasing size and non-convexity in large-scale models make IFs inaccurate. We suspect that the fragility comes from the first-order approximation…

Machine Learning · Computer Science 2024-05-07 Hyeonsu Lyu , Jonggyu Jang , Sehyun Ryu , Hyun Jong Yang

Theoretical and Practical Perspectives on what Influence Functions Do

Influence functions (IF) have been seen as a technique for explaining model predictions through the lens of the training data. Their utility is assumed to be in identifying training examples "responsible" for a prediction so that, for…

Machine Learning · Computer Science 2023-05-29 Andrea Schioppa , Katja Filippova , Ivan Titov , Polina Zablotskaia

Visually Communicating and Teaching Intuition for Influence Functions

Estimators based on influence functions (IFs) have been shown to be effective in many settings, especially when combined with machine learning techniques. By focusing on estimating a specific target of interest (e.g., the average effect of…

Methodology · Statistics 2019-10-29 Aaron Fisher , Edward H. Kennedy

Demystifying statistical learning based on efficient influence functions

Evaluation of treatment effects and more general estimands is typically achieved via parametric modelling, which is unsatisfactory since model misspecification is likely. Data-adaptive model building (e.g. statistical/machine learning) is…

Statistics Theory · Mathematics 2022-01-14 Oliver Hines , Oliver Dukes , Karla Diaz-Ordaz , Stijn Vansteelandt

Towards Robust Influence Functions with Flat Validation Minima

The Influence Function (IF) is a widely used technique for assessing the impact of individual training samples on model predictions. However, existing IF methods often fail to provide reliable influence estimates in deep neural networks,…

Machine Learning · Computer Science 2025-12-02 Xichen Ye , Yifan Wu , Weizhong Zhang , Cheng Jin , Yifan Chen

A Free Lunch with Influence Functions? Improving Neural Network Estimates with Concepts from Semiparametric Statistics

Parameter estimation in empirical fields is usually undertaken using parametric models, and such models readily facilitate statistical inference. Unfortunately, they are unlikely to be sufficiently flexible to be able to adequately model…

Machine Learning · Computer Science 2022-06-13 Matthew J. Vowels , Sina Akbari , Necati Cihan Camgoz , Richard Bowden

How Biased are Your Features?: Computing Fairness Influence Functions with Global Sensitivity Analysis

Fairness in machine learning has attained significant focus due to the widespread application in high-stake decision-making tasks. Unregulated machine learning classifiers can exhibit bias towards certain demographic groups in data, thus…

Machine Learning · Computer Science 2023-07-04 Bishwamittra Ghosh , Debabrota Basu , Kuldeep S. Meel

Evaluating Data Influence in Meta Learning

As one of the most fundamental models, meta learning aims to effectively address few-shot learning challenges. However, it still faces significant issues related to the training data, such as training inefficiencies due to numerous…

Machine Learning · Computer Science 2025-01-28 Chenyang Ren , Huanyi Xie , Shu Yang , Meng Ding , Lijie Hu , Di Wang

Constructing targeted minimum loss/maximum likelihood estimators: a simple illustration to build intuition

Use of machine learning to estimate nuisance functions (e.g. outcomes models, propensity score models) in estimators used in causal inference is increasingly common, as it can mitigate bias due to model misspecification. However, it can be…

Methodology · Statistics 2025-07-17 Rachael K. Ross , Lina M. Montoya , Dana E. Goin , Ivan Diaz , Audrey Renson

Influence Functions in Deep Learning Are Fragile

Influence functions approximate the effect of training samples in test-time predictions and have a wide variety of applications in machine learning interpretability and uncertainty estimation. A commonly-used (first-order) influence…

Machine Learning · Computer Science 2021-02-12 Samyadeep Basu , Philip Pope , Soheil Feizi

Rescaled Influence Functions: Accurate Data Attribution in High Dimension

How does the training data affect a model's behavior? This is the question we seek to answer with data attribution. The leading practical approaches to data attribution are based on influence functions (IF). IFs utilize a first-order Taylor…

Machine Learning · Computer Science 2025-09-11 Ittai Rubinstein , Samuel B. Hopkins

Bias-Targeted Nonparametric Balancing for Stable Causal Mediation Analysis

Influence function (IF)-based estimators are widely used in mediation analysis due to their modeling flexibility, but standard implementations require direct estimation of the distribution functions of the mediator and treatment variables.…

Methodology · Statistics 2026-02-10 Chang Liu , AmirEmad Ghassami

Identification and Debiased Learning of Causal Effects with General Instrumental Variables

Instrumental variable methods are fundamental to causal inference when treatment assignment is confounded by unobserved variables. In this article, we develop a general nonparametric causal framework for identification and learning with…

Methodology · Statistics 2026-02-10 Shuyuan Chen , Peng Zhang , Yifan Cui

Understanding Influence Functions and Datamodels via Harmonic Analysis

Influence functions estimate effect of individual data points on predictions of the model on test data and were adapted to deep learning in Koh and Liang [2017]. They have been used for detecting data poisoning, detecting helpful and…

Machine Learning · Computer Science 2022-10-04 Nikunj Saunshi , Arushi Gupta , Mark Braverman , Sanjeev Arora

Revisiting Data Attribution for Influence Functions

The goal of data attribution is to trace the model's predictions through the learning algorithm and back to its training data. thereby identifying the most influential training samples and understanding how the model's behavior leads to…

Machine Learning · Computer Science 2025-08-12 Hongbo Zhu , Angelo Cangelosi

A Versatile Influence Function for Data Attribution with Non-Decomposable Loss

Influence function, a technique rooted in robust statistics, has been adapted in modern machine learning for a novel application: data attribution -- quantifying how individual training data points affect a model's predictions. However, the…

Machine Learning · Computer Science 2024-12-03 Junwei Deng , Weijing Tang , Jiaqi W. Ma

Information Theoretic Measures for Fairness-aware Feature Selection

Machine learning algorithms are increasingly used for consequential decision making regarding individuals based on their relevant features. Features that are relevant for accurate decisions may however lead to either explicit or implicit…

Machine Learning · Computer Science 2021-06-09 Sajad Khodadadian , Mohamed Nafea , AmirEmad Ghassami , Negar Kiyavash

Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation

Customer feedback can be an important signal for improving commercial machine translation systems. One solution for fixing specific translation errors is to remove the related erroneous training instances followed by re-training of the…

Computation and Language · Computer Science 2022-10-25 Tsz Kin Lam , Eva Hasler , Felix Hieber

Distributional Training Data Attribution: What do Influence Functions Sample?

Randomness is an unavoidable part of training deep learning models, yet something that traditional training data attribution algorithms fail to rigorously account for. They ignore the fact that, due to stochasticity in the initialisation…

Machine Learning · Computer Science 2025-10-28 Bruno Mlodozeniec , Isaac Reid , Sam Power , David Krueger , Murat Erdogdu , Richard E. Turner , Roger Grosse

Cross-Loss Influence Functions to Explain Deep Network Representations

As machine learning is increasingly deployed in the real world, it is paramount that we develop the tools necessary to analyze the decision-making of the models we train and deploy to end-users. Recently, researchers have shown that…

Machine Learning · Computer Science 2022-05-05 Andrew Silva , Rohit Chopra , Matthew Gombolay