Related papers: Datamodels: Predicting Predictions from Training D…

ModelPred: A Framework for Predicting Trained Model from Training Data

In this work, we propose ModelPred, a framework that helps to understand the impact of changes in training data on a trained model. This is critical for building trust in various stages of a machine learning pipeline: from cleaning…

Machine Learning · Computer Science 2022-12-27 Yingyan Zeng , Jiachen T. Wang , Si Chen , Hoang Anh Just , Ran Jin , Ruoxi Jia

Data Complexity-aware Deep Model Performance Forecasting

Deep learning models are widely used across computer vision and other domains. When working on the model induction, selecting the right architecture for a given dataset often relies on repetitive trial-and-error procedures. This procedure…

Machine Learning · Computer Science 2026-01-06 Yen-Chia Chen , Hsing-Kuo Pao , Hanjuan Huang

Training Data Influence Analysis and Estimation: A Survey

Good models require good training data. For overparameterized deep models, the causal relationship between training data and model predictions is increasingly opaque and poorly understood. Influence analysis partially demystifies training's…

Machine Learning · Computer Science 2024-04-02 Zayd Hammoudeh , Daniel Lowd

Understanding Influence Functions and Datamodels via Harmonic Analysis

Influence functions estimate effect of individual data points on predictions of the model on test data and were adapted to deep learning in Koh and Liang [2017]. They have been used for detecting data poisoning, detecting helpful and…

Machine Learning · Computer Science 2022-10-04 Nikunj Saunshi , Arushi Gupta , Mark Braverman , Sanjeev Arora

DsDm: Model-Aware Dataset Selection with Datamodels

When selecting data for training large-scale models, standard practice is to filter for examples that match human notions of data quality. Such filtering yields qualitatively clean datapoints that intuitively should improve model behavior.…

Machine Learning · Computer Science 2024-01-24 Logan Engstrom , Axel Feldmann , Aleksander Madry

Tools for Verifying Neural Models' Training Data

It is important that consumers and regulators can verify the provenance of large neural models to evaluate their capabilities and risks. We introduce the concept of a "Proof-of-Training-Data": any protocol that allows a model trainer to…

Machine Learning · Computer Science 2023-07-04 Dami Choi , Yonadav Shavit , David Duvenaud

Selective Prediction via Training Dynamics

Selective Prediction is the task of rejecting inputs a model would predict incorrectly on. This involves a trade-off between input space coverage (how many data points are accepted) and model utility (how good is the performance on accepted…

Machine Learning · Computer Science 2025-07-08 Stephan Rabanser , Anvith Thudi , Kimia Hamidieh , Adam Dziedzic , Israfil Bahceci , Akram Bin Sediq , Hamza Sokun , Nicolas Papernot

Forecasting with Deep Learning

This paper presents a method for time series forecasting with deep learning and its assessment on two datasets. The method starts with data preparation, followed by model training and evaluation. The final step is a visual inspection.…

Machine Learning · Computer Science 2023-02-24 Gissel Velarde

Training Machine Learning Models by Regularizing their Explanations

Neural networks are among the most accurate supervised learning methods in use today. However, their opacity makes them difficult to trust in critical applications, especially when conditions in training may differ from those in practice.…

Machine Learning · Computer Science 2018-10-03 Andrew Slavin Ross

Patterns, predictions, and actions: A story about machine learning

This graduate textbook on machine learning tells a story of how patterns in data support predictions and consequential actions. Starting with the foundations of decision making, we cover representation, optimization, and generalization as…

Machine Learning · Computer Science 2021-10-27 Moritz Hardt , Benjamin Recht

Revisiting Data Attribution for Influence Functions

The goal of data attribution is to trace the model's predictions through the learning algorithm and back to its training data. thereby identifying the most influential training samples and understanding how the model's behavior leads to…

Machine Learning · Computer Science 2025-08-12 Hongbo Zhu , Angelo Cangelosi

Deep transformation models: Tackling complex regression problems with neural network based transformation models

We present a deep transformation model for probabilistic regression. Deep learning is known for outstandingly accurate predictions on complex data but in regression tasks, it is predominantly used to just predict a single number. This…

Machine Learning · Statistics 2020-04-02 Beate Sick , Torsten Hothorn , Oliver Dürr

Pretraining and the Lasso

Pretraining is a popular and powerful paradigm in machine learning to pass information from one model to another. As an example, suppose one has a modest-sized dataset of images of cats and dogs, and plans to fit a deep neural network to…

Methodology · Statistics 2024-10-31 Erin Craig , Mert Pilanci , Thomas Le Menestrel , Balasubramanian Narasimhan , Manuel Rivas , Stein-Erik Gullaksen , Roozbeh Dehghannasiri , Julia Salzman , Jonathan Taylor , Robert Tibshirani

How much data do you need? Part 2: Predicting DL class specific training dataset sizes

This paper targets the question of predicting machine learning classification model performance, when taking into account the number of training examples per class and not just the overall number of training examples. This leads to the a…

Machine Learning · Computer Science 2024-03-12 Thomas Mühlenstädt , Jelena Frtunikj

Generating Samples to Probe Trained Models

There is a growing need for investigating how machine learning models operate. With this work, we aim to understand trained machine learning models by questioning their data preferences. We propose a mathematical framework that allows us to…

Machine Learning · Computer Science 2025-12-22 Eren Mehmet Kıral , Nurşen Aydın , Ş. İlker Birbil

Revealing the Underlying Patterns: Investigating Dataset Similarity, Performance, and Generalization

Supervised deep learning models require significant amount of labeled data to achieve an acceptable performance on a specific task. However, when tested on unseen data, the models may not perform well. Therefore, the models need to be…

Computer Vision and Pattern Recognition · Computer Science 2024-01-01 Akshit Achara , Ram Krishna Pandey

Statistically Testing Training Data for Unwanted Error Patterns using Rule-Oriented Regression

Artificial intelligence models trained from data can only be as good as the underlying data is. Biases in training data propagating through to the output of a machine learning model are a well-documented and well-understood phenomenon, but…

Machine Learning · Computer Science 2025-04-02 Stefan Rass , Martin Dallinger

Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions

Large amounts of training data are one of the major reasons for the high performance of state-of-the-art NLP models. But what exactly in the training data causes a model to make a certain prediction? We seek to answer this question by…

Computation and Language · Computer Science 2023-03-27 Yanai Elazar , Nora Kassner , Shauli Ravfogel , Amir Feder , Abhilasha Ravichander , Marius Mosbach , Yonatan Belinkov , Hinrich Schütze , Yoav Goldberg

Supervised Pretraining for Molecular Force Fields and Properties Prediction

Machine learning approaches have become popular for molecular modeling tasks, including molecular force fields and properties prediction. Traditional supervised learning methods suffer from scarcity of labeled data for particular tasks,…

Chemical Physics · Physics 2022-11-29 Xiang Gao , Weihao Gao , Wenzhi Xiao , Zhirui Wang , Chong Wang , Liang Xiang

Training Feature Attribution for Vision Models

Deep neural networks are often considered opaque systems, prompting the need for explainability methods to improve trust and accountability. Existing approaches typically attribute test-time predictions either to input features (e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-13 Aziz Bacha , Thomas George