Related papers: ModelDiff: A Framework for Comparing Learning Algo…

ModelDiff: Testing-Based DNN Similarity Comparison for Model Reuse Detection

The knowledge of a deep learning model may be transferred to a student model, leading to intellectual property infringement or vulnerability propagation. Detecting such knowledge reuse is nontrivial because the suspect models may not be…

Machine Learning · Computer Science 2021-06-17 Yuanchun Li , Ziqi Zhang , Bingyan Liu , Ziyue Yang , Yunxin Liu

Learning-From-Disagreement: A Model Comparison and Visual Analytics Framework

With the fast-growing number of classification models being produced every day, numerous model interpretation and comparison solutions have also been introduced. For example, LIME and SHAP can interpret what input features contribute more…

Machine Learning · Computer Science 2022-01-21 Junpeng Wang , Liang Wang , Yan Zheng , Chin-Chia Michael Yeh , Shubham Jain , Wei Zhang

ModelPred: A Framework for Predicting Trained Model from Training Data

In this work, we propose ModelPred, a framework that helps to understand the impact of changes in training data on a trained model. This is critical for building trust in various stages of a machine learning pipeline: from cleaning…

Machine Learning · Computer Science 2022-12-27 Yingyan Zeng , Jiachen T. Wang , Si Chen , Hoang Anh Just , Ran Jin , Ruoxi Jia

Agreement-based Learning

Model selection is a problem that has occupied machine learning researchers for a long time. Recently, its importance has become evident through applications in deep learning. We propose an agreement-based learning framework that prevents…

Machine Learning · Computer Science 2018-06-05 Emmanouil Antonios Platanios

Is your algorithm unlearning or untraining?

As models are getting larger and are trained on increasing amounts of data, there has been an explosion of interest into how we can ``delete'' specific data points or behaviours from a trained model, after the fact. This goal has been…

Machine Learning · Computer Science 2026-04-10 Eleni Triantafillou , Ahmed Imtiaz Humayun , Monica Ribero , Alexander Matt Turner , Michael C. Mozer , Georgios Kaissis

Differential testing for machine learning: an analysis for classification algorithms beyond deep learning

Context: Differential testing is a useful approach that uses different implementations of the same algorithms and compares the results for software testing. In recent years, this approach was successfully used for test campaigns of deep…

Software Engineering · Computer Science 2022-07-26 Steffen Herbold , Steffen Tunkel

StyleDiff: Attribute Comparison Between Unlabeled Datasets in Latent Disentangled Space

One major challenge in machine learning applications is coping with mismatches between the datasets used in the development and those obtained in real-world applications. These mismatches may lead to inaccurate predictions and errors,…

Machine Learning · Statistics 2023-09-01 Keisuke Kawano , Takuro Kutsuna , Ryoko Tokuhisa , Akihiro Nakamura , Yasushi Esaki

On the Feature Learning in Diffusion Models

The predominant success of diffusion models in generative modeling has spurred significant interest in understanding their theoretical foundations. In this work, we propose a feature learning framework aimed at analyzing and comparing the…

Machine Learning · Statistics 2025-03-04 Andi Han , Wei Huang , Yuan Cao , Difan Zou

"What is Different Between These Datasets?" A Framework for Explaining Data Distribution Shifts

The performance of machine learning models relies heavily on the quality of input data, yet real-world applications often face significant data-related challenges. A common issue arises when curating training data or deploying models: two…

Machine Learning · Computer Science 2025-09-24 Varun Babbar , Zhicheng Guo , Cynthia Rudin

A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks

The field of deep learning has witnessed a remarkable shift towards extremely compute- and memory-intensive neural networks. These newer larger models have enabled researchers to advance state-of-the-art tools across a variety of fields.…

Machine Learning · Computer Science 2022-07-04 Daniel Nichols , Siddharth Singh , Shu-Huai Lin , Abhinav Bhatele

Diffeomorphic Learning

We introduce in this paper a learning paradigm in which the training data is transformed by a diffeomorphic transformation before prediction. The learning algorithm minimizes a cost function evaluating the prediction error on the training…

Machine Learning · Statistics 2023-12-05 Laurent Younes

Fairer and more accurate, but for whom?

Complex statistical machine learning models are increasingly being used or considered for use in high-stakes decision-making pipelines in domains such as financial services, health care, criminal justice and human services. These models are…

Applications · Statistics 2017-07-04 Alexandra Chouldechova , Max G'Sell

Bridging the Gap: Machine Learning to Resolve Improperly Modeled Dynamics

We present a data-driven modeling strategy to overcome improperly modeled dynamics for systems exhibiting complex spatio-temporal behaviors. We propose a Deep Learning framework to resolve the differences between the true dynamics of the…

Machine Learning · Computer Science 2020-10-28 Maan Qraitem , Dhanushka Kularatne , Eric Forgoston , M. Ani Hsieh

Datamodels: Predicting Predictions from Training Data

We present a conceptual framework, datamodeling, for analyzing the behavior of a model class in terms of the training data. For any fixed "target" example $x$, training set $S$, and learning algorithm, a datamodel is a parameterized…

Machine Learning · Statistics 2022-02-02 Andrew Ilyas , Sung Min Park , Logan Engstrom , Guillaume Leclerc , Aleksander Madry

Understanding Prediction Discrepancies in Machine Learning Classifiers

A multitude of classifiers can be trained on the same data to achieve similar performances during test time, while having learned significantly different classification patterns. This phenomenon, which we call prediction discrepancies, is…

Machine Learning · Computer Science 2024-08-01 Xavier Renard , Thibault Laugel , Marcin Detyniecki

SceneDiff: A Benchmark and Method for Multiview Object Change Detection

We investigate the problem of identifying objects that have been added, removed, or moved between a pair of captures (images or videos) of the same scene at different times. Accurately identifying verifiable changes is extremely challenging…

Computer Vision and Pattern Recognition · Computer Science 2026-04-01 Yuqun Wu , Chih-hao Lin , Henry Che , Aditi Tiwari , Chuhang Zou , Shenlong Wang , Derek Hoiem

Data Complexity-aware Deep Model Performance Forecasting

Deep learning models are widely used across computer vision and other domains. When working on the model induction, selecting the right architecture for a given dataset often relies on repetitive trial-and-error procedures. This procedure…

Machine Learning · Computer Science 2026-01-06 Yen-Chia Chen , Hsing-Kuo Pao , Hanjuan Huang

Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate

Machine unlearning has been used to remove unwanted knowledge acquired by large language models (LLMs). In this paper, we examine machine unlearning from an optimization perspective, framing it as a regularized multi-task optimization…

Machine Learning · Computer Science 2025-05-07 Zhiqi Bu , Xiaomeng Jin , Bhanukiran Vinzamuri , Anil Ramakrishna , Kai-Wei Chang , Volkan Cevher , Mingyi Hong

Attribute-to-Delete: Machine Unlearning via Datamodel Matching

Machine unlearning -- efficiently removing the effect of a small "forget set" of training data on a pre-trained machine learning model -- has recently attracted significant research interest. Despite this interest, however, recent work…

Machine Learning · Computer Science 2024-11-13 Kristian Georgiev , Roy Rinberg , Sung Min Park , Shivam Garg , Andrew Ilyas , Aleksander Madry , Seth Neel

Benchmarking Neural Network Training Algorithms

Training algorithms, broadly construed, are an essential part of every deep learning pipeline. Training algorithm improvements that speed up training across a wide variety of workloads (e.g., better update rules, tuning protocols, learning…

Machine Learning · Computer Science 2025-06-19 George E. Dahl , Frank Schneider , Zachary Nado , Naman Agarwal , Chandramouli Shama Sastry , Philipp Hennig , Sourabh Medapati , Runa Eschenhagen , Priya Kasimbeg , Daniel Suo , Juhan Bae , Justin Gilmer , Abel L. Peirson , Bilal Khan , Rohan Anil , Mike Rabbat , Shankar Krishnan , Daniel Snider , Ehsan Amid , Kongtao Chen , Chris J. Maddison , Rakshith Vasudev , Michal Badura , Ankush Garg , Peter Mattson