Related papers: Sample based Explanations via Generalized Represen…

Representer Point Selection for Explaining Regularized High-dimensional Models

We introduce a novel class of sample-based explanations we term high-dimensional representers, that can be used to explain the predictions of a regularized high-dimensional model in terms of importance weights for each of the training…

Machine Learning · Computer Science 2023-07-04 Che-Ping Tsai , Jiong Zhang , Eli Chien , Hsiang-Fu Yu , Cho-Jui Hsieh , Pradeep Ravikumar

A Generalized Representer Theorem for Hilbert Space - Valued Functions

The necessary and sufficient conditions for existence of a generalized representer theorem are presented for learning Hilbert space-valued functions. Representer theorems involving explicit basis functions and Reproducing Kernels are a…

Machine Learning · Computer Science 2018-09-21 Sanket Diwale , Colin Jones

Global-to-Local Support Spectrums for Language Model Explainability

Existing sample-based methods, like influence functions and representer points, measure the importance of a training point by approximating the effect of its removal from training. As such, they are skewed towards outliers and points that…

Machine Learning · Computer Science 2024-08-13 Lucas Agussurja , Xinyang Lu , Bryan Kian Hsiang Low

Generalized SHAP: Generating multiple types of explanations in machine learning

Many important questions about a model cannot be answered just by explaining how much each feature contributes to its output. To answer a broader set of questions, we generalize a popular, mathematically well-grounded explanation technique,…

Machine Learning · Computer Science 2020-06-16 Dillon Bowen , Lyle Ungar

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Machine learning models that are developed with invariance to certain types of data transformations have demonstrated superior generalization performance in practice. However, the underlying mechanism that explains why invariance leads to…

Machine Learning · Computer Science 2023-02-24 Sicheng Zhu , Bang An , Furong Huang

When is there a representer theorem? Vector versus matrix regularizers

We consider a general class of regularization methods which learn a vector of parameters on the basis of linear measurements. It is well known that if the regularizer is a nondecreasing function of the inner product then the learned vector…

Machine Learning · Computer Science 2012-08-21 Andreas Argyriou , Charles Micchelli , Massimiliano Pontil

More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize

Of theories for why large-scale machine learning models generalize despite being vastly overparameterized, which of their assumptions are needed to capture the qualitative phenomena of generalization in the real world? On one hand, we find…

Machine Learning · Computer Science 2022-03-14 Alexander Wei , Wei Hu , Jacob Steinhardt

Generalization Metrics for Practical Quantum Advantage in Generative Models

As the quantum computing community gravitates towards understanding the practical benefits of quantum computers, having a clear definition and evaluation scheme for assessing practical quantum advantage in the context of specific…

Machine Learning · Computer Science 2023-05-12 Kaitlin Gili , Marta Mauri , Alejandro Perdomo-Ortiz

Understanding Generalization via Set Theory

Generalization is at the core of machine learning models. However, the definition of generalization is not entirely clear. We employ set theory to introduce the concepts of algorithms, hypotheses, and dataset generalization. We analyze the…

Machine Learning · Computer Science 2023-11-14 Shiqi Liu

Modeling Generalization in Machine Learning: A Methodological and Computational Study

As machine learning becomes more and more available to the general public, theoretical questions are turning into pressing practical issues. Possibly, one of the most relevant concerns is the assessment of our confidence in trusting machine…

Machine Learning · Computer Science 2020-06-30 Pietro Barbiero , Giovanni Squillero , Alberto Tonda

Leveraging Conditional Generative Models in a General Explanation Framework of Classifier Decisions

Providing a human-understandable explanation of classifiers' decisions has become imperative to generate trust in their use for day-to-day tasks. Although many works have addressed this problem by generating visual explanation maps, they…

Machine Learning · Computer Science 2021-06-22 Martin Charachon , Paul-Henry Cournède , Céline Hudelot , Roberto Ardon

Biased Generalization in Diffusion Models

Generalization in generative modeling is defined as the ability to learn an underlying distribution from a finite dataset and produce novel samples, with evaluation largely driven by held-out performance and perceived sample quality. In…

Machine Learning · Computer Science 2026-03-05 Jerome Garnier-Brun , Luca Biggio , Davide Beltrame , Marc Mézard , Luca Saglietti

Robust Representation Learning through Explicit Environment Modeling

We consider learning from labeled data collected across multiple environments, where the data distribution may vary across these environments. This problem is commonly approached from a causal perspective, seeking invariant representations…

Machine Learning · Statistics 2026-04-30 Yuli Slavutsky , David M. Blei

Generalized Energy Based Models

We introduce the Generalized Energy Based Model (GEBM) for generative modelling. These models combine two trained components: a base distribution (generally an implicit model), which can learn the support of data with low intrinsic…

Machine Learning · Statistics 2021-12-22 Michael Arbel , Liang Zhou , Arthur Gretton

Generalizability of experimental studies

Experimental studies are a cornerstone of Machine Learning (ML) research. A common and often implicit assumption is that the study's results will generalize beyond the study itself, e.g., to new data. That is, repeating the same study under…

Machine Learning · Computer Science 2025-12-05 Federico Matteucci , Vadim Arzamasov , Jose Cribeiro-Ramallo , Marco Heyden , Konstantin Ntounas , Klemens Böhm

Learning Internal Representations (PhD Thesis)

Most machine learning theory and practice is concerned with learning a single task. In this thesis it is argued that in general there is insufficient information in a single task for a learner to generalise well and that what is required…

Machine Learning · Computer Science 2019-11-25 Jonathan Baxter

Generalization Properties of Retrieval-based Models

Many modern high-performing machine learning models such as GPT-3 primarily rely on scaling up models, e.g., transformer networks. Simultaneously, a parallel line of work aims to improve the model performance by augmenting an input instance…

Machine Learning · Computer Science 2022-10-07 Soumya Basu , Ankit Singh Rawat , Manzil Zaheer

Provable benefits of representation learning

There is general consensus that learning representations is useful for a variety of reasons, e.g. efficient use of labeled data (semi-supervised learning), transfer learning and understanding hidden structure of data. Popular techniques for…

Machine Learning · Computer Science 2017-06-15 Sanjeev Arora , Andrej Risteski

Visual Representation Learning Does Not Generalize Strongly Within the Same Domain

An important component for generalization in machine learning is to uncover underlying latent factors of variation as well as the mechanism through which each factor acts in the world. In this paper, we test whether 17 unsupervised, weakly…

Machine Learning · Computer Science 2022-02-15 Lukas Schott , Julius von Kügelgen , Frederik Träuble , Peter Gehler , Chris Russell , Matthias Bethge , Bernhard Schölkopf , Francesco Locatello , Wieland Brendel

Compact Example-Based Explanations for Language Models

Training data influence estimation methods quantify the contribution of training documents to a model's output, making them a promising source of information for example-based explanations. As humans cannot interpret thousands of documents,…

Computation and Language · Computer Science 2026-04-10 Loris Schoenegger , Benjamin Roth