Related papers: Localized Complexities for Transductive Learning

Improved Generalization Bounds for Transductive Learning by Transductive Local Complexity and Its Applications

We introduce Transductive Local Complexity (TLC) to extend the classical Local Rademacher Complexity (LRC) to the transductive setting, incorporating substantial and novel components. Although LRC has been used to obtain sharp…

Machine Learning · Statistics 2026-02-06 Yingzhen Yang

Tessellation Localized Transfer learning for nonparametric regression

Transfer learning aims to improve performance on a target task by leveraging information from related source tasks. We propose a nonparametric regression transfer learning framework that explicitly models heterogeneity in the source-target…

Statistics Theory · Mathematics 2026-03-19 Hélène Halconruy , Benjamin Bobbia , Paul Lejamtel

On Localized Discrepancy for Domain Adaptation

We propose the discrepancy-based generalization theories for unsupervised domain adaptation. Previous theories introduced distribution discrepancies defined as the supremum over complete hypothesis space. The hypothesis space may contain…

Machine Learning · Computer Science 2020-08-17 Yuchen Zhang , Mingsheng Long , Jianmin Wang , Michael I. Jordan

Permutational Rademacher Complexity: a New Complexity Measure for Transductive Learning

Transductive learning considers situations when a learner observes $m$ labelled training points and $u$ unlabelled test points with the final goal of giving correct answers for the test points. This paper introduces a new complexity measure…

Machine Learning · Statistics 2016-02-24 Ilya Tolstikhin , Nikita Zhivotovskiy , Gilles Blanchard

Transductive Model Selection under Prior Probability Shift

Transductive learning is a supervised machine learning task in which, unlike in traditional inductive learning, the unlabelled data that require labelling are a finite set and are available at training time. Similarly to inductive learning…

Machine Learning · Computer Science 2025-07-31 Lorenzo Volpi , Alejandro Moreo , Fabrizio Sebastiani

New concentration inequalities for suprema of empirical processes

While effective concentration inequalities for suprema of empirical processes exist under boundedness or strict tail assumptions, no comparable results have been available under considerably weaker assumptions. In this paper, we derive…

Probability · Mathematics 2014-10-23 Johannes Lederer , Sara van de Geer

On Inductive Biases for Machine Learning in Data Constrained Settings

Learning with limited data is one of the biggest problems of machine learning. Current approaches to this issue consist in learning general representations from huge amounts of data before fine-tuning the model on a small dataset of…

Machine Learning · Computer Science 2023-02-22 Grégoire Mialon

Topologically Densified Distributions

We study regularization in the context of small sample-size learning with over-parameterized neural networks. Specifically, we shift focus from architectural properties, such as norms on the network weights, to properties of the internal…

Machine Learning · Computer Science 2021-05-18 Christoph D. Hofer , Florian Graf , Marc Niethammer , Roland Kwitt

Contextual Counting: A Mechanistic Study of Transformers on a Quantitative Task

Transformers have revolutionized machine learning across diverse domains, yet understanding their behavior remains crucial, particularly in high-stakes applications. This paper introduces the contextual counting task, a novel toy problem…

Machine Learning · Computer Science 2024-06-06 Siavash Golkar , Alberto Bietti , Mariel Pettee , Michael Eickenberg , Miles Cranmer , Keiya Hirashima , Geraud Krawezik , Nicholas Lourie , Michael McCabe , Rudy Morel , Ruben Ohana , Liam Holden Parker , Bruno Régaldo-Saint Blancard , Kyunghyun Cho , Shirley Ho

Supervised Contextual Embeddings for Transfer Learning in Natural Language Processing Tasks

Pre-trained word embeddings are the primary method for transfer learning in several Natural Language Processing (NLP) tasks. Recent works have focused on using unsupervised techniques such as language modeling to obtain these embeddings. In…

Computation and Language · Computer Science 2019-07-01 Mihir Kale , Aditya Siddhant , Sreyashi Nag , Radhika Parik , Matthias Grabmair , Anthony Tomasic

Learning to Embed Distributions via Maximum Kernel Entropy

Empirical data can often be considered as samples from a set of probability distributions. Kernel methods have emerged as a natural approach for learning to classify these distributions. Although numerous kernels between distributions have…

Machine Learning · Computer Science 2024-12-02 Oleksii Kachaiev , Stefano Recanatesi

Transductive Learning with String Kernels for Cross-Domain Text Classification

For many text classification tasks, there is a major problem posed by the lack of labeled data in a target domain. Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of…

Computation and Language · Computer Science 2018-11-06 Radu Tudor Ionescu , Andrei M. Butnaru

Invariant Models for Causal Transfer Learning

Methods of transfer learning try to combine knowledge from several related tasks (or domains) to improve performance on a test task. Inspired by causal methodology, we relax the usual covariate shift assumption and assume that it holds true…

Machine Learning · Statistics 2018-09-25 Mateo Rojas-Carulla , Bernhard Schölkopf , Richard Turner , Jonas Peters

The Local Rademacher Complexity of Lp-Norm Multiple Kernel Learning

We derive an upper bound on the local Rademacher complexity of $\ell_p$-norm multiple kernel learning, which yields a tighter excess risk bound than global approaches. Previous local approaches aimed at analyzed the case $p=1$ only while…

Machine Learning · Statistics 2011-03-07 Marius Kloft , Gilles Blanchard

A Learning Theoretic Perspective on Local Explainability

In this paper, we explore connections between interpretable machine learning and learning theory through the lens of local approximation explanations. First, we tackle the traditional problem of performance generalization and bound the…

Machine Learning · Computer Science 2020-11-03 Jeffrey Li , Vaishnavh Nagarajan , Gregory Plumb , Ameet Talwalkar

Localizing Paragraph Memorization in Language Models

Can we localize the weights and mechanisms used by a language model to memorize and recite entire paragraphs of its training data? In this paper, we show that while memorization is spread across multiple layers and model components,…

Computation and Language · Computer Science 2024-04-01 Niklas Stoehr , Mitchell Gordon , Chiyuan Zhang , Owen Lewis

Learning Confidence Bounds for Classification with Imbalanced Data

Class imbalance poses a significant challenge in classification tasks, where traditional approaches often lead to biased models and unreliable predictions. Undersampling and oversampling techniques have been commonly employed to address…

Machine Learning · Computer Science 2025-10-22 Matt Clifford , Jonathan Erskine , Alexander Hepburn , Raúl Santos-Rodríguez , Dario Garcia-Garcia

On the Sample Complexity of Learning under Invariance and Geometric Stability

Many supervised learning problems involve high-dimensional data such as images, text, or graphs. In order to make efficient use of data, it is often useful to leverage certain geometric priors in the problem at hand, such as invariance to…

Machine Learning · Statistics 2021-11-08 Alberto Bietti , Luca Venturi , Joan Bruna

Model-Robust and Adaptive-Optimal Transfer Learning for Tackling Concept Shifts in Nonparametric Regression

When concept shifts and sample scarcity are present in the target domain of interest, nonparametric regression learners often struggle to generalize effectively. The technique of transfer learning remedies these issues by leveraging data or…

Machine Learning · Statistics 2025-01-22 Haotian Lin , Matthew Reimherr

Comparative Learning: A Sample Complexity Theory for Two Hypothesis Classes

In many learning theory problems, a central role is played by a hypothesis class: we might assume that the data is labeled according to a hypothesis in the class (usually referred to as the realizable setting), or we might evaluate the…

Machine Learning · Computer Science 2022-11-17 Lunjia Hu , Charlotte Peale