Related papers: Efficient Data-Dependent Learnability

Quantifying the Prediction Uncertainty of Machine Learning Models for Individual Data

Machine learning models have exhibited exceptional results in various domains. The most prevalent approach for learning is the empirical risk minimizer (ERM), which adapts the model's weights to reduce the loss on a training set and…

Machine Learning · Computer Science 2024-12-11 Koby Bibas

Deep pNML: Predictive Normalized Maximum Likelihood for Deep Neural Networks

The Predictive Normalized Maximum Likelihood (pNML) scheme has been recently suggested for universal learning in the individual setting, where both the training and test samples are individual data. The goal of universal learning is to…

Machine Learning · Computer Science 2020-01-09 Koby Bibas , Yaniv Fogel , Meir Feder

Universal Supervised Learning for Individual Data

Universal supervised learning is considered from an information theoretic point of view following the universal prediction approach, see Merhav and Feder (1998). We consider the standard supervised "batch" learning where prediction is done…

Information Theory · Computer Science 2018-12-27 Yaniv Fogel , Meir Feder

Distribution Free Uncertainty for the Minimum Norm Solution of Over-parameterized Linear Regression

A fundamental principle of learning theory is that there is a trade-off between the complexity of a prediction rule and its ability to generalize. Modern machine learning models do not obey this paradigm: They produce an accurate prediction…

Machine Learning · Computer Science 2021-06-18 Koby Bibas , Meir Feder

Beyond Ridge Regression for Distribution-Free Data

In supervised batch learning, the predictive normalized maximum likelihood (pNML) has been proposed as the min-max regret solution for the distribution-free setting, where no distributional assumptions are made on the data. However, the…

Machine Learning · Computer Science 2022-06-20 Koby Bibas , Meir Feder

Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation

While deep neural networks provide good performance for a range of challenging tasks, calibration and uncertainty estimation remain major challenges, especially under distribution shift. In this paper, we propose the amortized conditional…

Machine Learning · Computer Science 2021-03-03 Aurick Zhou , Sergey Levine

Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation

In this work we consider data-driven optimization problems where one must maximize a function given only queries at a fixed set of points. This problem setting emerges in many domains where function evaluation is a complex and expensive…

Machine Learning · Computer Science 2021-02-17 Justin Fu , Sergey Levine

Measuring Stochastic Data Complexity with Boltzmann Influence Functions

Estimating the uncertainty of a model's prediction on a test point is a crucial part of ensuring reliability and calibration under distribution shifts. A minimum description length approach to this problem uses the predictive normalized…

Machine Learning · Computer Science 2024-07-22 Nathan Ng , Roger Grosse , Marzyeh Ghassemi

Learn Like The Pro: Norms from Theory to Size Neural Computation

The optimal design of neural networks is a critical problem in many applications. Here, we investigate how dynamical systems with polynomial nonlinearities can inform the design of neural systems that seek to emulate them. We propose a…

Machine Learning · Computer Science 2021-06-23 Margaret Trautner , Ziwei Li , Sai Ravela

Anomaly-aware summary statistic from data batches

Signal-agnostic data exploration based on machine learning could unveil very subtle statistical deviations of collider data from the expected Standard Model of particle physics. The beneficial impact of a large training sample on machine…

High Energy Physics - Experiment · Physics 2024-09-17 Gaia Grosso

Single Layer Predictive Normalized Maximum Likelihood for Out-of-Distribution Detection

Detecting out-of-distribution (OOD) samples is vital for developing machine learning based models for critical safety systems. Common approaches for OOD detection assume access to some OOD samples during training which may not be available…

Machine Learning · Computer Science 2021-10-19 Koby Bibas , Meir Feder , Tal Hassner

Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method

Machine unlearning aims to solve the problem of removing the influence of selected training examples from a learned model. Despite the increasing attention to this problem, it remains an open research question how to evaluate unlearning in…

Machine Learning · Computer Science 2024-11-08 Teodora Baluta , Pascal Lamblin , Daniel Tarlow , Fabian Pedregosa , Gintare Karolina Dziugaite

Learning from Conditional Distributions via Dual Embeddings

Many machine learning tasks, such as learning with invariance and policy evaluation in reinforcement learning, can be characterized as problems of learning from conditional distributions. In such problems, each sample $x$ itself is…

Machine Learning · Computer Science 2017-01-03 Bo Dai , Niao He , Yunpeng Pan , Byron Boots , Le Song

Optimal Multi-Distribution Learning

Multi-distribution learning (MDL), which seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions, has emerged as a unified framework in response to the evolving demand for robustness,…

Machine Learning · Computer Science 2025-08-12 Zihan Zhang , Wenhao Zhan , Yuxin Chen , Simon S. Du , Jason D. Lee

Instance Optimal Learning

We consider the following basic learning task: given independent draws from an unknown distribution over a discrete support, output an approximation of the distribution that is as accurate as possible in $\ell_1$ distance (i.e. total…

Machine Learning · Computer Science 2015-11-12 Gregory Valiant , Paul Valiant

Maximum Likelihood Estimation for Learning Populations of Parameters

Consider a setting with $N$ independent individuals, each with an unknown parameter, $p_i \in [0, 1]$ drawn from some unknown distribution $P^\star$. After observing the outcomes of $t$ independent Bernoulli trials, i.e., $X_i \sim…

Statistics Theory · Mathematics 2019-02-13 Ramya Korlakai Vinayak , Weihao Kong , Gregory Valiant , Sham M. Kakade

The Broad Optimality of Profile Maximum Likelihood

We study three fundamental statistical-learning problems: distribution estimation, property estimation, and property testing. We establish the profile maximum likelihood (PML) estimator as the first unified sample-optimal approach to a wide…

Machine Learning · Statistics 2019-07-12 Yi Hao , Alon Orlitsky

Dependable Distributed Training of Compressed Machine Learning Models

The existing work on the distributed training of machine learning (ML) models has consistently overlooked the distribution of the achieved learning quality, focusing instead on its average value. This leads to a poor dependability}of the…

Machine Learning · Computer Science 2024-02-23 Francesco Malandrino , Giuseppe Di Giacomo , Marco Levorato , Carla Fabiana Chiasserini

Estimating Learnability in the Sublinear Data Regime

We consider the problem of estimating how well a model class is capable of fitting a distribution of labeled data. We show that it is often possible to accurately estimate this "learnability" even when given an amount of data that is too…

Machine Learning · Computer Science 2019-03-26 Weihao Kong , Gregory Valiant

Measure Theoretic Approach to Nonuniform Learnability

An earlier introduced characterization of nonuniform learnability that allows the sample size to depend on the hypothesis to which the learner is compared has been redefined using the measure theoretic approach. Where nonuniform…

Machine Learning · Computer Science 2020-11-03 Ankit Bandyopadhyay