Related papers: Universal Batch Learning Under The Misspecificatio…

Misspecified Universal Learning

This paper addresses the problem of universal learning under model misspecification with log-loss. In this setting, the learner operates with a hypothesis class of models denoted by $\Theta$, while the true data-generating process belongs…

Information Theory · Computer Science 2026-05-12 Shlomi Vituri , Meir Feder

Leave-One-Out Learning with Log-Loss

We study batch learning with log-loss in the individual setting, where the outcome sequence is deterministic. Because empirical statistics are not directly applicable in this regime, obtaining regret guarantees for batch learning has long…

Information Theory · Computer Science 2025-11-18 Yaniv Fogel , Meir Feder

Sequential prediction under log-loss and misspecification

We consider the question of sequential prediction under the log-loss in terms of cumulative regret. Namely, given a hypothesis class of distributions, learner sequentially predicts the (distribution of the) next letter in sequence and its…

Machine Learning · Computer Science 2021-09-16 Meir Feder , Yury Polyanskiy

The Conditional Regret-Capacity Theorem for Batch Universal Prediction

We derive a conditional version of the classical regret-capacity theorem. This result can be used in universal prediction to find lower bounds on the minimal batch regret, which is a recently introduced generalization of the average regret,…

Information Theory · Computer Science 2025-08-15 Marco Bondaschi , Michael Gastpar

On Misspecification in Prediction Problems and Robustness via Improper Learning

We study probabilistic prediction games when the underlying model is misspecified, investigating the consequences of predicting using an incorrect parametric model. We show that for a broad class of loss functions and parametric families of…

Machine Learning · Statistics 2021-02-02 Annie Marsden , John Duchi , Gregory Valiant

Minimax Regret Optimization for Robust Machine Learning under Distribution Shift

In this paper, we consider learning scenarios where the learned model is evaluated under an unknown test distribution which potentially differs from the training distribution (i.e. distribution shift). The learner has access to a family of…

Machine Learning · Computer Science 2022-02-14 Alekh Agarwal , Tong Zhang

Distribution Free Uncertainty for the Minimum Norm Solution of Over-parameterized Linear Regression

A fundamental principle of learning theory is that there is a trade-off between the complexity of a prediction rule and its ability to generalize. Modern machine learning models do not obey this paradigm: They produce an accurate prediction…

Machine Learning · Computer Science 2021-06-18 Koby Bibas , Meir Feder

Nonasymptotic Regret Analysis of Adaptive Linear Quadratic Control with Model Misspecification

The strategy of pre-training a large model on a diverse dataset, then fine-tuning for a particular application has yielded impressive results in computer vision, natural language processing, and robotic control. This strategy has vast…

Systems and Control · Electrical Eng. & Systems 2024-07-30 Bruce D. Lee , Anders Rantzer , Nikolai Matni

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

In reinforcement learning, specifying reward functions that capture the intended task can be very challenging. Reward learning aims to address this issue by learning the reward function. However, a learned reward model may have a low error…

Machine Learning · Computer Science 2025-07-09 Lukas Fluri , Leon Lang , Alessandro Abate , Patrick Forré , David Krueger , Joar Skalse

Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations

We study algorithms for online linear optimization in Hilbert spaces, focusing on the case where the player is unconstrained. We develop a novel characterization of a large class of minimax algorithms, recovering, and even improving,…

Machine Learning · Computer Science 2014-05-22 H. Brendan McMahan , Francesco Orabona

Minimax Regret Learning for Data with Heterogeneous Subgroups

Modern complex datasets often consist of various sub-populations with known group information. In the presence of sub-population heterogeneity, it is crucial to develop robust and generalizable learning methods that (1) can enjoy robust…

Methodology · Statistics 2025-09-30 Weibin Mo , Weijing Tang , Songkai Xue , Yufeng Liu , Ji Zhu

Learning under Distribution Mismatch and Model Misspecification

We study learning algorithms when there is a mismatch between the distributions of the training and test datasets of a learning algorithm. The effect of this mismatch on the generalization error and model misspecification are quantified.…

Information Theory · Computer Science 2022-08-11 Saeed Masiha , Amin Gohari , Mohammad Hossein Yassaee , Mohammad Reza Aref

Dissecting the Impact of Model Misspecification in Data-driven Optimization

Data-driven optimization aims to translate a machine learning model into decision-making by optimizing decisions on estimated costs. Such a pipeline can be conducted by fitting a distributional model which is then plugged into the target…

Machine Learning · Computer Science 2025-03-17 Adam N. Elmachtoub , Henry Lam , Haixiang Lan , Haofeng Zhang

On Optimal Learning Under Targeted Data Poisoning

Consider the task of learning a hypothesis class $\mathcal{H}$ in the presence of an adversary that can replace up to an $\eta$ fraction of the examples in the training set with arbitrary adversarial examples. The adversary aims to fail the…

Machine Learning · Computer Science 2022-10-13 Steve Hanneke , Amin Karbasi , Mohammad Mahmoody , Idan Mehalel , Shay Moran

An Online Learning Analysis of Minimax Adaptive Control

We present an online learning analysis of minimax adaptive control for the case where the uncertainty includes a finite set of linear dynamical systems. Precisely, for each system inside the uncertainty set, we define the model-based regret…

Systems and Control · Electrical Eng. & Systems 2023-09-12 Venkatraman Renganathan , Andrea Iannelli , Anders Rantzer

Efficient and Near-Optimal Smoothed Online Learning for Generalized Linear Functions

Due to the drastic gap in complexity between sequential and batch statistical learning, recent work has studied a smoothed sequential learning setting, where Nature is constrained to select contexts with density bounded by 1/{\sigma} with…

Machine Learning · Statistics 2022-05-27 Adam Block , Max Simchowitz

Test-Time Regret Minimization in Meta Reinforcement Learning

Meta reinforcement learning sets a distribution over a set of tasks on which the agent can train at will, then is asked to learn an optimal policy for any test task efficiently. In this paper, we consider a finite set of tasks modeled…

Machine Learning · Computer Science 2024-06-05 Mirco Mutti , Aviv Tamar

Precise Regret Bounds for Log-loss via a Truncated Bayesian Algorithm

We study the sequential general online regression, known also as the sequential probability assignments, under logarithmic loss when compared against a broad class of experts. We focus on obtaining tight, often matching, lower and upper…

Machine Learning · Computer Science 2023-02-02 Changlong Wu , Mohsen Heidari , Ananth Grama , Wojciech Szpankowski

A Regret Minimization Approach to Iterative Learning Control

We consider the setting of iterative learning control, or model-based policy learning in the presence of uncertain, time-varying dynamics. In this setting, we propose a new performance metric, planning regret, which replaces the standard…

Machine Learning · Computer Science 2021-03-01 Naman Agarwal , Elad Hazan , Anirudha Majumdar , Karan Singh

No-Regret Linear Bandits under Gap-Adjusted Misspecification

This work studies linear bandits under a new notion of gap-adjusted misspecification and is an extension of Liu et al. (2023). When the underlying reward function is not linear, existing linear bandits work usually relies on a uniform…

Machine Learning · Computer Science 2025-01-10 Chong Liu , Dan Qiao , Ming Yin , Ilija Bogunovic , Yu-Xiang Wang