Related papers: Competitive Distribution Estimation

Oracle-Efficient Hybrid Online Learning with Unknown Distribution

We study the problem of oracle-efficient hybrid online learning when the features are generated by an unknown i.i.d. process and the labels are generated adversarially. Assuming access to an (offline) ERM oracle, we show that there exists a…

Machine Learning · Computer Science 2025-02-13 Changlong Wu , Jin Sima , Wojciech Szpankowski

Instance-Optimality for Private KL Distribution Estimation

We study the fundamental problem of estimating an unknown discrete distribution $p$ over $d$ symbols, given $n$ i.i.d. samples from the distribution. We are interested in minimizing the KL divergence between the true distribution and the…

Machine Learning · Statistics 2025-05-30 Jiayuan Ye , Vitaly Feldman , Kunal Talwar

Distributed Non-Stochastic Experts

We consider the online distributed non-stochastic experts problem, where the distributed system consists of one coordinator node that is connected to $k$ sites, and the sites are required to communicate with each other via the coordinator.…

Machine Learning · Computer Science 2012-11-15 Varun Kanade , Zhenming Liu , Bozidar Radunovic

Distribution Testing with a Confused Collector

We are interested in testing properties of distributions with systematically mislabeled samples. Our goal is to make decisions about unknown probability distributions, using a sample that has been collected by a confused collector, such as…

Data Structures and Algorithms · Computer Science 2023-11-27 Renato Ferreira Pinto , Nathaniel Harms

The Computational Power of Optimization in Online Learning

We consider the fundamental problem of prediction with expert advice where the experts are "optimizable": there is a black-box optimization oracle that can be used to compute, in constant time, the leading expert in retrospect at any point…

Machine Learning · Computer Science 2016-01-28 Elad Hazan , Tomer Koren

Unconstrained Online Linear Learning in Hilbert Spaces: Minimax Algorithms and Normal Approximations

We study algorithms for online linear optimization in Hilbert spaces, focusing on the case where the player is unconstrained. We develop a novel characterization of a large class of minimax algorithms, recovering, and even improving,…

Machine Learning · Computer Science 2014-05-22 H. Brendan McMahan , Francesco Orabona

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

In reinforcement learning, specifying reward functions that capture the intended task can be very challenging. Reward learning aims to address this issue by learning the reward function. However, a learned reward model may have a low error…

Machine Learning · Computer Science 2025-07-09 Lukas Fluri , Leon Lang , Alessandro Abate , Patrick Forré , David Krueger , Joar Skalse

Regret Minimization for Piecewise Linear Rewards: Contracts, Auctions, and Beyond

Most microeconomic models of interest involve optimizing a piecewise linear function. These include contract design in hidden-action principal-agent problems, selling an item in posted-price auctions, and bidding in first-price auctions.…

Computer Science and Game Theory · Computer Science 2025-03-04 Francesco Bacchiocchi , Matteo Castiglioni , Alberto Marchesi , Nicola Gatti

A graph-informed regret metric for optimal distributed control

We consider the optimal control of large-scale systems using distributed controllers with a network topology that mirrors the coupling graph between subsystems. In this work, we introduce spatial regret, a graph-informed metric that…

Systems and Control · Electrical Eng. & Systems 2025-11-19 Daniele Martinelli , Andrea Martin , Giancarlo Ferrari-Trecate , Luca Furieri

Distribution Free Uncertainty for the Minimum Norm Solution of Over-parameterized Linear Regression

A fundamental principle of learning theory is that there is a trade-off between the complexity of a prediction rule and its ability to generalize. Modern machine learning models do not obey this paradigm: They produce an accurate prediction…

Machine Learning · Computer Science 2021-06-18 Koby Bibas , Meir Feder

Estimation of KL Divergence: Optimal Minimax Rate

The problem of estimating the Kullback-Leibler divergence $D(P\|Q)$ between two unknown distributions $P$ and $Q$ is studied, under the assumption that the alphabet size $k$ of the distributions can scale to infinity. The estimation is…

Information Theory · Computer Science 2018-02-22 Yuheng Bu , Shaofeng Zou , Yingbin Liang , Venugopal V. Veeravalli

Online MinCut: Competitive and Regret Analysis

In this paper we study the mincut problem in the online setting. We consider two distinct models: A) competitive analysis and B) regret analysis. In the competitive setting we consider the vertex arrival model; whenever a new vertex arrives…

Data Structures and Algorithms · Computer Science 2020-08-17 Avah Banerjee , Guoli Ding

Order Optimal One-Shot Distributed Learning

We consider distributed statistical optimization in one-shot setting, where there are $m$ machines each observing $n$ i.i.d. samples. Based on its observed samples, each machine then sends an $O(\log(mn))$-length message to a server, at…

Machine Learning · Computer Science 2019-11-12 Arsalan Sharifnassab , Saber Salehkaleybar , S. Jamaloddin Golestani

More Supervision, Less Computation: Statistical-Computational Tradeoffs in Weakly Supervised Learning

We consider the weakly supervised binary classification problem where the labels are randomly flipped with probability $1- {\alpha}$. Although there exist numerous algorithms for this problem, it remains theoretically unexplored how the…

Machine Learning · Computer Science 2019-07-16 Xinyang Yi , Zhaoran Wang , Zhuoran Yang , Constantine Caramanis , Han Liu

Minimax Regret Optimization for Robust Machine Learning under Distribution Shift

In this paper, we consider learning scenarios where the learned model is evaluated under an unknown test distribution which potentially differs from the training distribution (i.e. distribution shift). The learner has access to a family of…

Machine Learning · Computer Science 2022-02-14 Alekh Agarwal , Tong Zhang

No-Regret Learning in Partially-Informed Auctions

Auctions with partially-revealed information about items are broadly employed in real-world applications, but the underlying mechanisms have limited theoretical support. In this work, we study a machine learning formulation of these types…

Machine Learning · Computer Science 2022-07-06 Wenshuo Guo , Michael I. Jordan , Ellen Vitercik

Optimal amortized regret in every interval

Consider the classical problem of predicting the next bit in a sequence of bits. A standard performance measure is {\em regret} (loss in payoff) with respect to a set of experts. For example if we measure performance with respect to two…

Machine Learning · Computer Science 2013-04-30 Rina Panigrahy , Preyas Popat

Online Learning of Optimal Sequential Testing Policies

This paper studies an online learning problem that seeks optimal testing policies for a stream of subjects, each of whom can be evaluated through a sequence of candidate tests drawn from a common pool. We refer to this problem as the Online…

Machine Learning · Computer Science 2025-09-05 Qiyuan Chen , Raed Al Kontar

Label optimal regret bounds for online local learning

We resolve an open question from (Christiano, 2014b) posed in COLT'14 regarding the optimal dependency of the regret achievable for online local learning on the size of the label set. In this framework the algorithm is shown a pair of items…

Machine Learning · Computer Science 2015-08-25 Pranjal Awasthi , Moses Charikar , Kevin A. Lai , Andrej Risteski

Minimal Expected Regret in Linear Quadratic Control

We consider the problem of online learning in Linear Quadratic Control systems whose state transition and state-action transition matrices $A$ and $B$ may be initially unknown. We devise an online learning algorithm and provide guarantees…

Machine Learning · Computer Science 2021-09-30 Yassir Jedra , Alexandre Proutiere