Related papers: Derandomizing Multi-Distribution Learning

Collaborative Learning with Different Labeling Functions

We study a variant of Collaborative PAC Learning, in which we aim to learn an accurate classifier for each of the $n$ data distributions, while minimizing the number of samples drawn from them in total. Unlike in the usual collaborative…

Machine Learning · Computer Science 2024-05-24 Yuyang Deng , Mingda Qiao

Optimal Multi-Distribution Learning

Multi-distribution learning (MDL), which seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions, has emerged as a unified framework in response to the evolving demand for robustness,…

Machine Learning · Computer Science 2025-08-12 Zihan Zhang , Wenhao Zhan , Yuxin Chen , Simon S. Du , Jason D. Lee

On Calibration in Multi-Distribution Learning

Modern challenges of robustness, fairness, and decision-making in machine learning have led to the formulation of multi-distribution learning (MDL) frameworks in which a predictor is optimized across multiple distributions. We study the…

Machine Learning · Computer Science 2024-12-19 Rajeev Verma , Volker Fischer , Eric Nalisnick

On Policy Evaluation Algorithms in Distributional Reinforcement Learning

We introduce a novel class of algorithms to efficiently approximate the unknown return distributions in policy evaluation problems from distributional reinforcement learning (DRL). The proposed distributional dynamic programming algorithms…

Machine Learning · Statistics 2024-07-22 Julian Gerstenberg , Ralph Neininger , Denis Spiegel

On-Demand Sampling: Learning Optimally from Multiple Distributions

Social and real-world considerations such as robustness, fairness, social welfare and multi-agent tradeoffs have given rise to multi-distribution learning paradigms, such as collaborative learning, group distributionally robust…

Machine Learning · Computer Science 2024-04-04 Nika Haghtalab , Michael I. Jordan , Eric Zhao

How Does Independence Help Generalization? Sample Complexity of ERM on Product Distributions

While many classical notions of learnability (e.g., PAC learnability) are distribution-free, utilizing the specific structures of an input distribution may improve learning performance. For example, a product distribution on a…

Machine Learning · Computer Science 2022-12-14 Tao Lin

Lifting uniform learners via distributional decomposition

We show how any PAC learning algorithm that works under the uniform distribution can be transformed, in a blackbox fashion, into one that works under an arbitrary and unknown distribution $\mathcal{D}$. The efficiency of our transformation…

Machine Learning · Statistics 2023-03-31 Guy Blanc , Jane Lange , Ali Malik , Li-Yang Tan

Calibrating Predictions to Decisions: A Novel Approach to Multi-Class Calibration

When facing uncertainty, decision-makers want predictions they can trust. A machine learning provider can convey confidence to decision-makers by guaranteeing their predictions are distribution calibrated -- amongst the inputs that receive…

Machine Learning · Statistics 2021-07-14 Shengjia Zhao , Michael P. Kim , Roshni Sahoo , Tengyu Ma , Stefano Ermon

Distribution Shift Is Key to Learning Invariant Prediction

An interesting phenomenon arises: Empirical Risk Minimization (ERM) sometimes outperforms methods specifically designed for out-of-distribution tasks. This motivates an investigation into the reasons behind such behavior beyond algorithmic…

Machine Learning · Computer Science 2026-01-21 Hong Zheng , Fei Teng

Discretization-free Multicalibration through Loss Minimization over Tree Ensembles

In recent years, multicalibration has emerged as a desirable learning objective for ensuring that a predictor is calibrated across a rich collection of overlapping subpopulations. Existing approaches typically achieve multicalibration by…

Machine Learning · Computer Science 2025-05-26 Hongyi Henry Jin , Zijun Ding , Dung Daniel Ngo , Zhiwei Steven Wu

Distributed Multitask Learning

We consider the problem of distributed multi-task learning, where each machine learns a separate, but related, task. Specifically, each machine learns a linear predictor in high-dimensional space,where all tasks share the same small…

Machine Learning · Statistics 2015-10-05 Jialei Wang , Mladen Kolar , Nathan Srebro

Towards Fundamental Limits for Active Multi-distribution Learning

Multi-distribution learning extends agnostic Probably Approximately Correct (PAC) learning to the setting in which a family of $k$ distributions, $\{D_i\}_{i\in[k]}$, is considered and a classifier's performance is measured by its error…

Machine Learning · Computer Science 2025-06-24 Chicheng Zhang , Yihan Zhou

Distral: Robust Multitask Reinforcement Learning

Most deep reinforcement learning algorithms are data inefficient in complex and rich environments, limiting their applicability to many scenarios. One direction for improving data efficiency is multitask learning with shared neural network…

Machine Learning · Computer Science 2017-07-14 Yee Whye Teh , Victor Bapst , Wojciech Marian Czarnecki , John Quan , James Kirkpatrick , Raia Hadsell , Nicolas Heess , Razvan Pascanu

Distributed Block-diagonal Approximation Methods for Regularized Empirical Risk Minimization

In recent years, there is a growing need to train machine learning models on a huge volume of data. Designing efficient distributed optimization algorithms for empirical risk minimization (ERM) has therefore become an active and challenging…

Optimization and Control · Mathematics 2019-11-19 Ching-pei Lee , Kai-Wei Chang

Distributionally Robust Learning for Multi-source Unsupervised Domain Adaptation

Empirical risk minimization often performs poorly when the distribution of the target domain differs from those of source domains. To address such potential distribution shifts, we develop an unsupervised domain adaptation approach that…

Machine Learning · Statistics 2025-03-25 Zhenyu Wang , Peter Bühlmann , Zijian Guo

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a…

Optimization and Control · Mathematics 2024-01-04 Daokuan Zhu , Tianqi Xu , Jie Lu

Optimal Learners for Multiclass Problems

The fundamental theorem of statistical learning states that for binary classification problems, any Empirical Risk Minimization (ERM) learning rule has close to optimal sample complexity. In this paper we seek for a generic optimal learner…

Machine Learning · Computer Science 2014-05-13 Amit Daniely , Shai Shalev-Shwartz

Optimal Robust Learning of Discrete Distributions from Batches

Many applications, including natural language processing, sensor networks, collaborative filtering, and federated learning, call for estimating discrete distributions from data collected in batches, some of which may be untrustworthy,…

Machine Learning · Computer Science 2020-02-26 Ayush Jain , Alon Orlitsky

Discriminative Learning via Semidefinite Probabilistic Models

Discriminative linear models are a popular tool in machine learning. These can be generally divided into two types: The first is linear classifiers, such as support vector machines, which are well studied and provide state-of-the-art…

Machine Learning · Computer Science 2012-07-02 Koby Crammer , Amir Globerson

Meta-Learned Invariant Risk Minimization

Empirical Risk Minimization (ERM) based machine learning algorithms have suffered from weak generalization performance on data obtained from out-of-distribution (OOD). To address this problem, Invariant Risk Minimization (IRM) objective was…

Machine Learning · Computer Science 2021-03-25 Jun-Hyun Bae , Inchul Choi , Minho Lee