Related papers: Learning from compressed observations

Achievability results for statistical learning under communication constraints

The problem of statistical learning is to construct an accurate predictor of a random variable as a function of a correlated random variable on the basis of an i.i.d. training sample from their joint distribution. Allowable predictors are…

Information Theory · Computer Science 2009-04-30 Maxim Raginsky

Conditional Risk Minimization for Stochastic Processes

We study the task of learning from non-i.i.d. data. In particular, we aim at learning predictors that minimize the conditional risk for a stochastic process, i.e. the expected loss of the predictor on the next point conditioned on the set…

Machine Learning · Statistics 2016-03-15 Alexander Zimin , Christoph H. Lampert

Statistical learning with indirect observations

Let $(X,Y)\in\mathcal{X}\times \mathcal{Y}$ be a random couple with unknown distribution $P$. Let $\GG$ be a class of measurable functions and $\ell$ a loss function. The problem of statistical learning deals with the estimation of the…

Statistics Theory · Mathematics 2012-07-12 Sébastien Loustau

Convergence Rate of a Functional Learning Method for Contextual Stochastic Optimization

We consider a stochastic optimization problem involving two random variables: a context variable $X$ and a dependent variable $Y$. The objective is to minimize the expected value of a nonlinear loss functional applied to the conditional…

Optimization and Control · Mathematics 2026-03-16 Noel Smith , Andrzej Ruszczynski

Compression, Generalization and Learning

A compression function is a map that slims down an observational set into a subset of reduced size, while preserving its informational content. In multiple applications, the condition that one new observation makes the compressed set change…

Machine Learning · Computer Science 2024-01-09 Marco C. Campi , Simone Garatti

A statistical perspective of sampling scores for linear regression

In this paper, we consider a statistical problem of learning a linear model from noisy samples. Existing work has focused on approximating the least squares solution by using leverage-based scores as an importance sampling distribution.…

Machine Learning · Statistics 2016-02-11 Siheng Chen , Rohan Varma , Aarti Singh , Jelena Kovačević

Minimax Learning for Remote Prediction

The classical problem of supervised learning is to infer an accurate predictor of a target variable $Y$ from a measured variable $X$ by using a finite number of labeled training samples. Motivated by the increasingly distributed nature of…

Information Theory · Computer Science 2021-01-19 Cheuk Ting Li , Xiugang Wu , Ayfer Ozgur , Abbas El Gamal

Learning High-dimensional Gaussians from Censored Data

We provide efficient algorithms for the problem of distribution learning from high-dimensional Gaussian data where in each sample, some of the variable values are missing. We suppose that the variables are missing not at random (MNAR). The…

Machine Learning · Computer Science 2025-04-29 Arnab Bhattacharyya , Constantinos Daskalakis , Themis Gouleakis , Yuhao Wang

On statistical learning via the lens of compression

This work continues the study of the relationship between sample compression schemes and statistical learning, which has been mostly investigated within the framework of binary classification. The central theme of this work is establishing…

Machine Learning · Computer Science 2017-01-02 Ofir David , Shay Moran , Amir Yehudayoff

Learning to Abstain From Uninformative Data

Learning and decision-making in domains with naturally high noise-to-signal ratio, such as Finance or Healthcare, is often challenging, while the stakes are very high. In this paper, we study the problem of learning and acting under a…

Machine Learning · Computer Science 2023-09-26 Yikai Zhang , Songzhu Zheng , Mina Dalirrooyfard , Pengxiang Wu , Anderson Schneider , Anant Raj , Yuriy Nevmyvaka , Chao Chen

Estimating Learnability in the Sublinear Data Regime

We consider the problem of estimating how well a model class is capable of fitting a distribution of labeled data. We show that it is often possible to accurately estimate this "learnability" even when given an amount of data that is too…

Machine Learning · Computer Science 2019-03-26 Weihao Kong , Gregory Valiant

Learning Whenever Learning is Possible: Universal Learning under General Stochastic Processes

This work initiates a general study of learning and generalization without the i.i.d. assumption, starting from first principles. While the traditional approach to statistical learning theory typically relies on standard assumptions from…

Machine Learning · Statistics 2020-10-21 Steve Hanneke

The Stochastic Replica Approach to Machine Learning: Stability and Parameter Optimization

We introduce a statistical physics inspired supervised machine learning algorithm for classification and regression problems. The method is based on the invariances or stability of predicted results when known data is represented as…

Machine Learning · Statistics 2018-11-19 Patrick Chao , Tahereh Mazaheri , Bo Sun , Nicholas B. Weingartner , Zohar Nussinov

Learning from Conditional Distributions via Dual Embeddings

Many machine learning tasks, such as learning with invariance and policy evaluation in reinforcement learning, can be characterized as problems of learning from conditional distributions. In such problems, each sample $x$ itself is…

Machine Learning · Computer Science 2017-01-03 Bo Dai , Niao He , Yunpeng Pan , Byron Boots , Le Song

Valid inferential models for prediction in supervised learning problems

Prediction, where observed data is used to quantify uncertainty about a future observation, is a fundamental problem in statistics. Prediction sets with coverage probability guarantees are a common solution, but these do not provide…

Statistics Theory · Mathematics 2022-11-22 Leonardo Cella , Ryan Martin

Learning invariant representations of time-homogeneous stochastic dynamical systems

We consider the general class of time-homogeneous stochastic dynamical systems, both discrete and continuous, and study the problem of learning a representation of the state that faithfully captures its dynamics. This is instrumental to…

Machine Learning · Computer Science 2024-03-15 Vladimir R. Kostic , Pietro Novelli , Riccardo Grazzi , Karim Lounici , Massimiliano Pontil

Statistical Learning of Arbitrary Computable Classifiers

Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…

Machine Learning · Computer Science 2008-07-10 David Soloveichik

Tracking Changing Probabilities via Dynamic Learners

Consider a predictor, a learner, whose input is a stream of discrete items. The predictor's task, at every time point, is probabilistic multiclass prediction, i.e. to predict which item may occur next by outputting zero or more candidate…

Machine Learning · Computer Science 2024-12-25 Omid Madani

Statistical Learning Guarantees for Compressive Clustering and Compressive Mixture Modeling

We provide statistical learning guarantees for two unsupervised learning tasks in the context of compressive statistical learning, a general framework for resource-efficient large-scale learning that we introduced in a companion paper.The…

Machine Learning · Computer Science 2021-08-18 Rémi Gribonval , Gilles Blanchard , Nicolas Keriven , Yann Traonmilin

Statistical Mechanics of Learning: A Variational Approach for Real Data

Using a variational technique, we generalize the statistical physics approach of learning from random examples to make it applicable to real data. We demonstrate the validity and relevance of our method by computing approximate estimators…

Disordered Systems and Neural Networks · Physics 2009-11-07 D. Malzahn , M. Opper