Related papers: A Statistical Learning Theory Framework for Superv…

Modeling Latent Variable Uncertainty for Loss-based Learning

We consider the problem of parameter estimation using weakly supervised datasets, where a training sample consists of the input and a partially specified annotation, which we refer to as the output. The missing information in the annotation…

Machine Learning · Computer Science 2012-06-22 M. Pawan Kumar , Ben Packer , Daphne Koller

Rademacher Complexity Bounds for a Penalized Multiclass Semi-Supervised Algorithm

We propose Rademacher complexity bounds for multiclass classifiers trained with a two-step semi-supervised model. In the first step, the algorithm partitions the partially labeled data and then identifies dense clusters containing $\kappa$…

Machine Learning · Statistics 2021-09-15 Yury Maximov , Massih-Reza Amini , Zaid Harchaoui

Supervised Learning as Lossy Compression: Characterizing Generalization and Sample Complexity via Finite Blocklength Analysis

This paper presents a novel information-theoretic perspective on generalization in machine learning by framing the learning problem within the context of lossy compression and applying finite blocklength analysis. In our approach, the…

Machine Learning · Computer Science 2026-02-05 Kosuke Sugiyama , Masato Uchida

Rademacher learning rates for iterated random functions

Most existing literature on supervised machine learning assumes that the training dataset is drawn from an i.i.d. sample. However, many real-world problems exhibit temporal dependence and strong correlations between the marginal…

Machine Learning · Statistics 2025-06-18 Nikola Sandrić

Pattern Recognition for Conditionally Independent Data

In this work we consider the task of relaxing the i.i.d assumption in pattern recognition (or classification), aiming to make existing learning algorithms applicable to a wider range of tasks. Pattern recognition is guessing a discrete…

Machine Learning · Computer Science 2012-02-28 Daniil Ryabko

Single Index Latent Variable Models for Network Topology Inference

A semi-parametric, non-linear regression model in the presence of latent variables is applied towards learning network graph structure. These latent variables can correspond to unmodeled phenomena or unmeasured agents in a complex system of…

Machine Learning · Statistics 2018-07-03 Jonathan Mei , José M. F. Moura

A Bayesian Approach to Restricted Latent Class Models for Scientifically-Structured Clustering of Multivariate Binary Outcomes

In this paper, we propose a general framework for combining evidence of varying quality to estimate underlying binary latent variables in the presence of restrictions imposed to respect the scientific context. The resulting algorithms…

Methodology · Statistics 2018-08-28 Zhenke Wu , Livia Casciola-Rosen , Antony Rosen , Scott L. Zeger

Unsupervised representation learning with recognition-parametrised probabilistic models

We introduce a new approach to probabilistic unsupervised learning based on the recognition-parametrised model (RPM): a normalised semi-parametric hypothesis class for joint distributions over observed and latent variables. Under the key…

Machine Learning · Computer Science 2023-04-21 William I. Walker , Hugo Soulat , Changmin Yu , Maneesh Sahani

Learning in the Presence of Corruption

In supervised learning one wishes to identify a pattern present in a joint distribution $P$, of instances, label pairs, by providing a function $f$ from instances to labels that has low risk $\mathbb{E}_{P}\ell(y,f(x))$. To do so, the…

Machine Learning · Statistics 2015-07-07 Brendan van Rooyen , Robert C. Williamson

Toward Supervised Anomaly Detection

Anomaly detection is being regarded as an unsupervised learning task as anomalies stem from adversarial or unlikely events with unknown distributions. However, the predictive performance of purely unsupervised anomaly detection often fails…

Machine Learning · Computer Science 2014-01-27 Nico Goernitz , Marius Micha Kloft , Konrad Rieck , Ulf Brefeld

A Tutorial on Statistically Sound Pattern Discovery

Statistically sound pattern discovery harnesses the rigour of statistical hypothesis testing to overcome many of the issues that have hampered standard data mining approaches to pattern discovery. Most importantly, application of…

Methodology · Statistics 2019-01-07 Wilhelmiina Hämäläinen , Geoffrey I. Webb

A Tutorial on Online Supervised Learning with Applications to Node Classification in Social Networks

We revisit the elegant observation of T. Cover '65 which, perhaps, is not as well-known to the broader community as it should be. The first goal of the tutorial is to explain---through the prism of this elementary result---how to solve…

Machine Learning · Computer Science 2016-09-01 Alexander Rakhlin , Karthik Sridharan

Uniform Risk Bounds for Learning with Dependent Data Sequences

This paper extends standard results from learning theory with independent data to sequences of dependent data. Contrary to most of the literature, we do not rely on mixing arguments or sequential measures of complexity and derive uniform…

Machine Learning · Computer Science 2023-03-22 Fabien Lauer

Guaranteed Classification via Regularized Similarity Learning

Learning an appropriate (dis)similarity function from the available data is a central problem in machine learning, since the success of many machine learning algorithms critically depends on the choice of a similarity function to compare…

Machine Learning · Computer Science 2013-08-30 Zheng-Chu Guo , Yiming Ying

Bayesian Estimation of Multidimensional Latent Variables and Its Asymptotic Accuracy

Hierarchical learning models, such as mixture models and Bayesian networks, are widely employed for unsupervised learning tasks, such as clustering analysis. They consist of observable and hidden variables, which represent the given data…

Machine Learning · Statistics 2018-01-08 Keisuke Yamazaki

Statistical Learning of Arbitrary Computable Classifiers

Statistical learning theory chiefly studies restricted hypothesis classes, particularly those with finite Vapnik-Chervonenkis (VC) dimension. The fundamental quantity of interest is the sample complexity: the number of samples required to…

Machine Learning · Computer Science 2008-07-10 David Soloveichik

Generalization bounds for learning under graph-dependence: A survey

Traditional statistical learning theory relies on the assumption that data are identically and independently distributed (i.i.d.). However, this assumption often does not hold in many real-life applications. In this survey, we explore…

Machine Learning · Computer Science 2024-04-09 Rui-Ray Zhang , Massih-Reza Amini

Weakly Supervised Representation Learning with Sparse Perturbations

The theory of representation learning aims to build methods that provably invert the data generating process with minimal domain knowledge or any source of supervision. Most prior approaches require strong distributional assumptions on the…

Machine Learning · Computer Science 2022-06-03 Kartik Ahuja , Jason Hartford , Yoshua Bengio

Supervised Learning Under Distributed Features

This work studies the problem of learning under both large datasets and large-dimensional feature space scenarios. The feature information is assumed to be spread across agents in a network, where each agent observes some of the features.…

Multiagent Systems · Computer Science 2020-05-26 Bicheng Ying , Kun Yuan , Ali H. Sayed

Semi-Supervised Class Discovery

One promising approach to dealing with datapoints that are outside of the initial training distribution (OOD) is to create new classes that capture similarities in the datapoints previously rejected as uncategorizable. Systems that generate…

Machine Learning · Computer Science 2020-02-25 Jeremy Nixon , Jeremiah Liu , David Berthelot