Related papers: Predictive Inference with Weak Supervision

Query-Adaptive Predictive Inference with Partial Labels

The cost and scarcity of fully supervised labels in statistical machine learning encourage using partially labeled data for model validation as a cheaper and more accessible alternative. Effectively collecting and leveraging weakly…

Machine Learning · Statistics 2022-06-16 Maxime Cauchois , John Duchi

Reliable Programmatic Weak Supervision with Confidence Intervals for Label Probabilities

The accurate labeling of datasets is often both costly and time-consuming. Given an unlabeled dataset, programmatic weak supervision obtains probabilistic predictions for the labels by leveraging multiple weak labeling functions (LFs) that…

Machine Learning · Statistics 2025-08-07 Verónica Álvarez , Santiago Mazuelas , Steven An , Sanjoy Dasgupta

Structured Prediction with Partial Labelling through the Infimum Loss

Annotating datasets is one of the main costs in nowadays supervised learning. The goal of weak supervision is to enable models to learn using only forms of labelling which are cheaper to collect, as partial labelling. This is a type of…

Machine Learning · Computer Science 2021-02-02 Vivien Cabannes , Alessandro Rudi , Francis Bach

Learning with Proper Partial Labels

Partial-label learning is a kind of weakly-supervised learning with inexact labels, where for each training example, we are given a set of candidate labels instead of only one true label. Recently, various approaches on partial-label…

Machine Learning · Computer Science 2022-08-30 Zhenguo Wu , Jiaqi Lv , Masashi Sugiyama

Knowing what you know: valid and validated confidence sets in multiclass and multilabel prediction

We develop conformal prediction methods for constructing valid predictive confidence sets in multiclass and multilabel problems without assumptions on the data generating distribution. A challenge here is that typical conformal prediction…

Machine Learning · Statistics 2020-07-14 Maxime Cauchois , Suyash Gupta , John Duchi

Reliable Weakly Supervised Learning: Maximize Gain and Maintain Safeness

Weakly supervised data are widespread and have attracted much attention. However, since label quality is often difficult to guarantee, sometimes the use of weakly supervised data will lead to unsatisfactory performance, i.e., performance…

Machine Learning · Computer Science 2019-04-23 Lan-Zhe Guo , Yu-Feng Li , Ming Li , Jin-Feng Yi , Bo-Wen Zhou , Zhi-Hua Zhou

Training Subset Selection for Weak Supervision

Existing weak supervision approaches use all the data covered by weak signals to train a classifier. We show both theoretically and empirically that this is not always optimal. Intuitively, there is a tradeoff between the amount of…

Machine Learning · Statistics 2023-03-08 Hunter Lang , Aravindan Vijayaraghavan , David Sontag

Conformal Inference for Open-Set and Imbalanced Classification

This paper presents a conformal prediction method for classification in highly imbalanced and open-set settings, where there are many possible classes and not all may be represented in the data. Existing approaches require a finite, known…

Machine Learning · Statistics 2025-10-16 Tianmin Xie , Yanfei Zhou , Ziyi Liang , Stefano Favaro , Matteo Sesia

Data Consistency for Weakly Supervised Learning

In many applications, training machine learning models involves using large amounts of human-annotated data. Obtaining precise labels for the data is expensive. Instead, training with weak supervision provides a low-cost alternative. We…

Machine Learning · Computer Science 2022-02-09 Chidubem Arachie , Bert Huang

Conformal Structured Prediction

Conformal prediction has recently emerged as a promising strategy for quantifying the uncertainty of a predictive model; these algorithms modify the model to output sets of labels that are guaranteed to contain the true label with high…

Machine Learning · Computer Science 2025-03-11 Botong Zhang , Shuo Li , Osbert Bastani

Noise-Adaptive Conformal Classification with Marginal Coverage

Conformal inference provides a rigorous statistical framework for uncertainty quantification in machine learning, enabling well-calibrated prediction sets with precise coverage guarantees for any classification model. However, its reliance…

Methodology · Statistics 2025-01-31 Teresa Bortolotti , Y. X. Rachel Wang , Xin Tong , Alessandra Menafoglio , Simone Vantini , Matteo Sesia

Conformal Prediction with Partially Labeled Data

While the predictions produced by conformal prediction are set-valued, the data used for training and calibration is supposed to be precise. In the setting of superset learning or learning from partial labels, a variant of weakly supervised…

Machine Learning · Computer Science 2023-06-05 Alireza Javanmardi , Yusuf Sale , Paul Hofman , Eyke Hüllermeier

Cross-Prediction-Powered Inference

While reliable data-driven decision-making hinges on high-quality labeled data, the acquisition of quality labels often involves laborious human annotations or slow and expensive scientific measurements. Machine learning is becoming an…

Machine Learning · Statistics 2024-03-01 Tijana Zrnic , Emmanuel J. Candès

Trustworthy Classification through Rank-Based Conformal Prediction Sets

Machine learning classification tasks often benefit from predicting a set of possible labels with confidence scores to capture uncertainty. However, existing methods struggle with the high-dimensional nature of the data and the lack of…

Machine Learning · Computer Science 2024-07-08 Rui Luo , Zhixin Zhou

Weak Supervision with Incremental Source Accuracy Estimation

Motivated by the desire to generate labels for real-time data we develop a method to estimate the dependency structure and accuracy of weak supervision sources incrementally. Our method first estimates the dependency structure associated…

Machine Learning · Computer Science 2022-05-12 Richard Gresham Correro

Resolving label uncertainty with implicit posterior models

We propose a method for jointly inferring labels across a collection of data samples, where each sample consists of an observation and a prior belief about the label. By implicitly assuming the existence of a generative model for which a…

Machine Learning · Computer Science 2022-06-22 Esther Rolf , Nikolay Malkin , Alexandros Graikos , Ana Jojic , Caleb Robinson , Nebojsa Jojic

Conformal Prediction with Corrupted Labels: Uncertain Imputation and Robust Re-weighting

We introduce a framework for robust uncertainty quantification in situations where labeled training data are corrupted, through noisy or missing labels. We build on conformal prediction, a statistical tool for generating prediction sets…

Machine Learning · Computer Science 2026-02-27 Shai Feldman , Stephen Bates , Yaniv Romano

Conformal Credal Self-Supervised Learning

In semi-supervised learning, the paradigm of self-training refers to the idea of learning from pseudo-labels suggested by the learner itself. Across various domains, corresponding methods have proven effective and achieve state-of-the-art…

Machine Learning · Statistics 2023-06-12 Julian Lienen , Caglar Demir , Eyke Hüllermeier

Valid inferential models for prediction in supervised learning problems

Prediction, where observed data is used to quantify uncertainty about a future observation, is a fundamental problem in statistics. Prediction sets with coverage probability guarantees are a common solution, but these do not provide…

Statistics Theory · Mathematics 2022-11-22 Leonardo Cella , Ryan Martin

Conformal Prediction Sets with Limited False Positives

We develop a new approach to multi-label conformal prediction in which we aim to output a precise set of promising prediction candidates with a bounded number of incorrect answers. Standard conformal prediction provides the ability to adapt…

Machine Learning · Computer Science 2022-02-16 Adam Fisch , Tal Schuster , Tommi Jaakkola , Regina Barzilay