Related papers: Mixture Proportion Estimation and Weakly-supervise…

A Flexible Procedure for Mixture Proportion Estimation in Positive-Unlabeled Learning

Positive--unlabeled (PU) learning considers two samples, a positive set P with observations from only one class and an unlabeled set U with observations from two classes. The goal is to classify observations in U. Class mixture proportion…

Methodology · Statistics 2020-01-13 Zhenfeng Lin , James P. Long

Mixture Proportion Estimation Beyond Irreducibility

The task of mixture proportion estimation (MPE) is to estimate the weight of a component distribution in a mixture, given observations from both the component and mixture. Previous work on MPE adopts the irreducibility assumption, which…

Machine Learning · Statistics 2023-08-01 Yilun Zhu , Aaron Fjeldsted , Darren Holland , George Landon , Azaree Lintereur , Clayton Scott

Mixture Proportion Estimation via Kernel Embedding of Distributions

Mixture proportion estimation (MPE) is the problem of estimating the weight of a component distribution in a mixture, given samples from the mixture and component. This problem constitutes a key part in many "weakly supervised learning"…

Machine Learning · Computer Science 2016-06-01 Harish G. Ramaswamy , Clayton Scott , Ambuj Tewari

Mixture Proportion Estimation and PU Learning: A Modern Approach

Given only positive examples and unlabeled examples (from both positive and negative classes), we might hope nevertheless to estimate an accurate positive-versus-negative classifier. Formally, this task is broken down into two subtasks: (i)…

Machine Learning · Computer Science 2021-11-02 Saurabh Garg , Yifan Wu , Alex Smola , Sivaraman Balakrishnan , Zachary C. Lipton

Testing Conditional Independence in Supervised Learning Algorithms

We propose the conditional predictive impact (CPI), a consistent and unbiased estimator of the association between one or several features and a given outcome, conditional on a reduced feature set. Building on the knockoff framework of…

Methodology · Statistics 2021-05-14 David S. Watson , Marvin N. Wright

A Measure-Theoretic Approach to Kernel Conditional Mean Embeddings

We present an operator-free, measure-theoretic approach to the conditional mean embedding (CME) as a random variable taking values in a reproducing kernel Hilbert space. While the kernel mean embedding of unconditional distributions has…

Machine Learning · Computer Science 2021-01-11 Junhyung Park , Krikamol Muandet

Toward Scalable and Valid Conditional Independence Testing with Spectral Representations

Conditional independence (CI) is central to causal inference, feature selection, and graphical modeling, yet it is untestable in many settings without additional assumptions. Existing CI tests often rely on restrictive structural…

Machine Learning · Computer Science 2025-12-23 Alek Frohlich , Vladimir Kostic , Karim Lounici , Daniel Perazzo , Massimiliano Pontil

Unsupervised Ensemble Learning with Dependent Classifiers

In unsupervised ensemble learning, one obtains predictions from multiple sources or classifiers, yet without knowing the reliability and expertise of each source, and with no labeled data to assess it. The task is to combine these possibly…

Machine Learning · Computer Science 2016-02-24 Ariel Jaffe , Ethan Fetaya , Boaz Nadler , Tingting Jiang , Yuval Kluger

Prediction-Powered Conditional Inference

We study prediction-powered conditional inference in the setting where labeled data are scarce, unlabeled covariates are abundant, and a black-box machine-learning predictor is available. The goal is to perform statistical inference on…

Machine Learning · Statistics 2026-03-09 Yang Sui , Jin Zhou , Hua Zhou , Xiaowu Dai

An Asymptotic Test for Conditional Independence using Analytic Kernel Embeddings

We propose a new conditional dependence measure and a statistical test for conditional independence. The measure is based on the difference between analytic kernel embeddings of two well-suited distributions evaluated at a finite set of…

Machine Learning · Statistics 2022-06-17 Meyer Scetbon , Laurent Meunier , Yaniv Romano

On the Consistency of Kernel Methods with Dependent Observations

The consistency of a learning method is usually established under the assumption that the observations are a realization of an independent and identically distributed (i.i.d.) or mixing process. Yet, kernel methods such as support vector…

Machine Learning · Computer Science 2024-06-11 Pierre-François Massiani , Sebastian Trimpe , Friedrich Solowjow

Performance Estimation in Binary Classification Using Calibrated Confidence

Model monitoring is a critical component of the machine learning lifecycle, safeguarding against undetected drops in the model's performance after deployment. Traditionally, performance monitoring has required access to ground truth labels,…

Machine Learning · Computer Science 2026-03-10 Juhani Kivimäki , Jakub Białek , Wojtek Kuberski , Jukka K. Nurminen

Calibrate to Discriminate: Improve In-Context Learning with Label-Free Comparative Inference

While in-context learning with large language models (LLMs) has shown impressive performance, we have discovered a unique miscalibration behavior where both correct and incorrect predictions are assigned the same level of confidence. We…

Computation and Language · Computer Science 2024-10-04 Wei Cheng , Tianlu Wang , Yanmin Ji , Fan Yang , Keren Tan , Yiyu Zheng

Moment-based Estimation of Mixtures of Regression Models

Finite mixtures of regression models provide a flexible modeling framework for many phenomena. Using moment-based estimation of the regression parameters, we develop unbiased estimators with a minimum of assumptions on the mixture…

Statistics Theory · Mathematics 2019-05-17 Claus Thorn Ekstrøm , Christian Bressen Pipper

Unsupervised Risk Estimation Using Only Conditional Independence Structure

We show how to estimate a model's test error from unlabeled data, on distributions very different from the training distribution, while assuming only that certain conditional independencies are preserved between train and test. We do not…

Machine Learning · Computer Science 2016-06-17 Jacob Steinhardt , Percy Liang

On sample complexity of conditional independence testing with Von Mises estimator with application to causal discovery

Motivated by conditional independence testing, an essential step in constraint-based causal discovery algorithms, we study the nonparametric Von Mises estimator for the entropy of multivariate distributions built on a kernel density…

Machine Learning · Computer Science 2023-10-23 Fateme Jamshidi , Luca Ganassali , Negar Kiyavash

CCMI : Classifier based Conditional Mutual Information Estimation

Conditional Mutual Information (CMI) is a measure of conditional dependence between random variables X and Y, given another random variable Z. It can be used to quantify conditional dependence among variables in many data-driven inference…

Machine Learning · Computer Science 2019-06-10 Sudipto Mukherjee , Himanshu Asnani , Sreeram Kannan

Contrastive Mixture of Posteriors for Counterfactual Inference, Data Integration and Fairness

Learning meaningful representations of data that can address challenges such as batch effect correction and counterfactual inference is a central problem in many domains including computational biology. Adopting a Conditional VAE framework,…

Machine Learning · Statistics 2022-06-28 Adam Foster , Árpi Vezér , Craig A Glastonbury , Páidí Creed , Sam Abujudeh , Aaron Sim

ProPML: Probability Partial Multi-label Learning

Partial Multi-label Learning (PML) is a type of weakly supervised learning where each training instance corresponds to a set of candidate labels, among which only some are true. In this paper, we introduce \our{}, a novel probabilistic…

Machine Learning · Computer Science 2024-03-13 Łukasz Struski , Adam Pardyl , Jacek Tabor , Bartosz Zieliński

CAME: Contrastive Automated Model Evaluation

The Automated Model Evaluation (AutoEval) framework entertains the possibility of evaluating a trained machine learning model without resorting to a labeled testing set. Despite the promise and some decent results, the existing AutoEval…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Ru Peng , Qiuyang Duan , Haobo Wang , Jiachen Ma , Yanbo Jiang , Yongjun Tu , Xiu Jiang , Junbo Zhao