English
Related papers

Related papers: Mixture Proportion Estimation and Weakly-supervise…

200 papers

Positive--unlabeled (PU) learning considers two samples, a positive set P with observations from only one class and an unlabeled set U with observations from two classes. The goal is to classify observations in U. Class mixture proportion…

Methodology · Statistics 2020-01-13 Zhenfeng Lin , James P. Long

The task of mixture proportion estimation (MPE) is to estimate the weight of a component distribution in a mixture, given observations from both the component and mixture. Previous work on MPE adopts the irreducibility assumption, which…

Machine Learning · Statistics 2023-08-01 Yilun Zhu , Aaron Fjeldsted , Darren Holland , George Landon , Azaree Lintereur , Clayton Scott

Mixture proportion estimation (MPE) is the problem of estimating the weight of a component distribution in a mixture, given samples from the mixture and component. This problem constitutes a key part in many "weakly supervised learning"…

Machine Learning · Computer Science 2016-06-01 Harish G. Ramaswamy , Clayton Scott , Ambuj Tewari

Given only positive examples and unlabeled examples (from both positive and negative classes), we might hope nevertheless to estimate an accurate positive-versus-negative classifier. Formally, this task is broken down into two subtasks: (i)…

Machine Learning · Computer Science 2021-11-02 Saurabh Garg , Yifan Wu , Alex Smola , Sivaraman Balakrishnan , Zachary C. Lipton

We propose the conditional predictive impact (CPI), a consistent and unbiased estimator of the association between one or several features and a given outcome, conditional on a reduced feature set. Building on the knockoff framework of…

Methodology · Statistics 2021-05-14 David S. Watson , Marvin N. Wright

We present an operator-free, measure-theoretic approach to the conditional mean embedding (CME) as a random variable taking values in a reproducing kernel Hilbert space. While the kernel mean embedding of unconditional distributions has…

Machine Learning · Computer Science 2021-01-11 Junhyung Park , Krikamol Muandet

Conditional independence (CI) is central to causal inference, feature selection, and graphical modeling, yet it is untestable in many settings without additional assumptions. Existing CI tests often rely on restrictive structural…

Machine Learning · Computer Science 2025-12-23 Alek Frohlich , Vladimir Kostic , Karim Lounici , Daniel Perazzo , Massimiliano Pontil

In unsupervised ensemble learning, one obtains predictions from multiple sources or classifiers, yet without knowing the reliability and expertise of each source, and with no labeled data to assess it. The task is to combine these possibly…

Machine Learning · Computer Science 2016-02-24 Ariel Jaffe , Ethan Fetaya , Boaz Nadler , Tingting Jiang , Yuval Kluger

We study prediction-powered conditional inference in the setting where labeled data are scarce, unlabeled covariates are abundant, and a black-box machine-learning predictor is available. The goal is to perform statistical inference on…

Machine Learning · Statistics 2026-03-09 Yang Sui , Jin Zhou , Hua Zhou , Xiaowu Dai

We propose a new conditional dependence measure and a statistical test for conditional independence. The measure is based on the difference between analytic kernel embeddings of two well-suited distributions evaluated at a finite set of…

Machine Learning · Statistics 2022-06-17 Meyer Scetbon , Laurent Meunier , Yaniv Romano

The consistency of a learning method is usually established under the assumption that the observations are a realization of an independent and identically distributed (i.i.d.) or mixing process. Yet, kernel methods such as support vector…

Machine Learning · Computer Science 2024-06-11 Pierre-François Massiani , Sebastian Trimpe , Friedrich Solowjow

Model monitoring is a critical component of the machine learning lifecycle, safeguarding against undetected drops in the model's performance after deployment. Traditionally, performance monitoring has required access to ground truth labels,…

Machine Learning · Computer Science 2026-03-10 Juhani Kivimäki , Jakub Białek , Wojtek Kuberski , Jukka K. Nurminen

While in-context learning with large language models (LLMs) has shown impressive performance, we have discovered a unique miscalibration behavior where both correct and incorrect predictions are assigned the same level of confidence. We…

Computation and Language · Computer Science 2024-10-04 Wei Cheng , Tianlu Wang , Yanmin Ji , Fan Yang , Keren Tan , Yiyu Zheng

Finite mixtures of regression models provide a flexible modeling framework for many phenomena. Using moment-based estimation of the regression parameters, we develop unbiased estimators with a minimum of assumptions on the mixture…

Statistics Theory · Mathematics 2019-05-17 Claus Thorn Ekstrøm , Christian Bressen Pipper

We show how to estimate a model's test error from unlabeled data, on distributions very different from the training distribution, while assuming only that certain conditional independencies are preserved between train and test. We do not…

Machine Learning · Computer Science 2016-06-17 Jacob Steinhardt , Percy Liang

Motivated by conditional independence testing, an essential step in constraint-based causal discovery algorithms, we study the nonparametric Von Mises estimator for the entropy of multivariate distributions built on a kernel density…

Machine Learning · Computer Science 2023-10-23 Fateme Jamshidi , Luca Ganassali , Negar Kiyavash

Conditional Mutual Information (CMI) is a measure of conditional dependence between random variables X and Y, given another random variable Z. It can be used to quantify conditional dependence among variables in many data-driven inference…

Machine Learning · Computer Science 2019-06-10 Sudipto Mukherjee , Himanshu Asnani , Sreeram Kannan

Learning meaningful representations of data that can address challenges such as batch effect correction and counterfactual inference is a central problem in many domains including computational biology. Adopting a Conditional VAE framework,…

Machine Learning · Statistics 2022-06-28 Adam Foster , Árpi Vezér , Craig A Glastonbury , Páidí Creed , Sam Abujudeh , Aaron Sim

Partial Multi-label Learning (PML) is a type of weakly supervised learning where each training instance corresponds to a set of candidate labels, among which only some are true. In this paper, we introduce \our{}, a novel probabilistic…

Machine Learning · Computer Science 2024-03-13 Łukasz Struski , Adam Pardyl , Jacek Tabor , Bartosz Zieliński

The Automated Model Evaluation (AutoEval) framework entertains the possibility of evaluating a trained machine learning model without resorting to a labeled testing set. Despite the promise and some decent results, the existing AutoEval…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Ru Peng , Qiuyang Duan , Haobo Wang , Jiachen Ma , Yanbo Jiang , Yongjun Tu , Xiu Jiang , Junbo Zhao
‹ Prev 1 2 3 10 Next ›