Related papers: Predictive Value Generalization Bounds

Comparison of predictive values with paired samples

Positive predictive value and negative predictive value are two widely used parameters to assess the clinical usefulness of a medical diagnostic test. When there are two diagnostic tests, it is recommendable to make a comparative assessment…

Methodology · Statistics 2024-05-29 Antonio Martín Andrés , Pedro Femia Marzo

Leveraging Uncertainty Estimates To Improve Classifier Performance

Binary classification involves predicting the label of an instance based on whether the model score for the positive class exceeds a threshold chosen based on the application requirements (e.g., maximizing recall for a precision bound).…

Machine Learning · Computer Science 2023-11-21 Gundeep Arora , Srujana Merugu , Anoop Saladi , Rajeev Rastogi

Calibration tests beyond classification

Most supervised machine learning tasks are subject to irreducible prediction errors. Probabilistic predictive models address this limitation by providing probability distributions that represent a belief over plausible targets, rather than…

Machine Learning · Statistics 2022-10-25 David Widmann , Fredrik Lindsten , Dave Zachariah

Conformal Predictions for Probabilistically Robust Scalable Machine Learning Classification

Conformal predictions make it possible to define reliable and robust learning algorithms. But they are essentially a method for evaluating whether an algorithm is good enough to be used in practice. To define a reliable learning framework…

Machine Learning · Statistics 2024-03-18 Alberto Carlevaro , Teodoro Alamo Cantarero , Fabrizio Dabbene , Maurizio Mongelli

How to Control the Error Rates of Binary Classifiers

The traditional binary classification framework constructs classifiers which may have good accuracy, but whose false positive and false negative error rates are not under users' control. In many cases, one of the errors is more severe and…

Machine Learning · Statistics 2020-10-22 Miloš Simić

Statistical learning and cross-validation for point processes

This paper presents the first general (supervised) statistical learning framework for point processes in general spaces. Our approach is based on the combination of two new concepts, which we define in the paper: i) bivariate innovations,…

Methodology · Statistics 2021-03-03 Ottmar Cronie , Mehdi Moradi , Christophe A. N. Biscio

On Orderings of Probability Vectors and Unsupervised Performance Estimation

Unsupervised performance estimation, or evaluating how well models perform on unlabeled data is a difficult task. Recently, a method was proposed by Garg et al. [2022] which performs much better than previous methods. Their method relies on…

Machine Learning · Computer Science 2023-06-21 Muhammad Maaz , Rui Qiao , Yiheng Zhou , Renxian Zhang

Post-Processing Posterior Predictive P-values

This article addresses issues of model criticism and model comparison in Bayesian contexts, and focusses on the use of the so-called posterior predictive p-values (ppp values). These involve a general discrepancy or conflict measure and…

Methodology · Statistics 2026-05-26 Nils Lid Hjort , Fredrik A. Dahl , Gunnhildur Högnadóttir Steinbakk

Prudence When Assuming Normality: an advice for machine learning practitioners

In a binary classification problem the feature vector (predictor) is the input to a scoring function that produces a decision value (score), which is compared to a particular chosen threshold to provide a final class prediction (output).…

Machine Learning · Computer Science 2021-11-11 Waleed A. Yousef

Notes on Noise Contrastive Estimation and Negative Sampling

Estimating the parameters of probabilistic models of language such as maxent models and probabilistic neural models is computationally difficult since it involves evaluating partition functions by summing over an entire vocabulary, which…

Machine Learning · Computer Science 2014-10-31 Chris Dyer

Reliable Probabilistic Classification with Neural Networks

Venn Prediction (VP) is a new machine learning framework for producing well-calibrated probabilistic predictions. In particular it provides well-calibrated lower and upper bounds for the conditional probability of an example belonging to…

Machine Learning · Computer Science 2023-12-18 Harris Papadopoulos

Calibration of Machine Learning Classifiers for Probability of Default Modelling

Binary classification is highly used in credit scoring in the estimation of probability of default. The validation of such predictive models is based both on rank ability, and also on calibration (i.e. how accurately the probabilities…

Econometrics · Economics 2017-10-25 Pedro G. Fonseca , Hugo D. Lopes

Using functional information for binary classifications

The adequate use of information measured in a continuous manner along a period of time represents a methodological challenge. In the last decades, most of traditional statistical procedures have been extended for accommodating these…

Methodology · Statistics 2025-12-04 Pablo Martinez-Camblor

From Uncertainty to Precision: Enhancing Binary Classifier Performance through Calibration

The assessment of binary classifier performance traditionally centers on discriminative ability using metrics, such as accuracy. However, these metrics often disregard the model's inherent uncertainty, especially when dealing with sensitive…

Machine Learning · Computer Science 2024-02-13 Agathe Fernandes Machado , Arthur Charpentier , Emmanuel Flachaire , Ewen Gallic , François Hu

Random positive operator valued measures

We introduce several notions of random positive operator valued measures (POVMs), and we prove that some of them are equivalent. We then study statistical properties of the effect operators for the canonical examples, obtaining limiting…

Quantum Physics · Physics 2020-04-27 Teiko Heinosaari , Maria Anastasia Jivulescu , Ion Nechita

On the Interpretability of Conditional Probability Estimates in the Agnostic Setting

We study the interpretability of conditional probability estimates for binary classification under the agnostic setting or scenario. Under the agnostic setting, conditional probability estimates do not necessarily reflect the true…

Machine Learning · Computer Science 2017-03-01 Yihan Gao , Aditya Parameswaran , Jian Peng

Evaluating model calibration in classification

Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their…

Machine Learning · Computer Science 2019-02-20 Juozas Vaicenavicius , David Widmann , Carl Andersson , Fredrik Lindsten , Jacob Roll , Thomas B. Schön

P-values for classification

Let $(X,Y)$ be a random variable consisting of an observed feature vector $X\in \mathcal{X}$ and an unobserved class label $Y\in \{1,2,...,L\}$ with unknown joint distribution. In addition, let $\mathcal{D}$ be a training data set…

Statistics Theory · Mathematics 2008-06-26 Lutz Duembgen , Bernd-Wolfgang Igl , Axel Munk

From Classification Accuracy to Proper Scoring Rules: Elicitability of Probabilistic Top List Predictions

In the face of uncertainty, the need for probabilistic assessments has long been recognized in the literature on forecasting. In classification, however, comparative evaluation of classifiers often focuses on predictions specifying a single…

Methodology · Statistics 2023-05-31 Johannes Resin

Classification from Positive and Biased Negative Data with Skewed Labeled Posterior Probability

The binary classification problem has a situation where only biased data are observed in one of the classes. In this paper, we propose a new method to approach the positive and biased negative (PbN) classification problem, which is a weakly…

Methodology · Statistics 2025-10-28 Shotaro Watanabe , Hidetoshi Matsui