Related papers: Classifier comparison using precision

Statistical comparison of classifiers through Bayesian hierarchical modelling

Usually one compares the accuracy of two competing classifiers via null hypothesis significance tests (nhst). Yet the nhst tests suffer from important shortcomings, which can be overcome by switching to Bayesian hypothesis testing. We…

Machine Learning · Computer Science 2016-11-23 Giorgio Corani , Alessio Benavoli , Janez Demšar , Francesca Mangili , Marco Zaffalon

Better-Than-Chance Classification for Signal Detection

The estimated accuracy of a classifier is a random quantity with variability. A common practice in supervised machine learning, is thus to test if the estimated accuracy is significantly better than chance level. This method of signal…

Methodology · Statistics 2020-01-28 Jonathan D. Rosenblatt , Yuval Benjamini , Roee Gilron , Roy Mukamel , Jelle J. Goeman

Classifier Calibration: A survey on how to assess and improve predicted class probabilities

This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its…

Machine Learning · Computer Science 2023-06-16 Telmo Silva Filho , Hao Song , Miquel Perello-Nieto , Raul Santos-Rodriguez , Meelis Kull , Peter Flach

Methods for Quantifying Dataset Similarity: a Review, Taxonomy and Comparison

Quantifying the similarity between datasets has widespread applications in statistics and machine learning. The performance of a predictive model on novel datasets, referred to as generalizability, depends on how similar the training and…

Methodology · Statistics 2025-06-18 Marieke Stolte , Franziska Kappenberg , Jörg Rahnenführer , Andrea Bommert

Classifier uncertainty: evidence, potential impact, and probabilistic treatment

Classifiers are often tested on relatively small data sets, which should lead to uncertain performance metrics. Nevertheless, these metrics are usually taken at face value. We present an approach to quantify the uncertainty of…

Machine Learning · Statistics 2021-03-05 Niklas Tötsch , Daniel Hoffmann

Non-Parametric Calibration for Classification

Many applications of classification methods not only require high accuracy but also reliable estimation of predictive uncertainty. However, while many current classification frameworks, in particular deep neural networks, achieve high…

Machine Learning · Computer Science 2020-02-28 Jonathan Wenger , Hedvig Kjellström , Rudolph Triebel

Evaluation of Performance Measures for Classifiers Comparison

The selection of the best classification algorithm for a given dataset is a very widespread problem, occuring each time one has to choose a classifier to solve a real-world problem. It is also a complex task with many important…

Machine Learning · Computer Science 2012-08-16 Vincent Labatut , Hocine Cherifi

Accuracy Measures for the Comparison of Classifiers

The selection of the best classification algorithm for a given dataset is a very widespread problem. It is also a complex one, in the sense it requires to make several important methodological choices. Among them, in this work we focus on…

Machine Learning · Computer Science 2012-07-18 Vincent Labatut , Hocine Cherifi

Bayesian Estimation and Regularization Techniques in Categorical Data Analysis

This paper explores Bayesian estimation for categorical data, focusing on simple yet effective models that provide a foundation for applying more advanced methods accurately and reliably in real-world applications. We begin by revisiting…

Methodology · Statistics 2025-09-03 Jan Kalina

Fairer and more accurate, but for whom?

Complex statistical machine learning models are increasingly being used or considered for use in high-stakes decision-making pipelines in domains such as financial services, health care, criminal justice and human services. These models are…

Applications · Statistics 2017-07-04 Alexandra Chouldechova , Max G'Sell

To Trust Or Not To Trust A Classifier

Knowing when a classifier's prediction can be trusted is useful in many applications and critical for safely using AI. While the bulk of the effort in machine learning research has been towards improving classifier performance,…

Machine Learning · Statistics 2018-10-30 Heinrich Jiang , Been Kim , Melody Y. Guan , Maya Gupta

Diagnosing model misspecification and performing generalized Bayes' updates via probabilistic classifiers

Model misspecification is a long-standing enigma of the Bayesian inference framework as posteriors tend to get overly concentrated on ill-informed parameter values towards the large sample limit. Tempering of the likelihood has been…

Methodology · Statistics 2019-12-13 Owen Thomas , Jukka Corander

Evaluating model calibration in classification

Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their…

Machine Learning · Computer Science 2019-02-20 Juozas Vaicenavicius , David Widmann , Carl Andersson , Fredrik Lindsten , Jacob Roll , Thomas B. Schön

A Goodness-of-Fit Test for Statistical Models

Statistical modeling plays a fundamental role in understanding the underlying mechanism of massive data (statistical inference) and predicting the future (statistical prediction). Although all models are wrong, researchers try their best to…

Methodology · Statistics 2020-06-17 Hangjin Jiang

One vs Previous and Similar Classes Learning -- A Comparative Study

When dealing with multi-class classification problems, it is common practice to build a model consisting of a series of binary classifiers using a learning paradigm which dictates how the classifiers are built and combined to discriminate…

Machine Learning · Computer Science 2021-01-06 Daniel Cauchi , Adrian Muscat

Equivalence testing for linear regression

We introduce equivalence testing procedures for linear regression analyses. Such tests can be very useful for confirming the lack of a meaningful association between a continuous outcome and a continuous or binary predictor. Specifically,…

Methodology · Statistics 2023-05-17 Harlan Campbell

A comprehensive review of classifier probability calibration metrics

Probabilities or confidence values produced by artificial intelligence (AI) and machine learning (ML) models often do not reflect their true accuracy, with some models being under or over confident in their predictions. For example, if a…

Machine Learning · Computer Science 2025-04-28 Richard Oliver Lane

Comparing Classifiers: A Case Study Using PyCM

Selecting an optimal classification model requires a robust and comprehensive understanding of the performance of the model. This paper provides a tutorial on the PyCM library, demonstrating its utility in conducting deep-dive evaluations…

Machine Learning · Computer Science 2026-02-17 Sadra Sabouri , Alireza Zolanvari , Sepand Haghighi

Conjunction Subspaces Test for Conformal and Selective Classification

In this paper, we present a new classifier, which integrates significance testing results over different random subspaces to yield consensus p-values for quantifying the uncertainty of classification decision. The null hypothesis is that…

Machine Learning · Computer Science 2024-10-17 Zengyou He , Zerun Li , Junjie Dong , Xinying Liu , Mudi Jiang , Lianyu Hu

A deterministic matching method for exact matchings to compare the outcome of different interventions

Statistical matching methods are widely used in the social and health sciences to estimate causal effects using observational data. Often the objective is to find comparable groups with similar covariate distributions in a dataset, with the…

Applications · Statistics 2021-01-19 Felix Bestehorn , Maike Bestehorn , Christian Kirches