English
Related papers

Related papers: Classifier comparison using precision

200 papers

Usually one compares the accuracy of two competing classifiers via null hypothesis significance tests (nhst). Yet the nhst tests suffer from important shortcomings, which can be overcome by switching to Bayesian hypothesis testing. We…

Machine Learning · Computer Science 2016-11-23 Giorgio Corani , Alessio Benavoli , Janez Demšar , Francesca Mangili , Marco Zaffalon

The estimated accuracy of a classifier is a random quantity with variability. A common practice in supervised machine learning, is thus to test if the estimated accuracy is significantly better than chance level. This method of signal…

Methodology · Statistics 2020-01-28 Jonathan D. Rosenblatt , Yuval Benjamini , Roee Gilron , Roy Mukamel , Jelle J. Goeman

This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its…

Machine Learning · Computer Science 2023-06-16 Telmo Silva Filho , Hao Song , Miquel Perello-Nieto , Raul Santos-Rodriguez , Meelis Kull , Peter Flach

Quantifying the similarity between datasets has widespread applications in statistics and machine learning. The performance of a predictive model on novel datasets, referred to as generalizability, depends on how similar the training and…

Methodology · Statistics 2025-06-18 Marieke Stolte , Franziska Kappenberg , Jörg Rahnenführer , Andrea Bommert

Classifiers are often tested on relatively small data sets, which should lead to uncertain performance metrics. Nevertheless, these metrics are usually taken at face value. We present an approach to quantify the uncertainty of…

Machine Learning · Statistics 2021-03-05 Niklas Tötsch , Daniel Hoffmann

Many applications of classification methods not only require high accuracy but also reliable estimation of predictive uncertainty. However, while many current classification frameworks, in particular deep neural networks, achieve high…

Machine Learning · Computer Science 2020-02-28 Jonathan Wenger , Hedvig Kjellström , Rudolph Triebel

The selection of the best classification algorithm for a given dataset is a very widespread problem, occuring each time one has to choose a classifier to solve a real-world problem. It is also a complex task with many important…

Machine Learning · Computer Science 2012-08-16 Vincent Labatut , Hocine Cherifi

The selection of the best classification algorithm for a given dataset is a very widespread problem. It is also a complex one, in the sense it requires to make several important methodological choices. Among them, in this work we focus on…

Machine Learning · Computer Science 2012-07-18 Vincent Labatut , Hocine Cherifi

This paper explores Bayesian estimation for categorical data, focusing on simple yet effective models that provide a foundation for applying more advanced methods accurately and reliably in real-world applications. We begin by revisiting…

Methodology · Statistics 2025-09-03 Jan Kalina

Complex statistical machine learning models are increasingly being used or considered for use in high-stakes decision-making pipelines in domains such as financial services, health care, criminal justice and human services. These models are…

Applications · Statistics 2017-07-04 Alexandra Chouldechova , Max G'Sell

Knowing when a classifier's prediction can be trusted is useful in many applications and critical for safely using AI. While the bulk of the effort in machine learning research has been towards improving classifier performance,…

Machine Learning · Statistics 2018-10-30 Heinrich Jiang , Been Kim , Melody Y. Guan , Maya Gupta

Model misspecification is a long-standing enigma of the Bayesian inference framework as posteriors tend to get overly concentrated on ill-informed parameter values towards the large sample limit. Tempering of the likelihood has been…

Methodology · Statistics 2019-12-13 Owen Thomas , Jukka Corander

Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their…

Machine Learning · Computer Science 2019-02-20 Juozas Vaicenavicius , David Widmann , Carl Andersson , Fredrik Lindsten , Jacob Roll , Thomas B. Schön

Statistical modeling plays a fundamental role in understanding the underlying mechanism of massive data (statistical inference) and predicting the future (statistical prediction). Although all models are wrong, researchers try their best to…

Methodology · Statistics 2020-06-17 Hangjin Jiang

When dealing with multi-class classification problems, it is common practice to build a model consisting of a series of binary classifiers using a learning paradigm which dictates how the classifiers are built and combined to discriminate…

Machine Learning · Computer Science 2021-01-06 Daniel Cauchi , Adrian Muscat

We introduce equivalence testing procedures for linear regression analyses. Such tests can be very useful for confirming the lack of a meaningful association between a continuous outcome and a continuous or binary predictor. Specifically,…

Methodology · Statistics 2023-05-17 Harlan Campbell

Probabilities or confidence values produced by artificial intelligence (AI) and machine learning (ML) models often do not reflect their true accuracy, with some models being under or over confident in their predictions. For example, if a…

Machine Learning · Computer Science 2025-04-28 Richard Oliver Lane

Selecting an optimal classification model requires a robust and comprehensive understanding of the performance of the model. This paper provides a tutorial on the PyCM library, demonstrating its utility in conducting deep-dive evaluations…

Machine Learning · Computer Science 2026-02-17 Sadra Sabouri , Alireza Zolanvari , Sepand Haghighi

In this paper, we present a new classifier, which integrates significance testing results over different random subspaces to yield consensus p-values for quantifying the uncertainty of classification decision. The null hypothesis is that…

Machine Learning · Computer Science 2024-10-17 Zengyou He , Zerun Li , Junjie Dong , Xinying Liu , Mudi Jiang , Lianyu Hu

Statistical matching methods are widely used in the social and health sciences to estimate causal effects using observational data. Often the objective is to find comparable groups with similar covariate distributions in a dataset, with the…

Applications · Statistics 2021-01-19 Felix Bestehorn , Maike Bestehorn , Christian Kirches
‹ Prev 1 2 3 10 Next ›