Related papers: Binary Classification Tests, Imperfect Standards, …

A method for classification of data with uncertainty using hypothesis testing

Binary classification is a task that involves the classification of data into one of two distinct classes. It is widely utilized in various fields. However, conventional classifiers tend to make overconfident predictions for data that…

Machine Learning · Computer Science 2025-03-13 Shoma Yokura , Akihisa Ichiki

Binary classification with ambiguous training data

In supervised learning, we often face with ambiguous (A) samples that are difficult to label even by domain experts. In this paper, we consider a binary classification problem in the presence of such A samples. This problem is substantially…

Machine Learning · Computer Science 2020-11-25 Naoya Otani , Yosuke Otsubo , Tetsuya Koike , Masashi Sugiyama

Aggregating multiple test results to improve medical decision-making

Gathering observational data for medical decision-making often involves uncertainties arising from both type I (false positive)and type II (false negative) errors. In this work, we develop a statistical model to study how medical…

Applications · Statistics 2025-10-21 Lucas Böttcher , Maria R. D'Orsogna , Tom Chou

Confusion Matrices and Accuracy Statistics for Binary Classifiers Using Unlabeled Data: The Diagnostic Test Approach

Medical researchers have solved the problem of estimating the sensitivity and specificity of binary medical diagnostic tests without gold standard tests for comparison. That problem is the same as estimating confusion matrices for…

Machine Learning · Statistics 2022-12-29 Richard Evans

Classification Under Uncertainty: Data Analysis for Diagnostic Antibody Testing

Formulating accurate and robust classification strategies is a key challenge of developing diagnostic and antibody tests. Methods that do not explicitly account for disease prevalence and uncertainty therein can lead to significant…

Methodology · Statistics 2022-02-01 Paul N. Patrone , Anthony J. Kearsley

"Nothing Abnormal": Disambiguating Medical Reports via Contrastive Knowledge Infusion

Sharing medical reports is essential for patient-centered care. A recent line of work has focused on automatically generating reports with NLP methods. However, different audiences have different purposes when writing/reading medical…

Computation and Language · Computer Science 2023-05-16 Zexue He , An Yan , Amilcare Gentili , Julian McAuley , Chun-Nan Hsu

Testing (Conditional) Mutual Information

We investigate the sample complexity of mutual information and conditional mutual information testing. For conditional mutual information testing, given access to independent samples of a triple of random variables $(A, B, C)$ with unknown…

Data Structures and Algorithms · Computer Science 2025-06-05 Jan Seyfried , Sayantan Sen , Marco Tomamichel

Ambiguous signals and efficient codes

In many biological networks the responses of individual elements are ambiguous. We consider a scenario in which many sensors respond to a shared signal, each with limited information capacity, and ask that the outputs together convey as…

Biological Physics · Physics 2025-12-30 Marianne Bauer , William Bialek

Misinformation Has High Perplexity

Debunking misinformation is an important and time-critical task as there could be adverse consequences when misinformation is not quashed promptly. However, the usual supervised approach to debunking via misinformation classification…

Computation and Language · Computer Science 2020-06-11 Nayeon Lee , Yejin Bang , Andrea Madotto , Pascale Fung

On the Ubiquity of Information Inconsistency for Conjugate Priors

Informally, "Information Inconsistency" is the property that has been observed in many Bayesian hypothesis testing and model selection procedures whereby the Bayesian conclusion does not become definitive when the data seems to become…

Statistics Theory · Mathematics 2017-10-27 Joris Mulder , James O. Berger , Víctor Peña , M. J. Bayarri

Correcting for partial verification bias in diagnostic accuracy studies: A tutorial using R

Diagnostic tests play a crucial role in medical care. Thus any new diagnostic tests must undergo a thorough evaluation. New diagnostic tests are evaluated in comparison with the respective gold standard tests. The performance of binary…

Applications · Statistics 2025-09-17 Wan Nor Arifin , Umi Kalsom Yusof

The Sample Complexity of Distributed Simple Binary Hypothesis Testing under Information Constraints

This paper resolves two open problems from a recent paper, arXiv:2403.16981, concerning the sample complexity of distributed simple binary hypothesis testing under information constraints. The first open problem asks whether interaction…

Information Theory · Computer Science 2025-06-18 Hadi Kazemi , Ankit Pensia , Varun Jog

Hypothesis Testing over Observable Regimes in Singular Models

Hypothesis testing in singular statistical models is often regarded as inherently problematic due to non-identifiability and degeneracy of the Fisher information. We show that the fundamental obstruction to testing in such models is not…

Statistics Theory · Mathematics 2026-03-02 Sean Plummer

Uniform reliability tests for forecasting systems with small lead time

A long noted difficulty when assessing the reliability (or calibration) of forecasting systems is that reliability, in general, is a hypothesis not about a finite dimensional parameter but about an entire functional relationship. A…

Data Analysis, Statistics and Probability · Physics 2020-12-09 Jochen Bröcker

Incorporating external information in analyses of clinical trials with binary outcomes

External information, such as prior information or expert opinions, can play an important role in the design, analysis and interpretation of clinical trials. However, little attention has been devoted thus far to incorporating external…

Applications · Statistics 2013-04-24 Minge Xie , Regina Y. Liu , C. V. Damaraju , William H. Olson

Distributed Binary Detection with Lossy Data Compression

Consider the problem where a statistician in a two-node system receives rate-limited information from a transmitter about marginal observations of a memoryless process generated from two possible distributions. Using its own observations,…

Information Theory · Computer Science 2017-03-02 Gil Katz , Pablo Piantanida , Mérouane Debbah

Phi-Divergence test statistics for testing the validity of latent class models for binary data

The main purpose of this paper is to present new families of test statistics for studying the problem of goodness-of-fit of some data to a latent class model for binary data. The families of test statistics introduced are based on…

Methodology · Statistics 2014-07-09 Ángel Felipe , Nirian Martín , Pedro Miranda , Leandro Pardo

Quantifying the Fraction of Missing Information for Hypothesis Testing in Statistical and Genetic Studies

Many practical studies rely on hypothesis testing procedures applied to data sets with missing information. An important part of the analysis is to determine the impact of the missing data on the performance of the test, and this can be…

Methodology · Statistics 2011-02-15 Dan L. Nicolae , Xiao-Li Meng , Augustine Kong

Robust Mitigation of Age-Dependent Confounding Effects via Sample-Difficulty Decorrelation

Age dependent performance disparities in medical image classification often arise because age acts as a confounder, linking imaging morphology with disease prevalence. In practice, disparities can manifest as overdiagnosis at ages where…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Nikhil Cherian Kurian , Victor Caquilpan Parra , Abin Shoby , Luke Whitbread , Lyle J. Palmer

Finite-sample expansions for the optimal error probability in asymmetric binary hypothesis testing

The problem of binary hypothesis testing between two probability measures is considered. New sharp bounds are derived for the best achievable error probability of such tests based on independent and identically distributed observations.…

Information Theory · Computer Science 2024-05-30 Valentinian Lungu , Ioannis Kontoyiannis