Related papers: Classification using Ensemble Learning under Weigh…

Statistical inference for association studies in the presence of binary outcome misclassification

In biomedical and public health association studies, binary outcome variables may be subject to misclassification, resulting in substantial bias in effect estimates. The feasibility of addressing binary outcome misclassification in…

Methodology · Statistics 2024-03-19 Kimberly A. Hochstedler Webb , Martin T. Wells

Covariate Balancing Propensity Score by Tailored Loss Functions

In observational studies, propensity scores are commonly estimated by maxi- mum likelihood but may fail to balance high-dimensional pre-treatment covariates even after specification search. We introduce a general framework that unifies and…

Methodology · Statistics 2017-03-22 Qingyuan Zhao

Leveraging Uncertainty Estimates To Improve Classifier Performance

Binary classification involves predicting the label of an instance based on whether the model score for the positive class exceeds a threshold chosen based on the application requirements (e.g., maximizing recall for a precision bound).…

Machine Learning · Computer Science 2023-11-21 Gundeep Arora , Srujana Merugu , Anoop Saladi , Rajeev Rastogi

Inference problems in binary regression model with misclassified responses

Misclassification of binary responses, if ignored, may severely bias the maximum likelihood estimators (MLE) of regression parameters. For such data, a binary regression model incorporating misclassification probabilities is extensively…

Statistics Theory · Mathematics 2020-09-28 Arindam Chatterjee , Tathagata Bandyopadhyay , Sumanta Adhya

Covariance-engaged Classification of Sets via Linear Programming

Set classification aims to classify a set of observations as a whole, as opposed to classifying individual observations separately. To formally understand the unfamiliar concept of binary set classification, we first investigate the optimal…

Machine Learning · Statistics 2020-06-29 Zhao Ren , Sungkyu Jung , Xingye Qiao

A Variational Approach for Learning from Positive and Unlabeled Data

Learning binary classifiers only from positive and unlabeled (PU) data is an important and challenging task in many real-world applications, including web text classification, disease gene identification and fraud detection, where negative…

Machine Learning · Computer Science 2020-12-01 Hui Chen , Fangqing Liu , Yin Wang , Liyue Zhao , Hao Wu

Class-Imbalanced Complementary-Label Learning via Weighted Loss

Complementary-label learning (CLL) is widely used in weakly supervised classification, but it faces a significant challenge in real-world datasets when confronted with class-imbalanced training samples. In such scenarios, the number of…

Machine Learning · Computer Science 2024-03-21 Meng Wei , Yong Zhou , Zhongnian Li , Xinzheng Xu

Classification approach based on association rules mining for unbalanced data

This paper deals with the binary classification task when the target class has the lower probability of occurrence. In such situation, it is not possible to build a powerful classifier by using standard methods such as logistic regression,…

Machine Learning · Statistics 2015-02-26 Cheikh Ndour , Aliou Diop , Simplice Dossou-Gbété

Optimal Binary Classifier Aggregation for General Losses

We address the problem of aggregating an ensemble of predictors with known loss bounds in a semi-supervised binary classification setting, to minimize prediction loss incurred on the unlabeled data. We find the minimax optimal predictions…

Machine Learning · Computer Science 2016-11-08 Akshay Balsubramani , Yoav Freund

Weighting-Based Treatment Effect Estimation via Distribution Learning

Existing weighting methods for treatment effect estimation are often built upon the idea of propensity scores or covariate balance. They usually impose strong assumptions on treatment assignment or outcome model to obtain unbiased…

Machine Learning · Computer Science 2023-05-09 Dongcheng Zhang , Kunpeng Zhang

Sub-Classifier Construction for Error Correcting Output Code Using Minimum Weight Perfect Matching

Multi-class classification is mandatory for real world problems and one of promising techniques for multi-class classification is Error Correcting Output Code. We propose a method for constructing the Error Correcting Output Code to obtain…

Machine Learning · Computer Science 2013-12-30 Patoomsiri Songsiri , Thimaporn Phetkaew , Ryutaro Ichise , Boonserm Kijsirikul

Unbiased Loss Functions for Multilabel Classification with Missing Labels

This paper considers binary and multilabel classification problems in a setting where labels are missing independently and with a known rate. Missing labels are a ubiquitous phenomenon in extreme multi-label classification (XMC) tasks, such…

Machine Learning · Computer Science 2021-09-24 Erik Schultheis , Rohit Babbar

Learning to Rank Binary Codes

Binary codes have been widely used in vision problems as a compact feature representation to achieve both space and time advantages. Various methods have been proposed to learn data-dependent hash functions which map a feature vector to a…

Computer Vision and Pattern Recognition · Computer Science 2014-10-22 Jie Feng , Wei Liu , Yan Wang

A weighting method for simultaneous adjustment for confounding and joint exposure-outcome misclassifications

Joint misclassification of exposure and outcome variables can lead to considerable bias in epidemiological studies of causal exposure-outcome effects. In this paper, we present a new maximum likelihood based estimator for the marginal…

Methodology · Statistics 2019-01-16 Bas B. L. Penning de Vries , Maarten van Smeden , Rolf H. H. Groenwold

Effect estimation in the presence of a misclassified binary mediator

Mediation analyses allow researchers to quantify the effect of an exposure variable on an outcome variable through a mediator variable. If a binary mediator variable is misclassified, the resulting analysis can be severely biased.…

Methodology · Statistics 2024-07-19 Kimberly A. Hochstedler Webb , Martin T. Wells

Optimal Covariate Weighting Increases Discoveries in High-throughput Biology

The large-scale multiple testing inherent to high throughput biological data necessitates very high statistical stringency and thus true effects in data are difficult to detect unless they have high effect sizes. One promising approach for…

Methodology · Statistics 2022-03-14 Mohamad Hasan , Paul Schliekelman

Permutation Weighting

In observational causal inference, in order to emulate a randomized experiment, weights are used to render treatments independent of observed covariates. This property is known as balance; in its absence, estimated causal effects may be…

Methodology · Statistics 2020-07-16 David Arbour , Drew Dimmery , Arjun Sondhi

General Framework for Binary Classification on Top Samples

Many binary classification problems minimize misclassification above (or below) a threshold. We show that instances of ranking problems, accuracy at the top or hypothesis testing may be written in this form. We propose a general framework…

Machine Learning · Computer Science 2020-02-26 Lukáš Adam , Václav Mácha , Václav Šmídl , Tomáš Pevný

Improving Variance Estimation for Covariate Adjustment with Binary Outcomes

Covariate adjustment is a general method for improving precision when estimating treatment effects in randomized trials and is recommended by the FDA in its 2023 guidance when baseline variables are prognostic for the primary outcome. We…

Methodology · Statistics 2026-05-08 Kaitlyn Lee , Alex Ocampo , Courtney Schiffman , Michael Friesenhahn , Christina Rabe , Michael Rosenblum

Binary classification with ambiguous training data

In supervised learning, we often face with ambiguous (A) samples that are difficult to label even by domain experts. In this paper, we consider a binary classification problem in the presence of such A samples. This problem is substantially…

Machine Learning · Computer Science 2020-11-25 Naoya Otani , Yosuke Otsubo , Tetsuya Koike , Masashi Sugiyama