English
Related papers

Related papers: Classification using Ensemble Learning under Weigh…

200 papers

In biomedical and public health association studies, binary outcome variables may be subject to misclassification, resulting in substantial bias in effect estimates. The feasibility of addressing binary outcome misclassification in…

Methodology · Statistics 2024-03-19 Kimberly A. Hochstedler Webb , Martin T. Wells

In observational studies, propensity scores are commonly estimated by maxi- mum likelihood but may fail to balance high-dimensional pre-treatment covariates even after specification search. We introduce a general framework that unifies and…

Methodology · Statistics 2017-03-22 Qingyuan Zhao

Binary classification involves predicting the label of an instance based on whether the model score for the positive class exceeds a threshold chosen based on the application requirements (e.g., maximizing recall for a precision bound).…

Machine Learning · Computer Science 2023-11-21 Gundeep Arora , Srujana Merugu , Anoop Saladi , Rajeev Rastogi

Misclassification of binary responses, if ignored, may severely bias the maximum likelihood estimators (MLE) of regression parameters. For such data, a binary regression model incorporating misclassification probabilities is extensively…

Statistics Theory · Mathematics 2020-09-28 Arindam Chatterjee , Tathagata Bandyopadhyay , Sumanta Adhya

Set classification aims to classify a set of observations as a whole, as opposed to classifying individual observations separately. To formally understand the unfamiliar concept of binary set classification, we first investigate the optimal…

Machine Learning · Statistics 2020-06-29 Zhao Ren , Sungkyu Jung , Xingye Qiao

Learning binary classifiers only from positive and unlabeled (PU) data is an important and challenging task in many real-world applications, including web text classification, disease gene identification and fraud detection, where negative…

Machine Learning · Computer Science 2020-12-01 Hui Chen , Fangqing Liu , Yin Wang , Liyue Zhao , Hao Wu

Complementary-label learning (CLL) is widely used in weakly supervised classification, but it faces a significant challenge in real-world datasets when confronted with class-imbalanced training samples. In such scenarios, the number of…

Machine Learning · Computer Science 2024-03-21 Meng Wei , Yong Zhou , Zhongnian Li , Xinzheng Xu

This paper deals with the binary classification task when the target class has the lower probability of occurrence. In such situation, it is not possible to build a powerful classifier by using standard methods such as logistic regression,…

Machine Learning · Statistics 2015-02-26 Cheikh Ndour , Aliou Diop , Simplice Dossou-Gbété

We address the problem of aggregating an ensemble of predictors with known loss bounds in a semi-supervised binary classification setting, to minimize prediction loss incurred on the unlabeled data. We find the minimax optimal predictions…

Machine Learning · Computer Science 2016-11-08 Akshay Balsubramani , Yoav Freund

Existing weighting methods for treatment effect estimation are often built upon the idea of propensity scores or covariate balance. They usually impose strong assumptions on treatment assignment or outcome model to obtain unbiased…

Machine Learning · Computer Science 2023-05-09 Dongcheng Zhang , Kunpeng Zhang

Multi-class classification is mandatory for real world problems and one of promising techniques for multi-class classification is Error Correcting Output Code. We propose a method for constructing the Error Correcting Output Code to obtain…

Machine Learning · Computer Science 2013-12-30 Patoomsiri Songsiri , Thimaporn Phetkaew , Ryutaro Ichise , Boonserm Kijsirikul

This paper considers binary and multilabel classification problems in a setting where labels are missing independently and with a known rate. Missing labels are a ubiquitous phenomenon in extreme multi-label classification (XMC) tasks, such…

Machine Learning · Computer Science 2021-09-24 Erik Schultheis , Rohit Babbar

Binary codes have been widely used in vision problems as a compact feature representation to achieve both space and time advantages. Various methods have been proposed to learn data-dependent hash functions which map a feature vector to a…

Computer Vision and Pattern Recognition · Computer Science 2014-10-22 Jie Feng , Wei Liu , Yan Wang

Joint misclassification of exposure and outcome variables can lead to considerable bias in epidemiological studies of causal exposure-outcome effects. In this paper, we present a new maximum likelihood based estimator for the marginal…

Methodology · Statistics 2019-01-16 Bas B. L. Penning de Vries , Maarten van Smeden , Rolf H. H. Groenwold

Mediation analyses allow researchers to quantify the effect of an exposure variable on an outcome variable through a mediator variable. If a binary mediator variable is misclassified, the resulting analysis can be severely biased.…

Methodology · Statistics 2024-07-19 Kimberly A. Hochstedler Webb , Martin T. Wells

The large-scale multiple testing inherent to high throughput biological data necessitates very high statistical stringency and thus true effects in data are difficult to detect unless they have high effect sizes. One promising approach for…

Methodology · Statistics 2022-03-14 Mohamad Hasan , Paul Schliekelman

In observational causal inference, in order to emulate a randomized experiment, weights are used to render treatments independent of observed covariates. This property is known as balance; in its absence, estimated causal effects may be…

Methodology · Statistics 2020-07-16 David Arbour , Drew Dimmery , Arjun Sondhi

Many binary classification problems minimize misclassification above (or below) a threshold. We show that instances of ranking problems, accuracy at the top or hypothesis testing may be written in this form. We propose a general framework…

Machine Learning · Computer Science 2020-02-26 Lukáš Adam , Václav Mácha , Václav Šmídl , Tomáš Pevný

Covariate adjustment is a general method for improving precision when estimating treatment effects in randomized trials and is recommended by the FDA in its 2023 guidance when baseline variables are prognostic for the primary outcome. We…

In supervised learning, we often face with ambiguous (A) samples that are difficult to label even by domain experts. In this paper, we consider a binary classification problem in the presence of such A samples. This problem is substantially…

Machine Learning · Computer Science 2020-11-25 Naoya Otani , Yosuke Otsubo , Tetsuya Koike , Masashi Sugiyama
‹ Prev 1 2 3 10 Next ›