Related papers: Mixed-Integer Linear Optimization for Cardinality-…

Mixed-Integer Linear Optimization for Semi-Supervised Optimal Classification Trees

Decision trees are one of the most popular methods for solving classification problems, mainly because of their good interpretability properties. Moreover, due to advances in recent years in mixed-integer optimization, several models have…

Optimization and Control · Mathematics 2026-05-29 Jan Pablo Burgard , Maria Eduarda Pinheiro , Martin Schmidt

Feature Ranking for Semi-supervised Learning

The data made available for analysis are becoming more and more complex along several directions: high dimensionality, number of examples and the amount of labels per example. This poses a variety of challenges for the existing machine…

Machine Learning · Computer Science 2020-08-11 Matej Petković , Sašo Džeroski , Dragi Kocev

Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Semi-supervised learning (SSL) is a common approach to learning predictive models using not only labeled examples, but also unlabeled examples. While SSL for the simple tasks of classification and regression has received a lot of attention…

Machine Learning · Computer Science 2024-04-02 Jurica Levatić , Michelangelo Ceci , Dragi Kocev , Sašo Džeroski

Random Forest-Supervised Manifold Alignment

Manifold alignment is a type of data fusion technique that creates a shared low-dimensional representation of data collected from multiple domains, enabling cross-domain learning and improved performance in downstream tasks. This paper…

Machine Learning · Computer Science 2024-11-26 Jake S. Rhodes , Adam G. Rustad

MixMatch: A Holistic Approach to Semi-Supervised Learning

Semi-supervised learning has proven to be a powerful paradigm for leveraging unlabeled data to mitigate the reliance on large labeled datasets. In this work, we unify the current dominant approaches for semi-supervised learning to produce a…

Machine Learning · Computer Science 2019-10-25 David Berthelot , Nicholas Carlini , Ian Goodfellow , Nicolas Papernot , Avital Oliver , Colin Raffel

Large Margin Semi-supervised Structured Output Learning

In structured output learning, obtaining labelled data for real-world applications is usually costly, while unlabelled examples are available in abundance. Semi-supervised structured classification has been developed to handle large amounts…

Machine Learning · Computer Science 2013-11-12 P. Balamurugan , Shirish Shevade , Sundararajan Sellamanickam

Leveraging Structure for Improved Classification of Grouped Biased Data

We consider semi-supervised binary classification for applications in which data points are naturally grouped (e.g., survey responses grouped by state) and the labeled data is biased (e.g., survey respondents are not representative of the…

Machine Learning · Statistics 2022-12-08 Daniel Zeiberg , Shantanu Jain , Predrag Radivojac

Efficient semi-supervised inference for logistic regression under case-control studies

Semi-supervised learning has received increasingly attention in statistics and machine learning. In semi-supervised learning settings, a labeled data set with both outcomes and covariates and an unlabeled data set with covariates only are…

Machine Learning · Statistics 2024-02-26 Zhuojun Quan , Yuanyuan Lin , Kani Chen , Wen Yu

On missing label patterns in semi-supervised learning

We investigate model based classification with partially labelled training data. In many biostatistical applications, labels are manually assigned by experts, who may leave some observations unlabelled due to class uncertainty. We analyse…

Methodology · Statistics 2019-04-08 Daniel Ahfock , Geoffrey J. McLachlan

Semi-supervised learning

Semi-supervised learning deals with the problem of how, if possible, to take advantage of a huge amount of not classified data, to perform classification, in situations when, typically, the labelled data are few. Even though this is not…

Statistics Theory · Mathematics 2017-12-18 Alejandro Cholaquidis , Ricardo Fraiman , Mariela Sued

Random Forest for Label Ranking

Label ranking aims to learn a mapping from instances to rankings over a finite number of predefined labels. Random forest is a powerful and one of the most successful general-purpose machine learning algorithms of modern times. In this…

Machine Learning · Computer Science 2018-06-19 Yangming Zhou , Guoping Qiu

Fairness in Semi-supervised Learning: Unlabeled Data Help to Reduce Discrimination

A growing specter in the rise of machine learning is whether the decisions made by machine learning models are fair. While research is already underway to formalize a machine-learning concept of fairness and to design frameworks for…

Machine Learning · Computer Science 2020-09-28 Tao Zhang , Tianqing Zhu , Jing Li , Mengde Han , Wanlei Zhou , Philip S. Yu

Semi-supervised Learning with Deterministic Labeling and Large Margin Projection

The centrality and diversity of the labeled data are very influential to the performance of semi-supervised learning (SSL), but most SSL models select the labeled data randomly. This study first construct a leading forest that forms a…

Artificial Intelligence · Computer Science 2022-10-11 Ji Xu , Gang Ren , Yao Xiao , Shaobo Li , Guoyin Wang

Reliable Semi-Supervised Learning when Labels are Missing at Random

Semi-supervised learning methods are motivated by the availability of large datasets with unlabeled features in addition to labeled data. Unlabeled data is, however, not guaranteed to improve classification performance and has in fact been…

Machine Learning · Statistics 2019-10-25 Xiuming Liu , Dave Zachariah , Johan Wågberg , Thomas B. Schön

Best-scored Random Forest Classification

We propose an algorithm named best-scored random forest for binary classification problems. The terminology "best-scored" means to select the one with the best empirical performance out of a certain number of purely random tree candidates…

Machine Learning · Statistics 2019-05-28 Hanyuan Hang , Xiaoyu Liu , Ingo Steinwart

Cold PAWS: Unsupervised class discovery and addressing the cold-start problem for semi-supervised learning

In many machine learning applications, labeling datasets can be an arduous and time-consuming task. Although research has shown that semi-supervised learning techniques can achieve high accuracy with very few labels within the field of…

Computer Vision and Pattern Recognition · Computer Science 2023-06-07 Evelyn J. Mannix , Howard D. Bondell

Generalized Optimal Classification Trees: A Mixed-Integer Programming Approach

Global optimization of decision trees is a long-standing challenge in combinatorial optimization, yet such models play an important role in interpretable machine learning. Although the problem has been investigated for several decades, only…

Machine Learning · Computer Science 2026-02-03 Jiancheng Tu , Wenqi Fan , Zhibin Wu

Local Multi-Label Explanations for Random Forest

Multi-label classification is a challenging task, particularly in domains where the number of labels to be predicted is large. Deep neural networks are often effective at multi-label classification of images and textual data. When dealing…

Machine Learning · Computer Science 2023-03-30 Nikolaos Mylonas , Ioannis Mollas , Nick Bassiliades , Grigorios Tsoumakas

Bayesian Semi-supervised Multi-category Classification under Nonparanormality

Semi-supervised learning is a model training method that uses both labeled and unlabeled data. This paper proposes a fully Bayes semi-supervised learning algorithm that can be applied to any multi-category classification problem. We assume…

Machine Learning · Statistics 2024-07-22 Rui Zhu , Shuvrarghya Ghosh , Subhashis Ghosal

Stochastic Optimization for Machine Learning

It has been found that stochastic algorithms often find good solutions much more rapidly than inherently-batch approaches. Indeed, a very useful rule of thumb is that often, when solving a machine learning problem, an iterative technique…

Machine Learning · Computer Science 2013-08-19 Andrew Cotter