Related papers: Gradient Regularized Budgeted Boosting

Budget-constrained Active Learning to Effectively De-censor Survival Data

Standard supervised learners attempt to learn a model from a labeled dataset. Given a small set of labeled instances, and a pool of unlabeled instances, a budgeted learner can use its given budget to pay to acquire the labels of some…

Machine Learning · Computer Science 2025-10-15 Ali Parsaee , Bei Jiang , Zachary Friggstad , Russell Greiner

Semi-Unsupervised Learning: Clustering and Classifying using Ultra-Sparse Labels

In semi-supervised learning for classification, it is assumed that every ground truth class of data is present in the small labelled dataset. Many real-world sparsely-labelled datasets are plausibly not of this type. It could easily be the…

Machine Learning · Statistics 2021-01-11 Matthew Willetts , Stephen J Roberts , Christopher C Holmes

Semi-Supervised Learning with Meta-Gradient

In this work, we propose a simple yet effective meta-learning algorithm in semi-supervised learning. We notice that most existing consistency-based approaches suffer from overfitting and limited model generalization ability, especially when…

Machine Learning · Computer Science 2021-03-18 Xin-Yu Zhang , Taihong Xiao , Haolin Jia , Ming-Ming Cheng , Ming-Hsuan Yang

Block-distributed Gradient Boosted Trees

The Gradient Boosted Tree (GBT) algorithm is one of the most popular machine learning algorithms used in production, for tasks that include Click-Through Rate (CTR) prediction and learning-to-rank. To deal with the massive datasets…

Machine Learning · Computer Science 2019-05-30 Theodore Vasiloudis , Hyunsu Cho , Henrik Boström

Reducing the Cost of Training Security Classifier (via Optimized Semi-Supervised Learning)

Background: Most of the existing machine learning models for security tasks, such as spam detection, malware detection, or network intrusion detection, are built on supervised machine learning algorithms. In such a paradigm, models need a…

Cryptography and Security · Computer Science 2022-05-03 Rui Shu , Tianpei Xia , Huy Tu , Laurie Williams , Tim Menzies

Gradient-based Label Binning in Multi-label Classification

In multi-label classification, where a single example may be associated with several class labels at the same time, the ability to model dependencies between labels is considered crucial to effectively optimize non-decomposable evaluation…

Machine Learning · Computer Science 2021-06-23 Michael Rapp , Eneldo Loza Mencía , Johannes Fürnkranz , Eyke Hüllermeier

Learning Gradient Boosted Multi-label Classification Rules

In multi-label classification, where the evaluation of predictions is less straightforward than in single-label classification, various meaningful, though different, loss functions have been proposed. Ideally, the learning algorithm should…

Machine Learning · Computer Science 2020-06-25 Michael Rapp , Eneldo Loza Mencía , Johannes Fürnkranz , Vu-Linh Nguyen , Eyke Hüllermeier

Semi-supervised learning

Semi-supervised learning deals with the problem of how, if possible, to take advantage of a huge amount of not classified data, to perform classification, in situations when, typically, the labelled data are few. Even though this is not…

Statistics Theory · Mathematics 2017-12-18 Alejandro Cholaquidis , Ricardo Fraiman , Mariela Sued

Gradient Boosting Reinforcement Learning

We present Gradient Boosting Reinforcement Learning (GBRL), a framework that adapts the strengths of gradient boosting trees (GBT) to reinforcement learning (RL) tasks. While neural networks (NNs) have become the de facto choice for RL,…

Machine Learning · Computer Science 2025-10-21 Benjamin Fuhrer , Chen Tessler , Gal Dalal

Budgeted Training: Rethinking Deep Neural Network Training Under Resource Constraints

In most practical settings and theoretical analyses, one assumes that a model can be trained until convergence. However, the growing complexity of machine learning datasets and models may violate such assumptions. Indeed, current approaches…

Computer Vision and Pattern Recognition · Computer Science 2020-07-01 Mengtian Li , Ersin Yumer , Deva Ramanan

On semi-supervised learning

Semi-supervised learning deals with the problem of how, if possible, to take advantage of a huge amount of unclassified data, to perform a classification in situations when, typically, there is little labeled data. Even though this is not…

Machine Learning · Statistics 2020-12-11 Alejandro Cholaquidis , Ricardo Fraiman , Mariela Sued

Learning Safe Unlabeled Multi-Robot Planning with Motion Constraints

In this paper, we present a learning approach to goal assignment and trajectory planning for unlabeled robots operating in 2D, obstacle-filled workspaces. More specifically, we tackle the unlabeled multi-robot motion planning problem with…

Robotics · Computer Science 2019-07-12 Arbaaz Khan , Chi Zhang , Shuo Li , Jiayue Wu , Brent Schlotfeldt , Sarah Y. Tang , Alejandro Ribeiro , Osbert Bastani , Vijay Kumar

Gradient Descent in RKHS with Importance Labeling

Labeling cost is often expensive and is a fundamental limitation of supervised learning. In this paper, we study importance labeling problem, in which we are given many unlabeled data and select a limited number of data to be labeled from…

Machine Learning · Computer Science 2021-04-13 Tomoya Murata , Taiji Suzuki

Sample-Optimal Agnostic Boosting with Unlabeled Data

Boosting provides a practical and provably effective framework for constructing accurate learning algorithms from inaccurate rules of thumb. It extends the promise of sample-efficient learning to settings where direct Empirical Risk…

Machine Learning · Computer Science 2025-03-07 Udaya Ghai , Karan Singh

Gradient tree boosting with random output projections for multi-label classification and multi-output regression

In many applications of supervised learning, multiple classification or regression outputs have to be predicted jointly. We consider several extensions of gradient boosting to address such problems. We first propose a straightforward…

Machine Learning · Statistics 2019-05-21 Arnaud Joly , Louis Wehenkel , Pierre Geurts

Gradient Boosting With Piece-Wise Linear Regression Trees

Gradient Boosted Decision Trees (GBDT) is a very successful ensemble learning algorithm widely used across a variety of applications. Recently, several variants of GBDT training algorithms and implementations have been designed and heavily…

Machine Learning · Computer Science 2019-06-27 Yu Shi , Jian Li , Zhize Li

Semi-supervised Active Regression

Labelled data often comes at a high cost as it may require recruiting human labelers or running costly experiments. At the same time, in many practical scenarios, one already has access to a partially labelled, potentially biased dataset…

Machine Learning · Computer Science 2021-06-15 Fnu Devvrit , Nived Rajaraman , Pranjal Awasthi

Toward Robust Graph Semi-Supervised Learning against Extreme Data Scarcity

The success of graph neural networks on graph-based web mining highly relies on abundant human-annotated data, which is laborious to obtain in practice. When only few labeled nodes are available, how to improve their robustness is a key to…

Machine Learning · Computer Science 2022-12-13 Kaize Ding , Elnaz Nouri , Guoqing Zheng , Huan Liu , Ryen White

Gradient Boosting for Linear Mixed Models

Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current…

Methodology · Statistics 2020-11-03 Colin Griesbach , Benjamin Säfken , Elisabeth Waldmann

ConstraintMatch for Semi-constrained Clustering

Constrained clustering allows the training of classification models using pairwise constraints only, which are weak and relatively easy to mine, while still yielding full-supervision-level model performance. While they perform well even in…

Machine Learning · Computer Science 2023-11-28 Jann Goschenhofer , Bernd Bischl , Zsolt Kira