Related papers: Tree Learning: Optimal Algorithms and Sample Compl…

Optimal Sample Complexity of Contrastive Learning

Contrastive learning is a highly successful technique for learning representations of data from labeled tuples, specifying the distance relations within the tuple. We study the sample complexity of contrastive learning, i.e. the minimum…

Machine Learning · Computer Science 2023-12-04 Noga Alon , Dmitrii Avdiukhin , Dor Elboim , Orr Fischer , Grigory Yaroslavtsev

Classification Tree-based Active Learning: A Wrapper Approach

Supervised machine learning often requires large training sets to train accurate models, yet obtaining large amounts of labeled data is not always feasible. Hence, it becomes crucial to explore active learning methods for reducing the size…

Machine Learning · Computer Science 2024-04-16 Ashna Jose , Emilie Devijver , Massih-Reza Amini , Noel Jakse , Roberta Poloni

On the computational complexity of the probabilistic label tree algorithms

Label tree-based algorithms are widely used to tackle multi-class and multi-label problems with a large number of labels. We focus on a particular subclass of these algorithms that use probabilistic classifiers in the tree nodes. Examples…

Machine Learning · Computer Science 2019-06-04 Robert Busa-Fekete , Krzysztof Dembczynski , Alexander Golovnev , Kalina Jasinska , Mikhail Kuznetsov , Maxim Sviridenko , Chao Xu

Nearly Optimal Sample Complexity for Learning with Label Proportions

We investigate Learning from Label Proportions (LLP), a partial information setting where examples in a training set are grouped into bags, and only aggregate label values in each bag are available. Despite the partial observability, the…

Machine Learning · Computer Science 2025-06-02 Robert Busa-Fekete , Travis Dick , Claudio Gentile , Haim Kaplan , Tomer Koren , Uri Stemmer

Computationally Efficient Wasserstein Loss for Structured Labels

The problem of estimating the probability distribution of labels has been widely studied as a label distribution learning (LDL) problem, whose applications include age estimation, emotion analysis, and semantic segmentation. We propose a…

Machine Learning · Computer Science 2021-03-02 Ayato Toyokuni , Sho Yokoi , Hisashi Kashima , Makoto Yamada

On the Robustness of Decision Tree Learning under Label Noise

In most practical problems of classifier learning, the training data suffers from the label noise. Hence, it is important to understand how robust is a learning algorithm to such label noise. This paper presents some theoretical analysis to…

Machine Learning · Computer Science 2016-08-29 Aritra Ghosh , Naresh Manwani , P. S. Sastry

On Learning Vector Representations in Hierarchical Label Spaces

An important problem in multi-label classification is to capture label patterns or underlying structures that have an impact on such patterns. This paper addresses one such problem, namely how to exploit hierarchical structures over labels.…

Machine Learning · Computer Science 2015-04-17 Jinseok Nam , Johannes Fürnkranz

Learning to Branch

Tree search algorithms, such as branch-and-bound, are the most widely used tools for solving combinatorial and nonconvex problems. For example, they are the foremost method for solving (mixed) integer programs and constraint satisfaction…

Artificial Intelligence · Computer Science 2018-05-18 Maria-Florina Balcan , Travis Dick , Tuomas Sandholm , Ellen Vitercik

MurTree: Optimal Classification Trees via Dynamic Programming and Search

Decision tree learning is a widely used approach in machine learning, favoured in applications that require concise and interpretable models. Heuristic methods are traditionally used to quickly produce models with reasonably high accuracy.…

Machine Learning · Computer Science 2022-06-30 Emir Demirović , Anna Lukina , Emmanuel Hebrard , Jeffrey Chan , James Bailey , Christopher Leckie , Kotagiri Ramamohanarao , Peter J. Stuckey

Simultaneous Learning of Trees and Representations for Extreme Classification and Density Estimation

We consider multi-class classification where the predictor has a hierarchical structure that allows for a very large number of labels both at train and test time. The predictive power of such models can heavily depend on the structure of…

Machine Learning · Statistics 2017-03-06 Yacine Jernite , Anna Choromanska , David Sontag

Learning Optimal Tree Models Under Beam Search

Retrieving relevant targets from an extremely large target set under computational limits is a common challenge for information retrieval and recommendation systems. Tree models, which formulate targets as leaves of a tree with trainable…

Machine Learning · Statistics 2020-06-30 Jingwei Zhuo , Ziru Xu , Wei Dai , Han Zhu , Han Li , Jian Xu , Kun Gai

Random Subspace with Trees for Feature Selection Under Memory Constraints

Dealing with datasets of very high dimension is a major challenge in machine learning. In this paper, we consider the problem of feature selection in applications where the memory is not large enough to contain all features. In this…

Machine Learning · Statistics 2017-09-07 Antonio Sutera , Célia Châtel , Gilles Louppe , Louis Wehenkel , Pierre Geurts

Learning with tree-based tensor formats

This paper is concerned with the approximation of high-dimensional functions in a statistical learning setting, by empirical risk minimization over model classes of functions in tree-based tensor format. These are particular classes of…

Machine Learning · Statistics 2019-01-15 Erwan Grelier , Anthony Nouy , Mathilde Chevreuil

Joint Hierarchical Category Structure Learning and Large-Scale Image Classification

We investigate the scalable image classification problem with a large number of categories. Hierarchical visual data structures are helpful for improving the efficiency and performance of large-scale multi-class classification. We propose a…

Computer Vision and Pattern Recognition · Computer Science 2017-09-18 Yanyun Qu , Li Lin , Fumin Shen , Chang Lu , Yang Wu , Yuan Xie , Dacheng Tao

Local Multi-Label Explanations for Random Forest

Multi-label classification is a challenging task, particularly in domains where the number of labels to be predicted is large. Deep neural networks are often effective at multi-label classification of images and textual data. When dealing…

Machine Learning · Computer Science 2023-03-30 Nikolaos Mylonas , Ioannis Mollas , Nick Bassiliades , Grigorios Tsoumakas

Estimating decision tree learnability with polylogarithmic sample complexity

We show that top-down decision tree learning heuristics are amenable to highly efficient learnability estimation: for monotone target functions, the error of the decision tree hypothesis constructed by these heuristics can be estimated with…

Machine Learning · Computer Science 2020-11-04 Guy Blanc , Neha Gupta , Jane Lange , Li-Yang Tan

Learning Optimal Decision Trees from Large Datasets

Inferring a decision tree from a given dataset is one of the classic problems in machine learning. This problem consists of buildings, from a labelled dataset, a tree such that each node corresponds to a class and a path between the tree…

Machine Learning · Computer Science 2019-04-15 Florent Avellaneda

A Hierarchical Spectral Method for Extreme Classification

Extreme classification problems are multiclass and multilabel classification problems where the number of outputs is so large that straightforward strategies are neither statistically nor computationally viable. One strategy for dealing…

Machine Learning · Statistics 2016-02-05 Paul Mineiro , Nikos Karampatziakis

Probabilistic Label Trees for Extreme Multi-label Classification

Extreme multi-label classification (XMLC) is a learning task of tagging instances with a small subset of relevant labels chosen from an extremely large pool of possible labels. Problems of this scale can be efficiently handled by organizing…

Machine Learning · Computer Science 2020-09-24 Kalina Jasinska-Kobus , Marek Wydmuch , Krzysztof Dembczynski , Mikhail Kuznetsov , Robert Busa-Fekete

Multi-label Classification under Uncertainty: A Tree-based Conformal Prediction Approach

Multi-label classification is a common challenge in various machine learning applications, where a single data instance can be associated with multiple classes simultaneously. The current paper proposes a novel tree-based method for…

Methodology · Statistics 2024-05-01 Chhavi Tyagi , Wenge Guo