Related papers: A meta-algorithm for classification using random r…

Particle identification is one of the core tasks in the data analysis pipeline at the Large Hadron Collider (LHC). Statistically, this entails the identification of rare signal events buried in immense backgrounds that mimic the properties…

Machine Learning · Statistics 2020-01-20 Vidhi Lalchand

MetaBags: Bagged Meta-Decision Trees for Regression

Ensembles are popular methods for solving practical supervised learning problems. They reduce the risk of having underperforming models in production-grade software. Although critical, methods for learning heterogeneous regression ensembles…

Machine Learning · Computer Science 2018-04-18 Jihed Khiari , Luis Moreira-Matias , Ammar Shaker , Bernard Zenko , Saso Dzeroski

Boosting-Based Sequential Meta-Tree Ensemble Construction for Improved Decision Trees

A decision tree is one of the most popular approaches in machine learning fields. However, it suffers from the problem of overfitting caused by overly deepened trees. Then, a meta-tree is recently proposed. It solves the problem of…

Machine Learning · Statistics 2024-02-12 Ryota Maniwa , Naoki Ichijo , Yuta Nakahara , Toshiyasu Matsushima

Algorithm Selection on a Meta Level

The problem of selecting an algorithm that appears most suitable for a specific instance of an algorithmic problem class, such as the Boolean satisfiability problem, is called instance-specific algorithm selection. Over the past decade, the…

Machine Learning · Computer Science 2021-07-21 Alexander Tornede , Lukas Gehring , Tanja Tornede , Marcel Wever , Eyke Hüllermeier

Fast Meta-Learning for Adaptive Hierarchical Classifier Design

We propose a new splitting criterion for a meta-learning approach to multiclass classifier design that adaptively merges the classes into a tree-structured hierarchy of increasingly difficult binary classification problems. The…

Machine Learning · Computer Science 2017-11-10 Gerrit J. J. van den Burg , Alfred O. Hero

An Empirical Study of Boosted Neural Network for Particle Classification in High Energy Collisions

The possible application of boosted neural network to particle classification in high energy physics is discussed. A two-dimensional toy model, where the boundary between signal and background is irregular but not overlapping, is…

High Energy Physics - Phenomenology · Physics 2007-05-23 Yu Meiling , Xu Mingmei , Liu Lianshou

Boosted decision trees

Boosted decision trees are a very powerful machine learning technique. After introducing specific concepts of machine learning in the high-energy physics context and describing ways to quantify the performance and training quality of…

Data Analysis, Statistics and Probability · Physics 2022-06-22 Yann Coadou

Ensemble Classifier Design Tuned to Dataset Characteristics for Network Intrusion Detection

Machine Learning-based supervised approaches require highly customized and fine-tuned methodologies to deliver outstanding performance. This paper presents a dataset-driven design and performance evaluation of a machine learning classifier…

Cryptography and Security · Computer Science 2022-05-13 Zeinab Zoghi , Gursel Serpen

A Novel Multi-clustering Method for Hierarchical Clusterings, Based on Boosting

Bagging and boosting are proved to be the best methods of building multiple classifiers in classification combination problems. In the area of "flat clustering" problems, it is also recognized that multi-clustering methods based on boosting…

Machine Learning · Computer Science 2018-05-31 Elaheh Rashedi , Abdolreza Mirzaei

Stacking machine learning classifiers to identify Higgs bosons at the LHC

Machine learning (ML) algorithms have been employed in the problem of classifying signal and background events with high accuracy in particle physics. In this paper, we compare the performance of a widespread ML technique, namely,…

High Energy Physics - Phenomenology · Physics 2017-06-01 Alexandre Alves

metboost: Exploratory regression analysis with hierarchically clustered data

As data collections become larger, exploratory regression analysis becomes more important but more challenging. When observations are hierarchically clustered the problem is even more challenging because model selection with mixed effect…

Machine Learning · Statistics 2017-02-15 Patrick J. Miller , Daniel B. McArtor , Gitta H. Lubke

Deep Learning Meets Oversampling: A Learning Framework to Handle Imbalanced Classification

Despite extensive research spanning several decades, class imbalance is still considered a profound difficulty for both machine learning and deep learning models. While data oversampling is the foremost technique to address this issue,…

Machine Learning · Computer Science 2025-02-12 Sukumar Kishanthan , Asela Hevapathige

On the impact of selected modern deep-learning techniques to the performance and celerity of classification models in an experimental high-energy physics use case

Beginning from a basic neural-network architecture, we test the potential benefits offered by a range of advanced techniques for machine learning, in particular deep learning, in the context of a typical classification problem encountered…

Data Analysis, Statistics and Probability · Physics 2020-06-03 Giles Chatham Strong

A Direct Approach to Multi-class Boosting and Extensions

Boosting methods combine a set of moderately accurate weaklearners to form a highly accurate predictor. Despite the practical importance of multi-class boosting, it has received far less attention than its binary counterpart. In this work,…

Machine Learning · Computer Science 2012-10-18 Chunhua Shen , Sakrapee Paisitkriangkrai , Anton van den Hengel

Reconciling meta-learning and continual learning with online mixtures of tasks

Learning-to-learn or meta-learning leverages data-driven inductive bias to increase the efficiency of learning on a novel task. This approach encounters difficulty when transfer is not advantageous, for instance, when tasks are considerably…

Machine Learning · Computer Science 2019-06-20 Ghassen Jerfel , Erin Grant , Thomas L. Griffiths , Katherine Heller

A theory of multiclass boosting

Boosting combines weak classifiers to form highly accurate predictors. Although the case of binary classification is well understood, in the multiclass setting, the "correct" requirements on the weak classifier, or the notion of the most…

Machine Learning · Statistics 2011-08-16 Indraneel Mukherjee , Robert E. Schapire

Searches for the BSM scenarios at the LHC using decision tree based machine learning algorithms: A comparative study and review of Random Forest, Adaboost, XGboost and LightGBM frameworks

Machine learning algorithms are now being extensively used in our daily lives, spanning across diverse industries as well as academia. In the field of high energy physics (HEP), the most common and challenging task is separating a rare…

High Energy Physics - Phenomenology · Physics 2025-07-23 Arghya Choudhury , Arpita Mondal , Subhadeep Sarkar

Optimizing Ensemble Weights and Hyperparameters of Machine Learning Models for Regression Problems

Aggregating multiple learners through an ensemble of models aim to make better predictions by capturing the underlying distribution of the data more accurately. Different ensembling methods, such as bagging, boosting, and stacking/blending,…

Machine Learning · Statistics 2020-11-03 Mohsen Shahhosseini , Guiping Hu , Hieu Pham

Big Data Classification Using Augmented Decision Trees

We present an algorithm for classification tasks on big data. Experiments conducted as part of this study indicate that the algorithm can be as accurate as ensemble methods such as random forests or gradient boosted trees. Unlike ensemble…

Machine Learning · Statistics 2017-10-27 Rajiv Sambasivan , Sourish Das

Reviving Threshold-Moving: a Simple Plug-in Bagging Ensemble for Binary and Multiclass Imbalanced Data

Class imbalance presents a major hurdle in the application of data mining methods. A common practice to deal with it is to create ensembles of classifiers that learn from resampled balanced data. For example, bagged decision trees combined…

Machine Learning · Computer Science 2017-06-21 Guillem Collell , Drazen Prelec , Kaustubh Patil