English
Related papers

Related papers: Replica Analysis for Ensemble Techniques in Variab…

200 papers

Standard approaches for variable selection in linear models are not tailored to deal properly with high-dimensional and incomplete data. Currently, methods dedicated to high-dimensional data handle missing values by ad-hoc strategies, like…

Methodology · Statistics 2021-06-09 Avner Bar-Hen , Vincent Audigier

Training in machine learning generally consists in finding one model, whose parameters minimize a data-dependent loss. Yet, empirical work shows that ensemble learning, an approach in which multiple models are sampled, can improve…

Disordered Systems and Neural Networks · Physics 2026-04-28 Thomas Tulinski , Jorge Fernandez-De-Cossio-Diaz , Simona Cocco , Rémi Monasson

Deep reinforcement learning (RL) has achieved remarkable success in solving complex tasks through its integration with deep neural networks (DNNs) as function approximators. However, the reliance on DNNs has introduced a new challenge…

Machine Learning · Computer Science 2023-11-01 Woojun Kim , Yongjae Shin , Jongeui Park , Youngchul Sung

Variable selection for high-dimensional, highly correlated data has long been a challenging problem, often yielding unstable and unreliable models. We propose a resample-aggregate framework that exploits diffusion models' ability to…

Methodology · Statistics 2025-08-20 Minjie Wang , Xiaotong Shen , Wei Pan

In this work we study binary classification problems where we assume that our training data is subject to uncertainty, i.e. the precise data points are not known. To tackle this issue in the field of robust machine learning the aim is to…

Machine Learning · Computer Science 2022-03-04 Jannis Kurtz

Machine learning techniques always aim to reduce the generalized prediction error. In order to reduce it, ensemble methods present a good approach combining several models that results in a greater forecasting capacity. The Random Machines…

Machine Learning · Statistics 2020-03-31 Anderson Ara , Mateus Maia , Samuel Macêdo , Francisco Louzada

We study a random code ensemble with a hierarchical structure, which is closely related to the generalized random energy model with discrete energy values. Based on this correspondence, we analyze the hierarchical random code ensemble by…

Disordered Systems and Neural Networks · Physics 2011-02-08 Tomoyuki Obuchi , Kazutaka Takahashi , Koujin Takeda

Ensemble learning is a very prevalent method employed in machine learning. The relative success of ensemble methods is attributed to their ability to tackle a wide range of instances and complex problems that require different low-level…

Machine Learning · Computer Science 2021-05-18 Rohan Saphal , Balaraman Ravindran , Dheevatsa Mudigere , Sasikanth Avancha , Bharat Kaul

The popularity of data augmentation techniques in machine learning has increased in recent years, as they enable the creation of new samples from existing datasets. Rotational augmentation, in particular, has shown great promise by…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Unai Muñoz-Aseguinolaza , Basilio Sierra , Naiara Aginako

Time-series data are one of the fundamental types of raw data representation used in data-driven techniques. In machine condition monitoring, time-series vibration data are overly used in data mining for deep neural networks. Typically,…

Machine Learning · Computer Science 2022-01-14 Atik Faysal , Ngui Wai Keng , M. H. Lim

Model selection is a strategy aimed at creating accurate and robust models. A key challenge in designing these algorithms is identifying the optimal model for classifying any particular input sample. This paper addresses this challenge and…

Machine Learning · Computer Science 2023-05-22 James Kotary , Vincenzo Di Vito , Ferdinando Fioretto

Stability selection is a widely adopted resampling-based framework for high-dimensional variable selection. This paper seeks to broaden the use of an established stability estimator to evaluate the overall stability of the stability…

Methodology · Statistics 2025-06-04 Mahdi Nouraie , Samuel Muller

It is challenging for reinforcement learning (RL) algorithms to succeed in real-world applications like financial trading and logistic system due to the noisy observation and environment shifting between training and evaluation. Thus, it…

Machine Learning · Computer Science 2022-05-20 Zhengyu Yang , Kan Ren , Xufang Luo , Minghuan Liu , Weiqing Liu , Jiang Bian , Weinan Zhang , Dongsheng Li

Reproducibility is imperative for any scientific discovery. More often than not, modern scientific findings rely on statistical analysis of high-dimensional data. At a minimum, reproducibility manifests itself in stability of statistical…

Statistics Theory · Mathematics 2013-10-02 Bin Yu

Since their emergence in the 1990's, the support vector machine and the AdaBoost algorithm have spawned a wave of research in statistical machine learning. Much of this new research falls into one of two broad categories: kernel methods and…

Methodology · Statistics 2008-04-15 Mu Zhu

Recent work has focused on data-driven learning of the evolution of unknown systems via deep neural networks (DNNs), with the goal of conducting long time prediction of the evolution of the unknown system. Training a DNN with low…

Machine Learning · Computer Science 2022-12-28 Victor Churchill , Steve Manns , Zhen Chen , Dongbin Xiu

Replicability requires that algorithmic conclusions remain consistent when rerun on independently drawn data. A central structural question is composition: given $k$ problems each admitting a $\rho$-replicable algorithm with sample…

Ensemble learning is a methodology that integrates multiple DNN learners for improving prediction performance of individual learners. Diversity is greater when the errors of the ensemble prediction is more uniformly distributed. Greater…

Machine Learning · Computer Science 2019-08-30 Ling Liu , Wenqi Wei , Ka-Ho Chow , Margaret Loper , Emre Gursoy , Stacey Truex , Yanzhao Wu

High-dimensional, low sample-size (HDLSS) data problems have been a topic of immense importance for the last couple of decades. There is a vast literature that proposed a wide variety of approaches to deal with this situation, among which…

Methodology · Statistics 2021-07-09 Kaixu Yang , Tapabrata Maiti

A system of nested dichotomies is a method of decomposing a multi-class problem into a collection of binary problems. Such a system recursively splits the set of classes into two subsets, and trains a binary classifier to distinguish…

Machine Learning · Statistics 2016-07-06 Tim Leathart , Bernhard Pfahringer , Eibe Frank
‹ Prev 1 2 3 10 Next ›