English
Related papers

Related papers: Prediction-based Variable Selection for Component-…

200 papers

We present a new variable selection method based on model-based gradient boosting and randomly permuted variables. Model-based boosting is a tool to fit a statistical model while performing variable selection at the same time. A drawback of…

Machine Learning · Statistics 2017-02-16 Janek Thomas , Tobias Hepp , Andreas Mayr , Bernd Bischl

Boosting methods are widely used in statistical learning to deal with high-dimensional data due to their variable selection feature. However, those methods lack straightforward ways to construct estimators for the precision of the…

Methodology · Statistics 2021-06-10 Boyao Zhang , Colin Griesbach , Cora Kim , Nadia Müller-Voggel , Elisabeth Bergherr

We present a new procedure for enhanced variable selection for component-wise gradient boosting. Statistical boosting is a computational approach that emerged from machine learning, which allows to fit regression models in the presence of…

We present a statistical perspective on boosting. Special emphasis is given to estimating potentially complex parametric or nonparametric models, including generalized linear and additive models as well as regression models for survival…

Methodology · Statistics 2008-12-18 Peter Bühlmann , Torsten Hothorn

This paper introduces a boosted conformal procedure designed to tailor conformalized prediction intervals toward specific desired properties, such as enhanced conditional coverage or reduced interval length. We employ machine learning…

Methodology · Statistics 2024-11-12 Ran Xie , Rina Foygel Barber , Emmanuel J. Candès

Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current…

Methodology · Statistics 2020-11-03 Colin Griesbach , Benjamin Säfken , Elisabeth Waldmann

Gradient boosting of regression trees is a competitive procedure for learning predictive models of continuous data that fits the data with an additive non-parametric model. The classic version of gradient boosting assumes that the data is…

Machine Learning · Computer Science 2016-07-04 Iman Alodah , Jennifer Neville

We develop an algorithm for model selection which allows for the consideration of a combinatorially large number of candidate models governing a dynamical system. The innovation circumvents a disadvantage of standard model selection which…

Data Analysis, Statistics and Probability · Physics 2017-11-01 Niall M. Mangan , J. Nathan Kutz , Steven L. Brunton , Joshua L. Proctor

In the problem of selecting variables in a multivariate linear regression model, we derive new Bayesian information criteria based on a prior mixing a smooth distribution and a delta distribution. Each of them can be interpreted as a fusion…

Statistics Theory · Mathematics 2022-09-29 Haruki Kono , Tatsuya Kubokawa

This paper investigates the integration of gradient boosted decision trees and varying coefficient models. We introduce the tree boosted varying coefficient framework which justifies the implementation of decision tree boosting as the…

Methodology · Statistics 2019-04-03 Yichen Zhou , Giles Hooker

Model selection in mixed models based on the conditional distribution is appropriate for many practical applications and has been a focus of recent statistical research. In this paper we introduce the R-package cAIC4 that allows for the…

Computation · Statistics 2018-03-20 Benjamin Säfken , David Rügamer , Thomas Kneib , Sonja Greven

Automatic machine learning performs predictive modeling with high performing machine learning tools without human interference. This is achieved by making machine learning applications parameter-free, i.e. only a dataset is provided while…

Machine Learning · Statistics 2018-07-16 Janek Thomas , Stefan Coors , Bernd Bischl

Gradient boosting algorithms construct a regression predictor using a linear combination of ``base learners''. Boosting also offers an approach to obtaining robust non-parametric regression estimators that are scalable to applications with…

Methodology · Statistics 2020-08-11 Xiaomeng Ju , Matías Salibián-Barrera

Recent advances in the literature have demonstrated that standard supervised learning algorithms are ill-suited for problems with endogenous explanatory variables. To correct for the endogeneity bias, many variants of nonparameteric…

Econometrics · Economics 2021-01-18 Edvard Bakhitov , Amandeep Singh

While the Bayesian Information Criterion (BIC) and Akaike Information Criterion (AIC) are powerful tools for model selection in linear regression, they are built on different prior assumptions and thereby apply to different data generation…

Methodology · Statistics 2017-12-15 MB de Kock , HC Eggers

Gradient boosting is a prediction method that iteratively combines weak learners to produce a complex and accurate model. From an optimization point of view, the learning procedure of gradient boosting mimics a gradient descent on a…

Machine Learning · Computer Science 2022-11-30 Erwan Fouillen , Claire Boyer , Maxime Sangnier

Boosting techniques from the field of statistical learning have grown to be a popular tool for estimating and selecting predictor effects in various regression models and can roughly be separated in two general approaches, namely gradient…

Methodology · Statistics 2019-12-16 Colin Griesbach , Andreas Groll , Elisabeth Waldmann

Gradient boosting is a state-of-the-art prediction technique that sequentially produces a model in the form of linear combinations of simple predictors---typically decision trees---by solving an infinite-dimensional convex optimization…

Statistics Theory · Mathematics 2017-07-18 Gérard Biau , Benoît Cadre

Componentwise boosting (CWB), also known as model-based boosting, is a variant of gradient boosting that builds on additive models as base learners to ensure interpretability. CWB is thus often used in research areas where models are…

Computation · Statistics 2021-11-01 Daniel Schalk , Bernd Bischl , David Rügamer

We propose a robust variable selection procedure using a divergence based M-estimator combined with a penalty function. It produces robust estimates of the regression parameters and simultaneously selects the important explanatory…

Methodology · Statistics 2020-01-01 Abhijit Mandal , Samiran Ghosh
‹ Prev 1 2 3 10 Next ›