Related papers: Generalized Linear Rule Models

Predictive learning via rule ensembles

General regression and classification models are constructed as linear combinations of simple rules derived from the data. Each rule consists of a conjunction of a small number of simple statements concerning the values of individual input…

Applications · Statistics 2008-11-12 Jerome H. Friedman , Bogdan E. Popescu

Obtaining Explainable Classification Models using Distributionally Robust Optimization

Model explainability is crucial for human users to be able to interpret how a proposed classifier assigns labels to data based on its feature values. We study generalized linear models constructed using sets of feature value rules, which…

Machine Learning · Statistics 2023-11-06 Sanjeeb Dash , Soumyadip Ghosh , Joao Goncalves , Mark S. Squillante

Dimension Reduction Using Rule Ensemble Machine Learning Methods: A Numerical Study of Three Ensemble Methods

Ensemble methods for supervised machine learning have become popular due to their ability to accurately predict class labels with groups of simple, lightweight "base learners." While ensembles offer computationally efficient models that…

Machine Learning · Statistics 2011-09-01 Orianna DeMasi , Juan Meza , David H. Bailey

Better Short than Greedy: Interpretable Models through Optimal Rule Boosting

Rule ensembles are designed to provide a useful trade-off between predictive accuracy and model interpretability. However, the myopic and random search components of current rule ensemble methods can compromise this goal: they often need…

Machine Learning · Computer Science 2021-01-22 Mario Boley , Simon Teshuva , Pierre Le Bodic , Geoffrey I Webb

Learning Locally Interpretable Rule Ensemble

This paper proposes a new framework for learning a rule ensemble model that is both accurate and interpretable. A rule ensemble is an interpretable model based on the linear combination of weighted rules. In practice, we often face the…

Machine Learning · Computer Science 2023-06-21 Kentaro Kanamori

An improved column-generation-based matheuristic for learning classification trees

Decision trees are highly interpretable models for solving classification problems in machine learning (ML). The standard ML algorithms for training decision trees are fast but generate suboptimal trees in terms of accuracy. Other discrete…

Machine Learning · Computer Science 2024-01-24 Krunal Kishor Patel , Guy Desaulniers , Andrea Lodi

Generalized linearization in nonlinear modeling of data

The principal innovative idea in this paper is to transform the original complex nonlinear modeling problem into a combination of linear problem and very simple nonlinear problems. The key step is the generalized linearization of nonlinear…

Computational Engineering, Finance, and Science · Computer Science 2024-09-21 W. Chen

Rule Generation for Classification: Scalability, Interpretability, and Fairness

We introduce a new rule-based optimization method for classification with constraints. The proposed method leverages column generation for linear programming, and hence, is scalable to large datasets. The resulting pricing subproblem is…

Machine Learning · Computer Science 2025-02-07 Tabea E. Röber , Adia C. Lumadjeng , M. Hakan Akyüz , Ş. İlker Birbil

A connection between the pattern classification problem and the General Linear Model for statistical inference

A connection between the General Linear Model (GLM) in combination with classical statistical inference and the machine learning (MLE)-based inference is described in this paper. Firstly, the estimation of the GLM parameters is expressed as…

Machine Learning · Statistics 2022-02-10 Juan Manuel Gorriz , SIPBA group , John Suckling

Nonparametric Functional Analysis of Generalized Linear Models Under Nonlinear Constraints

This article introduces a novel nonparametric methodology for Generalized Linear Models which combines the strengths of the binary regression and latent variable formulations for categorical data, while overcoming their disadvantages.…

Machine Learning · Statistics 2021-10-12 K. P. Chowdhury

Robust Estimation in Generalised Linear Models : The Density Power Divergence Approach

The generalised linear model (GLM) is a very important tool for analysing real data in biology, sociology, agriculture, engineering and many other application domain where the relationship between the response and explanatory variables may…

Methodology · Statistics 2016-07-04 Abhik Ghosh , Ayanendranath Basu

Uniform-in-Submodel Bounds for Linear Regression in a Model Free Framework

For the last two decades, high-dimensional data and methods have proliferated throughout the literature. Yet, the classical technique of linear regression has not lost its usefulness in applications. In fact, many high-dimensional…

Statistics Theory · Mathematics 2021-05-18 Arun Kumar Kuchibhotla , Lawrence D. Brown , Andreas Buja , Edward I. George , Linda Zhao

A Simple Correction Procedure for High-Dimensional Generalized Linear Models with Measurement Error

We consider high-dimensional generalized linear models when the covariates are contaminated by measurement error. Estimates from errors-in-variables regression models are well-known to be biased in traditional low-dimensional settings if…

Computation · Statistics 2020-01-06 Michael Byrd , Monnie McGee

Explainable Linear and Generalized Linear Models by the Predictions Plot

Multiple linear regression is a basic statistical tool, yielding a prediction formula with the input variables, slopes, and an intercept. But is it really easy to see which terms have the largest effect, or to explain why the prediction of…

Methodology · Statistics 2025-07-23 Peter J. Rousseeuw

Rule-based Bayesian regression

We introduce a novel rule-based approach for handling regression problems. The new methodology carries elements from two frameworks: (i) it provides information about the uncertainty of the parameters of interest using Bayesian inference,…

Machine Learning · Statistics 2021-10-11 Themistoklis Botsas , Lachlan R. Mason , Indranil Pan

Rule-Mining based classification: a benchmark study

This study proposed an exhaustive stable/reproducible rule-mining algorithm combined to a classifier to generate both accurate and interpretable models. Our method first extracts rules (i.e., a conjunction of conditions about the values of…

Machine Learning · Computer Science 2017-07-03 Margaux Luck , Nicolas Pallet , Cecilia Damon

Application of the Generalized Linear Models in Actuarial Framework

This paper aims to review the methodology behind the generalized linear models which are used in analyzing the actuarial situations instead of the ordinary multiple linear regression. We introduce how to assess the adequacy of the model…

Statistical Finance · Quantitative Finance 2016-11-09 Murwan H. M. A. Siddig

Column-Randomized Linear Programs: Performance Guarantees and Applications

We propose a randomized method for solving linear programs with a large number of columns but a relatively small number of constraints. Since enumerating all the columns is usually unrealistic, such linear programs are commonly solved by…

Optimization and Control · Mathematics 2023-11-29 Yi-Chun Akchen , Velibor V. Mišić

Subset Selection for Multiple Linear Regression via Optimization

Subset selection in multiple linear regression aims to choose a subset of candidate explanatory variables that tradeoff fitting error (explanatory power) and model complexity (number of variables selected). We build mathematical programming…

Machine Learning · Statistics 2020-09-04 Young Woong Park , Diego Klabjan

Robust and efficient estimation of nonparametric generalized linear models

Generalized linear models are flexible tools for the analysis of diverse datasets, but the classical formulation requires that the parametric component is correctly specified and the data contain no atypical observations. To address these…

Methodology · Statistics 2023-04-21 Ioannis Kalogridis , Gerda Claeskens , Stefan Van Aelst