Related papers: High-dimensional classification by sparse logistic…

Multiclass classification by sparse multinomial logistic regression

In this paper we consider high-dimensional multiclass classification by sparse multinomial logistic regression. We propose first a feature selection procedure based on penalized maximum likelihood with a complexity penalty on the model size…

Statistics Theory · Mathematics 2020-11-20 Felix Abramovich , Vadim Grinshtein , Tomer Levy

High Dimensional Classification through $\ell_0$-Penalized Empirical Risk Minimization

We consider a high dimensional binary classification problem and construct a classification procedure by minimizing the empirical misclassification risk with a penalty on the number of selected features. We derive non-asymptotic probability…

Methodology · Statistics 2018-11-26 Le-Yu Chen , Sokbae Lee

Generalization Error Bounds for Multiclass Sparse Linear Classifiers

We consider high-dimensional multiclass classification by sparse multinomial logistic regression. Unlike binary classification, in the multiclass setup one can think about an entire spectrum of possible notions of sparsity associated with…

Statistics Theory · Mathematics 2023-01-18 Tomer Levy , Felix Abramovich

On high-dimensional classification by sparse generalized Bayesian logistic regression

This work addresses the problem of high-dimensional classification by exploring the generalized Bayesian logistic regression method under a sparsity-inducing prior distribution. The method involves utilizing a fractional power of the…

Statistics Theory · Mathematics 2024-03-20 The Tien Mai

Selection of variables and decision boundaries for functional data via bi-level selection

Sparsity-inducing penalties are useful tools for variable selection and they are also effective for regression settings where the data are functions. We consider the problem of selecting not only variables but also decision boundaries in…

Methodology · Statistics 2020-06-01 Hidetoshi Matsui

High-Dimensional Sparse Additive Hazards Regression

High-dimensional sparse modeling with censored survival data is of great practical importance, as exemplified by modern applications in high-throughput genomic data analysis and credit risk analysis. In this article, we propose a class of…

Methodology · Statistics 2014-03-19 Wei Lin , Jinchi Lv

Model selection and minimax estimation in generalized linear models

We consider model selection in generalized linear models (GLM) for high-dimensional data and propose a wide class of model selection criteria based on penalized maximum likelihood with a complexity penalty on the model size. We derive a…

Statistics Theory · Mathematics 2016-03-31 Felix Abramovich , Vadim Grinshtein

High Dimensional Classification with combined Adaptive Sparse PLS and Logistic Regression

Motivation: The high dimensionality of genomic data calls for the development of specific classification methodologies, especially to prevent over-optimistic predictions. This challenge can be tackled by compression and variable selection,…

Methodology · Statistics 2021-04-10 G. Durif , L. Modolo , J. Michaelsson , J. E. Mold , S. Lambert-Lacroix , F. Picard

Efficient and robust high-dimensional sparse logistic regression via nonlinear primal-dual hybrid gradient algorithms

Logistic regression is a widely used statistical model to describe the relationship between a binary response variable and predictor variables in data sets. It is often used in machine learning to identify important predictor variables.…

Optimization and Control · Mathematics 2021-12-30 Jérôme Darbon , Gabriel P. Langlois

High-dimensional additive modeling

We propose a new sparsity-smoothness penalty for high-dimensional generalized additive models. The combination of sparsity and smoothness is crucial for mathematical theory as well as performance for finite-sample data. We present a…

Machine Learning · Statistics 2009-11-18 Lukas Meier , Sara van de Geer , Peter Bühlmann

High dimensional thresholded regression and shrinkage effect

High-dimensional sparse modeling via regularization provides a powerful tool for analyzing large-scale data sets and obtaining meaningful, interpretable models. The use of nonconvex penalty functions shows advantage in selecting important…

Methodology · Statistics 2016-05-12 Zemin Zheng , Yingying Fan , Jinchi Lv

Penalized Composite Quasi-Likelihood for Ultrahigh-Dimensional Variable Selection

In high-dimensional model selection problems, penalized simple least-square approaches have been extensively used. This paper addresses the question of both robustness and efficiency of penalized model selection methods, and proposes a…

Methodology · Statistics 2011-07-06 Jelena Bradic , Jianqing Fan , Weiwei Wang

High-dimensional regression with a count response

We consider high-dimensional regression with a count response modeled by Poisson or negative binomial generalized linear model (GLM). We propose a penalized maximum likelihood estimator with a properly chosen complexity penalty and…

Methodology · Statistics 2024-09-16 Or Zilberman , Felix Abramovich

Sparse classification with positive-confidence data in high dimensions

High-dimensional learning problems, where the number of features exceeds the sample size, often require sparse regularization for effective prediction and variable selection. While established for fully supervised data, these techniques…

Machine Learning · Computer Science 2026-01-01 The Tien Mai , Mai Anh Nguyen , Trung Nghia Nguyen

Model Selection Through Sparse Maximum Likelihood Estimation

We consider the problem of estimating the parameters of a Gaussian or binary distribution in such a way that the resulting undirected graphical model is sparse. Our approach is to solve a maximum likelihood problem with an added l_1-norm…

Artificial Intelligence · Computer Science 2007-07-06 Onureena Banerjee , Laurent El Ghaoui , Alexandre d'Aspremont

Robust adaptive Lasso in high-dimensional logistic regression

Penalized logistic regression is extremely useful for binary classification with large number of covariates (higher than the sample size), having several real life applications, including genomic disease classification. However, the…

Methodology · Statistics 2023-04-10 Ayanendranath Basu , Abhik Ghosh , María Jaenada , Leandro Pardo

Variable selection and basis learning for ordinal classification

We propose a method for variable selection and basis learning for high-dimensional classification with ordinal responses. The proposed method extends sparse multiclass linear discriminant analysis, with the aim of identifying not only the…

Methodology · Statistics 2025-02-17 Minwoo Kim , Sangil Han , Jeongyoun Ahn , Sungkyu Jung

Choosing a penalty for model selection in heteroscedastic regression

We consider the problem of choosing between several models in least-squares regression with heteroscedastic data. We prove that any penalization procedure is suboptimal when the penalty is a function of the dimension of the model, at least…

Statistics Theory · Mathematics 2010-07-28 Sylvain Arlot

Estimation and variable selection in high dimension in nonlinear mixed-effects models

We consider nonlinear mixed effects models including high-dimensional covariates to model individual parameters variability. The objective is to identify relevant covariates among a large set under sparsity assumption and to estimate model…

Statistics Theory · Mathematics 2025-08-06 Antoine Caillebotte , Estelle Kuhn , Sarah Lemler

Penalized robust estimators in logistic regression with applications to sparse models

Sparse covariates are frequent in classification and regression problems and in these settings the task of variable selection is usually of interest. As it is well known, sparse statistical models correspond to situations where there are…

Methodology · Statistics 2020-02-14 Ana M. Bianco , Graciela Boente , Gonzalo Chebi