Related papers: Dual Extrapolation for Sparse Generalized Linear M…

Celer: a Fast Solver for the Lasso with Dual Extrapolation

Convex sparsity-inducing regularizations are ubiquitous in high-dimensional machine learning, but solving the resulting optimization problems can be slow. To accelerate solvers, state-of-the-art approaches consist in reducing the size of…

Machine Learning · Statistics 2018-06-07 Mathurin Massias , Alexandre Gramfort , Joseph Salmon

Efficient Penalized Generalized Linear Mixed Models for Variable Selection and Genetic Risk Prediction in High-Dimensional Data

Sparse regularized regression methods are now widely used in genome-wide association studies (GWAS) to address the multiple testing burden that limits discovery of potentially important predictors. Linear mixed models (LMMs) have become an…

Methodology · Statistics 2022-06-27 Julien St-Pierre , Karim Oualkacha , Sahir Rai Bhatnagar

Robust and Sparse Regression in GLM by Stochastic Optimization

The generalized linear model (GLM) plays a key role in regression analyses. In high-dimensional data, the sparse GLM has been used but it is not robust against outliers. Recently, the robust methods have been proposed for the specific…

Machine Learning · Statistics 2026-05-15 Takayuki Kawashima , Hironori Fujisawa

Optimal Errors and Phase Transitions in High-Dimensional Generalized Linear Models

Generalized linear models (GLMs) arise in high-dimensional machine learning, statistics, communications and signal processing. In this paper we analyze GLMs when the data matrix is random, as relevant in problems such as compressed sensing,…

Information Theory · Computer Science 2019-04-01 Jean Barbier , Florent Krzakala , Nicolas Macris , Léo Miolane , Lenka Zdeborová

GAP Safe screening rules for sparse multi-task and multi-class models

High dimensional regression benefits from sparsity promoting regularizations. Screening rules leverage the known sparsity of the solution by ignoring some variables in the optimization, hence speeding up solvers. When the procedure is…

Machine Learning · Statistics 2015-11-19 Eugene Ndiaye , Olivier Fercoq , Alexandre Gramfort , Joseph Salmon

Fast Dual Variational Inference for Non-Conjugate LGMs

Latent Gaussian models (LGMs) are widely used in statistics and machine learning. Bayesian inference in non-conjugate LGMs is difficult due to intractable integrals involving the Gaussian prior and non-conjugate likelihoods. Algorithms…

Machine Learning · Statistics 2013-06-06 Mohammad Emtiyaz Khan , Aleksandr Y. Aravkin , Michael P. Friedlander , Matthias Seeger

Safe Screening Rules for Generalized Double Sparsity Learning

In a high-dimensional setting, sparse model has shown its power in computational and statistical efficiency. We consider variables selection problem with a broad class of simultaneous sparsity regularization, enforcing both feature-wise and…

Optimization and Control · Mathematics 2021-09-27 Xinyu Zhang

A connection between the pattern classification problem and the General Linear Model for statistical inference

A connection between the General Linear Model (GLM) in combination with classical statistical inference and the machine learning (MLE)-based inference is described in this paper. Firstly, the estimation of the GLM parameters is expressed as…

Machine Learning · Statistics 2022-02-10 Juan Manuel Gorriz , SIPBA group , John Suckling

Learning sparse generalized linear models with binary outcomes via iterative hard thresholding

In statistics, generalized linear models (GLMs) are widely used for modeling data and can expressively capture potential nonlinear dependence of the model's outcomes on its covariates. Within the broad family of GLMs, those with binary…

Statistics Theory · Mathematics 2025-09-04 Namiko Matsumoto , Arya Mazumdar

Sparse Techniques for Regression in Deep Gaussian Processes

Gaussian processes (GPs) have gained popularity as flexible machine learning models for regression and function approximation with an in-built method for uncertainty quantification. However, GPs suffer when the amount of training data is…

Machine Learning · Statistics 2025-11-26 Jonas Latz , Aretha L. Teckentrup , Simon Urbainczyk

Generalized Linear Model Regression under Distance-to-set Penalties

Estimation in generalized linear models (GLM) is complicated by the presence of constraints. One can handle constraints by maximizing a penalized log-likelihood. Penalties such as the lasso are effective in high dimensions, but often lead…

Machine Learning · Statistics 2017-11-07 Jason Xu , Eric C. Chi , Kenneth Lange

GPU-friendly and Linearly Convergent First-order Methods for Certifying Optimal $k$-sparse GLMs

We investigate the problem of certifying optimality for sparse generalized linear models (GLMs), where sparsity is enforced through a cardinality constraint. While Branch-and-Bound (BnB) frameworks can certify optimality using perspective…

Optimization and Control · Mathematics 2026-03-03 Jiachang Liu , Andrea Lodi , Soroosh Shafiee

Gap Safe screening rules for sparsity enforcing penalties

In high dimensional regression settings, sparsity enforcing penalties have proved useful to regularize the data-fitting term. A recently introduced technique called screening rules propose to ignore some variables in the optimization…

Machine Learning · Statistics 2017-12-29 Eugene Ndiaye , Olivier Fercoq , Alexandre Gramfort , Joseph Salmon

Learning a Class of Mixed Linear Regressions: Global Convergence under General Data Conditions

Mixed linear regression (MLR) has attracted increasing attention because of its great theoretical and practical importance in capturing nonlinear relationships by utilizing a mixture of linear regression sub-models. Although considerable…

Machine Learning · Statistics 2025-03-25 Yujing Liu , Zhixin Liu , Lei Guo

Covariance Estimation: The GLM and Regularization Perspectives

Finding an unconstrained and statistically interpretable reparameterization of a covariance matrix is still an open problem in statistics. Its solution is of central importance in covariance estimation, particularly in the recent…

Methodology · Statistics 2012-02-09 Mohsen Pourahmadi

GLMMLasso: An Algorithm for High-Dimensional Generalized Linear Mixed Models Using L1-Penalization

We propose an L1-penalized algorithm for fitting high-dimensional generalized linear mixed models. Generalized linear mixed models (GLMMs) can be viewed as an extension of generalized linear models for clustered observations. This…

Computation · Statistics 2014-06-03 Jürg Schelldorfer , Lukas Meier , Peter Bühlmann

Adaptive posterior convergence in sparse high dimensional clipped generalized linear models

We develop a framework to study posterior contraction rates in sparse high dimensional generalized linear models (GLM). We introduce a new family of GLMs, denoted by clipped GLM, which subsumes many standard GLMs and makes minor…

Statistics Theory · Mathematics 2021-03-16 Biraj Subhra Guha , Debdeep Pati

Robust and Sparse Generalized Linear Models for High-Dimensional Data via Maximum Mean Discrepancy

High-dimensional datasets are frequently subject to contamination by outliers and heavy-tailed noise, which can severely bias standard regularized estimators like the Lasso. While Maximum Mean Discrepancy (MMD) has recently been introduced…

Methodology · Statistics 2026-02-25 Xiaoning Kang , Lulu Kang

Generalisation error in learning with random features and the hidden manifold model

We study generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden…

Statistics Theory · Mathematics 2022-03-28 Federica Gerace , Bruno Loureiro , Florent Krzakala , Marc Mézard , Lenka Zdeborová

Sibling Regression for Generalized Linear Models

Field observations form the basis of many scientific studies, especially in ecological and social sciences. Despite efforts to conduct such surveys in a standardized way, observations can be prone to systematic measurement errors. The…

Methodology · Statistics 2021-08-31 Shiv Shankar , Daniel Sheldon