Related papers: Fast Sparse Classification for Generalized Linear …

Beyond L1: Faster and Better Sparse Models with skglm

We propose a new fast algorithm to estimate any sparse generalized linear model with convex or non-convex separable penalties. Our algorithm is able to solve problems with millions of samples and features in seconds, by relying on…

Machine Learning · Statistics 2023-03-09 Quentin Bertrand , Quentin Klopfenstein , Pierre-Antoine Bannier , Gauthier Gidel , Mathurin Massias

Safe Feature Elimination in Sparse Supervised Learning

We investigate fast methods that allow to quickly eliminate variables (features) in supervised learning problems involving a convex loss function and a $l_1$-norm penalty, leading to a potentially substantial reduction in the number of…

Machine Learning · Computer Science 2010-10-28 Laurent El Ghaoui , Vivian Viallon , Tarek Rabbani

Best-Subset Selection in Generalized Linear Models: A Fast and Consistent Algorithm via Splicing Technique

In high-dimensional generalized linear models, it is crucial to identify a sparse model that adequately accounts for response variation. Although the best subset section has been widely regarded as the Holy Grail of problems of this type,…

Machine Learning · Statistics 2023-08-02 Junxian Zhu , Jin Zhu , Borui Tang , Xuanyu Chen , Hongmei Lin , Xueqin Wang

Fast Simultaneous Training of Generalized Linear Models (FaSTGLZ)

We present an efficient algorithm for simultaneously training sparse generalized linear models across many related problems, which may arise from bootstrapping, cross-validation and nonparametric permutation testing. Our approach leverages…

Machine Learning · Computer Science 2013-08-01 Bryan R. Conroy , Jennifer M. Walz , Brian Cheung , Paul Sajda

Sparse Classification: a scalable discrete optimization perspective

We formulate the sparse classification problem of $n$ samples with $p$ features as a binary convex optimization problem and propose a cutting-plane algorithm to solve it exactly. For sparse logistic regression and sparse SVM, our algorithm…

Optimization and Control · Mathematics 2025-01-08 Dimitris Bertsimas , Jean Pauphilet , Bart Van Parys

Classification by sparse generalized additive models

We consider (nonparametric) sparse (generalized) additive models (SpAM) for classification. The design of a SpAM classifier is based on minimizing the logistic loss with a sparse group Lasso/Slope-type penalties on the coefficients of…

Statistics Theory · Mathematics 2024-05-16 Felix Abramovich

Multi-block linearized alternating direction method for sparse fused Lasso modeling problems

In many statistical modeling problems, such as classification and regression, it is common to encounter sparse and blocky coefficients. Sparse fused Lasso is specifically designed to recover these sparse and blocky structured features,…

Statistics Theory · Mathematics 2024-05-30 Xiaofei Wu , Rongmei Liang , Zhimin Zhang , Zhenyu Cui

Generalization Error Bounds for Multiclass Sparse Linear Classifiers

We consider high-dimensional multiclass classification by sparse multinomial logistic regression. Unlike binary classification, in the multiclass setup one can think about an entire spectrum of possible notions of sparsity associated with…

Statistics Theory · Mathematics 2023-01-18 Tomer Levy , Felix Abramovich

High-Dimensional Distributed Sparse Classification with Scalable Communication-Efficient Global Updates

As the size of datasets used in statistical learning continues to grow, distributed training of models has attracted increasing attention. These methods partition the data and exploit parallelism to reduce memory and runtime, but suffer…

Machine Learning · Computer Science 2024-07-10 Fred Lu , Ryan R. Curtin , Edward Raff , Francis Ferraro , James Holt

Extreme Sparse Multinomial Logistic Regression: A Fast and Robust Framework for Hyperspectral Image Classification

Although the sparse multinomial logistic regression (SMLR) has provided a useful tool for sparse classification, it suffers from inefficacy in dealing with high dimensional features and manually set initial regressor values. This has…

Computer Vision and Pattern Recognition · Computer Science 2017-12-05 Faxian Cao , Zhijing Yang , Jinchang Ren , Wing-Kuen Ling

Fast Sublinear Sparse Representation using Shallow Tree Matching Pursuit

Sparse approximations using highly over-complete dictionaries is a state-of-the-art tool for many imaging applications including denoising, super-resolution, compressive sensing, light-field analysis, and object recognition. Unfortunately,…

Computer Vision and Pattern Recognition · Computer Science 2014-12-03 Ali Ayremlou , Thomas Goldstein , Ashok Veeraraghavan , Richard Baraniuk

Low Complexity Approximate Bayesian Logistic Regression for Sparse Online Learning

Theoretical results show that Bayesian methods can achieve lower bounds on regret for online logistic regression. In practice, however, such techniques may not be feasible especially for very large feature sets. Various approximations that,…

Machine Learning · Computer Science 2021-01-29 Gil I. Shamir , Wojciech Szpankowski

Sparse Linear Regression With Missing Data

This paper proposes a fast and accurate method for sparse regression in the presence of missing data. The underlying statistical model encapsulates the low-dimensional structure of the incomplete data matrix and the sparsity of the…

Machine Learning · Statistics 2015-03-31 Ravi Ganti , Rebecca M. Willett

Efficient Learning of Sparse Invariant Representations

We propose a simple and efficient algorithm for learning sparse invariant representations from unlabeled data with fast inference. When trained on short movies sequences, the learned features are selective to a range of orientations and…

Computer Vision and Pattern Recognition · Computer Science 2011-05-27 Karol Gregor , Yann LeCun

Fast sparse optimization via adaptive shrinkage

The need for fast sparse optimization is emerging, e.g., to deal with large-dimensional data-driven problems and to track time-varying systems. In the framework of linear sparse optimization, the iterative shrinkage-thresholding algorithm…

Optimization and Control · Mathematics 2025-01-22 Vito Cerone , Sophie M. Fosson , Diego Regruto

Sparse matrix linear models for structured high-throughput data

Recent technological advancements have led to the rapid generation of high-throughput biological data, which can be used to address novel scientific questions in broad areas of research. These data can be thought of as a large matrix with…

Computation · Statistics 2021-03-01 Jane W. Liang , Saunak Sen

High-dimensional sparse classification using exponential weighting with empirical hinge loss

In this study, we address the problem of high-dimensional binary classification. Our proposed solution involves employing an aggregation technique founded on exponential weights and empirical hinge loss. Through the employment of a suitable…

Methodology · Statistics 2024-10-02 The Tien Mai

Sparse Learning over Infinite Subgraph Features

We present a supervised-learning algorithm from graph data (a set of graphs) for arbitrary twice-differentiable loss functions and sparse linear models over all possible subgraph features. To date, it has been shown that under all possible…

Machine Learning · Statistics 2014-03-21 Ichigaku Takigawa , Hiroshi Mamitsuka

Improving Group Lasso for high-dimensional categorical data

Sparse modelling or model selection with categorical data is challenging even for a moderate number of variables, because one parameter is roughly needed to encode one category or level. The Group Lasso is a well known efficient algorithm…

Methodology · Statistics 2022-11-14 Szymon Nowakowski , Piotr Pokarowski , Wojciech Rejchel , Agnieszka Sołtys

Sparse Model Identification and Learning for Ultra-high-dimensional Additive Partially Linear Models

The additive partially linear model (APLM) combines the flexibility of nonparametric regression with the parsimony of regression models, and has been widely used as a popular tool in multivariate nonparametric regression to alleviate the…

Methodology · Statistics 2019-03-19 Xinyi Li , Li Wang , Dan Nettleton