Related papers: Sparse Classification: a scalable discrete optimiz…

Feature Adaptation for Sparse Linear Regression

Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,\Sigma)$, and we seek an estimator with small…

Data Structures and Algorithms · Computer Science 2023-05-29 Jonathan Kelner , Frederic Koehler , Raghu Meka , Dhruv Rohatgi

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective. We devise a new cutting plane method and provide evidence that it can solve to provable optimality the sparse…

Optimization and Control · Mathematics 2017-09-29 Dimitris Bertsimas , Bart Van Parys

Learning Sparse Classifiers: Continuous and Mixed Integer Optimization Perspectives

We consider a discrete optimization formulation for learning sparse classifiers, where the outcome depends upon a linear combination of a small subset of features. Recent work has shown that mixed integer programming (MIP) can be used to…

Machine Learning · Statistics 2021-06-08 Antoine Dedieu , Hussein Hazimeh , Rahul Mazumder

Sparse classification with positive-confidence data in high dimensions

High-dimensional learning problems, where the number of features exceeds the sample size, often require sparse regularization for effective prediction and variable selection. While established for fully supervised data, these techniques…

Machine Learning · Computer Science 2026-01-01 The Tien Mai , Mai Anh Nguyen , Trung Nghia Nguyen

Lasso-type recovery of sparse representations for high-dimensional data

The Lasso is an attractive technique for regularization and variable selection for high-dimensional data, where the number of predictor variables $p_n$ is potentially much larger than the number of samples $n$. However, it was recently…

Statistics Theory · Mathematics 2009-03-02 Nicolai Meinshausen , Bin Yu

Slowly Varying Regression under Sparsity

We present the framework of slowly varying regression under sparsity, allowing sparse regression models to exhibit slow and sparse variations. The problem of parameter estimation is formulated as a mixed-integer optimization problem. We…

Machine Learning · Computer Science 2023-11-14 Dimitris Bertsimas , Vassilis Digalakis , Michael Linghzi Li , Omar Skali Lami

Sparse Regression: Scalable algorithms and empirical performance

In this paper, we review state-of-the-art methods for feature selection in statistics with an application-oriented eye. Indeed, sparsity is a valuable property and the profusion of research on the topic might have provided little guidance…

Methodology · Statistics 2021-11-08 Dimitris Bertsimas , Jean Pauphilet , Bart Van Parys

Direct l_(2,p)-Norm Learning for Feature Selection

In this paper, we propose a novel sparse learning based feature selection method that directly optimizes a large margin linear classification model sparsity with l_(2,p)-norm (0 < p < 1)subject to data-fitting constraints, rather than using…

Machine Learning · Computer Science 2015-04-03 Hanyang Peng , Yong Fan

On best subset regression

In this paper we discuss the variable selection method from \ell0-norm constrained regression, which is equivalent to the problem of finding the best subset of a fixed size. Our study focuses on two aspects, consistency and computation. We…

Methodology · Statistics 2013-03-20 Shifeng Xiong

Sparse Recovery using Smoothed $\ell^0$ (SL0): Convergence Analysis

Finding the sparse solution of an underdetermined system of linear equations has many applications, especially, it is used in Compressed Sensing (CS), Sparse Component Analysis (SCA), and sparse decomposition of signals on overcomplete…

Information Theory · Computer Science 2010-01-29 Hosein Mohimani , Massoud Babaie-Zadeh , Irina Gorodnitsky , Christian Jutten

Sparse Solutions of a Class of Constrained Optimization Problems

In this paper, we consider a well-known sparse optimization problem that aims to find a sparse solution of a possibly noisy underdetermined system of linear equations. Mathematically, it can be modeled in a unified manner by minimizing…

Optimization and Control · Mathematics 2021-10-01 Lei Yang , Xiaojun Chen , Shuhuang Xiang

Safe Feature Elimination in Sparse Supervised Learning

We investigate fast methods that allow to quickly eliminate variables (features) in supervised learning problems involving a convex loss function and a $l_1$-norm penalty, leading to a potentially substantial reduction in the number of…

Machine Learning · Computer Science 2010-10-28 Laurent El Ghaoui , Vivian Viallon , Tarek Rabbani

The Statistics of Streaming Sparse Regression

We present a sparse analogue to stochastic gradient descent that is guaranteed to perform well under similar conditions to the lasso. In the linear regression setup with irrepresentable noise features, our algorithm recovers the support set…

Statistics Theory · Mathematics 2014-12-16 Jacob Steinhardt , Stefan Wager , Percy Liang

Sparse-Input Neural Network using Group Concave Regularization

Simultaneous feature selection and non-linear function estimation is challenging in modeling, especially in high-dimensional settings where the number of variables exceeds the available sample size. In this article, we investigate the…

Machine Learning · Statistics 2026-01-05 Bin Luo , Susan Halabi

A Coordinate-wise Optimization Algorithm for Sparse Inverse Covariance Selection

Sparse inverse covariance selection is a fundamental problem for analyzing dependencies in high dimensional data. However, such a problem is difficult to solve since it is NP-hard. Existing solutions are primarily based on convex…

Numerical Analysis · Computer Science 2018-04-05 Ganzhao Yuan , Haoxian Tan , Wei-Shi Zheng

Error bounds for sparse classifiers in high-dimensions

We prove an L2 recovery bound for a family of sparse estimators defined as minimizers of some empirical loss functions -- which include hinge loss and logistic loss. More precisely, we achieve an upper-bound for coefficients estimation…

Statistics Theory · Mathematics 2019-01-15 Antoine Dedieu

Sparse Regression via Range Counting

The sparse regression problem, also known as best subset selection problem, can be cast as follows: Given a set $S$ of $n$ points in $\mathbb{R}^d$, a point $y\in \mathbb{R}^d$, and an integer $2 \leq k \leq d$, find an affine combination…

Data Structures and Algorithms · Computer Science 2020-01-01 Jean Cardinal , Aurélien Ooms

Model Selection Through Sparse Maximum Likelihood Estimation

We consider the problem of estimating the parameters of a Gaussian or binary distribution in such a way that the resulting undirected graphical model is sparse. Our approach is to solve a maximum likelihood problem with an added l_1-norm…

Artificial Intelligence · Computer Science 2007-07-06 Onureena Banerjee , Laurent El Ghaoui , Alexandre d'Aspremont

Improving Group Lasso for high-dimensional categorical data

Sparse modelling or model selection with categorical data is challenging even for a moderate number of variables, because one parameter is roughly needed to encode one category or level. The Group Lasso is a well known efficient algorithm…

Methodology · Statistics 2022-11-14 Szymon Nowakowski , Piotr Pokarowski , Wojciech Rejchel , Agnieszka Sołtys

Sparse Sliced Inverse Regression Via Lasso

For multiple index models, it has recently been shown that the sliced inverse regression (SIR) is consistent for estimating the sufficient dimension reduction (SDR) space if and only if $\rho=\lim\frac{p}{n}=0$, where $p$ is the dimension…

Statistics Theory · Mathematics 2018-06-19 Qian Lin , Zhigen Zhao , Jun S. Liu