Related papers: Smoothing Proximal Gradient Method for General Str…

Smoothing proximal gradient method for general structured sparse regression

We study the problem of estimating high-dimensional regression models regularized by a structured sparsity-inducing penalty that encodes prior structural information on either the input or output variables. We consider two widely adopted…

Machine Learning · Statistics 2012-07-02 Xi Chen , Qihang Lin , Seyoung Kim , Jaime G. Carbonell , Eric P. Xing

Proximal methods for the latent group lasso penalty

We consider a regularized least squares problem, with regularization by structured sparsity-inducing norms, which extend the usual $\ell_1$ and the group lasso penalty, by allowing the subsets to overlap. Such regularizations lead to…

Optimization and Control · Mathematics 2012-09-04 Silvia Villa , Lorenzo Rosasco , Sofia Mosci , Alessandro Verri

Effective Proximal Methods for Non-convex Non-smooth Regularized Learning

Sparse learning is a very important tool for mining useful information and patterns from high dimensional data. Non-convex non-smooth regularized learning problems play essential roles in sparse learning, and have drawn extensive attentions…

Machine Learning · Computer Science 2020-10-22 Guannan Liang , Qianqian Tong , Jiahao Ding , Miao Pan , Jinbo Bi

New Penalized Stochastic Gradient Methods for Linearly Constrained Strongly Convex Optimization

For minimizing a strongly convex objective function subject to linear inequality constraints, we consider a penalty approach that allows one to utilize stochastic methods for problems with a large number of constraints and/or objective…

Optimization and Control · Mathematics 2022-02-16 Meng Li , Paul Grigas , Alper Atamturk

Continuation of Nesterov's Smoothing for Regression with Structured Sparsity in High-Dimensional Neuroimaging

Predictive models can be used on high-dimensional brain images for diagnosis of a clinical condition. Spatial regularization through structured sparsity offers new perspectives in this context and reduces the risk of overfitting the model…

Machine Learning · Statistics 2018-04-24 Fouad Hadj-Selem , Tommy Lofstedt , Elvis Dohmatob , Vincent Frouin , Mathieu Dubois , Vincent Guillemot , Edouard Duchesnay

Sparse Regression with Multi-type Regularized Feature Modeling

Within the statistical and machine learning literature, regularization techniques are often used to construct sparse (predictive) models. Most regularization strategies only work for data where all predictors are treated identically, such…

Computation · Statistics 2020-12-16 Sander Devriendt , Katrien Antonio , Tom Reynkens , Roel Verbelen

Proximal Iteration for Nonlinear Adaptive Lasso

Augmenting a smooth cost function with an $\ell_1$ penalty allows analysts to efficiently conduct estimation and variable selection simultaneously in sophisticated models and can be efficiently implemented using proximal gradient methods.…

Machine Learning · Statistics 2024-12-10 Nathan Wycoff , Lisa O. Singh , Ali Arab , Katharine M. Donato

Proximal Gradient methods with Adaptive Subspace Sampling

Many applications in machine learning or signal processing involve nonsmooth optimization problems. This nonsmoothness brings a low-dimensional structure to the optimal solutions. In this paper, we propose a randomized proximal gradient…

Optimization and Control · Mathematics 2020-04-29 Dmitry Grishchenko , Franck Iutzeler , Jérôme Malick

Stochastic Proximal Methods for Non-Smooth Non-Convex Constrained Sparse Optimization

This paper focuses on stochastic proximal gradient methods for optimizing a smooth non-convex loss function with a non-smooth non-convex regularizer and convex constraints. To the best of our knowledge we present the first non-asymptotic…

Optimization and Control · Mathematics 2019-05-27 Michael R. Metel , Akiko Takeda

Accelerated Gradient Methods for Sparse Statistical Learning with Nonconvex Penalties

Nesterov's accelerated gradient (AG) is a popular technique to optimize objective functions comprising two components: a convex loss and a penalty function. While AG methods perform well for convex penalties, such as the LASSO, convergence…

Optimization and Control · Mathematics 2024-01-04 Kai Yang , Masoud Asgharian , Sahir Bhatnagar

Classification and regression using an outer approximation projection-gradient method

This paper deals with sparse feature selection and grouping for classification and regression. The classification or regression problems under consideration consists in minimizing a convex empirical risk function subject to an $\ell^1$…

Statistics Theory · Mathematics 2017-03-27 Michel Barlaud , Wafa Belhajali , Patrick L. Combettes , Lionel Fillatre

Optimization with Sparsity-Inducing Penalties

Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. They were first dedicated to linear variable selection but numerous extensions have now emerged such as structured sparsity or kernel…

Machine Learning · Computer Science 2011-11-24 Francis Bach , Rodolphe Jenatton , Julien Mairal , Guillaume Obozinski

Structured Sparsity Inducing Adaptive Optimizers for Deep Learning

The parameters of a neural network are naturally organized in groups, some of which might not contribute to its overall performance. To prune out unimportant groups of parameters, we can include some non-differentiable penalty to the…

Machine Learning · Computer Science 2023-01-06 Tristan Deleu , Yoshua Bengio

A General Framework for Structured Sparsity via Proximal Optimization

We study a generalized framework for structured sparsity. It extends the well-known methods of Lasso and Group Lasso by incorporating additional constraints on the variables as part of a convex optimization problem. This framework provides…

Machine Learning · Computer Science 2011-06-28 Andreas Argyriou , Luca Baldassarre , Jean Morales , Massimiliano Pontil

A Zeroth-order Proximal Stochastic Gradient Method for Weakly Convex Stochastic Optimization

In this paper we analyze a zeroth-order proximal stochastic gradient method suitable for the minimization of weakly convex stochastic optimization problems. We consider nonsmooth and nonlinear stochastic composite problems, for which…

Optimization and Control · Mathematics 2025-04-21 Spyridon Pougkakiotis , Dionysios S. Kalogerias

A geometric proximal gradient method for sparse least squares regression with probabilistic simplex constraint

In this paper, we consider the sparse least squares regression problem with probabilistic simplex constraint. Due to the probabilistic simplex constraint, one could not apply the L1 regularization to the considered regression model. To find…

Optimization and Control · Mathematics 2021-12-28 Guiyun Xiao , Zheng-Jian Bai

On the Convergence and Complexity of Proximal Gradient and Accelerated Proximal Gradient Methods under Adaptive Gradient Estimation

In this paper, we propose a proximal gradient method and an accelerated proximal gradient method for solving composite optimization problems, where the objective function is the sum of a smooth and a convex, possibly nonsmooth, function. We…

Optimization and Control · Mathematics 2025-07-22 Raghu Bollapragada , Shagun Gupta

A New Analysis of Compressive Sensing by Stochastic Proximal Gradient Descent

In this manuscript, we analyze the sparse signal recovery (compressive sensing) problem from the perspective of convex optimization by stochastic proximal gradient descent. This view allows us to significantly simplify the recovery analysis…

Data Structures and Algorithms · Computer Science 2013-04-19 Rong Jin , Tianbao Yang , Shenghuo Zhu

Smooth Optimization Approach for Sparse Covariance Selection

In this paper we first study a smooth optimization approach for solving a class of nonsmooth strictly concave maximization problems whose objective functions admit smooth convex minimization reformulations. In particular, we apply…

Methodology · Statistics 2009-04-07 Zhaosong Lu

spred: Solving $L_1$ Penalty with SGD

We propose to minimize a generic differentiable objective with $L_1$ constraint using a simple reparametrization and straightforward stochastic gradient descent. Our proposal is the direct generalization of previous ideas that the $L_1$…

Machine Learning · Computer Science 2023-07-13 Liu Ziyin , Zihao Wang