Related papers: Gradient-based Regularization Parameter Selection …

Regularization of Case-Specific Parameters for Robustness and Efficiency

Regularization methods allow one to handle a variety of inferential problems where there are more covariates than cases. This allows one to consider a potentially enormous number of covariates for a problem. We exploit the power of these…

Methodology · Statistics 2012-10-03 Yoonkyung Lee , Steven N. MacEachern , Yoonsuh Jung

A Survey of Tuning Parameter Selection for High-dimensional Regression

Penalized (or regularized) regression, as represented by Lasso and its variants, has become a standard technique for analyzing high-dimensional data when the number of variables substantially exceeds the sample size. The performance of…

Methodology · Statistics 2019-08-13 Yunan Wu , Lan Wang

On the structure of regularization paths for piecewise differentiable regularization terms

Regularization is used in many different areas of optimization when solutions are sought which not only minimize a given function, but also possess a certain degree of regularity. Popular applications are image denoising, sparse regression…

Optimization and Control · Mathematics 2021-11-15 Bennet Gebken , Katharina Bieker , Sebastian Peitz

On improving generalization in a class of learning problems with the method of small parameters for weakly-controlled optimal gradient systems

In this paper, we provide a mathematical framework for improving generalization in a class of learning problems which is related to point estimations for modeling of high-dimensional nonlinear functions. In particular, we consider a…

Optimization and Control · Mathematics 2024-12-13 Getachew K. Befekadu

Adaptive multi-penalty regularization based on a generalized Lasso path

For many algorithms, parameter tuning remains a challenging and critical task, which becomes tedious and infeasible in a multi-parameter setting. Multi-penalty regularization, successfully used for solving undetermined sparse regression of…

Machine Learning · Statistics 2017-10-12 Markus Grasmair , Timo Klock , Valeriya Naumova

Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters

Hyperparameter selection generally relies on running multiple full training trials, with selection based on validation set performance. We propose a gradient-based approach for locally adjusting hyperparameters during training of the model.…

Machine Learning · Computer Science 2016-06-20 Jelena Luketina , Mathias Berglund , Klaus Greff , Tapani Raiko

Cross-regularization: Adaptive Model Complexity through Validation Gradients

Model regularization requires extensive manual tuning to balance complexity against overfitting. Cross-regularization resolves this tradeoff by directly adapting regularization parameters through validation gradients during training. The…

Machine Learning · Computer Science 2025-06-25 Carlos Stein Brito

Penalizing Gradient Norm for Efficiently Improving Generalization in Deep Learning

How to train deep neural networks (DNNs) to generalize well is a central concern in deep learning, especially for severely overparameterized networks nowadays. In this paper, we propose an effective method to improve the model…

Machine Learning · Computer Science 2022-06-28 Yang Zhao , Hao Zhang , Xiuyuan Hu

Monotonicity Regularization: Improved Penalties and Novel Applications to Disentangled Representation Learning and Robust Classification

We study settings where gradient penalties are used alongside risk minimization with the goal of obtaining predictors satisfying different notions of monotonicity. Specifically, we present two sets of contributions. In the first part of the…

Machine Learning · Computer Science 2022-05-18 Joao Monteiro , Mohamed Osama Ahmed , Hossein Hajimirsadeghi , Greg Mori

Regularization-based Pruning of Irrelevant Weights in Deep Neural Architectures

Deep neural networks exploiting millions of parameters are nowadays the norm in deep learning applications. This is a potential issue because of the great amount of computational resources needed for training, and of the possible loss of…

Computation and Language · Computer Science 2022-10-31 Giovanni Bonetta , Matteo Ribero , Rossella Cancelliere

Gradient-based bilevel optimization for multi-penalty Ridge regression through matrix differential calculus

Common regularization algorithms for linear regression, such as LASSO and Ridge regression, rely on a regularization hyperparameter that balances the tradeoff between minimizing the fitting error and the norm of the learned model…

Machine Learning · Computer Science 2023-11-27 Gabriele Maroni , Loris Cannelli , Dario Piga

Regression on a Graph

The `Signal plus Noise' model for nonparametric regression can be extended to the case of observations taken at the vertices of a graph. This model includes many familiar regression problems. This article discusses the use of the edges of a…

Methodology · Statistics 2009-11-11 Arne Kovac , Andrew D. A. C. Smith

Neural Pruning via Growing Regularization

Regularization has long been utilized to learn sparsity in deep neural network pruning. However, its role is mainly explored in the small penalty strength regime. In this work, we extend its application to a new scenario where the…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Huan Wang , Can Qin , Yulun Zhang , Yun Fu

Regularization Techniques for Learning with Matrices

There is growing body of learning problems for which it is natural to organize the parameters into matrix, so as to appropriately regularize the parameters under some matrix norm (in order to impose some more sophisticated prior knowledge).…

Machine Learning · Computer Science 2010-10-19 Sham M. Kakade , Shai Shalev-Shwartz , Ambuj Tewari

Optimizing generalization on the train set: a novel gradient-based framework to train parameters and hyperparameters simultaneously

Generalization is a central problem in Machine Learning. Most prediction methods require careful calibration of hyperparameters carried out on a hold-out \textit{validation} dataset to achieve generalization. The main goal of this paper is…

Machine Learning · Computer Science 2020-06-15 Karim Lounici , Katia Meziani , Benjamin Riu

Learning Sparse Neural Networks via Sensitivity-Driven Regularization

The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify…

Machine Learning · Computer Science 2018-10-30 Enzo Tartaglione , Skjalg Lepsøy , Attilio Fiandrotti , Gianluca Francini

A Generic Path Algorithm for Regularized Statistical Estimation

Regularization is widely used in statistics and machine learning to prevent overfitting and gear solution towards prior information. In general, a regularized estimation problem minimizes the sum of a loss function and a penalty term. The…

Computation · Statistics 2012-01-18 Hua Zhou , Yichao Wu

Generalized Linear Model Regression under Distance-to-set Penalties

Estimation in generalized linear models (GLM) is complicated by the presence of constraints. One can handle constraints by maximizing a penalized log-likelihood. Penalties such as the lasso are effective in high dimensions, but often lead…

Machine Learning · Statistics 2017-11-07 Jason Xu , Eric C. Chi , Kenneth Lange

Selection of the Regularization Parameter in Graphical Models using Network Characteristics

Gaussian graphical models represent the underlying graph structure of conditional dependence between random variables which can be determined using their partial correlation or precision matrix. In a high-dimensional setting, the precision…

Applications · Statistics 2016-05-24 Adria Caballe , Natalia Bochkina , Claus Mayer

Improving Deep Learning Optimization through Constrained Parameter Regularization

Regularization is a critical component in deep learning. The most commonly used approach, weight decay, applies a constant penalty coefficient uniformly across all parameters. This may be overly restrictive for some parameters, while…

Machine Learning · Computer Science 2024-12-10 Jörg K. H. Franke , Michael Hefenbrock , Gregor Koehler , Frank Hutter