English
Related papers

Related papers: Gradient-based Regularization Parameter Selection …

200 papers

Regularization methods allow one to handle a variety of inferential problems where there are more covariates than cases. This allows one to consider a potentially enormous number of covariates for a problem. We exploit the power of these…

Methodology · Statistics 2012-10-03 Yoonkyung Lee , Steven N. MacEachern , Yoonsuh Jung

Penalized (or regularized) regression, as represented by Lasso and its variants, has become a standard technique for analyzing high-dimensional data when the number of variables substantially exceeds the sample size. The performance of…

Methodology · Statistics 2019-08-13 Yunan Wu , Lan Wang

Regularization is used in many different areas of optimization when solutions are sought which not only minimize a given function, but also possess a certain degree of regularity. Popular applications are image denoising, sparse regression…

Optimization and Control · Mathematics 2021-11-15 Bennet Gebken , Katharina Bieker , Sebastian Peitz

In this paper, we provide a mathematical framework for improving generalization in a class of learning problems which is related to point estimations for modeling of high-dimensional nonlinear functions. In particular, we consider a…

Optimization and Control · Mathematics 2024-12-13 Getachew K. Befekadu

For many algorithms, parameter tuning remains a challenging and critical task, which becomes tedious and infeasible in a multi-parameter setting. Multi-penalty regularization, successfully used for solving undetermined sparse regression of…

Machine Learning · Statistics 2017-10-12 Markus Grasmair , Timo Klock , Valeriya Naumova

Hyperparameter selection generally relies on running multiple full training trials, with selection based on validation set performance. We propose a gradient-based approach for locally adjusting hyperparameters during training of the model.…

Machine Learning · Computer Science 2016-06-20 Jelena Luketina , Mathias Berglund , Klaus Greff , Tapani Raiko

Model regularization requires extensive manual tuning to balance complexity against overfitting. Cross-regularization resolves this tradeoff by directly adapting regularization parameters through validation gradients during training. The…

Machine Learning · Computer Science 2025-06-25 Carlos Stein Brito

How to train deep neural networks (DNNs) to generalize well is a central concern in deep learning, especially for severely overparameterized networks nowadays. In this paper, we propose an effective method to improve the model…

Machine Learning · Computer Science 2022-06-28 Yang Zhao , Hao Zhang , Xiuyuan Hu

We study settings where gradient penalties are used alongside risk minimization with the goal of obtaining predictors satisfying different notions of monotonicity. Specifically, we present two sets of contributions. In the first part of the…

Machine Learning · Computer Science 2022-05-18 Joao Monteiro , Mohamed Osama Ahmed , Hossein Hajimirsadeghi , Greg Mori

Deep neural networks exploiting millions of parameters are nowadays the norm in deep learning applications. This is a potential issue because of the great amount of computational resources needed for training, and of the possible loss of…

Computation and Language · Computer Science 2022-10-31 Giovanni Bonetta , Matteo Ribero , Rossella Cancelliere

Common regularization algorithms for linear regression, such as LASSO and Ridge regression, rely on a regularization hyperparameter that balances the tradeoff between minimizing the fitting error and the norm of the learned model…

Machine Learning · Computer Science 2023-11-27 Gabriele Maroni , Loris Cannelli , Dario Piga

The `Signal plus Noise' model for nonparametric regression can be extended to the case of observations taken at the vertices of a graph. This model includes many familiar regression problems. This article discusses the use of the edges of a…

Methodology · Statistics 2009-11-11 Arne Kovac , Andrew D. A. C. Smith

Regularization has long been utilized to learn sparsity in deep neural network pruning. However, its role is mainly explored in the small penalty strength regime. In this work, we extend its application to a new scenario where the…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Huan Wang , Can Qin , Yulun Zhang , Yun Fu

There is growing body of learning problems for which it is natural to organize the parameters into matrix, so as to appropriately regularize the parameters under some matrix norm (in order to impose some more sophisticated prior knowledge).…

Machine Learning · Computer Science 2010-10-19 Sham M. Kakade , Shai Shalev-Shwartz , Ambuj Tewari

Generalization is a central problem in Machine Learning. Most prediction methods require careful calibration of hyperparameters carried out on a hold-out \textit{validation} dataset to achieve generalization. The main goal of this paper is…

Machine Learning · Computer Science 2020-06-15 Karim Lounici , Katia Meziani , Benjamin Riu

The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify…

Machine Learning · Computer Science 2018-10-30 Enzo Tartaglione , Skjalg Lepsøy , Attilio Fiandrotti , Gianluca Francini

Regularization is widely used in statistics and machine learning to prevent overfitting and gear solution towards prior information. In general, a regularized estimation problem minimizes the sum of a loss function and a penalty term. The…

Computation · Statistics 2012-01-18 Hua Zhou , Yichao Wu

Estimation in generalized linear models (GLM) is complicated by the presence of constraints. One can handle constraints by maximizing a penalized log-likelihood. Penalties such as the lasso are effective in high dimensions, but often lead…

Machine Learning · Statistics 2017-11-07 Jason Xu , Eric C. Chi , Kenneth Lange

Gaussian graphical models represent the underlying graph structure of conditional dependence between random variables which can be determined using their partial correlation or precision matrix. In a high-dimensional setting, the precision…

Applications · Statistics 2016-05-24 Adria Caballe , Natalia Bochkina , Claus Mayer

Regularization is a critical component in deep learning. The most commonly used approach, weight decay, applies a constant penalty coefficient uniformly across all parameters. This may be overly restrictive for some parameters, while…

Machine Learning · Computer Science 2024-12-10 Jörg K. H. Franke , Michael Hefenbrock , Gregor Koehler , Frank Hutter
‹ Prev 1 2 3 10 Next ›