Related papers: Learning with incremental iterative regularization

Optimal Rates for Multi-pass Stochastic Gradient Methods

We analyze the learning properties of the stochastic gradient method when multiple passes over the data and mini-batches are allowed. We study how regularization properties are controlled by the step-size, the number of passes and the…

Machine Learning · Computer Science 2019-03-18 Junhong Lin , Lorenzo Rosasco

Iterate averaging as regularization for stochastic gradient descent

We propose and analyze a variant of the classic Polyak-Ruppert averaging scheme, broadly used in stochastic gradient methods. Rather than a uniform average of the iterates, we consider a weighted average, with weights decaying in a…

Machine Learning · Computer Science 2018-02-23 Gergely Neu , Lorenzo Rosasco

Generalization Properties and Implicit Regularization for Multiple Passes SGM

We study the generalization properties of stochastic gradient methods for learning with convex loss functions and linearly parameterized functions. We show that, in the absence of penalizations or constraints, the stability and…

Machine Learning · Computer Science 2016-05-27 Junhong Lin , Raffaello Camoriano , Lorenzo Rosasco

Iterative Regularization for Learning with Convex Loss Functions

We consider the problem of supervised learning with convex loss functions and propose a new form of iterative regularization based on the subgradient method. Unlike other regularization approaches, in iterative regularization no constraint…

Machine Learning · Statistics 2015-04-02 Junhong Lin , Lorenzo Rosasco , Ding-Xuan Zhou

Iterative regularization via dual diagonal descent

In the context of linear inverse problems, we propose and study a general iterative regularization method allowing to consider large classes of regularizers and data-fit terms. The algorithm we propose is based on a primal-dual diagonal…

Optimization and Control · Mathematics 2017-08-04 Guillaume Garrigos , Lorenzo Rosasco , Silvia Villa

Efficient learning with robust gradient descent

Minimizing the empirical risk is a popular training strategy, but for learning tasks where the data may be noisy or heavy-tailed, one may require many observations in order to generalize well. To achieve better performance under less…

Machine Learning · Statistics 2018-10-16 Matthew J. Holland , Kazushi Ikeda

Regularization-Induced Bias and Consistency in Recursive Least Squares

Within the context of recursive least squares (RLS) parameter estimation, the goal of the present paper is to study the effect of regularization-induced bias on the transient and asymptotic accuracy of the parameter estimates. We consider…

Systems and Control · Electrical Eng. & Systems 2021-09-01 Brian Lai , Syed Aseem Ul Islam , Dennis S. Bernstein

On the filtering effect of iterative regularization algorithms for linear least-squares problems

Many real-world applications are addressed through a linear least-squares problem formulation, whose solution is calculated by means of an iterative approach. A huge amount of studies has been carried out in the optimization field to…

Numerical Analysis · Mathematics 2013-11-25 Anastasia Cornelio , Federica Porta , Marco Prato , Luca Zanni

On the Regularizing Property of Stochastic Gradient Descent

Stochastic gradient descent is one of the most successful approaches for solving large-scale problems, especially in machine learning and statistics. At each iteration, it employs an unbiased estimator of the full gradient computed from one…

Numerical Analysis · Mathematics 2018-12-05 Bangti Jin , Xiliang Lu

Implicit Regularization of Discrete Gradient Dynamics in Linear Neural Networks

When optimizing over-parameterized models, such as deep neural networks, a large set of parameters can achieve zero training error. In such cases, the choice of the optimization algorithm and its respective hyper-parameters introduces…

Machine Learning · Computer Science 2019-12-06 Gauthier Gidel , Francis Bach , Simon Lacoste-Julien

Iterative Averaging in the Quest for Best Test Error

We analyse and explain the increased generalisation performance of iterate averaging using a Gaussian process perturbation model between the true and batch risk surface on the high dimensional quadratic. We derive three phenomena…

Machine Learning · Statistics 2021-11-02 Diego Granziol , Xingchen Wan , Samuel Albanie , Stephen Roberts

Effective New Methods for Automated Parameter Selection in Regularized Inverse Problems

The choice of the parameter value for regularized inverse problems is critical to the results and remains a topic of interest. This article explores a criterion for selecting a good parameter value by maximizing the probability of the data,…

Numerical Analysis · Mathematics 2020-02-11 Toby Sanders , Rodrigo B. Platte , Robert D. Skeel

Iterative regularization in classification via hinge loss diagonal descent

Iterative regularization is a classic idea in regularization theory, that has recently become popular in machine learning. On the one hand, it allows to design efficient algorithms controlling at the same time numerical and statistical…

Machine Learning · Statistics 2024-10-10 Vassilis Apidopoulos , Tomaso Poggio , Lorenzo Rosasco , Silvia Villa

Last Iterate Convergence of Incremental Methods and Applications in Continual Learning

Incremental gradient and incremental proximal methods are a fundamental class of optimization algorithms used for solving finite sum problems, broadly studied in the literature. Yet, without strong convexity, their convergence guarantees…

Optimization and Control · Mathematics 2024-07-01 Xufeng Cai , Jelena Diakonikolas

EILearn: Learning Incrementally Using Previous Knowledge Obtained From an Ensemble of Classifiers

We propose an algorithm for incremental learning of classifiers. The proposed method enables an ensemble of classifiers to learn incrementally by accommodating new training data. We use an effective mechanism to overcome the…

Machine Learning · Computer Science 2019-02-11 Shivang Agarwal , C. Ravindranath Chowdary , Shripriya Maheshwari

A randomized progressive iterative regularization method for data fitting problems

In this work, we investigate data fitting problems with random noises. A randomized progressive iterative regularization method is proposed. It works well for large-scale matrix computations and converges in expectation to the least-squares…

Numerical Analysis · Mathematics 2025-06-05 Dakang Cen , Wenlong Zhang , Junbin Zhong

The Statistical Complexity of Early-Stopped Mirror Descent

Recently there has been a surge of interest in understanding implicit regularization properties of iterative gradient-based optimization algorithms. In this paper, we study the statistical guarantees on the excess risk achieved by…

Machine Learning · Statistics 2020-08-28 Tomas Vaškevičius , Varun Kanade , Patrick Rebeschini

Iterative Reweighted Least Squares Networks With Convergence Guarantees for Solving Inverse Imaging Problems

In this work we present a novel optimization strategy for image reconstruction tasks under analysis-based image regularization, which promotes sparse and/or low-rank solutions in some learned transform domain. We parameterize such…

Computer Vision and Pattern Recognition · Computer Science 2023-08-11 Iaroslav Koshelev , Stamatios Lefkimmiatis

Regularization Techniques for Learning with Matrices

There is growing body of learning problems for which it is natural to organize the parameters into matrix, so as to appropriately regularize the parameters under some matrix norm (in order to impose some more sophisticated prior knowledge).…

Machine Learning · Computer Science 2010-10-19 Sham M. Kakade , Shai Shalev-Shwartz , Ambuj Tewari

An Iteratively Reweighted Least Squares Algorithm for Sparse Regularization

We present a new algorithm and the corresponding convergence analysis for the regularization of linear inverse problems with sparsity constraints, applied to a new generalized sparsity promoting functional. The algorithm is based on the…

Numerical Analysis · Mathematics 2016-12-30 Sergey Voronin , Ingrid Daubechies