Related papers: DR3: Value-Based Deep Reinforcement Learning Requi…

Implicit Regularization of Stochastic Gradient Descent in Natural Language Processing: Observations and Implications

Deep neural networks with remarkably strong generalization performances are usually over-parameterized. Despite explicit regularization strategies are used for practitioners to avoid over-fitting, the impacts are often small. Some…

Computation and Language · Computer Science 2018-11-05 Deren Lei , Zichen Sun , Yijun Xiao , William Yang Wang

Variational Deep Learning via Implicit Regularization

Modern deep learning models generalize remarkably well in-distribution, despite being overparametrized and trained with little to no explicit regularization. Instead, current theory credits implicit regularization imposed by the choice of…

Machine Learning · Computer Science 2026-03-17 Jonathan Wenger , Beau Coker , Juraj Marusic , John P. Cunningham

Bad Global Minima Exist and SGD Can Reach Them

Several works have aimed to explain why overparameterized neural networks generalize well when trained by Stochastic Gradient Descent (SGD). The consensus explanation that has emerged credits the randomized nature of SGD for the bias of the…

Machine Learning · Computer Science 2021-02-24 Shengchao Liu , Dimitris Papailiopoulos , Dimitris Achlioptas

Combining Explicit and Implicit Regularization for Efficient Learning in Deep Networks

Works on implicit regularization have studied gradient trajectories during the optimization process to explain why deep networks favor certain kinds of solutions over others. In deep linear networks, it has been shown that gradient descent…

Machine Learning · Computer Science 2023-06-02 Dan Zhao

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning

We identify an implicit under-parameterization phenomenon in value-based deep RL methods that use bootstrapping: when value functions, approximated using deep neural networks, are trained with gradient descent using iterated regression onto…

Machine Learning · Computer Science 2021-10-26 Aviral Kumar , Rishabh Agarwal , Dibya Ghosh , Sergey Levine

Estimating Implicit Regularization in Deep Learning

Deep learning systems are known to exhibit implicit regularization (alt. implicit bias), favoring simple solutions instead of merely minimizing the loss function. In some cases, we can analytically derive the implicit regularization --…

Machine Learning · Statistics 2026-05-08 Joseph H. Rudoler , Kevin Tan , Giles Hooker , Konrad P. Kording

Explicit Regularization via Regularizer Mirror Descent

Despite perfectly interpolating the training data, deep neural networks (DNNs) can often generalize fairly well, in part due to the "implicit regularization" induced by the learning algorithm. Nonetheless, various forms of regularization,…

Machine Learning · Computer Science 2022-02-23 Navid Azizan , Sahin Lale , Babak Hassibi

Linear regression with overparameterized linear neural networks: Tight upper and lower bounds for implicit $\ell^1$-regularization

Modern machine learning models are often trained in a setting where the number of parameters exceeds the number of training samples. To understand the implicit bias of gradient descent in such overparameterized models, prior work has…

Machine Learning · Statistics 2025-10-29 Hannes Matt , Dominik Stöger

The Generalization Error of Stochastic Mirror Descent on Over-Parametrized Linear Models

Despite being highly over-parametrized, and having the ability to fully interpolate the training data, deep networks are known to generalize well to unseen data. It is now understood that part of the reason for this is that the training…

Machine Learning · Computer Science 2023-02-21 Danil Akhtiamov , Babak Hassibi

Gradient-Coherent Strong Regularization for Deep Neural Networks

Regularization plays an important role in generalization of deep neural networks, which are often prone to overfitting with their numerous parameters. L1 and L2 regularizers are common regularization tools in machine learning with their…

Machine Learning · Computer Science 2019-10-21 Dae Hoon Park , Chiu Man Ho , Yi Chang , Huaqing Zhang

Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks

This paper proposes a new optimization objective for value-based deep reinforcement learning. We extend conventional Deep Q-Networks (DQNs) by adding a model-learning component yielding a transcoder network. The prediction errors for the…

Machine Learning · Computer Science 2018-11-21 Felix Leibfried , Peter Vrancx

An Empirical Study of Implicit Regularization in Deep Offline RL

Deep neural networks are the most commonly used function approximators in offline reinforcement learning. Prior works have shown that neural nets trained with TD-learning and gradient descent can exhibit implicit regularization that can be…

Machine Learning · Computer Science 2022-07-08 Caglar Gulcehre , Srivatsan Srinivasan , Jakub Sygnowski , Georg Ostrovski , Mehrdad Farajtabar , Matt Hoffman , Razvan Pascanu , Arnaud Doucet

DL-Reg: A Deep Learning Regularization Technique using Linear Regression

Regularization plays a vital role in the context of deep learning by preventing deep neural networks from the danger of overfitting. This paper proposes a novel deep learning regularization method named as DL-Reg, which carefully reduces…

Machine Learning · Computer Science 2020-11-05 Maryam Dialameh , Ali Hamzeh , Hossein Rahmani

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning

Deep Reinforcement Learning (RL) methods rely on experience replay to approximate the minibatched supervised learning setting; however, unlike supervised learning where access to lots of training data is crucial to generalization,…

Machine Learning · Computer Science 2021-02-24 Brett Daley , Cameron Hickert , Christopher Amato

Generalization and Regularization in DQN

Deep reinforcement learning algorithms have shown an impressive ability to learn complex control policies in high-dimensional tasks. However, despite the ever-increasing performance on popular benchmarks, policies learned by deep…

Machine Learning · Computer Science 2020-01-22 Jesse Farebrother , Marlos C. Machado , Michael Bowling

The Role of Deep Learning Regularizations on Actors in Offline RL

Deep learning regularization techniques, such as dropout, layer normalization, or weight decay, are widely adopted in the construction of modern artificial neural networks, often resulting in more robust training processes and improved…

Machine Learning · Computer Science 2024-11-22 Denis Tarasov , Anja Surina , Caglar Gulcehre

Efficient Deep Reinforcement Learning Requires Regulating Overfitting

Deep reinforcement learning algorithms that learn policies by trial-and-error must learn from limited amounts of data collected by actively interacting with the environment. While many prior works have shown that proper regularization…

Machine Learning · Computer Science 2023-04-21 Qiyang Li , Aviral Kumar , Ilya Kostrikov , Sergey Levine

Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning

Deep reinforcement learning (RL) agents often fail to generalize to unseen environments (yet semantically similar to trained agents), particularly when they are trained on high-dimensional state spaces, such as images. In this paper, we…

Machine Learning · Computer Science 2020-02-18 Kimin Lee , Kibok Lee , Jinwoo Shin , Honglak Lee

Deep Reinforcement Learning with Decorrelation

Learning an effective representation for high-dimensional data is a challenging problem in reinforcement learning (RL). Deep reinforcement learning (DRL) such as Deep Q networks (DQN) achieves remarkable success in computer games by…

Machine Learning · Computer Science 2019-05-10 Borislav Mavrin , Hengshuai Yao , Linglong Kong

Conflicting Biases at the Edge of Stability: Norm versus Sharpness Regularization

A widely believed explanation for the remarkable generalization capacities of overparameterized neural networks is that the optimization algorithms used for training induce an implicit bias towards benign solutions. To grasp this…

Machine Learning · Computer Science 2025-12-19 Maria Matveev , Vit Fojtik , Hung-Hsu Chou , Gitta Kutyniok , Johannes Maly