Related papers: Explicit Eigenvalue Regularization Improves Sharpn…

X-SAM: Boosting Sharpness-Aware Minimization with Dominant-Eigenvector Gradient Correction

Sharpness-Aware Minimization (SAM) aims to improve generalization by minimizing a worst-case perturbed loss over a small neighborhood of model parameters. However, during training, its optimization behavior does not always align with…

Machine Learning · Computer Science 2026-01-16 Hongru Duan , Yongle Chen , Lei Guan

How Does Sharpness-Aware Minimization Minimize Sharpness?

Sharpness-Aware Minimization (SAM) is a highly effective regularization technique for improving the generalization of deep neural networks for various settings. However, the underlying working of SAM remains elusive because of various…

Machine Learning · Computer Science 2023-01-06 Kaiyue Wen , Tengyu Ma , Zhiyuan Li

SAM operates far from home: eigenvalue regularization as a dynamical phenomenon

The Sharpness Aware Minimization (SAM) optimization algorithm has been shown to control large eigenvalues of the loss Hessian and provide generalization benefits in a variety of settings. The original motivation for SAM was a modified loss…

Machine Learning · Computer Science 2023-02-20 Atish Agarwala , Yann N. Dauphin

The Dynamics of Sharpness-Aware Minimization: Bouncing Across Ravines and Drifting Towards Wide Minima

We consider Sharpness-Aware Minimization (SAM), a gradient-based optimization method for deep networks that has exhibited performance improvements on image and language prediction problems. We show that when SAM is applied with a convex…

Machine Learning · Computer Science 2023-04-12 Peter L. Bartlett , Philip M. Long , Olivier Bousquet

A Universal Class of Sharpness-Aware Minimization Algorithms

Recently, there has been a surge in interest in developing optimization algorithms for overparameterized models as achieving generalization is believed to require algorithms with suitable biases. This interest centers on minimizing…

Machine Learning · Computer Science 2026-02-05 Behrooz Tahmasebi , Ashkan Soleymani , Dara Bahri , Stefanie Jegelka , Patrick Jaillet

Sharpness-Aware Minimization for Efficiently Improving Generalization

In today's heavily overparameterized models, the value of the training loss provides few guarantees on model generalization ability. Indeed, optimizing only the training loss value, as is commonly done, can easily lead to suboptimal model…

Machine Learning · Computer Science 2021-04-30 Pierre Foret , Ariel Kleiner , Hossein Mobahi , Behnam Neyshabur

Stabilizing Sharpness-aware Minimization Through A Simple Renormalization Strategy

Recently, sharpness-aware minimization (SAM) has attracted much attention because of its surprising effectiveness in improving generalization performance. However, compared to stochastic gradient descent (SGD), it is more prone to getting…

Machine Learning · Computer Science 2024-09-11 Chengli Tan , Jiangshe Zhang , Junmin Liu , Yicheng Wang , Yunda Hao

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Overparametrized Deep Neural Networks (DNNs) often achieve astounding performances, but may potentially result in severe generalization error. Recently, the relation between the sharpness of the loss landscape and the generalization error…

Artificial Intelligence · Computer Science 2022-05-31 Jiawei Du , Hanshu Yan , Jiashi Feng , Joey Tianyi Zhou , Liangli Zhen , Rick Siow Mong Goh , Vincent Y. F. Tan

Sharpness-Aware Minimization with Adaptive Regularization for Training Deep Neural Networks

Sharpness-Aware Minimization (SAM) has proven highly effective in improving model generalization in machine learning tasks. However, SAM employs a fixed hyperparameter associated with the regularization to characterize the sharpness of the…

Machine Learning · Computer Science 2024-12-24 Jinping Zou , Xiaoge Deng , Tao Sun

Unpacking the Implicit Norm Dynamics of Sharpness-Aware Minimization in Tensorized Models

Sharpness-Aware Minimization (SAM) has been proven to be an effective optimization technique for improving generalization in overparameterized models. While prior works have explored the implicit regularization of SAM in simple two-core…

Machine Learning · Computer Science 2025-08-15 Tianxiao Cao , Kyohei Atarashi , Hisashi Kashima

Towards Understanding Sharpness-Aware Minimization

Sharpness-Aware Minimization (SAM) is a recent training method that relies on worst-case weight perturbations which significantly improves generalization in various settings. We argue that the existing justifications for the success of SAM…

Machine Learning · Computer Science 2022-06-14 Maksym Andriushchenko , Nicolas Flammarion

1st-Order Magic: Analysis of Sharpness-Aware Minimization

Sharpness-Aware Minimization (SAM) is an optimization technique designed to improve generalization by favoring flatter loss minima. To achieve this, SAM optimizes a modified objective that penalizes sharpness, using computationally…

Machine Learning · Computer Science 2024-11-05 Nalin Tiwary , Siddarth Aananth

Unveiling m-Sharpness Through the Structure of Stochastic Gradient Noise

Sharpness-aware minimization (SAM) has emerged as a highly effective technique to improve model generalization, but its underlying principles are not fully understood. We investigate m-sharpness, where SAM performance improves monotonically…

Machine Learning · Computer Science 2026-04-03 Haocheng Luo , Mehrtash Harandi , Dinh Phung , Trung Le

Sharpness-Aware Minimization Alone can Improve Adversarial Robustness

Sharpness-Aware Minimization (SAM) is an effective method for improving generalization ability by regularizing loss sharpness. In this paper, we explore SAM in the context of adversarial robustness. We find that using only SAM can achieve…

Machine Learning · Computer Science 2023-07-04 Zeming Wei , Jingyu Zhu , Yihao Zhang

Sharpness-Aware Minimization Leads to Low-Rank Features

Sharpness-aware minimization (SAM) is a recently proposed method that minimizes the sharpness of the training loss of a neural network. While its generalization improvement is well-known and is the primary motivation, we uncover an…

Machine Learning · Computer Science 2023-10-31 Maksym Andriushchenko , Dara Bahri , Hossein Mobahi , Nicolas Flammarion

The Crucial Role of Normalization in Sharpness-Aware Minimization

Sharpness-Aware Minimization (SAM) is a recently proposed gradient-based optimizer (Foret et al., ICLR 2021) that greatly improves the prediction performance of deep neural networks. Consequently, there has been a surge of interest in…

Machine Learning · Computer Science 2023-10-24 Yan Dai , Kwangjun Ahn , Suvrit Sra

Critical Influence of Overparameterization on Sharpness-aware Minimization

Sharpness-Aware Minimization (SAM) has attracted considerable attention for its effectiveness in improving generalization in deep neural network training by explicitly minimizing sharpness in the loss landscape. Its success, however, relies…

Machine Learning · Computer Science 2025-06-16 Sungbin Shin , Dongyeop Lee , Maksym Andriushchenko , Namhoon Lee

CR-SAM: Curvature Regularized Sharpness-Aware Minimization

The capacity to generalize to future unseen data stands as one of the utmost crucial attributes of deep neural networks. Sharpness-Aware Minimization (SAM) aims to enhance the generalizability by minimizing worst-case loss using one-step…

Machine Learning · Computer Science 2023-12-27 Tao Wu , Tie Luo , Donald C. Wunsch

Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization

Recently, flat minima are proven to be effective for improving generalization and sharpness-aware minimization (SAM) achieves state-of-the-art performance. Yet the current definition of flatness discussed in SAM and its follow-ups are…

Machine Learning · Computer Science 2023-07-07 Xingxuan Zhang , Renzhe Xu , Han Yu , Hao Zou , Peng Cui

Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term

Deep Neural Networks (DNNs) generalization is known to be closely related to the flatness of minima, leading to the development of Sharpness-Aware Minimization (SAM) for seeking flatter minima and better generalization. In this paper, we…

Machine Learning · Computer Science 2024-12-06 Yun Yue , Jiadi Jiang , Zhiling Ye , Ning Gao , Yongchao Liu , Ke Zhang