English
Related papers

Related papers: How Does Sharpness-Aware Minimization Minimize Sha…

200 papers

Sharpness-Aware Minimization (SAM) is a recent training method that relies on worst-case weight perturbations which significantly improves generalization in various settings. We argue that the existing justifications for the success of SAM…

Machine Learning · Computer Science 2022-06-14 Maksym Andriushchenko , Nicolas Flammarion

Sharpness-Aware Minimization (SAM) is an optimization technique designed to improve generalization by favoring flatter loss minima. To achieve this, SAM optimizes a modified objective that penalizes sharpness, using computationally…

Machine Learning · Computer Science 2024-11-05 Nalin Tiwary , Siddarth Aananth

Sharpness-aware minimization (SAM) is a recently proposed method that minimizes the sharpness of the training loss of a neural network. While its generalization improvement is well-known and is the primary motivation, we uncover an…

Machine Learning · Computer Science 2023-10-31 Maksym Andriushchenko , Dara Bahri , Hossein Mobahi , Nicolas Flammarion

Sharpness-Aware Minimization (SAM) is a recently proposed gradient-based optimizer (Foret et al., ICLR 2021) that greatly improves the prediction performance of deep neural networks. Consequently, there has been a surge of interest in…

Machine Learning · Computer Science 2023-10-24 Yan Dai , Kwangjun Ahn , Suvrit Sra

Sharpness-Aware Minimization (SAM) is a recent optimization framework aiming to improve the deep neural network generalization, through obtaining flatter (i.e. less sharp) solutions. As SAM has been numerically successful, recent papers…

Machine Learning · Statistics 2023-05-22 Kayhan Behdin , Rahul Mazumder

Deep neural networks have been increasingly used in safety-critical applications such as medical diagnosis and autonomous driving. However, many studies suggest that they are prone to being poorly calibrated and have a propensity for…

Machine Learning · Computer Science 2025-06-02 Chengli Tan , Yubo Zhou , Haishan Ye , Guang Dai , Junmin Liu , Zengjie Song , Jiangshe Zhang , Zixiang Zhao , Yunda Hao , Yong Xu

Sharpness-Aware Minimization (SAM) aims to improve generalization by minimizing a worst-case perturbed loss over a small neighborhood of model parameters. However, during training, its optimization behavior does not always align with…

Machine Learning · Computer Science 2026-01-16 Hongru Duan , Yongle Chen , Lei Guan

We consider Sharpness-Aware Minimization (SAM), a gradient-based optimization method for deep networks that has exhibited performance improvements on image and language prediction problems. We show that when SAM is applied with a convex…

Machine Learning · Computer Science 2023-04-12 Peter L. Bartlett , Philip M. Long , Olivier Bousquet

Sharpness-Aware Minimization (SAM) has emerged as a promising alternative optimizer to stochastic gradient descent (SGD). The originally-proposed motivation behind SAM was to bias neural networks towards flatter minima that are believed to…

Machine Learning · Computer Science 2024-06-03 Jacob Mitchell Springer , Vaishnavh Nagarajan , Aditi Raghunathan

Sharpness-Aware Minimization (SAM) has attracted significant attention for its effectiveness in improving generalization across various tasks. However, its underlying principles remain poorly understood. In this work, we analyze SAM's…

Machine Learning · Computer Science 2025-01-23 Haocheng Luo , Tuan Truong , Tung Pham , Mehrtash Harandi , Dinh Phung , Trung Le

The paper investigates the fundamental convergence properties of Sharpness-Aware Minimization (SAM), a recently proposed gradient-based optimization method [Foret et al., 2021] that significantly improves the generalization of deep neural…

Optimization and Control · Mathematics 2024-10-22 Pham Duy Khanh , Hoang-Chau Luong , Boris S. Mordukhovich , Dat Ba Tran

Sharpness-Aware Minimization (SAM) has proven highly effective in improving model generalization in machine learning tasks. However, SAM employs a fixed hyperparameter associated with the regularization to characterize the sharpness of the…

Machine Learning · Computer Science 2024-12-24 Jinping Zou , Xiaoge Deng , Tao Sun

Sharpness-aware minimization (SAM) was proposed to reduce sharpness of minima and has been shown to enhance generalization performance in various settings. In this work we show that perturbing only the affine normalization parameters…

Machine Learning · Computer Science 2023-11-20 Maximilian Mueller , Tiffany Vlaar , David Rolnick , Matthias Hein

Deep Neural Networks (DNNs) generalization is known to be closely related to the flatness of minima, leading to the development of Sharpness-Aware Minimization (SAM) for seeking flatter minima and better generalization. In this paper, we…

Machine Learning · Computer Science 2024-12-06 Yun Yue , Jiadi Jiang , Zhiling Ye , Ning Gao , Yongchao Liu , Ke Zhang

In today's heavily overparameterized models, the value of the training loss provides few guarantees on model generalization ability. Indeed, optimizing only the training loss value, as is commonly done, can easily lead to suboptimal model…

Machine Learning · Computer Science 2021-04-30 Pierre Foret , Ariel Kleiner , Hossein Mobahi , Behnam Neyshabur

Sharpness-Aware Minimization (SAM) is an effective method for improving generalization ability by regularizing loss sharpness. In this paper, we explore SAM in the context of adversarial robustness. We find that using only SAM can achieve…

Machine Learning · Computer Science 2023-07-04 Zeming Wei , Jingyu Zhu , Yihao Zhang

Sharpness-Aware Minimization (SAM) has substantially improved the generalization of neural networks under various settings. Despite the success, its effectiveness remains poorly understood. In this work, we discover an intriguing phenomenon…

Machine Learning · Computer Science 2025-02-21 Zhanpeng Zhou , Mingze Wang , Yuchen Mao , Bingrui Li , Junchi Yan

Recently, sharpness-aware minimization (SAM) has attracted much attention because of its surprising effectiveness in improving generalization performance. However, compared to stochastic gradient descent (SGD), it is more prone to getting…

Machine Learning · Computer Science 2024-09-11 Chengli Tan , Jiangshe Zhang , Junmin Liu , Yicheng Wang , Yunda Hao

Sharpness-Aware Minimization (SAM) and adaptive sharpness-aware minimization (ASAM) aim to improve the model generalization. And in this project, we proposed three experiments to valid their generalization from the sharpness aware…

Machine Learning · Computer Science 2022-08-16 Jozef Marus Coldenhoff , Chengkun Li , Yurui Zhu

Overparametrized Deep Neural Networks (DNNs) often achieve astounding performances, but may potentially result in severe generalization error. Recently, the relation between the sharpness of the loss landscape and the generalization error…

Artificial Intelligence · Computer Science 2022-05-31 Jiawei Du , Hanshu Yan , Jiashi Feng , Joey Tianyi Zhou , Liangli Zhen , Rick Siow Mong Goh , Vincent Y. F. Tan
‹ Prev 1 2 3 10 Next ›