Related papers: Randomized Sharpness-Aware Training for Boosting C…

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Overparametrized Deep Neural Networks (DNNs) often achieve astounding performances, but may potentially result in severe generalization error. Recently, the relation between the sharpness of the loss landscape and the generalization error…

Artificial Intelligence · Computer Science 2022-05-31 Jiawei Du , Hanshu Yan , Jiashi Feng , Joey Tianyi Zhou , Liangli Zhen , Rick Siow Mong Goh , Vincent Y. F. Tan

Efficient Generalization Improvement Guided by Random Weight Perturbation

To fully uncover the great potential of deep neural networks (DNNs), various learning algorithms have been developed to improve the model's generalization ability. Recently, sharpness-aware minimization (SAM) establishes a generic scheme…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Tao Li , Weihao Yan , Zehao Lei , Yingwen Wu , Kun Fang , Ming Yang , Xiaolin Huang

Sharpness-Aware Training for Free

Modern deep neural networks (DNNs) have achieved state-of-the-art performances but are typically over-parameterized. The over-parameterization may result in undesirably large generalization error in the absence of other customized training…

Machine Learning · Computer Science 2023-03-03 Jiawei Du , Daquan Zhou , Jiashi Feng , Vincent Y. F. Tan , Joey Tianyi Zhou

Stabilizing Sharpness-aware Minimization Through A Simple Renormalization Strategy

Recently, sharpness-aware minimization (SAM) has attracted much attention because of its surprising effectiveness in improving generalization performance. However, compared to stochastic gradient descent (SGD), it is more prone to getting…

Machine Learning · Computer Science 2024-09-11 Chengli Tan , Jiangshe Zhang , Junmin Liu , Yicheng Wang , Yunda Hao

Sharpness Aware Surrogate Training for Spiking Neural Networks

Surrogate gradients are a standard tool for training spiking neural networks (SNNs), but conventional hard forward or surrogate backward training couples a nonsmooth forward model with a biased gradient estimator. We study sharpness aware…

Neural and Evolutionary Computing · Computer Science 2026-03-20 Maximilian Nicholson

Unveiling m-Sharpness Through the Structure of Stochastic Gradient Noise

Sharpness-aware minimization (SAM) has emerged as a highly effective technique to improve model generalization, but its underlying principles are not fully understood. We investigate m-sharpness, where SAM performance improves monotonically…

Machine Learning · Computer Science 2026-04-03 Haocheng Luo , Mehrtash Harandi , Dinh Phung , Trung Le

Convergence of Sharpness-Aware Minimization Algorithms using Increasing Batch Size and Decaying Learning Rate

The sharpness-aware minimization (SAM) algorithm and its variants, including gap guided SAM (GSAM), have been successful at improving the generalization capability of deep neural network models by finding flat local minima of the empirical…

Machine Learning · Computer Science 2024-09-17 Hinata Harada , Hideaki Iiduka

Sharpness-Aware Surrogate Training for On-Sensor Spiking Neural Networks

Spiking neural networks (SNNs) are a natural computational model for on-sensor and near-sensor vision, where event driven processors must operate under strict power budgets with hard binary spikes. However, models trained with surrogate…

Neural and Evolutionary Computing · Computer Science 2026-04-14 Maximilian Nicholson

Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization

Deep neural networks have been increasingly used in safety-critical applications such as medical diagnosis and autonomous driving. However, many studies suggest that they are prone to being poorly calibrated and have a propensity for…

Machine Learning · Computer Science 2025-06-02 Chengli Tan , Yubo Zhou , Haishan Ye , Guang Dai , Junmin Liu , Zengjie Song , Jiangshe Zhang , Zixiang Zhao , Yunda Hao , Yong Xu

Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach

Deep neural networks often suffer from poor generalization caused by complex and non-convex loss landscapes. One of the popular solutions is Sharpness-Aware Minimization (SAM), which smooths the loss landscape via minimizing the maximized…

Machine Learning · Computer Science 2022-10-25 Peng Mi , Li Shen , Tianhe Ren , Yiyi Zhou , Xiaoshuai Sun , Rongrong Ji , Dacheng Tao

Adaptively Sampling-Reusing-Mixing Decomposed Gradients to Speed Up Sharpness Aware Minimization

Sharpness-Aware Minimization (SAM) improves model generalization but doubles the computational cost of Stochastic Gradient Descent (SGD) by requiring twice the gradient calculations per optimization step. To mitigate this, we propose…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Jiaxin Deng , Junbiao Pang

Sharpness-Aware Minimization with Dynamic Reweighting

Deep neural networks are often overparameterized and may not easily achieve model generalization. Adversarial training has shown effectiveness in improving generalization by regularizing the change of loss on top of adversarially chosen…

Machine Learning · Computer Science 2022-12-07 Wenxuan Zhou , Fangyu Liu , Huan Zhang , Muhao Chen

Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Deep reinforcement learning (DRL) has significantly advanced the field of combinatorial optimization (CO). However, its practicality is hindered by the necessity for a large number of reward evaluations, especially in scenarios involving…

Machine Learning · Computer Science 2024-07-18 Hyeonah Kim , Minsu Kim , Sungsoo Ahn , Jinkyoo Park

Asynchronous Sharpness-Aware Minimization For Fast and Accurate Deep Learning

Sharpness-Aware Minimization (SAM) is an optimization method that improves generalization performance of machine learning models. Despite its superior generalization, SAM has not been actively used in real-world applications due to its…

Machine Learning · Computer Science 2025-03-17 Junhyuk Jo , Jihyun Lim , Sunwoo Lee

K-SAM: Sharpness-Aware Minimization at the Speed of SGD

Sharpness-Aware Minimization (SAM) has recently emerged as a robust technique for improving the accuracy of deep neural networks. However, SAM incurs a high computational cost in practice, requiring up to twice as much computation as…

Machine Learning · Computer Science 2022-10-25 Renkun Ni , Ping-yeh Chiang , Jonas Geiping , Micah Goldblum , Andrew Gordon Wilson , Tom Goldstein

Distributionally Robust Deep Learning using Hardness Weighted Sampling

Limiting failures of machine learning systems is of paramount importance for safety-critical applications. In order to improve the robustness of machine learning systems, Distributionally Robust Optimization (DRO) has been proposed as a…

Machine Learning · Computer Science 2023-07-04 Lucas Fidon , Michael Aertsen , Thomas Deprest , Doaa Emam , Frédéric Guffens , Nada Mufti , Esther Van Elslander , Ernst Schwartz , Michael Ebner , Daniela Prayer , Gregor Kasprian , Anna L. David , Andrew Melbourne , Sébastien Ourselin , Jan Deprest , Georg Langs , Tom Vercauteren

Sharpness-Aware Minimization in Genetic Programming

Sharpness-Aware Minimization (SAM) was recently introduced as a regularization procedure for training deep neural networks. It simultaneously minimizes the fitness (or loss) function and the so-called fitness sharpness. The latter serves as…

Neural and Evolutionary Computing · Computer Science 2024-05-20 Illya Bakurov , Nathan Haut , Wolfgang Banzhaf

Systematic Investigation of Sparse Perturbed Sharpness-Aware Minimization Optimizer

Deep neural networks often suffer from poor generalization due to complex and non-convex loss landscapes. Sharpness-Aware Minimization (SAM) is a popular solution that smooths the loss landscape by minimizing the maximized change of…

Artificial Intelligence · Computer Science 2023-07-03 Peng Mi , Li Shen , Tianhe Ren , Yiyi Zhou , Tianshuo Xu , Xiaoshuai Sun , Tongliang Liu , Rongrong Ji , Dacheng Tao

SALR: Sharpness-aware Learning Rate Scheduler for Improved Generalization

In an effort to improve generalization in deep learning and automate the process of learning rate scheduling, we propose SALR: a sharpness-aware learning rate update technique designed to recover flat minimizers. Our method dynamically…

Machine Learning · Computer Science 2023-07-04 Xubo Yue , Maher Nouiehed , Raed Al Kontar

Effective Gradient Sample Size via Variation Estimation for Accelerating Sharpness aware Minimization

Sharpness-aware Minimization (SAM) has been proposed recently to improve model generalization ability. However, SAM calculates the gradient twice in each optimization step, thereby doubling the computation costs compared to stochastic…

Computer Vision and Pattern Recognition · Computer Science 2024-03-15 Jiaxin Deng , Junbiao Pang , Baochang Zhang , Tian Wang