Related papers: Selective Attention-based Modulation for Continual…

SAM: A Self-adaptive Attention Module for Context-Aware Recommendation System

Recently, textual information has been proved to play a positive role in recommendation systems. However, most of the existing methods only focus on representation learning of textual information in ratings, while potential selection bias…

Information Retrieval · Computer Science 2021-10-14 Jiabin Liu , Zheng Wei , Zhengpin Li , Xiaojun Mao , Jian Wang , Zhongyu Wei , Qi Zhang

Self-Attention Meta-Learner for Continual Learning

Continual learning aims to provide intelligent agents capable of learning multiple tasks sequentially with neural networks. One of its main challenging, catastrophic forgetting, is caused by the neural networks non-optimal ability to learn…

Machine Learning · Computer Science 2021-01-29 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

Sequential Attention Module for Natural Language Processing

Recently, large pre-trained neural language models have attained remarkable performance on many downstream natural language processing (NLP) applications via fine-tuning. In this paper, we target at how to further improve the token…

Artificial Intelligence · Computer Science 2021-09-08 Mengyuan Zhou , Jian Ma , Haiqin Yang , Lianxin Jiang , Yang Mo

Modality-Aware SAM: Sharpness-Aware-Minimization Driven Gradient Modulation for Harmonized Multimodal Learning

In multimodal learning, dominant modalities often overshadow others, limiting generalization. We propose Modality-Aware Sharpness-Aware Minimization (M-SAM), a model-agnostic framework that applies to many modalities and supports early and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-30 Hossein R. Nowdeh , Jie Ji , Xiaolong Ma , Fatemeh Afghah

Improving Fine-Grained Visual Recognition in Low Data Regimes via Self-Boosting Attention Mechanism

The challenge of fine-grained visual recognition often lies in discovering the key discriminative regions. While such regions can be automatically identified from a large-scale labeled dataset, a similar method might become less effective…

Computer Vision and Pattern Recognition · Computer Science 2022-08-02 Yangyang Shu , Baosheng Yu , Haiming Xu , Lingqiao Liu

Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation

Foundational models such as the Segment Anything Model (SAM) are gaining traction in medical imaging segmentation, supporting multiple downstream tasks. However, such models are supervised in nature, still relying on large annotated…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Aishik Konwer , Zhijian Yang , Erhan Bas , Cao Xiao , Prateek Prasanna , Parminder Bhatia , Taha Kass-Hout

BAM: Bottleneck Attention Module

Recent advances in deep neural networks have been developed via architecture search for stronger representational power. In this work, we focus on the effect of attention in general deep neural networks. We propose a simple and effective…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Jongchan Park , Sanghyun Woo , Joon-Young Lee , In So Kweon

Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning

Sharpness-Aware Minimization (SAM) has emerged as a promising alternative optimizer to stochastic gradient descent (SGD). The originally-proposed motivation behind SAM was to bias neural networks towards flatter minima that are believed to…

Machine Learning · Computer Science 2024-06-03 Jacob Mitchell Springer , Vaishnavh Nagarajan , Aditi Raghunathan

Divisive Decisions: Improving Salience-Based Training for Generalization in Binary Classification Tasks

Existing saliency-guided training approaches improve model generalization by incorporating a loss term that compares the model's class activation map (CAM) for a sample's true-class ({\it i.e.}, correct-label class) against a human…

Computer Vision and Pattern Recognition · Computer Science 2025-07-24 Jacob Piland , Chris Sweet , Adam Czajka

SAM: Semantic Attribute Modulation for Language Modeling and Style Variation

This paper presents a Semantic Attribute Modulation (SAM) for language modeling and style variation. The semantic attribute modulation includes various document attributes, such as titles, authors, and document categories. We consider two…

Computation and Language · Computer Science 2017-09-15 Wenbo Hu , Lifeng Hua , Lei Li , Hang Su , Tian Wang , Ning Chen , Bo Zhang

Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation

User interests are usually dynamic in the real world, which poses both theoretical and practical challenges for learning accurate preferences from rich behavior data. Among existing user behavior modeling solutions, attention networks are…

Information Retrieval · Computer Science 2022-04-14 Chao Chen , Haoyu Geng , Nianzu Yang , Junchi Yan , Daiyue Xue , Jianping Yu , Xiaokang Yang

Saliency-Augmented Memory Completion for Continual Learning

Continual Learning is considered a key step toward next-generation Artificial Intelligence. Among various methods, replay-based approaches that maintain and replay a small episodic memory of previous samples are one of the most successful…

Machine Learning · Computer Science 2022-12-27 Guangji Bai , Chen Ling , Yuyang Gao , Liang Zhao

Learning to Select Like Humans: Explainable Active Learning for Medical Imaging

Medical image analysis requires substantial labeled data for model training, yet expert annotation is expensive and time-consuming. Active learning (AL) addresses this challenge by strategically selecting the most informative samples for…

Image and Video Processing · Electrical Eng. & Systems 2026-03-06 Ifrat Ikhtear Uddin , Longwei Wang , Xiao Qin , Yang Zhou , KC Santosh

Switchable Self-attention Module

Attention mechanism has gained great success in vision recognition. Many works are devoted to improving the effectiveness of attention mechanism, which finely design the structure of the attention operator. These works need lots of…

Computer Vision and Pattern Recognition · Computer Science 2022-09-14 Shanshan Zhong , Wushao Wen , Jinghui Qin

Pre-training Attention Mechanisms

Recurrent neural networks with differentiable attention mechanisms have had success in generative and classification tasks. We show that the classification performance of such models can be enhanced by guiding a randomly initialized model…

Machine Learning · Computer Science 2017-12-18 Jack Lindsey

Spectrum Attention Mechanism for Time Series Classification

Time series classification(TSC) has always been an important and challenging research task. With the wide application of deep learning, more and more researchers use deep learning models to solve TSC problems. Since time series always…

Machine Learning · Computer Science 2021-01-27 Shibo Zhou , Yu Pan

Towards Understanding Sharpness-Aware Minimization

Sharpness-Aware Minimization (SAM) is a recent training method that relies on worst-case weight perturbations which significantly improves generalization in various settings. We argue that the existing justifications for the success of SAM…

Machine Learning · Computer Science 2022-06-14 Maksym Andriushchenko , Nicolas Flammarion

Asynchronous Sharpness-Aware Minimization For Fast and Accurate Deep Learning

Sharpness-Aware Minimization (SAM) is an optimization method that improves generalization performance of machine learning models. Despite its superior generalization, SAM has not been actively used in real-world applications due to its…

Machine Learning · Computer Science 2025-03-17 Junhyuk Jo , Jihyun Lim , Sunwoo Lee

Scene-Aware Feature Matching

Current feature matching methods focus on point-level matching, pursuing better representation learning of individual features, but lacking further understanding of the scene. This results in significant performance degradation when…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Xiaoyong Lu , Yaping Yan , Tong Wei , Songlin Du

Sharpen Focus: Learning with Attention Separability and Consistency

Recent developments in gradient-based attention modeling have seen attention maps emerge as a powerful tool for interpreting convolutional neural networks. Despite good localization for an individual class of interest, these techniques…

Computer Vision and Pattern Recognition · Computer Science 2019-08-09 Lezi Wang , Ziyan Wu , Srikrishna Karanam , Kuan-Chuan Peng , Rajat Vikram Singh , Bo Liu , Dimitris N. Metaxas