Related papers: SAHDL: Sparse Attention Hypergraph Regularized Dic…

DLDL: Dynamic Label Dictionary Learning via Hypergraph Regularization

For classification tasks, dictionary learning based methods have attracted lots of attention in recent years. One popular way to achieve this purpose is to introduce label information to generate a discriminative dictionary to represent…

Computer Vision and Pattern Recognition · Computer Science 2020-10-26 Shuai Shao , Mengke Wang , Rui Xu , Yan-Jiang Wang , Bao-Di Liu

SSDL: Self-Supervised Dictionary Learning

The label-embedded dictionary learning (DL) algorithms generate influential dictionaries by introducing discriminative information. However, there exists a limitation: All the label-embedded DL methods rely on the labels due that this way…

Machine Learning · Computer Science 2021-12-06 Shuai Shao , Lei Xing , Wei Yu , Rui Xu , Yanjiang Wang , Baodi Liu

A Regularized Framework for Sparse and Structured Neural Attention

Modern neural networks are often augmented with an attention mechanism, which tells the network where to focus within the input. We propose in this paper a new framework for sparse and structured attention, building upon a smoothed max…

Machine Learning · Statistics 2019-02-26 Vlad Niculae , Mathieu Blondel

Hypergraph based semi-supervised learning algorithms applied to speech recognition problem: a novel approach

Most network-based speech recognition methods are based on the assumption that the labels of two adjacent speech samples in the network are likely to be the same. However, assuming the pairwise relationship between speech samples is not…

Machine Learning · Statistics 2018-10-31 Loc Hoang Tran , Trang Hoang , Bui Hoang Nam Huynh

Learning Subspace-Preserving Sparse Attention Graphs from Heterogeneous Multiview Data

The high-dimensional features extracted from large-scale unlabeled data via various pretrained models with diverse architectures are referred to as heterogeneous multiview data. Most existing unsupervised transfer learning methods fail to…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Jie Chen , Yuanbiao Gou , Chuanbin Liu , Zhu Wang , Xi Peng

SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention

While sparse attention mitigates the computational bottleneck of long-context LLM training, its distributed training process exhibits extreme heterogeneity in both \textit{1)} sequence length and \textit{2)} sparsity sensitivity, leading to…

Machine Learning · Computer Science 2026-04-27 Hongtao Xu , Jianchao Tan , Yuxuan Hu , Pengju Lu , Hongyu Wang , Pingwei Sun , Yerui Sun , Yuchen Xie , Xunliang Cai , Mingzhen Li , Weile Jia

Learning parametric dictionaries for graph signals

In sparse signal representation, the choice of a dictionary often involves a tradeoff between two desirable properties -- the ability to adapt to specific signal data and a fast implementation of the dictionary. To sparsely represent…

Machine Learning · Computer Science 2015-06-18 Dorina Thanou , David I Shuman , Pascal Frossard

Elastic Net Hypergraph Learning for Image Clustering and Semi-supervised Classification

Graph model is emerging as a very effective tool for learning the complex structures and relationships hidden in data. Generally, the critical purpose of graph-oriented learning algorithms is to construct an informative graph for image…

Computer Vision and Pattern Recognition · Computer Science 2016-12-21 Qingshan Liu , Yubao Sun , Cantian Wang , Tongliang Liu , Dacheng Tao

Supervised learning pays attention

In-context learning with attention enables large neural networks to make context-specific predictions by selectively focusing on relevant examples. Here, we adapt this idea to supervised learning procedures such as lasso regression and…

Machine Learning · Statistics 2025-12-11 Erin Craig , Robert Tibshirani

LaplaceNet: A Hybrid Graph-Energy Neural Network for Deep Semi-Supervised Classification

Semi-supervised learning has received a lot of recent attention as it alleviates the need for large amounts of labelled data which can often be expensive, requires expert knowledge and be time consuming to collect. Recent developments in…

Machine Learning · Computer Science 2022-09-30 Philip Sellars , Angelica I. Aviles-Rivero , Carola-Bibiane Schönlieb

Improving Sparse Autoencoder with Dynamic Attention

Recently, sparse autoencoders (SAEs) have emerged as a promising technique for interpreting activations in foundation models by disentangling features into a sparse set of concepts. However, identifying the optimal level of sparsity for…

Machine Learning · Computer Science 2026-04-17 Dongsheng Wang , Jinsen Zhang , Dawei Su , Hui Huang

Accelerating Prefilling for Long-Context LLMs via Sparse Pattern Sharing

Sparse attention methods exploit the inherent sparsity in attention to speed up the prefilling phase of long-context inference, mitigating the quadratic complexity of full attention computation. While existing sparse attention methods rely…

Machine Learning · Computer Science 2025-05-27 Dan Peng , Zhihui Fu , Zewen Ye , Zhuoran Song , Jun Wang

A Regularized Attention Mechanism for Graph Attention Networks

Machine learning models that can exploit the inherent structure in data have gained prominence. In particular, there is a surge in deep learning solutions for graph-structured data, due to its wide-spread applicability in several fields.…

Machine Learning · Computer Science 2020-02-12 Uday Shankar Shanthamallu , Jayaraman J. Thiagarajan , Andreas Spanias

Sequential Attention: A Context-Aware Alignment Function for Machine Reading

In this paper we propose a neural network model with a novel Sequential Attention layer that extends soft attention by assigning weights to words in an input sequence in a way that takes into account not just how well that word matches a…

Computation and Language · Computer Science 2017-06-28 Sebastian Brarda , Philip Yeres , Samuel R. Bowman

Overlap-aware meta-learning attention to enhance hypergraph neural networks for node classification

Although hypergraph neural networks (HGNNs) have emerged as a powerful framework for analyzing complex datasets, their practical performance often remains limited. On one hand, existing networks typically employ a single type of attention…

Machine Learning · Computer Science 2025-11-14 Murong Yang , Shihui Ying , Yue Gao , Xin-Jian Xu

p-Laplacian Adaptation for Generative Pre-trained Vision-Language Models

Vision-Language models (VLMs) pre-trained on large corpora have demonstrated notable success across a range of downstream tasks. In light of the rapidly increasing size of pre-trained VLMs, parameter-efficient transfer learning (PETL) has…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Haoyuan Wu , Xinyun Zhang , Peng Xu , Peiyu Liao , Xufeng Yao , Bei Yu

Attention-based Graph Neural Network for Semi-supervised Learning

Recently popularized graph neural networks achieve the state-of-the-art accuracy on a number of standard benchmark datasets for graph-based semi-supervised learning, improving significantly over existing approaches. These architectures…

Machine Learning · Statistics 2018-03-13 Kiran K. Thekumparampil , Chong Wang , Sewoong Oh , Li-Jia Li

Sparse Sinkhorn Attention

We propose Sparse Sinkhorn Attention, a new efficient and sparse method for learning to attend. Our method is based on differentiable sorting of internal representations. Concretely, we introduce a meta sorting network that learns to…

Machine Learning · Computer Science 2020-02-27 Yi Tay , Dara Bahri , Liu Yang , Donald Metzler , Da-Cheng Juan

Sparse Attention with Linear Units

Recently, it has been argued that encoder-decoder models can be made more interpretable by replacing the softmax function in the attention with its sparse variants. In this work, we introduce a novel, simple method for achieving sparsity in…

Computation and Language · Computer Science 2021-10-07 Biao Zhang , Ivan Titov , Rico Sennrich

Semi-supervised dictionary learning with graph regularization and active points

Supervised Dictionary Learning has gained much interest in the recent decade and has shown significant performance improvements in image classification. However, in general, supervised learning needs a large number of labelled samples per…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 Khanh-Hung Tran , Fred-Maurice Ngole-Mboula , Jean-Luc Starck , Vincent Prost