Related papers: Sparse Spatial Transformers for Few-Shot Learning

Few-Shot Learning Meets Transformer: Unified Query-Support Transformers for Few-Shot Classification

Few-shot classification which aims to recognize unseen classes using very limited samples has attracted more and more attention. Usually, it is formulated as a metric learning problem. The core issue of few-shot classification is how to…

Computer Vision and Pattern Recognition · Computer Science 2022-08-29 Xixi Wang , Xiao Wang , Bo Jiang , Bin Luo

SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning

Recent Few-Shot Learning (FSL) methods put emphasis on generating a discriminative embedding features to precisely measure the similarity between support and query sets. Current CNN-based cross-attention approaches generate discriminative…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Jinxiang Lai , Siqian Yang , Wenlong Wu , Tao Wu , Guannan Jiang , Xi Wang , Jun Liu , Bin-Bin Gao , Wei Zhang , Yuan Xie , Chengjie Wang

SparseFormer: Sparse Visual Recognition via Limited Latent Tokens

Human visual recognition is a sparse process, where only a few salient visual cues are attended to rather than traversing every detail uniformly. However, most current vision networks follow a dense paradigm, processing every single visual…

Computer Vision and Pattern Recognition · Computer Science 2023-04-10 Ziteng Gao , Zhan Tong , Limin Wang , Mike Zheng Shou

SASFormer: Transformers for Sparsely Annotated Semantic Segmentation

Semantic segmentation based on sparse annotation has advanced in recent years. It labels only part of each object in the image, leaving the remainder unlabeled. Most of the existing approaches are time-consuming and often necessitate a…

Computer Vision and Pattern Recognition · Computer Science 2023-02-28 Hui Su , Yue Ye , Wei Hua , Lechao Cheng , Mingli Song

Few-Shot Object Detection with Sparse Context Transformers

Few-shot detection is a major task in pattern recognition which seeks to localize objects using models trained with few labeled data. One of the mainstream few-shot methods is transfer learning which consists in pretraining a detection…

Computer Vision and Pattern Recognition · Computer Science 2024-02-15 Jie Mei , Mingyuan Jiu , Hichem Sahbi , Xiaoheng Jiang , Mingliang Xu

Prototype as Query for Few Shot Semantic Segmentation

Few-shot Semantic Segmentation (FSS) was proposed to segment unseen classes in a query image, referring to only a few annotated examples named support images. One of the characteristics of FSS is spatial inconsistency between query and…

Computer Vision and Pattern Recognition · Computer Science 2022-11-29 Leilei Cao , Yibo Guo , Ye Yuan , Qiangguo Jin

Few-shot Medical Image Segmentation via Cross-Reference Transformer

Deep learning models have become the mainstream method for medical image segmentation, but they require a large manually labeled dataset for training and are difficult to extend to unseen categories. Few-shot segmentation(FSS) has the…

Image and Video Processing · Electrical Eng. & Systems 2023-07-27 Yao Huang , Jianming Liu

HyperFormer: Learning Expressive Sparse Feature Representations via Hypergraph Transformer

Learning expressive representations for high-dimensional yet sparse features has been a longstanding problem in information retrieval. Though recent deep learning methods can partially solve the problem, they often fail to handle the…

Information Retrieval · Computer Science 2023-05-30 Kaize Ding , Albert Jiongqian Liang , Bryan Perrozi , Ting Chen , Ruoxi Wang , Lichan Hong , Ed H. Chi , Huan Liu , Derek Zhiyuan Cheng

Mask Matching Transformer for Few-Shot Segmentation

In this paper, we aim to tackle the challenging few-shot segmentation task from a new perspective. Typical methods follow the paradigm to firstly learn prototypical features from support images and then match query features in pixel-level…

Computer Vision and Pattern Recognition · Computer Science 2023-01-04 Siyu Jiao , Gengwei Zhang , Shant Navasardyan , Ling Chen , Yao Zhao , Yunchao Wei , Humphrey Shi

tSF: Transformer-based Semantic Filter for Few-Shot Learning

Few-Shot Learning (FSL) alleviates the data shortage challenge via embedding discriminative target-aware features among plenty seen (base) and few unseen (novel) labeled samples. Most feature embedding modules in recent FSL methods are…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Jinxiang Lai , Siqian Yang , Wenlong Liu , Yi Zeng , Zhongyi Huang , Wenlong Wu , Jun Liu , Bin-Bin Gao , Chengjie Wang

MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping

Few-shot Semantic Segmentation addresses the challenge of segmenting objects in query images with only a handful of annotated examples. However, many previous state-of-the-art methods either have to discard intricate local semantic features…

Computer Vision and Pattern Recognition · Computer Science 2025-11-06 Amirreza Fateh , Mohammad Reza Mohammadi , Mohammad Reza Jahed Motlagh

Improving Few-shot Learning with Weakly-supervised Object Localization

Few-shot learning often involves metric learning-based classifiers, which predict the image label by comparing the distance between the extracted feature vector and class representations. However, applying global pooling in the backend of…

Computer Vision and Pattern Recognition · Computer Science 2021-05-26 Inyong Koo , Minki Jeong , Changick Kim

CrossTransformers: spatially-aware few-shot transfer

Given new tasks with very little data$-$such as new classes in a classification problem or a domain shift in the input$-$performance of modern vision systems degrades remarkably quickly. In this work, we illustrate how the neural network…

Computer Vision and Pattern Recognition · Computer Science 2021-02-18 Carl Doersch , Ankush Gupta , Andrew Zisserman

Sparse-Aware Neural Networks for Nonlinear Functionals: Mitigating the Exponential Dependence on Dimension

Deep neural networks have emerged as powerful tools for learning operators defined over infinite-dimensional function spaces. However, existing theories frequently encounter difficulties related to dimensionality and limited…

Machine Learning · Computer Science 2026-05-12 Jianfei Li , Shuo Huang , Han Feng , Ding-Xuan Zhou , Gitta Kutyniok

AutoFocusFormer: Image Segmentation off the Grid

Real world images often have highly imbalanced content density. Some areas are very uniform, e.g., large patches of blue sky, while other areas are scattered with many small objects. Yet, the commonly used successive grid downsampling…

Computer Vision and Pattern Recognition · Computer Science 2023-10-27 Chen Ziwen , Kaushik Patnaik , Shuangfei Zhai , Alvin Wan , Zhile Ren , Alex Schwing , Alex Colburn , Li Fuxin

Improving Few-shot Learning by Spatially-aware Matching and CrossTransformer

Current few-shot learning models capture visual object relations in the so-called meta-learning setting under a fixed-resolution input. However, such models have a limited generalization ability under the scale and location mismatch between…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Hongguang Zhang , Philip H. S. Torr , Piotr Koniusz

Rethinking Generalization in Few-Shot Classification

Single image-level annotations only correctly describe an often small subset of an image's content, particularly when complex real-world scenes are depicted. While this might be acceptable in many classification scenarios, it poses a…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Markus Hiller , Rongkai Ma , Mehrtash Harandi , Tom Drummond

FieldFormer: Locality-Aware Transformers for Spatio-Temporal Modeling on Sparse Sensor Networks

Spatio-temporal sensor data in real-world systems is often sparse, noisy, and irregular, making latent field reconstruction fundamentally underconstrained. Under extreme sparsity, multiple physically plausible fields may remain consistent…

Machine Learning · Computer Science 2026-05-20 Ankit Bhardwaj , Ananth Balashankar , Lakshminarayanan Subramanian

Shapelet-based Sparse Representation for Landcover Classification of Hyperspectral Images

This paper presents a sparse representation-based classification approach with a novel dictionary construction procedure. By using the constructed dictionary sophisticated prior knowledge about the spatial nature of the image can be…

Computer Vision and Pattern Recognition · Computer Science 2017-08-22 Ribana Roscher , Björn Waske

Context-Transformer: Tackling Object Confusion for Few-Shot Detection

Few-shot object detection is a challenging but realistic scenario, where only a few annotated training images are available for training detectors. A popular approach to handle this problem is transfer learning, i.e., fine-tuning a detector…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Ze Yang , Yali Wang , Xianyu Chen , Jianzhuang Liu , Yu Qiao