Related papers: PEM: Prototype-based Efficient MaskFormer for Imag…

The revenge of BiSeNet: Efficient Multi-Task Image Segmentation

Recent advancements in image segmentation have focused on enhancing the efficiency of the models to meet the demands of real-time applications, especially on edge devices. However, existing research has primarily concentrated on single-task…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Gabriele Rosi , Claudia Cuttano , Niccolò Cavagnero , Giuseppe Averta , Fabio Cermelli

Masked-attention Mask Transformer for Universal Image Segmentation

Image segmentation is about grouping pixels with different semantics, e.g., category or instance membership, where each choice of semantics defines a task. While only the semantics of each task differ, current research focuses on designing…

Computer Vision and Pattern Recognition · Computer Science 2022-06-17 Bowen Cheng , Ishan Misra , Alexander G. Schwing , Alexander Kirillov , Rohit Girdhar

A Unified Efficient Pyramid Transformer for Semantic Segmentation

Semantic segmentation is a challenging problem due to difficulties in modeling context in complex scenes and class confusions along boundaries. Most literature either focuses on context modeling or boundary refinement, which is less…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Fangrui Zhu , Yi Zhu , Li Zhang , Chongruo Wu , Yanwei Fu , Mu Li

IncepFormer: Efficient Inception Transformer with Pyramid Pooling for Semantic Segmentation

Semantic segmentation usually benefits from global contexts, fine localisation information, multi-scale features, etc. To advance Transformer-based segmenters with these aspects, we present a simple yet powerful semantic segmentation…

Computer Vision and Pattern Recognition · Computer Science 2022-12-07 Lihua Fu , Haoyue Tian , Xiangping Bryce Zhai , Pan Gao , Xiaojiang Peng

ContextFormer: Redefining Efficiency in Semantic Segmentation

Semantic segmentation assigns labels to pixels in images, a critical yet challenging task in computer vision. Convolutional methods, although capturing local dependencies well, struggle with long-range relationships. Vision Transformers…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Mian Muhammad Naeem Abid , Nancy Mehta , Zongwei Wu , Radu Timofte

EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention Fusion

Panoptic segmentation is a scene parsing task which unifies semantic segmentation and instance segmentation into one single task. However, the current state-of-the-art studies did not take too much concern on inference time. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2021-07-14 Chia-Yuan Chang , Shuo-En Chang , Pei-Yung Hsiao , Li-Chen Fu

Pyramid Fusion Transformer for Semantic Segmentation

The recently proposed MaskFormer gives a refreshed perspective on the task of semantic segmentation: it shifts from the popular pixel-level classification paradigm to a mask-level classification method. In essence, it generates paired…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Zipeng Qin , Jianbo Liu , Xiaolin Zhang , Maoqing Tian , Aojun Zhou , Shuai Yi , Hongsheng Li

Image-Specific Adaptation of Transformer Encoders for Compute-Efficient Segmentation

Vision transformer based models bring significant improvements for image segmentation tasks. Although these architectures offer powerful capabilities irrespective of specific segmentation tasks, their use of computational resources can be…

Computer Vision and Pattern Recognition · Computer Science 2026-04-01 Manyi Yao , Abhishek Aich , Yumin Suh , Amit Roy-Chowdhury , Christian Shelton , Manmohan Chandraker

MacFormer: Semantic Segmentation with Fine Object Boundaries

Semantic segmentation involves assigning a specific category to each pixel in an image. While Vision Transformer-based models have made significant progress, current semantic segmentation methods often struggle with precise predictions in…

Computer Vision and Pattern Recognition · Computer Science 2024-08-13 Guoan Xu , Wenfeng Huang , Tao Wu , Ligeng Chen , Wenjing Jia , Guangwei Gao , Xiatian Zhu , Stuart Perry

Efficient MedSAMs: Segment Anything in Medical Images on Laptop

Promptable segmentation foundation models have emerged as a transformative approach to addressing the diverse needs in medical images, but most existing models require expensive computing, posing a big barrier to their adoption in clinical…

Image and Video Processing · Electrical Eng. & Systems 2024-12-23 Jun Ma , Feifei Li , Sumin Kim , Reza Asakereh , Bao-Hiep Le , Dang-Khoa Nguyen-Vu , Alexander Pfefferle , Muxin Wei , Ruochen Gao , Donghang Lyu , Songxiao Yang , Lennart Purucker , Zdravko Marinov , Marius Staring , Haisheng Lu , Thuy Thanh Dao , Xincheng Ye , Zhi Li , Gianluca Brugnara , Philipp Vollmuth , Martha Foltyn-Dumitru , Jaeyoung Cho , Mustafa Ahmed Mahmutoglu , Martin Bendszus , Irada Pflüger , Aditya Rastogi , Dong Ni , Xin Yang , Guang-Quan Zhou , Kaini Wang , Nicholas Heller , Nikolaos Papanikolopoulos , Christopher Weight , Yubing Tong , Jayaram K Udupa , Cahill J. Patrick , Yaqi Wang , Yifan Zhang , Francisco Contijoch , Elliot McVeigh , Xin Ye , Shucheng He , Robert Haase , Thomas Pinetz , Alexander Radbruch , Inga Krause , Erich Kobler , Jian He , Yucheng Tang , Haichun Yang , Yuankai Huo , Gongning Luo , Kaisar Kushibar , Jandos Amankulov , Dias Toleshbayev , Amangeldi Mukhamejan , Jan Egger , Antonio Pepe , Christina Gsaxner , Gijs Luijten , Shohei Fujita , Tomohiro Kikuchi , Benedikt Wiestler , Jan S. Kirschke , Ezequiel de la Rosa , Federico Bolelli , Luca Lumetti , Costantino Grana , Kunpeng Xie , Guomin Wu , Behrus Puladi , Carlos Martín-Isla , Karim Lekadir , Victor M. Campello , Wei Shao , Wayne Brisbane , Hongxu Jiang , Hao Wei , Wu Yuan , Shuangle Li , Yuyin Zhou , Bo Wang

Parameter-Efficient Masking Networks

A deeper network structure generally handles more complicated non-linearity and performs more competitively. Nowadays, advanced network designs often contain a large number of repetitive structures (e.g., Transformer). They empower the…

Machine Learning · Computer Science 2022-10-14 Yue Bai , Huan Wang , Xu Ma , Yitian Zhang , Zhiqiang Tao , Yun Fu

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

Panoptic segmentation involves a combination of joint semantic segmentation and instance segmentation, where image contents are divided into two types: things and stuff. We present Panoptic SegFormer, a general framework for panoptic…

Computer Vision and Pattern Recognition · Computer Science 2022-03-21 Zhiqi Li , Wenhai Wang , Enze Xie , Zhiding Yu , Anima Anandkumar , Jose M. Alvarez , Ping Luo , Tong Lu

EPNet: An Efficient Pyramid Network for Enhanced Single-Image Super-Resolution with Reduced Computational Requirements

Single-image super-resolution (SISR) has seen significant advancements through the integration of deep learning. However, the substantial computational and memory requirements of existing methods often limit their practical application.…

Computer Vision and Pattern Recognition · Computer Science 2023-12-22 Xin Xu , Jinman Park , Paul Fieguth

Segmenter: Transformer for Semantic Segmentation

Image segmentation is often ambiguous at the level of individual image patches and requires contextual information to reach label consensus. In this paper we introduce Segmenter, a transformer model for semantic segmentation. In contrast to…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Robin Strudel , Ricardo Garcia , Ivan Laptev , Cordelia Schmid

OneFormer: One Transformer to Rule Universal Image Segmentation

Universal Image Segmentation is not a new concept. Past attempts to unify image segmentation in the last decades include scene parsing, panoptic segmentation, and, more recently, new panoptic architectures. However, such panoptic…

Computer Vision and Pattern Recognition · Computer Science 2023-01-02 Jitesh Jain , Jiachen Li , MangTik Chiu , Ali Hassani , Nikita Orlov , Humphrey Shi

MetaFormer-driven Encoding Network for Robust Medical Semantic Segmentation

Semantic segmentation is crucial for medical image analysis, enabling precise disease diagnosis and treatment planning. However, many advanced models employ complex architectures, limiting their use in resource-constrained clinical…

Image and Video Processing · Electrical Eng. & Systems 2026-01-06 Le-Anh Tran , Chung Nguyen Tran , Nhan Cach Dang , Anh Le Van Quoc , Jordi Carrabina , David Castells-Rufas , Minh Son Nguyen

A Deep Learning Framework for Boundary-Aware Semantic Segmentation

As a fundamental task in computer vision, semantic segmentation is widely applied in fields such as autonomous driving, remote sensing image analysis, and medical image processing. In recent years, Transformer-based segmentation methods…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Tai An , Weiqiang Huang , Da Xu , Qingyuan He , Jiacheng Hu , Yujia Lou

An Enhanced Pyramid Feature Network Based on Long-Range Dependencies for Multi-Organ Medical Image Segmentation

In the field of multi-organ medical image segmentation, recent methods frequently employ Transformers to capture long-range dependencies from image features. However, these methods overlook the high computational cost of Transformers and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Dayu Tan , Cheng Kong , Yansen Su , Hai Chen , Dongliang Yang , Junfeng Xia , Chunhou Zheng

A Point-Based Approach to Efficient LiDAR Multi-Task Perception

Multi-task networks can potentially improve performance and computational efficiency compared to single-task networks, facilitating online deployment. However, current multi-task architectures in point cloud perception combine multiple…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Christopher Lang , Alexander Braun , Lars Schillingmann , Abhinav Valada

EFEM: Equivariant Neural Field Expectation Maximization for 3D Object Segmentation Without Scene Supervision

We introduce Equivariant Neural Field Expectation Maximization (EFEM), a simple, effective, and robust geometric algorithm that can segment objects in 3D scenes without annotations or training on scenes. We achieve such unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Jiahui Lei , Congyue Deng , Karl Schmeckpeper , Leonidas Guibas , Kostas Daniilidis