Related papers: High-Quality Entity Segmentation

HCFormer: Unified Image Segmentation with Hierarchical Clustering

Hierarchical clustering is an effective and efficient approach widely used for classical image segmentation methods. However, many existing methods using neural networks generate segmentation masks directly from per-pixel features,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Teppei Suzuki

DFormer: Diffusion-guided Transformer for Universal Image Segmentation

This paper introduces an approach, named DFormer, for universal image segmentation. The proposed DFormer views universal image segmentation task as a denoising process using a diffusion model. DFormer first adds various levels of Gaussian…

Computer Vision and Pattern Recognition · Computer Science 2023-06-09 Hefeng Wang , Jiale Cao , Rao Muhammad Anwer , Jin Xie , Fahad Shahbaz Khan , Yanwei Pang

Masked-attention Mask Transformer for Universal Image Segmentation

Image segmentation is about grouping pixels with different semantics, e.g., category or instance membership, where each choice of semantics defines a task. While only the semantics of each task differ, current research focuses on designing…

Computer Vision and Pattern Recognition · Computer Science 2022-06-17 Bowen Cheng , Ishan Misra , Alexander G. Schwing , Alexander Kirillov , Rohit Girdhar

Open-World Entity Segmentation

We introduce a new image segmentation task, called Entity Segmentation (ES), which aims to segment all visual entities (objects and stuffs) in an image without predicting their semantic labels. By removing the need of class label…

Computer Vision and Pattern Recognition · Computer Science 2022-12-21 Lu Qi , Jason Kuen , Yi Wang , Jiuxiang Gu , Hengshuang Zhao , Zhe Lin , Philip Torr , Jiaya Jia

HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation

Current semantic segmentation models have achieved great success under the independent and identically distributed (i.i.d.) condition. However, in real-world applications, test data might come from a different domain than training data.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Jian Ding , Nan Xue , Gui-Song Xia , Bernt Schiele , Dengxin Dai

OneFormer: One Transformer to Rule Universal Image Segmentation

Universal Image Segmentation is not a new concept. Past attempts to unify image segmentation in the last decades include scene parsing, panoptic segmentation, and, more recently, new panoptic architectures. However, such panoptic…

Computer Vision and Pattern Recognition · Computer Science 2023-01-02 Jitesh Jain , Jiachen Li , MangTik Chiu , Ali Hassani , Nikita Orlov , Humphrey Shi

Segmenter: Transformer for Semantic Segmentation

Image segmentation is often ambiguous at the level of individual image patches and requires contextual information to reach label consensus. In this paper we introduce Segmenter, a transformer model for semantic segmentation. In contrast to…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Robin Strudel , Ricardo Garcia , Ivan Laptev , Cordelia Schmid

PlankFormer: Robust Plankton Instance Segmentation via MAE-Pretrained Vision Transformers and Pseudo Community Image Generation

Plankton monitoring is essential for assessing aquatic ecosystems but is limited by the labor-intensive nature of manual microscopic analysis. Automating the segmentation of plankton from crowded images is crucial, however, it faces two…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Masaharu Miyazaki , Yurie Otake , Koichi Ito , Wataru Makino , Jotaro Urabe , Takafumi Aoki

Semantic Labeling of High Resolution Images Using EfficientUNets and Transformers

Semantic segmentation necessitates approaches that learn high-level characteristics while dealing with enormous amounts of data. Convolutional neural networks (CNNs) can learn unique and adaptive features to achieve this aim. However, due…

Computer Vision and Pattern Recognition · Computer Science 2023-07-19 Hasan AlMarzouqi , Lyes Saad Saoud

DepthCropSeg++: Scaling a Crop Segmentation Foundation Model With Depth-Labeled Data

DepthCropSeg++: a foundation model for crop segmentation, capable of segmenting different crop species under open in-field environment. Crop segmentation is a fundamental task for modern agriculture, which closely relates to many downstream…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Jiafei Zhang , Songliang Cao , Binghui Xu , Yanan Li , Weiwei Jia , Tingting Wu , Hao Lu , Weijuan Hu , Zhiguo Han

The revenge of BiSeNet: Efficient Multi-Task Image Segmentation

Recent advancements in image segmentation have focused on enhancing the efficiency of the models to meet the demands of real-time applications, especially on edge devices. However, existing research has primarily concentrated on single-task…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Gabriele Rosi , Claudia Cuttano , Niccolò Cavagnero , Giuseppe Averta , Fabio Cermelli

Mixed-Query Transformer: A Unified Image Segmentation Architecture

Existing unified image segmentation models either employ a unified architecture across multiple tasks but use separate weights tailored to each dataset, or apply a single set of weights to multiple datasets but are limited to a single task.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pei Wang , Zhaowei Cai , Hao Yang , Ashwin Swaminathan , R. Manmatha , Stefano Soatto

Mask Transfiner for High-Quality Instance Segmentation

Two-stage and query-based instance segmentation methods have achieved remarkable results. However, their segmented masks are still very coarse. In this paper, we present Mask Transfiner for high-quality and efficient instance segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Lei Ke , Martin Danelljan , Xia Li , Yu-Wing Tai , Chi-Keung Tang , Fisher Yu

Multi-modal Uncertainty Robust Tree Cover Segmentation For High-Resolution Remote Sensing Images

Recent advances in semantic segmentation of multi-modal remote sensing images have significantly improved the accuracy of tree cover mapping, supporting applications in urban planning, forest monitoring, and ecological assessment.…

Image and Video Processing · Electrical Eng. & Systems 2025-12-16 Yuanyuan Gui , Wei Li , Yinjian Wang , Xiang-Gen Xia , Mauro Marty , Christian Ginzler , Zuyuan Wang

HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation

Convolutional neural networks (CNNs) have been the consensus for medical image segmentation tasks. However, they suffer from the limitation in modeling long-range dependencies and spatial correlations due to the nature of convolution…

Computer Vision and Pattern Recognition · Computer Science 2023-01-10 Moein Heidari , Amirhossein Kazerouni , Milad Soltany , Reza Azad , Ehsan Khodapanah Aghdam , Julien Cohen-Adad , Dorit Merhof

A Deep Learning Framework for Boundary-Aware Semantic Segmentation

As a fundamental task in computer vision, semantic segmentation is widely applied in fields such as autonomous driving, remote sensing image analysis, and medical image processing. In recent years, Transformer-based segmentation methods…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Tai An , Weiqiang Huang , Da Xu , Qingyuan He , Jiacheng Hu , Yujia Lou

PU-EdgeFormer: Edge Transformer for Dense Prediction in Point Cloud Upsampling

Despite the recent development of deep learning-based point cloud upsampling, most MLP-based point cloud upsampling methods have limitations in that it is difficult to train the local and global structure of the point cloud at the same…

Computer Vision and Pattern Recognition · Computer Science 2023-05-03 Dohoon Kim , Minwoo Shin , Joonki Paik

Position-Guided Point Cloud Panoptic Segmentation Transformer

DEtection TRansformer (DETR) started a trend that uses a group of learnable queries for unified visual perception. This work begins by applying this appealing paradigm to LiDAR-based point cloud segmentation and obtains a simple yet…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Zeqi Xiao , Wenwei Zhang , Tai Wang , Chen Change Loy , Dahua Lin , Jiangmiao Pang

UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery

Semantic segmentation of remotely sensed urban scene images is required in a wide range of practical applications, such as land cover mapping, urban change detection, environmental protection, and economic assessment.Driven by rapid…

Computer Vision and Pattern Recognition · Computer Science 2022-06-28 Libo Wang , Rui Li , Ce Zhang , Shenghui Fang , Chenxi Duan , Xiaoliang Meng , Peter M. Atkinson

Hierarchical Mask2Former: Panoptic Segmentation of Crops, Weeds and Leaves

Advancements in machine vision that enable detailed inferences to be made from images have the potential to transform many sectors including agriculture. Precision agriculture, where data analysis enables interventions to be precisely…

Computer Vision and Pattern Recognition · Computer Science 2023-10-11 Madeleine Darbyshire , Elizabeth Sklar , Simon Parsons