Related papers: PatchDCT: Patch Refinement for High Quality Instan…

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Binary grid mask representation is broadly used in instance segmentation. A representative instantiation is Mask R-CNN which predicts masks on a $28\times 28$ binary grid. Generally, a low-resolution grid is not sufficient to capture the…

Computer Vision and Pattern Recognition · Computer Science 2021-04-28 Xing Shen , Jirui Yang , Chunbo Wei , Bing Deng , Jianqiang Huang , Xiansheng Hua , Xiaoliang Cheng , Kewei Liang

TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask

Arbitrary-shaped scene text detection is a challenging task due to the variety of text changes in font, size, color, and orientation. Most existing regression based methods resort to regress the masks or contour points of text regions to…

Computer Vision and Pattern Recognition · Computer Science 2022-06-28 Yuchen Su , Zhiwen Shao , Yong Zhou , Fanrong Meng , Hancheng Zhu , Bing Liu , Rui Yao

Mask Transfiner for High-Quality Instance Segmentation

Two-stage and query-based instance segmentation methods have achieved remarkable results. However, their segmented masks are still very coarse. In this paper, we present Mask Transfiner for high-quality and efficient instance segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Lei Ke , Martin Danelljan , Xia Li , Yu-Wing Tai , Chi-Keung Tang , Fisher Yu

DPT: Deformable Patch-based Transformer for Visual Recognition

Transformer has achieved great success in computer vision, while how to split patches in an image remains a problem. Existing methods usually use a fixed-size patch embedding which might destroy the semantics of objects. To address this…

Computer Vision and Pattern Recognition · Computer Science 2021-08-02 Zhiyang Chen , Yousong Zhu , Chaoyang Zhao , Guosheng Hu , Wei Zeng , Jinqiao Wang , Ming Tang

Video Mask Transfiner for High-Quality Video Instance Segmentation

While Video Instance Segmentation (VIS) has seen rapid progress, current approaches struggle to predict high-quality masks with accurate boundary details. Moreover, the predicted segmentations often fluctuate over time, suggesting that…

Computer Vision and Pattern Recognition · Computer Science 2022-07-29 Lei Ke , Henghui Ding , Martin Danelljan , Yu-Wing Tai , Chi-Keung Tang , Fisher Yu

Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation

Tremendous efforts have been made on instance segmentation but the mask quality is still not satisfactory. The boundaries of predicted instance masks are usually imprecise due to the low spatial resolution of feature maps and the imbalance…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Chufeng Tang , Hang Chen , Xiao Li , Jianmin Li , Zhaoxiang Zhang , Xiaolin Hu

Variance-insensitive and Target-preserving Mask Refinement for Interactive Image Segmentation

Point-based interactive image segmentation can ease the burden of mask annotation in applications such as semantic segmentation and image editing. However, fully extracting the target mask with limited user inputs remains challenging. We…

Computer Vision and Pattern Recognition · Computer Science 2023-12-25 Chaowei Fang , Ziyin Zhou , Junye Chen , Hanjing Su , Qingyao Wu , Guanbin Li

RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features

The two-stage methods for instance segmentation, e.g. Mask R-CNN, have achieved excellent performance recently. However, the segmented masks are still very coarse due to the downsampling operations in both the feature pyramid and the…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Gang Zhang , Xin Lu , Jingru Tan , Jianmin Li , Zhaoxiang Zhang , Quanquan Li , Xiaolin Hu

PARTICLE: Part Discovery and Contrastive Learning for Fine-grained Recognition

We develop techniques for refining representations for fine-grained classification and segmentation tasks in a self-supervised manner. We find that fine-tuning methods based on instance-discriminative contrastive learning are not as…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Oindrila Saha , Subhransu Maji

DBAT: Dynamic Backward Attention Transformer for Material Segmentation with Cross-Resolution Patches

The objective of dense material segmentation is to identify the material categories for every image pixel. Recent studies adopt image patches to extract material features. Although the trained networks can improve the segmentation…

Computer Vision and Pattern Recognition · Computer Science 2024-02-29 Yuwen Heng , Srinandan Dasmahapatra , Hansung Kim

DCCVT: Differentiable Clipped Centroidal Voronoi Tessellation

While Marching Cubes (MC) and Marching Tetrahedra (MTet) are widely adopted in 3D reconstruction pipelines due to their simplicity and efficiency, their differentiable variants remain suboptimal for mesh extraction. This often limits the…

Computational Geometry · Computer Science 2026-01-21 Wylliam Cantin Charawi , Adrien Gruson , Jane Wu , Christian Desrosiers , Diego Thomas

The Missing Point in Vision Transformers for Universal Image Segmentation

Image segmentation remains a challenging task in computer vision, demanding robust mask generation and precise classification. Recent mask-based approaches yield high-quality masks by capturing global context. However, accurately…

Computer Vision and Pattern Recognition · Computer Science 2025-12-10 Sajjad Shahabodini , Mobina Mansoori , Farnoush Bayatmakou , Jamshid Abouei , Konstantinos N. Plataniotis , Arash Mohammadi

Cascaded Refinement Network for Point Cloud Completion

Point clouds are often sparse and incomplete. Existing shape completion methods are incapable of generating details of objects or learning the complex point distributions. To this end, we propose a cascaded refinement network together with…

Computer Vision and Pattern Recognition · Computer Science 2020-06-08 Xiaogang Wang , Marcelo H Ang , Gim Hee Lee

ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator

Recently, several Vision Transformer (ViT) based methods have been proposed for Fine-Grained Visual Classification (FGVC).These methods significantly surpass existing CNN-based ones, demonstrating the effectiveness of ViT in FGVC…

Computer Vision and Pattern Recognition · Computer Science 2022-03-25 Zi-Chao Zhang , Zhen-Duo Chen , Yongxin Wang , Xin Luo , Xin-Shun Xu

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

Instance segmentation is one of the fundamental vision tasks. Recently, fully convolutional instance segmentation methods have drawn much attention as they are often simpler and more efficient than two-stage approaches like Mask R-CNN. To…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Hao Chen , Kunyang Sun , Zhi Tian , Chunhua Shen , Yongming Huang , Youliang Yan

Deeply Shape-guided Cascade for Instance Segmentation

The key to a successful cascade architecture for precise instance segmentation is to fully leverage the relationship between bounding box detection and mask segmentation across multiple stages. Although modern instance segmentation cascades…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Hao Ding , Siyuan Qiao , Alan Yuille , Wei Shen

Prediction of Transformed (DCT) Video Coding Residual for Video Compression

Video compression has been investigated by means of analysis-synthesis, and more particularly by means of inpainting. The first part of our approach has been to develop the inpainting of DCT coefficients in an image. This has shown good…

Information Theory · Computer Science 2014-04-17 Matthieu Moinard , Isabelle Amonou , Pierre Duhamel , Patrice Brault

Point Cloud Upsampling via Disentangled Refinement

Point clouds produced by 3D scanning are often sparse, non-uniform, and noisy. Recent upsampling approaches aim to generate a dense point set, while achieving both distribution uniformity and proximity-to-surface, and possibly amending…

Computer Vision and Pattern Recognition · Computer Science 2021-06-10 Ruihui Li , Xianzhi Li , Pheng-Ann Heng , Chi-Wing Fu

Exploring Semantic Segmentation on the DCT Representation

Typical convolutional networks are trained and conducted on RGB images. However, images are often compressed for memory savings and efficient transmission in real-world applications. In this paper, we explore methods for performing semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-01-01 Shao-Yuan Lo , Hsueh-Ming Hang

Generating Superpixels for High-resolution Images with Decoupled Patch Calibration

Superpixel segmentation has recently seen important progress benefiting from the advances in differentiable deep learning. However, the very high-resolution superpixel segmentation still remains challenging due to the expensive memory and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Yaxiong Wang , Yunchao Wei , Xueming Qian , Li Zhu , Yi Yang