Related papers: High-level Feature Guided Decoding for Semantic Se…

A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection

Both high-level and high-resolution feature representations are of great importance in various visual understanding tasks. To acquire high-resolution feature maps with high-level semantic information, one common strategy is to adopt dilated…

Computer Vision and Pattern Recognition · Computer Science 2020-12-21 Jianbo Liu , Sijie Ren , Yuanjie Zheng , Xiaogang Wang , Hongsheng Li

EfficientFCN: Holistically-guided Decoding for Semantic Segmentation

Both performance and efficiency are important to semantic segmentation. State-of-the-art semantic segmentation algorithms are mostly based on dilated Fully Convolutional Networks (dilatedFCN), which adopt dilated convolutions in the…

Computer Vision and Pattern Recognition · Computer Science 2020-08-28 Jianbo Liu , Junjun He , Jiawei Zhang , Jimmy S. Ren , Hongsheng Li

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations. Most existing methods are bottom-up approaches that try to group pixels into regions based on…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Zhaoyuan Yin , Pichao Wang , Fan Wang , Xianzhe Xu , Hanling Zhang , Hao Li , Rong Jin

HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models

Generative diffusion models show promise for data augmentation. However, applying them to fine-grained tasks presents a significant challenge: ensuring synthetic images accurately capture the subtle, category-defining features critical for…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Zhiguang Lu , Qianqian Xu , Peisong Wen , Siran Dai , Qingming Huang

Attention guided global enhancement and local refinement network for semantic segmentation

The encoder-decoder architecture is widely used as a lightweight semantic segmentation network. However, it struggles with a limited performance compared to a well-designed Dilated-FCN model for two major problems. First, commonly used…

Computer Vision and Pattern Recognition · Computer Science 2022-05-11 Jiangyun Li , Sen Zha , Chen Chen , Meng Ding , Tianxiang Zhang , Hong Yu

Learning High-level Prior with Convolutional Neural Networks for Semantic Segmentation

This paper proposes a convolutional neural network that can fuse high-level prior for semantic image segmentation. Motivated by humans' vision recognition system, our key design is a three-layer generative structure consisting of high-level…

Computer Vision and Pattern Recognition · Computer Science 2015-11-24 Haitian Zheng , Yebin Liu , Mengqi Ji , Feng Wu , Lu Fang

DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, the existing scalable compression methods face two challenges: reduced compression performance and insufficient…

Image and Video Processing · Electrical Eng. & Systems 2022-01-05 Yi Ma , Yongqi Zhai , Ronggang Wang

Learning Semantically Enhanced Feature for Fine-Grained Image Classification

We aim to provide a computationally cheap yet effective approach for fine-grained image classification (FGIC) in this letter. Unlike previous methods that rely on complex part localization modules, our approach learns fine-grained features…

Computer Vision and Pattern Recognition · Computer Science 2023-07-19 Wei Luo , Hengmin Zhang , Jun Li , Xiu-Shen Wei

Fine-grained Recognition with Learnable Semantic Data Augmentation

Fine-grained image recognition is a longstanding computer vision challenge that focuses on differentiating objects belonging to multiple subordinate categories within the same meta-category. Since images belonging to the same meta-category…

Computer Vision and Pattern Recognition · Computer Science 2023-09-04 Yifan Pu , Yizeng Han , Yulin Wang , Junlan Feng , Chao Deng , Gao Huang

Learning Multi-scale Features for Foreground Segmentation

Foreground segmentation algorithms aim segmenting moving objects from the background in a robust way under various challenging scenarios. Encoder-decoder type deep neural networks that are used in this domain recently perform impressive…

Computer Vision and Pattern Recognition · Computer Science 2019-09-04 Long Ang Lim , Hacer Yalim Keles

GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection

Fine-grained open-vocabulary object detection (FG-OVD) aims to detect novel object categories described by attribute-rich texts. While existing open-vocabulary detectors show promise at the base-category level, they underperform in…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Jiaming Li , Zhijia Liang , Weikai Chen , Lin Ma , Guanbin Li

A Higher-Order Semantic Dependency Parser

Higher-order features bring significant accuracy gains in semantic dependency parsing. However, modeling higher-order features with exact inference is NP-hard. Graph neural networks (GNNs) have been demonstrated to be an effective tool for…

Computation and Language · Computer Science 2022-01-28 Bin Li , Yunlong Fan , Yikemaiti Sataer , Zhiqiang Gao

Deep Saliency with Encoded Low level Distance Map and High Level Features

Recent advances in saliency detection have utilized deep learning to obtain high level features to detect salient regions in a scene. These advances have demonstrated superior results over previous works that utilize hand-crafted low level…

Computer Vision and Pattern Recognition · Computer Science 2016-04-20 Gayoung Lee , Yu-Wing Tai , Junmo Kim

Guided Upsampling Network for Real-Time Semantic Segmentation

Semantic segmentation architectures are mainly built upon an encoder-decoder structure. These models perform subsequent downsampling operations in the encoder. Since operations on high-resolution activation maps are computationally…

Computer Vision and Pattern Recognition · Computer Science 2018-07-20 Davide Mazzini

DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, most existing scalable compression methods face two challenges: reduced compression performance and insufficient…

Image and Video Processing · Electrical Eng. & Systems 2024-12-03 Yongqi Zhai , Yi Ma , Luyang Tang , Wei Jiang , Ronggang Wang

Cascaded Partial Decoder for Fast and Accurate Salient Object Detection

Existing state-of-the-art salient object detection networks rely on aggregating multi-level features of pre-trained convolutional neural networks (CNNs). Compared to high-level features, low-level features contribute less to performance but…

Computer Vision and Pattern Recognition · Computer Science 2019-04-19 Zhe Wu , Li Su , Qingming Huang

Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification

Fine-grained visual classification (FGVC) aims to classify sub-classes of objects in the same super-class (e.g., species of birds, models of cars). For the FGVC tasks, the essential solution is to find discriminative subtle information of…

Computer Vision and Pattern Recognition · Computer Science 2021-06-22 Chenyu Guo , Jiyang Xie , Kongming Liang , Xian Sun , Zhanyu Ma

Guidance in the Frequency Domain Enables High-Fidelity Sampling at Low CFG Scales

Classifier-free guidance (CFG) has become an essential component of modern conditional diffusion models. Although highly effective in practice, the underlying mechanisms by which CFG enhances quality, detail, and prompt alignment are not…

Machine Learning · Computer Science 2025-06-25 Seyedmorteza Sadat , Tobias Vontobel , Farnood Salehi , Romann M. Weber

Hidden State Guidance: Improving Image Captioning using An Image Conditioned Autoencoder

Most RNN-based image captioning models receive supervision on the output words to mimic human captions. Therefore, the hidden states can only receive noisy gradient signals via layers of back-propagation through time, leading to less…

Computer Vision and Pattern Recognition · Computer Science 2020-01-16 Jialin Wu , Raymond J. Mooney

GFF: Gated Fully Fusion for Semantic Segmentation

Semantic segmentation generates comprehensive understanding of scenes through densely predicting the category for each pixel. High-level features from Deep Convolutional Neural Networks already demonstrate their effectiveness in semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-02-25 Xiangtai Li , Houlong Zhao , Lei Han , Yunhai Tong , Kuiyuan Yang