English
Related papers

Related papers: Adaptive Context Encoding Module for Semantic Segm…

200 papers

Objects at different spatial positions in an image exhibit different scales. Adaptive receptive fields are expected to capture suitable ranges of context for accurate pixel level semantic prediction. Recently, atrous convolution with…

Computer Vision and Pattern Recognition · Computer Science 2020-10-12 Xin Jin , Cuiling Lan , Wenjun Zeng , Zhizheng Zhang , Zhibo Chen

Accurate polyp segmentation is of great significance for the diagnosis and treatment of colorectal cancer. However, it has always been very challenging due to the diverse shape and size of polyp. In recent years, state-of-the-art methods…

Computer Vision and Pattern Recognition · Computer Science 2023-01-13 Ruifei Zhang , Guanbin Li , Zhen Li , Shuguang Cui , Dahong Qian , Yizhou Yu

Recent work has made significant progress in improving spatial resolution for pixelwise labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous convolution, utilizing multi-scale features and refining…

Computer Vision and Pattern Recognition · Computer Science 2018-03-26 Hang Zhang , Kristin Dana , Jianping Shi , Zhongyue Zhang , Xiaogang Wang , Ambrish Tyagi , Amit Agrawal

State-of-the-art models for semantic segmentation are based on adaptations of convolutional networks that had originally been designed for image classification. However, dense prediction and image classification are structurally different.…

Computer Vision and Pattern Recognition · Computer Science 2016-05-03 Fisher Yu , Vladlen Koltun

Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing the incoming features with…

Computer Vision and Pattern Recognition · Computer Science 2018-08-24 Liang-Chieh Chen , Yukun Zhu , George Papandreou , Florian Schroff , Hartwig Adam

Context is essential for semantic segmentation. Due to the diverse shapes of objects and their complex layout in various scene images, the spatial scales and shapes of contexts for different objects have very large variation. It is thus…

Computer Vision and Pattern Recognition · Computer Science 2019-09-09 Henghui Ding , Xudong Jiang , Bing Shuai , Ai Qun Liu , Gang Wang

Context-aware compression techniques have gained increasing attention as model sizes continue to grow, introducing computational bottlenecks that hinder efficient deployment. A structured encoding approach was proposed to selectively…

Computation and Language · Computer Science 2025-02-13 Barnaby Schmitt , Alistair Grosvenor , Matthias Cunningham , Clementine Walsh , Julius Pembrokeshire , Jonathan Teel

In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter's field-of-view as well as control the resolution of feature responses computed by Deep Convolutional Neural Networks, in the application of semantic…

Computer Vision and Pattern Recognition · Computer Science 2017-12-06 Liang-Chieh Chen , George Papandreou , Florian Schroff , Hartwig Adam

Medical image segmentation is an important step in medical image analysis. With the rapid development of convolutional neural network in image processing, deep learning has been used for medical image segmentation, such as optic disc…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Zaiwang Gu , Jun Cheng , Huazhu Fu , Kang Zhou , Huaying Hao , Yitian Zhao , Tianyang Zhang , Shenghua Gao , Jiang Liu

Irregular scene text recognition has attracted much attention from the research community, mainly due to the complexity of shapes of text in natural scene. However, recent methods either rely on shape-sensitive modules such as bounding box…

Computer Vision and Pattern Recognition · Computer Science 2020-02-11 Shangbang Long , Yushuo Guan , Kaigui Bian , Cong Yao

Extending large language models (LLMs) to process longer inputs is crucial for a wide range of applications. However, the substantial computational cost of transformers and limited generalization of positional encoding restrict the size of…

Computation and Language · Computer Science 2025-06-11 Howard Yen , Tianyu Gao , Danqi Chen

Recent works attempt to improve scene parsing performance by exploring different levels of contexts, and typically train a well-designed convolutional network to exploit useful contexts across all pixels equally. However, in this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2019-11-06 Jun Fu , Jing Liu , Yuhang Wang , Yong Li , Yongjun Bao , Jinhui Tang , Hanqing Lu

The contextual information, presented in abdominal CT scan, is relative consistent. In order to make full use of the overall 3D context, we develop a whole-volume-based coarse-to-fine framework for efficient and effective abdominal…

Image and Video Processing · Electrical Eng. & Systems 2021-11-01 Fan Zhang , Yu Wang , Hua Yang

Learned Image Compression (LIC) has shown remarkable progress in recent years. Existing works commonly employ CNN-based or self-attention-based modules as transform methods for compression. However, there is no prior research on neural…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Yuxi Liu , Wenhan Yang , Huihui Bai , Yunchao Wei , Yao Zhao

Semantic segmentation, which refers to pixel-wise classification of an image, is a fundamental topic in computer vision owing to its growing importance in robot vision and autonomous driving industries. It provides rich information about…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Khwaja Monib Sediqi , Hyo Jong Lee

Deploying useful Long-Context Transformer Models (LCTMs) requires addressing two key challenges: (1) A growing memory footprint due to quadratic self-attention and linear KV-cache scaling in memory as sequence length increases; (2) the…

Computation and Language · Computer Science 2025-10-15 Baisub Lee , Sanghyun Byun , Mohanad Odema , Jung Guack , Jacob Song , Woo Seong Chung

Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative object pose and parts information for image recognition. For fine-grained recognition, context-aware rich feature representation of object/scene…

Computer Vision and Pattern Recognition · Computer Science 2021-01-19 Ardhendu Behera , Zachary Wharton , Pradeep Hewage , Asish Bera

This work is an improved system that we submitted to task 1 of DCASE2023 challenge. We propose a method of low-complexity acoustic scene classification by a parallel attention-convolution network which consists of four modules, including…

Audio and Speech Processing · Electrical Eng. & Systems 2024-06-13 Yanxiong Li , Jiaxin Tan , Guoqing Chen , Jialong Li , Yongjie Si , Qianhua He

Convolutional Pose Machine is a popular neural network architecture for articulated pose estimation. In this work we explore its empirical receptive field and realize, that it can be enhanced with integration of a global context. To do so…

Computer Vision and Pattern Recognition · Computer Science 2019-06-11 Daniil Osokin

In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters,…

Computer Vision and Pattern Recognition · Computer Science 2017-05-15 Liang-Chieh Chen , George Papandreou , Iasonas Kokkinos , Kevin Murphy , Alan L. Yuille
‹ Prev 1 2 3 10 Next ›