Related papers: Multi-layer Feature Aggregation for Deep Scene Par…

Scene Parsing with Multiscale Feature Learning, Purity Trees, and Optimal Covers

Scene parsing, or semantic segmentation, consists in labeling each pixel in an image with the category of the object it belongs to. It is a challenging task that involves the simultaneous detection, segmentation and recognition of all the…

Computer Vision and Pattern Recognition · Computer Science 2015-06-09 Clément Farabet , Camille Couprie , Laurent Najman , Yann LeCun

Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling

Traditionally, training neural networks to perform semantic segmentation required expensive human-made annotations. But more recently, advances in the field of unsupervised learning have made significant progress on this issue and towards…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Leon Sick , Dominik Engel , Pedro Hermosilla , Timo Ropinski

Deep Layer Aggregation

Visual recognition requires rich representations that span levels from low to high, scales from small to large, and resolutions from fine to coarse. Even with the depth of features in a convolutional network, a layer in isolation is not…

Computer Vision and Pattern Recognition · Computer Science 2019-01-07 Fisher Yu , Dequan Wang , Evan Shelhamer , Trevor Darrell

Feature boosting with efficient attention for scene parsing

The complexity of scene parsing grows with the number of object and scene classes, which is higher in unrestricted open scenes. The biggest challenge is to model the spatial relation between scene elements while succeeding in identifying…

Computer Vision and Pattern Recognition · Computer Science 2024-07-26 Vivek Singh , Shailza Sharma , Fabio Cuzzolin

HDF: Hybrid Deep Features for Scene Image Representation

Nowadays it is prevalent to take features extracted from pre-trained deep learning models as image representations which have achieved promising classification performance. Existing methods usually consider either object-based features or…

Computer Vision and Pattern Recognition · Computer Science 2020-10-13 Chiranjibi Sitaula , Yong Xiang , Anish Basnet , Sunil Aryal , Xuequan Lu

Unsupervised Feature Learning for Dense Correspondences across Scenes

We propose a fast, accurate matching method for estimating dense pixel correspondences across scenes. It is a challenging problem to estimate dense pixel correspondences between images depicting different scenes or instances of the same…

Computer Vision and Pattern Recognition · Computer Science 2015-04-24 Chao Zhang , Chunhua Shen , Tingzhi Shen

Global and Local Features through Gaussian Mixture Models on Image Semantic Segmentation

The semantic segmentation task aims at dense classification at the pixel-wise level. Deep models exhibited progress in tackling this task. However, one remaining problem with these approaches is the loss of spatial precision, often produced…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Darwin Saire , Adín Ramírez Rivera

MoE-SPNet: A Mixture-of-Experts Scene Parsing Network

Scene parsing is an indispensable component in understanding the semantics within a scene. Traditional methods rely on handcrafted local features and probabilistic graphical models to incorporate local and global cues. Recently, methods…

Computer Vision and Pattern Recognition · Computer Science 2018-06-20 Huan Fu , Mingming Gong , Chaohui Wang , Dacheng Tao

Deep Object Co-segmentation via Spatial-Semantic Network Modulation

Object co-segmentation is to segment the shared objects in multiple relevant images, which has numerous applications in computer vision. This paper presents a spatial and semantic modulated deep network framework for object co-segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2019-12-02 Kaihua Zhang , Jin Chen , Bo Liu , Qingshan Liu

Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation

Semantic image segmentation is a fundamental task in image understanding. Per-pixel semantic labelling of an image benefits greatly from the ability to consider region consistency both locally and globally. However, many Fully Convolutional…

Computer Vision and Pattern Recognition · Computer Science 2017-01-26 Tong Shen , Guosheng Lin , Chunhua Shen , Ian Reid

Learning Deep Representations for Scene Labeling with Semantic Context Guided Supervision

Scene labeling is a challenging classification problem where each input image requires a pixel-level prediction map. Recently, deep-learning-based methods have shown their effectiveness on solving this problem. However, we argue that the…

Computer Vision and Pattern Recognition · Computer Science 2017-06-12 Zhe Wang , Hongsheng Li , Wanli Ouyang , Xiaogang Wang

Deep Learning for Scene Classification: A Survey

Scene classification, aiming at classifying a scene image to one of the predefined scene categories by comprehending the entire image, is a longstanding, fundamental and challenging problem in computer vision. The rise of large-scale…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Delu Zeng , Minyu Liao , Mohammad Tavakolian , Yulan Guo , Bolei Zhou , Dewen Hu , Matti Pietikäinen , Li Liu

Unified Perceptual Parsing for Scene Understanding

Humans recognize the visual world at multiple levels: we effortlessly categorize scenes and detect objects inside, while also identifying the textures and surfaces of the objects along with their different compositional parts. In this…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Tete Xiao , Yingcheng Liu , Bolei Zhou , Yuning Jiang , Jian Sun

Unsupervised Feature Learning by Deep Sparse Coding

In this paper, we propose a new unsupervised feature learning framework, namely Deep Sparse Coding (DeepSC), that extends sparse coding to a multi-layer architecture for visual object recognition tasks. The main innovation of the framework…

Machine Learning · Computer Science 2013-12-23 Yunlong He , Koray Kavukcuoglu , Yun Wang , Arthur Szlam , Yanjun Qi

Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation

Image annotation aims to annotate a given image with a variable number of class labels corresponding to diverse visual concepts. In this paper, we address two main issues in large-scale image annotation: 1) how to learn a rich feature…

Computer Vision and Pattern Recognition · Computer Science 2018-10-22 Yulei Niu , Zhiwu Lu , Ji-Rong Wen , Tao Xiang , Shih-Fu Chang

Investigating the feature collection for semantic segmentation via single skip connection

Since the study of deep convolutional neural network became prevalent, one of the important discoveries is that a feature map from a convolutional network can be extracted before going into the fully connected layer and can be used as a…

Computer Vision and Pattern Recognition · Computer Science 2017-10-24 Jonghwa Yim , Kyung-Ah Sohn

Boosting Real-Time Driving Scene Parsing with Shared Semantics

Real-time scene parsing is a fundamental feature for autonomous driving vehicles with multiple cameras. In this letter we demonstrate that sharing semantics between cameras with different perspectives and overlapped views can boost the…

Computer Vision and Pattern Recognition · Computer Science 2020-01-14 Zhenzhen Xiang , Anbo Bao , Jie Li , Jianbo Su

Self-attention on Multi-Shifted Windows for Scene Segmentation

Scene segmentation in images is a fundamental yet challenging problem in visual content understanding, which is to learn a model to assign every image pixel to a categorical label. One of the challenges for this learning task is to consider…

Computer Vision and Pattern Recognition · Computer Science 2022-07-12 Litao Yu , Zhibin Li , Jian Zhang , Qiang Wu

Learning Local Features with Context Aggregation for Visual Localization

Keypoint detection and description is fundamental yet important in many vision applications. Most existing methods use detect-then-describe or detect-and-describe strategy to learn local features without considering their context…

Computer Vision and Pattern Recognition · Computer Science 2020-06-02 Siyu Hong , Kunhong Li , Yongcong Zhang , Zhiheng Fu , Mengyi Liu , Yulan Guo

Attention to Scale: Scale-aware Semantic Image Segmentation

Incorporating multi-scale features in fully convolutional neural networks (FCNs) has been a key element to achieving state-of-the-art performance on semantic image segmentation. One common way to extract multi-scale features is to feed…

Computer Vision and Pattern Recognition · Computer Science 2016-06-03 Liang-Chieh Chen , Yi Yang , Jiang Wang , Wei Xu , Alan L. Yuille