Related papers: Spatial-context-aware deep neural network for mult…

Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition

Recognizing multiple labels of images is a fundamental but challenging task in computer vision, and remarkable progress has been attained by localizing semantic-aware image regions and predicting their labels with deep convolutional neural…

Computer Vision and Pattern Recognition · Computer Science 2017-12-21 Tianshui Chen , Zhouxia Wang , Guanbin Li , Liang Lin

Multi-label Classification using Deep Multi-order Context-aware Kernel Networks

Multi-label classification is a challenging task in pattern recognition. Many deep learning methods have been proposed and largely enhanced classification performance. However, most of the existing sophisticated methods ignore context in…

Computer Vision and Pattern Recognition · Computer Science 2024-12-30 Mingyuan Jiu , Hailong Zhu , Hichem Sahbi

Multi-label Image Recognition by Recurrently Discovering Attentional Regions

This paper proposes a novel deep architecture to address multi-label image recognition, a fundamental and practical task towards general visual understanding. Current solutions for this task usually rely on an extra step of extracting…

Computer Vision and Pattern Recognition · Computer Science 2017-11-09 Zhouxia Wang , Tianshui Chen , Guanbin Li , Ruijia Xu , Liang Lin

Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification

Multi-label image classification is a fundamental but challenging task in computer vision. Great progress has been achieved by exploiting semantic relations between labels in recent years. However, conventional approaches are unable to…

Computer Vision and Pattern Recognition · Computer Science 2017-04-03 Feng Zhu , Hongsheng Li , Wanli Ouyang , Nenghai Yu , Xiaogang Wang

Semantic-Aware Representation Learning via Conditional Transport for Multi-Label Image Classification

Multi-label image classification is a critical task in machine learning that aims to accurately assign multiple labels to a single image. While existing methods often utilize attention mechanisms or graph convolutional networks to model…

Computer Vision and Pattern Recognition · Computer Science 2025-11-04 Ren-Dong Xie , Zhi-Fen He , Bo Li , Bin Liu , Jin-Yan Hu

Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance

Fine-grained multi-label classification models have broad applications in e-commerce, such as visual based label predictions ranging from fashion attribute detection to brand recognition. One challenge to achieve satisfactory performance…

Computer Vision and Pattern Recognition · Computer Science 2023-06-07 Xin Shen , Xiaonan Zhao , Rui Luo

Learning Structured Inference Neural Networks with Label Relations

Images of scenes have various objects as well as abundant attributes, and diverse levels of visual categorization are possible. A natural image could be assigned with fine-grained labels that describe major components, coarse-grained labels…

Computer Vision and Pattern Recognition · Computer Science 2016-10-25 Hexiang Hu , Guang-Tong Zhou , Zhiwei Deng , Zicheng Liao , Greg Mori

Learning Deep Representations for Scene Labeling with Semantic Context Guided Supervision

Scene labeling is a challenging classification problem where each input image requires a pixel-level prediction map. Recently, deep-learning-based methods have shown their effectiveness on solving this problem. However, we argue that the…

Computer Vision and Pattern Recognition · Computer Science 2017-06-12 Zhe Wang , Hongsheng Li , Wanli Ouyang , Xiaogang Wang

Learning Discriminative Representations for Multi-Label Image Recognition

Multi-label recognition is a fundamental, and yet is a challenging task in computer vision. Recently, deep learning models have achieved great progress towards learning discriminative features from input images. However, conventional…

Computer Vision and Pattern Recognition · Computer Science 2021-07-26 Mohammed Hassanin , Ibrahim Radwan , Salman Khan , Murat Tahtali

Multi-layered Semantic Representation Network for Multi-label Image Classification

Multi-label image classification (MLIC) is a fundamental and practical task, which aims to assign multiple possible labels to an image. In recent years, many deep convolutional neural network (CNN) based approaches have been proposed which…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Xiwen Qu , Hao Che , Jun Huang , Linchuan Xu , Xiao Zheng

Supervised Multilabel Image Classification Using Residual Networks with Probabilistic Reasoning

Multilabel image categorization has drawn interest recently because of its numerous computer vision applications. The proposed work introduces a novel method for classifying multilabel images using the COCO-2014 dataset and a modified…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Lokender Singh , Saksham Kumar , Chandan Kumar

Multi-label Classification with Panoptic Context Aggregation Networks

Context modeling is crucial for visual recognition, enabling highly discriminative image representations by integrating both intrinsic and extrinsic relationships between objects and labels in images. A limitation in current approaches is…

Computer Vision and Pattern Recognition · Computer Science 2025-12-30 Mingyuan Jiu , Hailong Zhu , Wenchuan Wei , Hichem Sahbi , Rongrong Ji , Mingliang Xu

A Baseline for Multi-Label Image Classification Using An Ensemble of Deep Convolutional Neural Networks

Recent studies on multi-label image classification have focused on designing more complex architectures of deep neural networks such as the use of attention mechanisms and region proposal networks. Although performance gains have been…

Computer Vision and Pattern Recognition · Computer Science 2019-05-10 Qian Wang , Ning Jia , Toby P. Breckon

Learning Semantic Segmentation with Diverse Supervision

Models based on deep convolutional neural networks (CNN) have significantly improved the performance of semantic segmentation. However, learning these models requires a large amount of training images with pixel-level labels, which are very…

Computer Vision and Pattern Recognition · Computer Science 2018-02-05 Linwei Ye , Zhi Liu , Yang Wang

ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization

We aim to localize objects in images using image-level supervision only. Previous approaches to this problem mainly focus on discriminative object regions and often fail to locate precise object boundaries. We address this problem by…

Computer Vision and Pattern Recognition · Computer Science 2016-09-15 Vadim Kantorov , Maxime Oquab , Minsu Cho , Ivan Laptev

Context-Based Semantic-Aware Alignment for Semi-Supervised Multi-Label Learning

Due to the lack of extensive precisely-annotated multi-label data in real word, semi-supervised multi-label learning (SSMLL) has gradually gained attention. Abundant knowledge embedded in vision-language models (VLMs) pre-trained on…

Computer Vision and Pattern Recognition · Computer Science 2024-12-30 Heng-Bo Fan , Ming-Kun Xie , Jia-Hao Xiao , Sheng-Jun Huang

DualCoOp: Fast Adaptation to Multi-Label Recognition with Limited Annotations

Solving multi-label recognition (MLR) for images in the low-label regime is a challenging task with many real-world applications. Recent work learns an alignment between textual and visual spaces to compensate for insufficient image labels,…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Ximeng Sun , Ping Hu , Kate Saenko

Deep Embedding for Spatial Role Labeling

This paper introduces the visually informed embedding of word (VIEW), a continuous vector representation for a word extracted from a deep neural model trained using the Microsoft COCO data set to forecast the spatial arrangements between…

Computation and Language · Computer Science 2016-03-29 Oswaldo Ludwig , Xiao Liu , Parisa Kordjamshidi , Marie-Francine Moens

Semantic-Aware Dual Contrastive Learning for Multi-label Image Classification

Extracting image semantics effectively and assigning corresponding labels to multiple objects or attributes for natural images is challenging due to the complex scene contents and confusing label dependencies. Recent works have focused on…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Leilei Ma , Dengdi Sun , Lei Wang , Haifeng Zhao , Bin Luo

Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification

Multi-label image and video classification are fundamental yet challenging tasks in computer vision. The main challenges lie in capturing spatial or temporal dependencies between labels and discovering the locations of discriminative…

Computer Vision and Pattern Recognition · Computer Science 2020-03-30 Renchun You , Zhiyao Guo , Lei Cui , Xiang Long , Yingze Bao , Shilei Wen