Related papers: Exploit Bounding Box Annotations for Multi-label O…

CNN-RNN: A Unified Framework for Multi-label Image Classification

While deep convolutional neural networks (CNNs) have shown a great success in single-label image classification, it is important to note that real world images generally contain multiple labels, which could correspond to different objects,…

Computer Vision and Pattern Recognition · Computer Science 2016-04-18 Jiang Wang , Yi Yang , Junhua Mao , Zhiheng Huang , Chang Huang , Wei Xu

Multiple Instance Learning Convolutional Neural Networks for Object Recognition

Convolutional Neural Networks (CNN) have demon- strated its successful applications in computer vision, speech recognition, and natural language processing. For object recog- nition, CNNs might be limited by its strict label requirement and…

Computer Vision and Pattern Recognition · Computer Science 2016-10-12 Miao Sun , Tony X. Han , Ming-Chang Liu , Ahmad Khodayari-Rostamabad

Multi-Label Classification with Label Graph Superimposing

Images or videos always contain multiple objects or actions. Multi-label recognition has been witnessed to achieve pretty performance attribute to the rapid development of deep learning technologies. Recently, graph convolution network…

Computer Vision and Pattern Recognition · Computer Science 2019-11-22 Ya Wang , Dongliang He , Fu Li , Xiang Long , Zhichao Zhou , Jinwen Ma , Shilei Wen

Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering

Object detection plays an important role in current solutions to vision and language tasks like image captioning and visual question answering. However, popular models like Faster R-CNN rely on a costly process of annotating ground-truths…

Computation and Language · Computer Science 2019-09-06 Soravit Changpinyo , Bo Pang , Piyush Sharma , Radu Soricut

Diverse Instance Discovery: Vision-Transformer for Instance-Aware Multi-Label Image Recognition

Previous works on multi-label image recognition (MLIR) usually use CNNs as a starting point for research. In this paper, we take pure Vision Transformer (ViT) as the research base and make full use of the advantages of Transformer with…

Computer Vision and Pattern Recognition · Computer Science 2022-04-25 Yunqing Hu , Xuan Jin , Yin Zhang , Haiwen Hong , Jingfeng Zhang , Feihu Yan , Yuan He , Hui Xue

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection

Multi-label image classification is a fundamental but challenging task towards general visual understanding. Existing methods found the region-level cues (e.g., features from RoIs) can facilitate multi-label classification. Nevertheless,…

Computer Vision and Pattern Recognition · Computer Science 2019-02-22 Yongcheng Liu , Lu Sheng , Jing Shao , Junjie Yan , Shiming Xiang , Chunhong Pan

Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation

Image annotation aims to annotate a given image with a variable number of class labels corresponding to diverse visual concepts. In this paper, we address two main issues in large-scale image annotation: 1) how to learn a rich feature…

Computer Vision and Pattern Recognition · Computer Science 2018-10-22 Yulei Niu , Zhiwu Lu , Ji-Rong Wen , Tao Xiang , Shih-Fu Chang

Multi-Label Image Recognition with Graph Convolutional Networks

The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Zhao-Min Chen , Xiu-Shen Wei , Peng Wang , Yanwen Guo

Multi-Label Image Classification with Regional Latent Semantic Dependencies

Deep convolution neural networks (CNN) have demonstrated advanced performance on single-label image classification, and various progress also have been made to apply CNN methods on multi-label image classification, which requires to…

Computer Vision and Pattern Recognition · Computer Science 2017-03-14 Junjie Zhang , Qi Wu , Chunhua Shen , Jian Zhang , Jianfeng Lu

MlTr: Multi-label Classification with Transformer

The task of multi-label image classification is to recognize all the object labels presented in an image. Though advancing for years, small objects, similar objects and objects with high conditional probability are still the main…

Computer Vision and Pattern Recognition · Computer Science 2021-06-14 Xing Cheng , Hezheng Lin , Xiangyu Wu , Fan Yang , Dong Shen , Zhongyuan Wang , Nian Shi , Honglin Liu

Multi-label Pixelwise Classification for Reconstruction of Large-scale Urban Areas

Object classification is one of the many holy grails in computer vision and as such has resulted in a very large number of algorithms being proposed already. Specifically in recent years there has been considerable progress in this area…

Computer Vision and Pattern Recognition · Computer Science 2018-01-25 Yuanlie He , Sudhir Mudur , Charalambos Poullis

Bounding Boxes Are All We Need: Street View Image Classification via Context Encoding of Detected Buildings

Street view images classification aiming at urban land use analysis is difficult because the class labels (e.g., commercial area), are concepts with higher abstract level compared to the ones of general visual tasks (e.g., persons and…

Computer Vision and Pattern Recognition · Computer Science 2021-03-22 Kun Zhao , Yongkun Liu , Siyuan Hao , Shaoxing Lu , Hongbin Liu , Lijian Zhou

Learning Image Conditioned Label Space for Multilabel Classification

This work addresses the task of multilabel image classification. Inspired by the great success from deep convolutional neural networks (CNNs) for single-label visual-semantic embedding, we exploit extending these models for multilabel…

Computer Vision and Pattern Recognition · Computer Science 2021-01-28 Yi-Nan Li , Mei-Chen Yeh

Multi-label Annotation for Visual Multi-Task Learning Models

Deep learning requires large amounts of data, and a well-defined pipeline for labeling and augmentation. Current solutions support numerous computer vision tasks with dedicated annotation types and formats, such as bounding boxes, polygons,…

Robotics · Computer Science 2023-12-01 G. Sharma , A. Angleraud , R. Pieters

A CNN-RNN Architecture for Multi-Label Weather Recognition

Weather Recognition plays an important role in our daily lives and many computer vision applications. However, recognizing the weather conditions from a single image remains challenging and has not been studied thoroughly. Generally, most…

Computer Vision and Pattern Recognition · Computer Science 2019-04-25 Bin Zhao , Xuelong Li , Xiaoqiang Lu , Zhigang Wang

A Scalable Multilabel Classification to Deploy Deep Learning Architectures For Edge Devices

Convolution Neural Networks (CNN) have performed well in many applications such as object detection, pattern recognition, video surveillance and so on. CNN carryout feature extraction on labelled data to perform classification. Multi-label…

Machine Learning · Computer Science 2021-01-28 Tolulope A. Odetola , Ogheneuriri Oderhohwo , Syed Rafay Hasan

Unlocking ImageNet's Multi-Object Nature: Automated Large-Scale Multilabel Annotation

The original ImageNet benchmark enforces a single-label assumption, despite many images depicting multiple objects. This leads to label noise and limits the richness of the learning signal. Multi-label annotations more accurately reflect…

Computer Vision and Pattern Recognition · Computer Science 2026-03-09 Junyu Chen , Md Yousuf Harun , Christopher Kanan

Deep Attributes from Context-Aware Regional Neural Codes

Recently, many researches employ middle-layer output of convolutional neural network models (CNN) as features for different visual recognition tasks. Although promising results have been achieved in some empirical studies, such type of…

Computer Vision and Pattern Recognition · Computer Science 2015-09-09 Jianwei Luo , Jianguo Li , Jun Wang , Zhiguo Jiang , Yurong Chen

Deep Convolutional Ranking for Multilabel Image Annotation

Multilabel image annotation is one of the most important challenges in computer vision with many real-world applications. While existing work usually use conventional visual features for multilabel annotation, features based on Deep Neural…

Computer Vision and Pattern Recognition · Computer Science 2014-04-15 Yunchao Gong , Yangqing Jia , Thomas Leung , Alexander Toshev , Sergey Ioffe

Embedding Label Structures for Fine-Grained Feature Representation

Recent algorithms in convolutional neural networks (CNN) considerably advance the fine-grained image classification, which aims to differentiate subtle differences among subordinate classes. However, previous studies have rarely focused on…

Computer Vision and Pattern Recognition · Computer Science 2016-03-14 Xiaofan Zhang , Feng Zhou , Yuanqing Lin , Shaoting Zhang