Related papers: ML-Decoder: Scalable and Versatile Classification …

CLIP-Decoder : ZeroShot Multilabel Classification using Multimodal CLIP Aligned Representation

Multi-label classification is an essential task utilized in a wide variety of real-world applications. Multi-label zero-shot learning is a method for classifying images into multiple unseen categories for which no training data is…

Computer Vision and Pattern Recognition · Computer Science 2024-06-24 Muhammad Ali , Salman Khan

Query2Label: A Simple Transformer Way to Multi-Label Classification

This paper presents a simple and effective approach to solving the multi-label classification problem. The proposed approach leverages Transformer decoders to query the existence of a class label. The use of Transformer is rooted in the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-23 Shilong Liu , Lei Zhang , Xiao Yang , Hang Su , Jun Zhu

Scalable Label Distribution Learning for Multi-Label Classification

Multi-label classification (MLC) refers to the problem of tagging a given instance with a set of relevant labels. Most existing MLC methods are based on the assumption that the correlation of two labels in each label pair is symmetric,…

Machine Learning · Computer Science 2024-10-04 Xingyu Zhao , Yuexuan An , Lei Qi , Xin Geng

Integration of Autoencoder and Functional Link Artificial Neural Network for Multi-label Classification

Multi-label (ML) classification is an actively researched topic currently, which deals with convoluted and overlapping boundaries that arise due to several labels being active for a particular data instance. We propose a classifier capable…

Machine Learning · Computer Science 2021-07-22 Anwesha Law , Ashish Ghosh

DenseMLLM: Standard Multimodal LLMs are Intrinsic Dense Predictors

Multimodal Large Language Models (MLLMs) have demonstrated exceptional capabilities in high-level visual understanding. However, extending these models to fine-grained dense prediction tasks, such as semantic segmentation and depth…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Yi Li , Hongze Shen , Lexiang Tang , Xin Li , Xinpeng Ding , Yinsong Liu , Deqiang Jiang , Xing Sun , Xiaomeng Li

Learning Semantic-Aware Representation in Visual-Language Models for Multi-Label Recognition with Partial Labels

Multi-label recognition with partial labels (MLR-PL), in which only some labels are known while others are unknown for each image, is a practical task in computer vision, since collecting large-scale and complete multi-label datasets is…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Haoxian Ruan , Zhihua Xu , Zhijing Yang , Yongyi Lu , Jinghui Qin , Tianshui Chen

MLMC: Interactive multi-label multi-classifier evaluation without confusion matrices

Machine learning-based classifiers are commonly evaluated by metrics like accuracy, but deeper analysis is required to understand their strengths and weaknesses. MLMC is a visual exploration tool that tackles the challenge of multi-label…

Machine Learning · Computer Science 2025-01-27 Aleksandar Doknic , Torsten Möller

Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification

We propose a novel model for multi-label text classification, which is based on sequence-to-sequence learning. The model generates higher-level semantic unit representations with multi-level dilated convolution as well as a corresponding…

Computation and Language · Computer Science 2018-11-13 Junyang Lin , Qi Su , Pengcheng Yang , Shuming Ma , Xu Sun

Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features

In computer vision, multi-label recognition are important tasks with many real-world applications, but classifying previously unseen labels remains a significant challenge. In this paper, we propose a novel algorithm, Aligned Dual moDality…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Shichao Xu , Yikang Li , Jenhao Hsiao , Chiuman Ho , Zhu Qi

ML-Net: multi-label classification of biomedical texts with deep neural networks

In multi-label text classification, each textual document can be assigned with one or more labels. Due to this nature, the multi-label text classification task is often considered to be more challenging compared to the binary or multi-class…

Information Retrieval · Computer Science 2019-07-02 Jingcheng Du , Qingyu Chen , Yifan Peng , Yang Xiang , Cui Tao , Zhiyong Lu

Similarity-based Multi-label Learning

Multi-label classification is an important learning problem with many applications. In this work, we propose a principled similarity-based approach for multi-label learning called SML. We also introduce a similarity-based approach for…

Machine Learning · Statistics 2017-10-31 Ryan A. Rossi , Nesreen K. Ahmed , Hoda Eldardiry , Rong Zhou

Multi-label Cluster Discrimination for Visual Representation Learning

Contrastive Language Image Pre-training (CLIP) has recently demonstrated success across various tasks due to superior feature representation empowered by image-text contrastive learning. However, the instance discrimination method used by…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Xiang An , Kaicheng Yang , Xiangzi Dai , Ziyong Feng , Jiankang Deng

From Drop-off to Recovery: A Mechanistic Analysis of Segmentation in MLLMs

Multimodal Large Language Models (MLLMs) are increasingly applied to pixel-level vision tasks, yet their intrinsic capacity for spatial understanding remains poorly understood. We investigate segmentation capacity through a layerwise linear…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Boyong Wu , Sanghwan Kim , Zeynep Akata

Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image Classification

Multi-label image classification allows predicting a set of labels from a given image. Unlike multiclass classification, where only one label per image is assigned, such a setup is applicable for a broader range of applications. In this…

Computer Vision and Pattern Recognition · Computer Science 2022-12-21 Kirill Prokofiev , Vladislav Sovrasov

Enhancing Visual Continual Learning with Language-Guided Supervision

Continual learning (CL) aims to empower models to learn new tasks without forgetting previously acquired knowledge. Most prior works concentrate on the techniques of architectures, replay data, regularization, \etc. However, the category…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Bolin Ni , Hongbo Zhao , Chenghao Zhang , Ke Hu , Gaofeng Meng , Zhaoxiang Zhang , Shiming Xiang

An Efficient Large-scale Semi-supervised Multi-label Classifier Capable of Handling Missing labels

Multi-label classification has received considerable interest in recent years. Multi-label classifiers have to address many problems including: handling large-scale datasets with many instances and a large set of labels, compensating…

Machine Learning · Computer Science 2016-06-21 Amirhossein Akbarnejad , Mahdieh Soleymani Baghshah

YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object Detection

We aim at providing the object detection community with an efficient and performant object detector, termed YOLO-MS. The core design is based on a series of investigations on how multi-branch features of the basic block and convolutions…

Computer Vision and Pattern Recognition · Computer Science 2025-02-21 Yuming Chen , Xinbin Yuan , Jiabao Wang , Ruiqi Wu , Xiang Li , Qibin Hou , Ming-Ming Cheng

A Multi-Model Adaptation of Speculative Decoding for Classification

The current study introduces a novel adaptation of speculative decoding, repurposed from generation to classification tasks. We propose a multi-model framework employing up to three lightweight worker models and a single, more robust judge…

Computation and Language · Computer Science 2025-03-25 Somnath Roy , Padharthi Sreekar , Srivatsa Narasimha , Anubhav Anand

Multi-label Image Recognition by Recurrently Discovering Attentional Regions

This paper proposes a novel deep architecture to address multi-label image recognition, a fundamental and practical task towards general visual understanding. Current solutions for this task usually rely on an extra step of extracting…

Computer Vision and Pattern Recognition · Computer Science 2017-11-09 Zhouxia Wang , Tianshui Chen , Guanbin Li , Ruijia Xu , Liang Lin

Multi-Head Encoding for Extreme Label Classification

The number of categories of instances in the real world is normally huge, and each instance may contain multiple labels. To distinguish these massive labels utilizing machine learning, eXtreme Label Classification (XLC) has been…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Daojun Liang , Haixia Zhang , Dongfeng Yuan , Minggao Zhang