Related papers: Learning to recognize occluded and small objects w…

Multi-Label Self-Supervised Learning with Scene Images

Self-supervised learning (SSL) methods targeting scene images have seen a rapid growth recently, and they mostly rely on either a dedicated dense matching mechanism or a costly unsupervised object discovery module. This paper shows that…

Computer Vision and Pattern Recognition · Computer Science 2023-10-02 Ke Zhu , Minghao Fu , Jianxin Wu

Masked Supervised Learning for Semantic Segmentation

Self-attention is of vital importance in semantic segmentation as it enables modeling of long-range context, which translates into improved performance. We argue that it is equally important to model short-range context, especially to…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Hasib Zunair , A. Ben Hamza

Co-Learning: Towards Semi-Supervised Object Detection with Road-side Cameras

Recently, deep learning has experienced rapid expansion, contributing significantly to the progress of supervised learning methodologies. However, acquiring labeled data in real-world settings can be costly, labor-intensive, and sometimes…

Computer Vision and Pattern Recognition · Computer Science 2024-12-02 Jicheng Yuan , Anh Le-Tuan , Ali Ganbarov , Manfred Hauswirth , Danh Le-Phuoc

Epsilon: Exploring Comprehensive Visual-Semantic Projection for Multi-Label Zero-Shot Learning

This paper investigates a challenging problem of zero-shot learning in the multi-label scenario (MLZSL), wherein the model is trained to recognize multiple unseen classes within a sample (e.g., an image) based on seen classes and auxiliary…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Ziming Liu , Jingcai Guo , Song Guo , Xiaocheng Lu

OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects

There is a gap in the understanding of occluded objects in existing large-scale visual language multi-modal models. Current state-of-the-art multimodal models fail to provide satisfactory results in describing occluded objects for…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Wenmo Qiu , Xinhan Di

Semantic Representation and Dependency Learning for Multi-Label Image Recognition

Recently many multi-label image recognition (MLR) works have made significant progress by introducing pre-trained object detection models to generate lots of proposals or utilizing statistical label co-occurrence enhance the correlation…

Computer Vision and Pattern Recognition · Computer Science 2023-01-10 Tao Pu , Mingzhan Sun , Hefeng Wu , Tianshui Chen , Ling Tian , Liang Lin

Self-Supervised Learning for Image Segmentation: A Comprehensive Survey

Supervised learning demands large amounts of precisely annotated data to achieve promising results. Such data curation is labor-intensive and imposes significant overhead regarding time and costs. Self-supervised learning (SSL) partially…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Thangarajah Akilan , Nusrat Jahan , Wandong Zhang

Context-Aware Zero-Shot Learning for Object Recognition

Zero-Shot Learning (ZSL) aims at classifying unlabeled objects by leveraging auxiliary knowledge, such as semantic representations. A limitation of previous approaches is that only intrinsic properties of objects, e.g. their visual…

Computer Vision and Pattern Recognition · Computer Science 2019-05-01 Eloi Zablocki , Patrick Bordes , Benjamin Piwowarski , Laure Soulier , Patrick Gallinari

Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning

Masked Image Modeling (MIM) has emerged as a promising method for deriving visual representations from unlabeled image data by predicting missing pixels from masked portions of images. It excels in region-aware learning and provides strong…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Yibing Wei , Abhinav Gupta , Pedro Morgado

Learning Semantic-Aware Representation in Visual-Language Models for Multi-Label Recognition with Partial Labels

Multi-label recognition with partial labels (MLR-PL), in which only some labels are known while others are unknown for each image, is a practical task in computer vision, since collecting large-scale and complete multi-label datasets is…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Haoxian Ruan , Zhihua Xu , Zhijing Yang , Yongyi Lu , Jinghui Qin , Tianshui Chen

MLSL: Multi-Level Self-Supervised Learning for Domain Adaptation with Spatially Independent and Semantically Consistent Labeling

Most of the recent Deep Semantic Segmentation algorithms suffer from large generalization errors, even when powerful hierarchical representation models based on convolutional neural networks have been employed. This could be attributed to…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Javed Iqbal , Mohsen Ali

Sequential Amodal Segmentation via Cumulative Occlusion Learning

To fully understand the 3D context of a single image, a visual system must be able to segment both the visible and occluded regions of objects, while discerning their occlusion order. Ideally, the system should be able to handle any object…

Computer Vision and Pattern Recognition · Computer Science 2024-05-10 Jiayang Ao , Qiuhong Ke , Krista A. Ehinger

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

Online tracking of multiple objects in videos requires strong capacity of modeling and matching object appearances. Previous methods for learning appearance embedding mostly rely on instance-level matching without considering the temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-07-07 Wei Li , Yuanjun Xiong , Shuo Yang , Mingze Xu , Yongxin Wang , Wei Xia

Towards Human-Machine Cooperation: Self-supervised Sample Mining for Object Detection

Though quite challenging, leveraging large-scale unlabeled or partially labeled images in a cost-effective way has increasingly attracted interests for its great importance to computer vision. To tackle this problem, many Active Learning…

Computer Vision and Pattern Recognition · Computer Science 2018-05-25 Keze Wang , Xiaopeng Yan , Dongyu Zhang , Lei Zhang , Liang Lin

Masked Cross-image Encoding for Few-shot Segmentation

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images. The key challenge in FSS is to classify the labels of query pixels using…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Wenbo Xu , Huaxi Huang , Ming Cheng , Litao Yu , Qiang Wu , Jian Zhang

Incomplete Multi-Label Image Recognition by Co-learning Semantic-Aware Features and Label Recovery

Multi-label image recognition with incomplete labels is a challenging yet vital task in computer vision, which faces two fundamental challenges: learning semantic-aware features and recovering missing labels. In this paper, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Zhi-Fen He , Ren-Dong Xie , Bo Li , Bin Liu , Jin-Yan Hu

Matching Multiple Perspectives for Efficient Representation Learning

Representation learning approaches typically rely on images of objects captured from a single perspective that are transformed using affine transformations. Additionally, self-supervised learning, a successful paradigm of representation…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Omiros Pantazis , Mathew Salvaris

OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning

There is a gap in the understanding of occluded objects in existing large-scale visual language multi-modal models. Current state-of-the-art multi-modal models fail to provide satisfactory results in describing occluded objects through…

Computer Vision and Pattern Recognition · Computer Science 2024-10-04 Shuxin Yang , Xinhan Di

Detecting Semantic Parts on Partially Occluded Objects

In this paper, we address the task of detecting semantic parts on partially occluded objects. We consider a scenario where the model is trained using non-occluded images but tested on occluded images. The motivation is that there are…

Computer Vision and Pattern Recognition · Computer Science 2017-07-26 Jianyu Wang , Cihang Xie , Zhishuai Zhang , Jun Zhu , Lingxi Xie , Alan Yuille

Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based on Weakly Supervised Learning

Supervised object detection and semantic segmentation require object or even pixel level annotations. When there exist image level labels only, it is challenging for weakly supervised algorithms to achieve accurate predictions. The accuracy…

Computer Vision and Pattern Recognition · Computer Science 2018-03-06 Weifeng Ge , Sibei Yang , Yizhou Yu