Related papers: Semantically Consistent Multi-view Representation …

SLRL: Structured Latent Representation Learning for Multi-view Clustering

In recent years, Multi-View Clustering (MVC) has attracted increasing attention for its potential to reduce the annotation burden associated with large datasets. The aim of MVC is to exploit the inherent consistency and complementarity…

Machine Learning · Computer Science 2024-07-12 Zhangci Xiong , Meng Cao

URRL-IMVC: Unified and Robust Representation Learning for Incomplete Multi-View Clustering

Incomplete multi-view clustering (IMVC) aims to cluster multi-view data that are only partially available. This poses two main challenges: effectively leveraging multi-view information and mitigating the impact of missing views. Prevailing…

Machine Learning · Computer Science 2024-07-15 Ge Teng , Ting Mao , Chen Shen , Xiang Tian , Xuesong Liu , Yaowu Chen , Jieping Ye

Spatially Consistent Representation Learning

Self-supervised learning has been widely used to obtain transferrable representations from unlabeled images. Especially, recent contrastive learning methods have shown impressive performances on downstream image classification tasks. While…

Computer Vision and Pattern Recognition · Computer Science 2021-04-29 Byungseok Roh , Wuhyun Shin , Ildoo Kim , Sungwoong Kim

UMCL: Unimodal-generated Multimodal Contrastive Learning for Cross-compression-rate Deepfake Detection

In deepfake detection, the varying degrees of compression employed by social media platforms pose significant challenges for model generalization and reliability. Although existing methods have progressed from single-modal to multimodal…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Ching-Yi Lai , Chih-Yu Jian , Pei-Cheng Chuang , Chia-Ming Lee , Chih-Chung Hsu , Chiou-Ting Hsu , Chia-Wen Lin

MMRL: Multi-Modal Representation Learning for Vision-Language Models

Large-scale pre-trained Vision-Language Models (VLMs) have become essential for transfer learning across diverse tasks. However, adapting these models with limited few-shot data often leads to overfitting, diminishing their performance on…

Machine Learning · Computer Science 2025-03-27 Yuncheng Guo , Xiaodong Gu

Multiview Self-Representation Learning across Heterogeneous Views

Features of the same sample generated by different pretrained models often exhibit inherently distinct feature distributions because of discrepancies in the model pretraining objectives or architectures. Learning invariant representations…

Computer Vision and Pattern Recognition · Computer Science 2026-02-05 Jie Chen , Zhu Wang , Chuanbin Liu , Xi Peng

Semantic-guided Representation Learning for Multi-Label Recognition

Multi-label Recognition (MLR) involves assigning multiple labels to each data instance in an image, offering advantages over single-label classification in complex scenarios. However, it faces the challenge of annotating all relevant…

Machine Learning · Computer Science 2025-06-03 Ruhui Zhang , Hezhe Qiao , Pengcheng Xu , Mingsheng Shang , Lin Chen

Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding

Unsupervised pre-training has shown great success in skeleton-based action understanding recently. Existing works typically train separate modality-specific models, then integrate the multi-modal information for action understanding by a…

Computer Vision and Pattern Recognition · Computer Science 2023-11-07 Shengkai Sun , Daizong Liu , Jianfeng Dong , Xiaoye Qu , Junyu Gao , Xun Yang , Xun Wang , Meng Wang

Multi-level Feature Learning for Contrastive Multi-view Clustering

Multi-view clustering can explore common semantics from multiple views and has attracted increasing attention. However, existing works punish multiple objectives in the same feature space, where they ignore the conflict between learning…

Machine Learning · Computer Science 2022-03-28 Jie Xu , Huayi Tang , Yazhou Ren , Liang Peng , Xiaofeng Zhu , Lifang He

Self-Supervised Representation Learning with Meta Comprehensive Regularization

Self-Supervised Learning (SSL) methods harness the concept of semantic invariance by utilizing data augmentation strategies to produce similar representations for different deformations of the same input. Essentially, the model captures the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-05 Huijie Guo , Ying Ba , Jie Hu , Lingyu Si , Wenwen Qiang , Lei Shi

Principled Multimodal Representation Learning

Multimodal representation learning seeks to create a unified representation space by integrating diverse data modalities to improve multimodal understanding. Traditional methods often depend on pairwise contrastive learning, which relies on…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Xiaohao Liu , Xiaobo Xia , See-Kiong Ng , Tat-Seng Chua

Robust Multi-View Learning via Representation Fusion of Sample-Level Attention and Alignment of Simulated Perturbation

Recently, multi-view learning (MVL) has garnered significant attention due to its ability to fuse discriminative information from multiple views. However, real-world multi-view datasets are often heterogeneous and imperfect, which usually…

Computer Vision and Pattern Recognition · Computer Science 2025-07-25 Jie Xu , Na Zhao , Gang Niu , Masashi Sugiyama , Xiaofeng Zhu

Self-supervised Consensus Representation Learning for Attributed Graph

Attempting to fully exploit the rich information of topological structure and node features for attributed graph, we introduce self-supervised learning mechanism to graph representation learning and propose a novel Self-supervised Consensus…

Social and Information Networks · Computer Science 2021-08-12 Changshu Liu , Liangjian Wen , Zhao Kang , Guangchun Luo , Ling Tian

Semantics-Consistent Representation Learning for Remote Sensing Image-Voice Retrieval

With the development of earth observation technology, massive amounts of remote sensing (RS) images are acquired. To find useful information from these images, cross-modal RS image-voice retrieval provides a new insight. This paper aims to…

Multimedia · Computer Science 2022-01-05 Hailong Ning , Bin Zhao , Yuan Yuan

Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning

Self-Supervised Contrastive Learning has proven effective in deriving high-quality representations from unlabeled data. However, a major challenge that hinders both unimodal and multimodal contrastive learning is feature suppression, a…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Jihai Zhang , Xiang Lan , Xiaoye Qu , Yu Cheng , Mengling Feng , Bryan Hooi

Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition

Recognizing multiple labels of images is a practical and challenging task, and significant progress has been made by searching semantic-aware regions and modeling label dependency. However, current methods cannot locate the semantic regions…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Tianshui Chen , Muxin Xu , Xiaolu Hui , Hefeng Wu , Liang Lin

Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning

In contrastive self-supervised learning, the common way to learn discriminative representation is to pull different augmented "views" of the same image closer while pushing all other images further apart, which has been proven to be…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Kaiyou Song , Shan Zhang , Zihao An , Zimeng Luo , Tong Wang , Jin Xie

Conditional Representation Learning for Customized Tasks

Conventional representation learning methods learn a universal representation that primarily captures dominant semantics, which may not always align with customized downstream tasks. For instance, in animal habitat analysis, researchers…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Honglin Liu , Chao Sun , Peng Hu , Yunfan Li , Xi Peng

CURL: Co-trained Unsupervised Representation Learning for Image Classification

In this paper we propose a strategy for semi-supervised image classification that leverages unsupervised representation learning and co-training. The strategy, that is called CURL from Co-trained Unsupervised Representation Learning,…

Machine Learning · Computer Science 2015-09-14 Simone Bianco , Gianluigi Ciocca , Claudio Cusano

MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models

Large-scale pre-trained Vision-Language Models (VLMs) have significantly advanced transfer learning across diverse tasks. However, adapting these models with limited few-shot data often leads to overfitting, undermining their ability to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-16 Yuncheng Guo , Xiaodong Gu