Related papers: Hyperbolic Contrastive Learning for Visual Represe…

Understanding Hyperbolic Metric Learning through Hard Negative Sampling

In recent years, there has been a growing trend of incorporating hyperbolic geometry methods into computer vision. While these methods have achieved state-of-the-art performance on various metric learning tasks using hyperbolic distance…

Computer Vision and Pattern Recognition · Computer Science 2024-05-06 Yun Yue , Fangzhou Lin , Guanyi Mou , Ziming Zhang

Learning Visual Hierarchies in Hyperbolic Space for Image Retrieval

Structuring latent representations in a hierarchical manner enables models to learn patterns at multiple levels of abstraction. However, most prevalent image understanding models focus on visual similarity, and learning visual hierarchies…

Computer Vision and Pattern Recognition · Computer Science 2026-01-07 Ziwei Wang , Sameera Ramasinghe , Chenchen Xu , Julien Monteil , Loris Bazzani , Thalaiyasingam Ajanthan

Learning Structured Representations with Hyperbolic Embeddings

Most real-world datasets consist of a natural hierarchy between classes or an inherent label structure that is either already available or can be constructed cheaply. However, most existing representation learning methods ignore this…

Machine Learning · Computer Science 2024-12-03 Aditya Sinha , Siqi Zeng , Makoto Yamada , Han Zhao

Temporally Consistent Object-Centric Learning by Contrasting Slots

Unsupervised object-centric learning from videos is a promising approach to extract structured representations from large, unlabeled collections of videos. To support downstream tasks like autonomous control, these representations must be…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Anna Manasyan , Maximilian Seitzer , Filip Radovic , Georg Martius , Andrii Zadaianchuk

Metapath-based Hyperbolic Contrastive Learning for Heterogeneous Graph Embedding

The hyperbolic space, characterized by a constant negative curvature and exponentially expanding space, aligns well with the structural properties of heterogeneous graphs. However, although heterogeneous graphs inherently possess diverse…

Machine Learning · Computer Science 2025-06-23 Jongmin Park , Seunghoon Han , Won-Yong Shin , Sungsu Lim

Unified Contrastive Learning in Image-Text-Label Space

Visual recognition is recently learned via either supervised learning on human-annotated image-label data or language-image contrastive learning with webly-crawled image-text pairs. While supervised learning may result in a more…

Computer Vision and Pattern Recognition · Computer Science 2022-04-08 Jianwei Yang , Chunyuan Li , Pengchuan Zhang , Bin Xiao , Ce Liu , Lu Yuan , Jianfeng Gao

Aligning Pretraining for Detection via Object-Level Contrastive Learning

Image-level contrastive representation learning has proven to be highly effective as a generic model for transfer learning. Such generality for transfer learning, however, sacrifices specificity if we are interested in a certain downstream…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Fangyun Wei , Yue Gao , Zhirong Wu , Han Hu , Stephen Lin

Object-aware Contrastive Learning for Debiased Scene Representation

Contrastive self-supervised learning has shown impressive results in learning visual representations from unlabeled images by enforcing invariance against different data augmentations. However, the learned representations are often…

Computer Vision and Pattern Recognition · Computer Science 2021-10-28 Sangwoo Mo , Hyunwoo Kang , Kihyuk Sohn , Chun-Liang Li , Jinwoo Shin

Physically Disentangled Representations

State-of-the-art methods in generative representation learning yield semantic disentanglement, but typically do not consider physical scene parameters, such as geometry, albedo, lighting, or camera. We posit that inverse rendering, a way to…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Tzofi Klinghoffer , Kushagra Tiwary , Arkadiusz Balata , Vivek Sharma , Ramesh Raskar

LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations

Contrastive instance discrimination methods outperform supervised learning in downstream tasks such as image classification and object detection. However, these methods rely heavily on data augmentation during representation learning, which…

Computer Vision and Pattern Recognition · Computer Science 2025-04-21 Mohammad Alkhalefi , Georgios Leontidis , Mingjun Zhong

Object Pursuit: Building a Space of Objects via Discriminative Weight Generation

We propose a framework to continuously learn object-centric representations for visual learning and understanding. Existing object-centric representations either rely on supervisions that individualize objects in the scene, or perform…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Chuanyu Pan , Yanchao Yang , Kaichun Mo , Yueqi Duan , Leonidas Guibas

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

Contrastive self-supervised learning has outperformed supervised pretraining on many downstream tasks like segmentation and object detection. However, current methods are still primarily applied to curated datasets like ImageNet. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Wouter Van Gansbeke , Simon Vandenhende , Stamatios Georgoulis , Luc Van Gool

Hyperbolic Deep Learning in Computer Vision: A Survey

Deep representation learning is a ubiquitous part of modern computer vision. While Euclidean space has been the de facto standard manifold for learning visual representations, hyperbolic space has recently gained rapid traction for learning…

Computer Vision and Pattern Recognition · Computer Science 2023-05-12 Pascal Mettes , Mina Ghadimi Atigh , Martin Keller-Ressel , Jeffrey Gu , Serena Yeung

Grasp2Vec: Learning Object Representations from Self-Supervised Grasping

Well structured visual representations can make robot learning faster and can improve generalization. In this paper, we study how we can acquire effective object-centric representations for robotic manipulation tasks without human labeling…

Robotics · Computer Science 2018-11-20 Eric Jang , Coline Devin , Vincent Vanhoucke , Sergey Levine

Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality

Contrastively trained vision-language models have achieved remarkable progress in vision and language representation learning, leading to state-of-the-art models for various downstream multimodal tasks. However, recent research has…

Computation and Language · Computer Science 2023-10-26 Harman Singh , Pengchuan Zhang , Qifan Wang , Mengjiao Wang , Wenhan Xiong , Jingfei Du , Yu Chen

Unsupervised Object-Level Representation Learning from Scene Images

Contrastive self-supervised learning has largely narrowed the gap to supervised pre-training on ImageNet. However, its success highly relies on the object-centric priors of ImageNet, i.e., different augmented views of the same image…

Computer Vision and Pattern Recognition · Computer Science 2021-12-06 Jiahao Xie , Xiaohang Zhan , Ziwei Liu , Yew Soon Ong , Chen Change Loy

Look-Ahead Selective Plasticity for Continual Learning of Visual Tasks

Contrastive representation learning has emerged as a promising technique for continual learning as it can learn representations that are robust to catastrophic forgetting and generalize well to unseen future tasks. Previous work in…

Computer Vision and Pattern Recognition · Computer Science 2023-11-06 Rouzbeh Meshkinnejad , Jie Mei , Daniel Lizotte , Yalda Mohsenzadeh

Learning Visual Representations via Language-Guided Sampling

Although an object may appear in numerous contexts, we often describe it in a limited number of ways. Language allows us to abstract away visual variation to represent and communicate concepts. Building on this intuition, we propose an…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Mohamed El Banani , Karan Desai , Justin Johnson

Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective

Video contrastive learning (V-CL) has emerged as a popular framework for unsupervised video representation learning, demonstrating strong results in tasks such as action classification and detection. Yet, to harness these benefits, it is…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Zeen Song , Wenwen Qiang , Changwen Zheng , Hui Xiong , Gang Hua

Hyperbolic Graph Representation Learning: A Tutorial

Graph-structured data are widespread in real-world applications, such as social networks, recommender systems, knowledge graphs, chemical molecules etc. Despite the success of Euclidean space for graph-related learning tasks, its ability to…

Machine Learning · Computer Science 2022-11-09 Min Zhou , Menglin Yang , Lujia Pan , Irwin King