Related papers: Self-Supervised Multi-View Learning via Auto-Encod…

Matching Multiple Perspectives for Efficient Representation Learning

Representation learning approaches typically rely on images of objects captured from a single perspective that are transformed using affine transformations. Additionally, self-supervised learning, a successful paradigm of representation…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Omiros Pantazis , Mathew Salvaris

Unsupervised Learning of View-invariant Action Representations

The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Junnan Li , Yongkang Wong , Qi Zhao , Mohan S. Kankanhalli

Auto-Encoder based Co-Training Multi-View Representation Learning

Multi-view learning is a learning problem that utilizes the various representations of an object to mine valuable knowledge and improve the performance of learning algorithm, and one of the significant directions of multi-view learning is…

Machine Learning · Computer Science 2022-01-11 Run-kun Lu , Jian-wei Liu , Yuan-fang Wang , Hao-jie Xie , Xin Zuo

Self-Supervised Learning of Object Parts for Semantic Segmentation

Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Adrian Ziegler , Yuki M. Asano

AVT: Unsupervised Learning of Transformation Equivariant Representations by Autoencoding Variational Transformations

The learning of Transformation-Equivariant Representations (TERs), which is introduced by Hinton et al. \cite{hinton2011transforming}, has been considered as a principle to reveal visual structures under various transformations. It contains…

Computer Vision and Pattern Recognition · Computer Science 2019-07-24 Guo-Jun Qi , Liheng Zhang , Chang Wen Chen , Qi Tian

Transitive Invariance for Self-supervised Visual Representation Learning

Learning visual representations with self-supervised learning has become popular in computer vision. The idea is to design auxiliary tasks where labels are free to obtain. Most of these tasks end up providing data to learn specific kinds of…

Computer Vision and Pattern Recognition · Computer Science 2017-08-16 Xiaolong Wang , Kaiming He , Abhinav Gupta

Exploit Clues from Views: Self-Supervised and Regularized Learning for Multiview Object Recognition

Multiview recognition has been well studied in the literature and achieves decent performance in object recognition and retrieval task. However, most previous works rely on supervised learning and some impractical underlying assumptions,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Chih-Hui Ho , Bo Liu , Tz-Ying Wu , Nuno Vasconcelos

Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations

Perceptual understanding of the scene and the relationship between its different components is important for successful completion of robotic tasks. Representation learning has been shown to be a powerful technique for this, but most of the…

Robotics · Computer Science 2023-03-14 Negin Heravi , Ayzaan Wahid , Corey Lynch , Pete Florence , Travis Armstrong , Jonathan Tompson , Pierre Sermanet , Jeannette Bohg , Debidatta Dwibedi

Deep Models for Multi-View 3D Object Recognition: A Review

Human decision-making often relies on visual information from multiple perspectives or views. In contrast, machine learning-based object recognition utilizes information from a single image of the object. However, the information conveyed…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Mona Alzahrani , Muhammad Usman , Salma Kammoun , Saeed Anwar , Tarek Helmy

Enhancing 2D Representation Learning with a 3D Prior

Learning robust and effective representations of visual data is a fundamental task in computer vision. Traditionally, this is achieved by training models with labeled data which can be expensive to obtain. Self-supervised learning attempts…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Mehmet Aygün , Prithviraj Dhar , Zhicheng Yan , Oisin Mac Aodha , Rakesh Ranjan

Self-Supervised Learning via multi-Transformation Classification for Action Recognition

Self-supervised tasks have been utilized to build useful representations that can be used in downstream tasks when the annotation is unavailable. In this paper, we introduce a self-supervised video representation learning method based on…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Duc Quang Vu , Ngan T. H. Le , Jia-Ching Wang

Scaling and Benchmarking Self-Supervised Visual Representation Learning

Self-supervised learning aims to learn representations from the data itself without explicit manual supervision. Existing efforts ignore a crucial aspect of self-supervised learning - the ability to scale to large amount of data because…

Computer Vision and Pattern Recognition · Computer Science 2019-06-07 Priya Goyal , Dhruv Mahajan , Abhinav Gupta , Ishan Misra

3D-MVP: 3D Multiview Pretraining for Robotic Manipulation

Recent works have shown that visual pretraining on egocentric datasets using masked autoencoders (MAE) can improve generalization for downstream robotics tasks. However, these approaches pretrain only on 2D images, while many robotics…

Robotics · Computer Science 2025-03-25 Shengyi Qian , Kaichun Mo , Valts Blukis , David F. Fouhey , Dieter Fox , Ankit Goyal

Self-Supervised Viewpoint Learning From Image Collections

Training deep neural networks to estimate the viewpoint of objects requires large labeled training datasets. However, manually labeling viewpoints is notoriously hard, error-prone, and time-consuming. On the other hand, it is relatively…

Computer Vision and Pattern Recognition · Computer Science 2020-04-07 Siva Karthik Mustikovela , Varun Jampani , Shalini De Mello , Sifei Liu , Umar Iqbal , Carsten Rother , Jan Kautz

Self-Supervised Contrastive Pre-Training for Multivariate Point Processes

Self-supervision is one of the hallmarks of representation learning in the increasingly popular suite of foundation models including large language models such as BERT and GPT-3, but it has not been pursued in the context of multivariate…

Machine Learning · Computer Science 2024-02-05 Xiao Shou , Dharmashankar Subramanian , Debarun Bhattacharjya , Tian Gao , Kristin P. Bennet

Self-supervised Transformation Learning for Equivariant Representations

Unsupervised representation learning has significantly advanced various machine learning tasks. In the computer vision domain, state-of-the-art approaches utilize transformations like random crop and color jitter to achieve invariant…

Computer Vision and Pattern Recognition · Computer Science 2025-01-16 Jaemyung Yu , Jaehyun Choi , Dong-Jae Lee , HyeongGwon Hong , Junmo Kim

ShapeCodes: Self-Supervised Feature Learning by Lifting Views to Viewgrids

We introduce an unsupervised feature learning approach that embeds 3D shape information into a single-view image representation. The main idea is a self-supervised training objective that, given only a single 2D image, requires all unseen…

Computer Vision and Pattern Recognition · Computer Science 2018-08-01 Dinesh Jayaraman , Ruohan Gao , Kristen Grauman

Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection

Popular representation learning methods encourage feature invariance under transformations applied at the input. However, in 3D perception tasks like object localization and segmentation, outputs are naturally equivariant to some…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Deepti Hegde , Suhas Lohit , Kuan-Chuan Peng , Michael J. Jones , Vishal M. Patel

Self-Supervision by Prediction for Object Discovery in Videos

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data. On the other hand, unsupervised settings pose many challenges, especially about determining the right inductive bias in diverse scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Beril Besbinar , Pascal Frossard

Unsupervised Learning of Dense Visual Representations

Contrastive self-supervised learning has emerged as a promising approach to unsupervised visual representation learning. In general, these methods learn global (image-level) representations that are invariant to different views (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-09 Pedro O. Pinheiro , Amjad Almahairi , Ryan Y. Benmalek , Florian Golemo , Aaron Courville