English
Related papers

Related papers: Self-Supervised Multi-View Learning via Auto-Encod…

200 papers

Representation learning approaches typically rely on images of objects captured from a single perspective that are transformed using affine transformations. Additionally, self-supervised learning, a successful paradigm of representation…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Omiros Pantazis , Mathew Salvaris

The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Junnan Li , Yongkang Wong , Qi Zhao , Mohan S. Kankanhalli

Multi-view learning is a learning problem that utilizes the various representations of an object to mine valuable knowledge and improve the performance of learning algorithm, and one of the significant directions of multi-view learning is…

Machine Learning · Computer Science 2022-01-11 Run-kun Lu , Jian-wei Liu , Yuan-fang Wang , Hao-jie Xie , Xin Zuo

Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Adrian Ziegler , Yuki M. Asano

The learning of Transformation-Equivariant Representations (TERs), which is introduced by Hinton et al. \cite{hinton2011transforming}, has been considered as a principle to reveal visual structures under various transformations. It contains…

Computer Vision and Pattern Recognition · Computer Science 2019-07-24 Guo-Jun Qi , Liheng Zhang , Chang Wen Chen , Qi Tian

Learning visual representations with self-supervised learning has become popular in computer vision. The idea is to design auxiliary tasks where labels are free to obtain. Most of these tasks end up providing data to learn specific kinds of…

Computer Vision and Pattern Recognition · Computer Science 2017-08-16 Xiaolong Wang , Kaiming He , Abhinav Gupta

Multiview recognition has been well studied in the literature and achieves decent performance in object recognition and retrieval task. However, most previous works rely on supervised learning and some impractical underlying assumptions,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Chih-Hui Ho , Bo Liu , Tz-Ying Wu , Nuno Vasconcelos

Perceptual understanding of the scene and the relationship between its different components is important for successful completion of robotic tasks. Representation learning has been shown to be a powerful technique for this, but most of the…

Human decision-making often relies on visual information from multiple perspectives or views. In contrast, machine learning-based object recognition utilizes information from a single image of the object. However, the information conveyed…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Mona Alzahrani , Muhammad Usman , Salma Kammoun , Saeed Anwar , Tarek Helmy

Learning robust and effective representations of visual data is a fundamental task in computer vision. Traditionally, this is achieved by training models with labeled data which can be expensive to obtain. Self-supervised learning attempts…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Mehmet Aygün , Prithviraj Dhar , Zhicheng Yan , Oisin Mac Aodha , Rakesh Ranjan

Self-supervised tasks have been utilized to build useful representations that can be used in downstream tasks when the annotation is unavailable. In this paper, we introduce a self-supervised video representation learning method based on…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Duc Quang Vu , Ngan T. H. Le , Jia-Ching Wang

Self-supervised learning aims to learn representations from the data itself without explicit manual supervision. Existing efforts ignore a crucial aspect of self-supervised learning - the ability to scale to large amount of data because…

Computer Vision and Pattern Recognition · Computer Science 2019-06-07 Priya Goyal , Dhruv Mahajan , Abhinav Gupta , Ishan Misra

Recent works have shown that visual pretraining on egocentric datasets using masked autoencoders (MAE) can improve generalization for downstream robotics tasks. However, these approaches pretrain only on 2D images, while many robotics…

Robotics · Computer Science 2025-03-25 Shengyi Qian , Kaichun Mo , Valts Blukis , David F. Fouhey , Dieter Fox , Ankit Goyal

Training deep neural networks to estimate the viewpoint of objects requires large labeled training datasets. However, manually labeling viewpoints is notoriously hard, error-prone, and time-consuming. On the other hand, it is relatively…

Computer Vision and Pattern Recognition · Computer Science 2020-04-07 Siva Karthik Mustikovela , Varun Jampani , Shalini De Mello , Sifei Liu , Umar Iqbal , Carsten Rother , Jan Kautz

Self-supervision is one of the hallmarks of representation learning in the increasingly popular suite of foundation models including large language models such as BERT and GPT-3, but it has not been pursued in the context of multivariate…

Machine Learning · Computer Science 2024-02-05 Xiao Shou , Dharmashankar Subramanian , Debarun Bhattacharjya , Tian Gao , Kristin P. Bennet

Unsupervised representation learning has significantly advanced various machine learning tasks. In the computer vision domain, state-of-the-art approaches utilize transformations like random crop and color jitter to achieve invariant…

Computer Vision and Pattern Recognition · Computer Science 2025-01-16 Jaemyung Yu , Jaehyun Choi , Dong-Jae Lee , HyeongGwon Hong , Junmo Kim

We introduce an unsupervised feature learning approach that embeds 3D shape information into a single-view image representation. The main idea is a self-supervised training objective that, given only a single 2D image, requires all unseen…

Computer Vision and Pattern Recognition · Computer Science 2018-08-01 Dinesh Jayaraman , Ruohan Gao , Kristen Grauman

Popular representation learning methods encourage feature invariance under transformations applied at the input. However, in 3D perception tasks like object localization and segmentation, outputs are naturally equivariant to some…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Deepti Hegde , Suhas Lohit , Kuan-Chuan Peng , Michael J. Jones , Vishal M. Patel

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data. On the other hand, unsupervised settings pose many challenges, especially about determining the right inductive bias in diverse scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Beril Besbinar , Pascal Frossard

Contrastive self-supervised learning has emerged as a promising approach to unsupervised visual representation learning. In general, these methods learn global (image-level) representations that are invariant to different views (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-09 Pedro O. Pinheiro , Amjad Almahairi , Ryan Y. Benmalek , Florian Golemo , Aaron Courville
‹ Prev 1 2 3 10 Next ›