Related papers: From Pixels to Components: Eigenvector Masking for…

Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning

Masked Image Modeling (MIM) has emerged as a promising method for deriving visual representations from unlabeled image data by predicting missing pixels from masked portions of images. It excels in region-aware learning and provides strong…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Yibing Wei , Abhinav Gupta , Pedro Morgado

Unsupervised Part Discovery via Dual Representation Alignment

Object parts serve as crucial intermediate representations in various downstream tasks, but part-level representation learning still has not received as much attention as other vision tasks. Previous research has established that Vision…

Computer Vision and Pattern Recognition · Computer Science 2024-08-16 Jiahao Xia , Wenjian Huang , Min Xu , Jianguo Zhang , Haimin Zhang , Ziyu Sheng , Dong Xu

Masking Strategies for Image Manifolds

We consider the problem of selecting an optimal mask for an image manifold, i.e., choosing a subset of the pixels of the image that preserves the manifold's geometric structure present in the original data. Such masking implements a form of…

Machine Learning · Statistics 2016-06-16 Hamid Dadkhahi , Marco F. Duarte

In Pursuit of Pixel Supervision for Visual Pre-training

At the most basic level, pixels are the source of the visual information through which we perceive the world. Pixels contain information at all levels, ranging from low-level attributes to high-level concepts. Autoencoders represent a…

Computer Vision and Pattern Recognition · Computer Science 2025-12-18 Lihe Yang , Shang-Wen Li , Yang Li , Xinjie Lei , Dong Wang , Abdelrahman Mohamed , Hengshuang Zhao , Hu Xu

Towards Learning Structure via Consensus for Face Segmentation and Parsing

Face segmentation is the task of densely labeling pixels on the face according to their semantics. While current methods place an emphasis on developing sophisticated architectures, use conditional random fields for smoothness, or rather…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Iacopo Masi , Joe Mathai , Wael AbdAlmageed

Understanding Self-Supervised Pretraining with Part-Aware Representation Learning

In this paper, we are interested in understanding self-supervised pretraining through studying the capability that self-supervised representation pretraining methods learn part-aware representations. The study is mainly motivated by that…

Computer Vision and Pattern Recognition · Computer Science 2024-01-24 Jie Zhu , Jiyang Qi , Mingyu Ding , Xiaokang Chen , Ping Luo , Xinggang Wang , Wenyu Liu , Leye Wang , Jingdong Wang

Unsupervised Part Discovery from Contrastive Reconstruction

The goal of self-supervised visual representation learning is to learn strong, transferable image representations, with the majority of research focusing on object or scene level. On the other hand, representation learning at part level has…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Subhabrata Choudhury , Iro Laina , Christian Rupprecht , Andrea Vedaldi

Autoregressive Unsupervised Image Segmentation

In this work, we propose a new unsupervised image segmentation approach based on mutual information maximization between different constructed views of the inputs. Taking inspiration from autoregressive generative models that predict the…

Computer Vision and Pattern Recognition · Computer Science 2020-07-17 Yassine Ouali , Céline Hudelot , Myriam Tami

SimMIM: A Simple Framework for Masked Image Modeling

This paper presents SimMIM, a simple framework for masked image modeling. We simplify recently proposed related approaches without special designs such as block-wise masking and tokenization via discrete VAE or clustering. To study what let…

Computer Vision and Pattern Recognition · Computer Science 2022-04-19 Zhenda Xie , Zheng Zhang , Yue Cao , Yutong Lin , Jianmin Bao , Zhuliang Yao , Qi Dai , Han Hu

Improving Masked Autoencoders by Learning Where to Mask

Masked image modeling is a promising self-supervised learning method for visual data. It is typically built upon image patches with random masks, which largely ignores the variation of information density between them. The question is: Is…

Computer Vision and Pattern Recognition · Computer Science 2024-01-09 Haijian Chen , Wendong Zhang , Yunbo Wang , Xiaokang Yang

From Pixels to Perception: Interpretable Predictions via Instance-wise Grouped Feature Selection

Understanding the decision-making process of machine learning models provides valuable insights into the task, the data, and the reasons behind a model's failures. In this work, we propose a method that performs inherently interpretable…

Computer Vision and Pattern Recognition · Computer Science 2025-05-19 Moritz Vandenhirtz , Julia E. Vogt

Masked Visual Pre-training for Motor Control

This paper shows that self-supervised visual pre-training from real-world images is effective for learning motor control tasks from pixels. We first train the visual representations by masked modeling of natural images. We then freeze the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Tete Xiao , Ilija Radosavovic , Trevor Darrell , Jitendra Malik

Pixel Objectness: Learning to Segment Generic Objects Automatically in Images and Videos

We propose an end-to-end learning framework for segmenting generic objects in both images and videos. Given a novel image or video, our approach produces a pixel-level mask for all "object-like" regions---even for object categories never…

Computer Vision and Pattern Recognition · Computer Science 2018-12-19 Bo Xiong , Suyog Dutt Jain , Kristen Grauman

Masked Transformer for image Anomaly Localization

Image anomaly detection consists in detecting images or image portions that are visually different from the majority of the samples in a dataset. The task is of practical importance for various real-life applications like biomedical image…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Axel De Nardin , Pankaj Mishra , Gian Luca Foresti , Claudio Piciarelli

AGRNet: Adaptive Graph Representation Learning and Reasoning for Face Parsing

Face parsing infers a pixel-wise label to each facial component, which has drawn much attention recently. Previous methods have shown their success in face parsing, which however overlook the correlation among facial components. As a matter…

Computer Vision and Pattern Recognition · Computer Science 2021-10-13 Gusi Te , Wei Hu , Yinglu Liu , Hailin Shi , Tao Mei

Improving self-supervised representation learning via sequential adversarial masking

Recent methods in self-supervised learning have demonstrated that masking-based pretext tasks extend beyond NLP, serving as useful pretraining objectives in computer vision. However, existing approaches apply random or ad hoc masking…

Computer Vision and Pattern Recognition · Computer Science 2022-12-19 Dylan Sam , Min Bai , Tristan McKinney , Li Erran Li

Selective Masking based Self-Supervised Learning for Image Semantic Segmentation

This paper proposes a novel self-supervised learning method for semantic segmentation using selective masking image reconstruction as the pretraining task. Our proposed method replaces the random masking augmentation used in most masked…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Yuemin Wang , Ian Stavness

Unsupervised part learning for visual recognition

Part-based image classification aims at representing categories by small sets of learned discriminative parts, upon which an image representation is built. Considered as a promising avenue a decade ago, this direction has been neglected…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Ronan Sicre , Yannis Avrithis , Ewa Kijak , Frederic Jurie

The Surprising Effectiveness of Representation Learning for Visual Imitation

While visual imitation learning offers one of the most effective ways of learning from visual demonstrations, generalizing from them requires either hundreds of diverse demonstrations, task specific priors, or large, hard-to-train…

Robotics · Computer Science 2021-12-07 Jyothish Pari , Nur Muhammad Shafiullah , Sridhar Pandian Arunachalam , Lerrel Pinto

Masked Modeling for Self-supervised Representation Learning on Vision and Beyond

As the deep learning revolution marches on, self-supervised learning has garnered increasing attention in recent years thanks to its remarkable representation learning ability and the low dependence on labeled data. Among these varied…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Siyuan Li , Luyuan Zhang , Zedong Wang , Di Wu , Lirong Wu , Zicheng Liu , Jun Xia , Cheng Tan , Yang Liu , Baigui Sun , Stan Z. Li