Related papers: Object-Aware Cropping for Self-Supervised Learning

Coarse Is Better? A New Pipeline Towards Self-Supervised Learning with Uncurated Images

Most self-supervised learning (SSL) methods often work on curated datasets where the object-centric assumption holds. This assumption breaks down in uncurated images. Existing scene image SSL methods try to find the two views from original…

Computer Vision and Pattern Recognition · Computer Science 2023-10-02 Ke Zhu , Yin-Yin He , Jianxin Wu

On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation

Self-supervised learning is a powerful way to learn useful representations from natural data. It has also been suggested as one possible means of building visual representation in humans, but the specific objective and algorithm are…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Binxu Wang , David Mayo , Arturo Deza , Andrei Barbu , Colin Conwell

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

Contrastive self-supervised learning has outperformed supervised pretraining on many downstream tasks like segmentation and object detection. However, current methods are still primarily applied to curated datasets like ImageNet. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Wouter Van Gansbeke , Simon Vandenhende , Stamatios Georgoulis , Luc Van Gool

Learning Subject-Aware Cropping by Outpainting Professional Photos

How to frame (or crop) a photo often depends on the image subject and its context; e.g., a human portrait. Recent works have defined the subject-aware image cropping task as a nuanced and practical version of image cropping. We propose a…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 James Hong , Lu Yuan , Michaël Gharbi , Matthew Fisher , Kayvon Fatahalian

Self-supervised Training of Proposal-based Segmentation via Background Prediction

While supervised object detection methods achieve impressive accuracy, they generalize poorly to images whose appearance significantly differs from the data they have been trained on. To address this in scenarios where annotating data is…

Computer Vision and Pattern Recognition · Computer Science 2019-07-19 Isinsu Katircioglu , Helge Rhodin , Victor Constantin , Jörg Spörri , Mathieu Salzmann , Pascal Fua

Self-Supervised Learning of Object Parts for Semantic Segmentation

Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Adrian Ziegler , Yuki M. Asano

Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations

Perceptual understanding of the scene and the relationship between its different components is important for successful completion of robotic tasks. Representation learning has been shown to be a powerful technique for this, but most of the…

Robotics · Computer Science 2023-03-14 Negin Heravi , Ayzaan Wahid , Corey Lynch , Pete Florence , Travis Armstrong , Jonathan Tompson , Pierre Sermanet , Jeannette Bohg , Debidatta Dwibedi

Unsupervised Object-Level Representation Learning from Scene Images

Contrastive self-supervised learning has largely narrowed the gap to supervised pre-training on ImageNet. However, its success highly relies on the object-centric priors of ImageNet, i.e., different augmented views of the same image…

Computer Vision and Pattern Recognition · Computer Science 2021-12-06 Jiahao Xie , Xiaohang Zhan , Ziwei Liu , Yew Soon Ong , Chen Change Loy

Improving Panoptic Segmentation at All Scales

Crop-based training strategies decouple training resolution from GPU memory consumption, allowing the use of large-capacity panoptic segmentation networks on multi-megapixel images. Using crops, however, can introduce a bias towards…

Computer Vision and Pattern Recognition · Computer Science 2021-03-24 Lorenzo Porzi , Samuel Rota Bulò , Peter Kontschieder

Self-supervised structured object representation learning

Self-supervised learning (SSL) has emerged as a powerful technique for learning visual representations. While recent SSL approaches achieve strong results in global image understanding, they are limited in capturing the structured…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Oussama Hadjerci , Antoine Letienne , Mohamed Abbas Hedjazi , Adel Hafiane

A Study on Self-Supervised Object Detection Pretraining

In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Trung Dang , Simon Kornblith , Huy Thong Nguyen , Peter Chin , Maryam Khademi

Demystifying Contrastive Self-Supervised Learning: Invariances, Augmentations and Dataset Biases

Self-supervised representation learning approaches have recently surpassed their supervised learning counterparts on downstream tasks like object detection and image classification. Somewhat mysteriously the recent gains in performance come…

Computer Vision and Pattern Recognition · Computer Science 2020-07-30 Senthil Purushwalkam , Abhinav Gupta

Grasp2Vec: Learning Object Representations from Self-Supervised Grasping

Well structured visual representations can make robot learning faster and can improve generalization. In this paper, we study how we can acquire effective object-centric representations for robotic manipulation tasks without human labeling…

Robotics · Computer Science 2018-11-20 Eric Jang , Coline Devin , Vincent Vanhoucke , Sergey Levine

Seeing the Whole in the Parts in Self-Supervised Representation Learning

Recent successes in self-supervised learning (SSL) model spatial co-occurrences of visual features either by masking portions of an image or by aggressively cropping it. Here, we propose a new way to model spatial co-occurrences by aligning…

Machine Learning · Computer Science 2025-01-07 Arthur Aubret , Céline Teulière , Jochen Triesch

Cropper: Vision-Language Model for Image Cropping through In-Context Learning

The goal of image cropping is to identify visually appealing crops in an image. Conventional methods are trained on specific datasets and fail to adapt to new requirements. Recent breakthroughs in large vision-language models (VLMs) enable…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Seung Hyun Lee , Jijun Jiang , Yiran Xu , Zhuofang Li , Junjie Ke , Yinxiao Li , Junfeng He , Steven Hickson , Katie Datsenko , Sangpil Kim , Ming-Hsuan Yang , Irfan Essa , Feng Yang

Density Crop-guided Semi-supervised Object Detection in Aerial Images

One of the important bottlenecks in training modern object detectors is the need for labeled images where bounding box annotations have to be produced for each object present in the image. This bottleneck is further exacerbated in aerial…

Computer Vision and Pattern Recognition · Computer Science 2023-08-10 Akhil Meethal , Eric Granger , Marco Pedersoli

Unsupervised Part Discovery from Contrastive Reconstruction

The goal of self-supervised visual representation learning is to learn strong, transferable image representations, with the majority of research focusing on object or scene level. On the other hand, representation learning at part level has…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Subhabrata Choudhury , Iro Laina , Christian Rupprecht , Andrea Vedaldi

Self-Supervised Learning from Non-Object Centric Images with a Geometric Transformation Sensitive Architecture

Most invariance-based self-supervised methods rely on single object-centric images (e.g., ImageNet images) for pretraining, learning features that invariant to geometric transformation. However, when images are not object-centric, the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Taeho Kim , Jong-Min Lee

A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping

Image cropping aims at improving the aesthetic quality of images by adjusting their composition. Most weakly supervised cropping methods (without bounding box supervision) rely on the sliding window mechanism. The sliding window mechanism…

Computer Vision and Pattern Recognition · Computer Science 2018-03-13 Debang Li , Huikai Wu , Junge Zhang , Kaiqi Huang

Towards Self-Supervised Learning of Global and Object-Centric Representations

Self-supervision allows learning meaningful representations of natural images, which usually contain one central object. How well does it transfer to multi-entity scenes? We discuss key aspects of learning structured object-centric…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Federico Baldassarre , Hossein Azizpour