English
Related papers

Related papers: Unsupervised Keypoints from Pretrained Diffusion M…

200 papers

Understanding and representing the structure of 3D objects in an unsupervised manner remains a core challenge in computer vision and graphics. Most existing unsupervised keypoint methods are not designed for unconditional generative…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Rhys Newbury , Juyan Zhang , Tin Tran , Hanna Kurniawati , Dana Kulić

This paper introduces a new algorithm for unsupervised learning of keypoint detectors and descriptors, which demonstrates fast convergence and good performance across different datasets. The training procedure uses homographic…

Computer Vision and Pattern Recognition · Computer Science 2020-06-02 Anatoly Belikov , Alexey Potapov

Unsupervised visual object tracking is a challenging task that requires following arbitrary targets in videos without training on ground-truth annotations. Despite considerable progress, existing state-of-the-art unsupervised trackers often…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Zhengbo Zhang , Zhigang Tu , Junsong Yuan , De Wen Soh , Bo Du

Text-to-image diffusion models are now capable of generating images that are often indistinguishable from real images. To generate such images, these models must understand the semantics of the objects they are asked to generate. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Eric Hedlin , Gopal Sharma , Shweta Mahajan , Hossam Isack , Abhishek Kar , Andrea Tagliasacchi , Kwang Moo Yi

This paper proposes a novel paradigm for the unsupervised learning of object landmark detectors. Contrary to existing methods that build on auxiliary tasks such as image generation or equivariance, we propose a self-training approach where,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-28 Dimitrios Mallis , Enrique Sanchez , Matt Bell , Georgios Tzimiropoulos

Diffusion models, such as Stable Diffusion, have shown incredible performance on text-to-image generation. Since text-to-image generation often requires models to generate visual concepts with fine-grained details and attributes specified…

Computer Vision and Pattern Recognition · Computer Science 2024-04-26 Xuehai He , Weixi Feng , Tsu-Jui Fu , Varun Jampani , Arjun Akula , Pradyumna Narayana , Sugato Basu , William Yang Wang , Xin Eric Wang

Despite the rapid advancement of unsupervised learning in visual representation, it requires training on large-scale datasets that demand costly data collection, and pose additional challenges due to concerns regarding data privacy.…

Computer Vision and Pattern Recognition · Computer Science 2023-08-15 David Junhao Zhang , Mutian Xu , Chuhui Xue , Wenqing Zhang , Xiaoguang Han , Song Bai , Mike Zheng Shou

Self-supervision can dramatically cut back the amount of manually-labelled data required to train deep neural networks. While self-supervision has usually been considered for tasks such as image classification, in this paper we aim at…

Computer Vision and Pattern Recognition · Computer Science 2018-04-06 David Novotny , Samuel Albanie , Diane Larlus , Andrea Vedaldi

Many of the existing methods for learning joint embedding of images and text use only supervised information from paired images and its textual attributes. Taking advantage of the recent success of unsupervised learning in deep neural…

Computer Vision and Pattern Recognition · Computer Science 2017-03-21 Yao-Hung Hubert Tsai , Liang-Kang Huang , Ruslan Salakhutdinov

Text-to-image (T2I) diffusion models have achieved widespread success due to their ability to generate high-resolution, photorealistic images. These models are trained on large-scale datasets, like LAION-5B, often scraped from the internet.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Korada Sri Vardhana , Shrikrishna Lolla , Soma Biswas

Deep clustering against self-supervised learning is a very important and promising direction for unsupervised visual representation learning since it requires little domain knowledge to design pretext tasks. However, the key component,…

Computer Vision and Pattern Recognition · Computer Science 2020-08-21 Weijie Chen , Shiliang Pu , Di Xie , Shicai Yang , Yilu Guo , Luojun Lin

Diffusion-based models, widely used in text-to-image generation, have proven effective in 2D representation learning. Recently, this framework has been extended to 3D self-supervised learning by constructing a conditional point generator…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Yiyang Chen , Shanshan Zhao , Lunhao Duan , Changxing Ding , Dacheng Tao

Subject-driven text-to-image diffusion models empower users to tailor the model to new concepts absent in the pre-training dataset using a few sample images. However, prevalent subject-driven models primarily rely on single-concept input…

Computer Vision and Pattern Recognition · Computer Science 2024-02-16 Junjie Shentu , Matthew Watson , Noura Al Moubayed

Unsupervised learning of feature representations is a challenging yet important problem for analyzing a large collection of multimedia data that do not have semantic labels. Recently proposed neural network-based unsupervised learning…

Computer Vision and Pattern Recognition · Computer Science 2022-11-15 Takahiko Furuya , Ryutarou Ohbuchi

Learning sensorimotor control policies from high-dimensional images crucially relies on the quality of the underlying visual representations. Prior works show that structured latent space such as visual keypoints often outperforms…

Machine Learning · Computer Science 2021-06-15 Boyuan Chen , Pieter Abbeel , Deepak Pathak

A central goal of unsupervised learning is to acquire representations from unlabeled data or experience that can be used for more effective learning of downstream tasks from modest amounts of labeled data. Many prior unsupervised learning…

Machine Learning · Computer Science 2019-03-25 Kyle Hsu , Sergey Levine , Chelsea Finn

Unsupervised landmarks discovery (ULD) for an object category is a challenging computer vision problem. In pursuit of developing a robust ULD framework, we explore the potential of a recent paradigm of self-supervised learning algorithms,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Siddharth Tourani , Ahmed Alwheibi , Arif Mahmood , Muhammad Haris Khan

Existing popular unsupervised embedding learning methods focus on enhancing the instance-level local discrimination of the given unlabeled images by exploring various negative data. However, the existed sample outliers which exhibit large…

Computer Vision and Pattern Recognition · Computer Science 2021-07-20 Jiahuan Zhou , Yansong Tang , Bing Su , Ying Wu

Deep convolutional neural networks (CNNs) have demonstrated remarkable success in computer vision by supervisedly learning strong visual feature representations. However, training CNNs relies heavily on the availability of exhaustive…

Computer Vision and Pattern Recognition · Computer Science 2019-05-31 Jiabo Huang , Qi Dong , Shaogang Gong , Xiatian Zhu

Text-to-image diffusion models have emerged as powerful tools for high-quality image generation and editing. Many existing approaches rely on text prompts as editing guidance. However, these methods are constrained by the need for manual…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Yuanyuan Chang , Yinghua Yao , Tao Qin , Mengmeng Wang , Ivor Tsang , Guang Dai
‹ Prev 1 2 3 10 Next ›