Related papers: Physically Disentangled Representations

Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing

Scene text images contain not only style information (font, background) but also content information (character, texture). Different scene text tasks need different information, but previous representation learning methods use tightly…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Boqiang Zhang , Hongtao Xie , Zuan Gao , Yuxin Wang

LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations

Contrastive instance discrimination methods outperform supervised learning in downstream tasks such as image classification and object detection. However, these methods rely heavily on data augmentation during representation learning, which…

Computer Vision and Pattern Recognition · Computer Science 2025-04-21 Mohammad Alkhalefi , Georgios Leontidis , Mingjun Zhong

Disentangled Representations in Neural Models

Representation learning is the foundation for the recent success of neural network models. However, the distributed representations generated by neural networks are far from ideal. Due to their highly entangled nature, they are di cult to…

Machine Learning · Computer Science 2016-02-09 William Whitney

Language-Guided Visual Perception Disentanglement for Image Quality Assessment and Conditional Image Generation

Contrastive vision-language models, such as CLIP, have demonstrated excellent zero-shot capability across semantic recognition tasks, mainly attributed to the training on a large-scale I&1T (one Image with one Text) dataset. This kind of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-05 Zhichao Yang , Leida Li , Pengfei Chen , Jinjian Wu , Giuseppe Valenzise

Neural Inverse Rendering of an Indoor Scene from a Single Image

Inverse rendering aims to estimate physical attributes of a scene, e.g., reflectance, geometry, and lighting, from image(s). Inverse rendering has been studied primarily for single objects or with methods that solve for only one of the…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Soumyadip Sengupta , Jinwei Gu , Kihwan Kim , Guilin Liu , David W. Jacobs , Jan Kautz

Explicitly Disentangled Representations in Object-Centric Learning

Extracting structured representations from raw visual data is an important and long-standing challenge in machine learning. Recently, techniques for unsupervised learning of object-centric representations have raised growing interest. In…

Computer Vision and Pattern Recognition · Computer Science 2025-01-24 Riccardo Majellaro , Jonathan Collu , Aske Plaat , Thomas M. Moerland

Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers

Recent work has shown that object-centric representations can greatly help improve the accuracy of learning dynamics while also bringing interpretability. In this work, we take this idea one step further, ask the following question: "can…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Sanket Gandhi , Atul , Samanyu Mahajan , Vishal Sharma , Rushil Gupta , Arnab Kumar Mondal , Parag Singla

On the Transfer of Disentangled Representations in Realistic Settings

Learning meaningful representations that disentangle the underlying structure of the data generating process is considered to be of key importance in machine learning. While disentangled representations were found to be useful for diverse…

Machine Learning · Computer Science 2021-03-12 Andrea Dittadi , Frederik Träuble , Francesco Locatello , Manuel Wüthrich , Vaibhav Agrawal , Ole Winther , Stefan Bauer , Bernhard Schölkopf

Learning Disentangled Representations in the Imaging Domain

Disentangled representation learning has been proposed as an approach to learning general representations even in the absence of, or with limited, supervision. A good general representation can be fine-tuned for new target tasks using…

Computer Vision and Pattern Recognition · Computer Science 2022-08-01 Xiao Liu , Pedro Sanchez , Spyridon Thermos , Alison Q. O'Neil , Sotirios A. Tsaftaris

Disentangled 3D Scene Generation with Layout Learning

We introduce a method to generate 3D scenes that are disentangled into their component objects. This disentanglement is unsupervised, relying only on the knowledge of a large pretrained text-to-image model. Our key insight is that objects…

Computer Vision and Pattern Recognition · Computer Science 2024-02-28 Dave Epstein , Ben Poole , Ben Mildenhall , Alexei A. Efros , Aleksander Holynski

Hyperbolic Contrastive Learning for Visual Representations beyond Objects

Although self-/un-supervised methods have led to rapid progress in visual representation learning, these methods generally treat objects and scenes using the same lens. In this paper, we focus on learning representations for objects and…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Songwei Ge , Shlok Mishra , Simon Kornblith , Chun-Liang Li , David Jacobs

Learning Disentangled Representations via Independent Subspaces

Image generating neural networks are mostly viewed as black boxes, where any change in the input can have a number of globally effective changes on the output. In this work, we propose a method for learning disentangled representations to…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Maren Awiszus , Hanno Ackermann , Bodo Rosenhahn

Learning Disentangled Representation by Exploiting Pretrained Generative Models: A Contrastive Learning View

From the intuitive notion of disentanglement, the image variations corresponding to different factors should be distinct from each other, and the disentangled representation should reflect those variations with separate dimensions. To…

Computer Vision and Pattern Recognition · Computer Science 2022-02-15 Xuanchi Ren , Tao Yang , Yuwang Wang , Wenjun Zeng

Are Disentangled Representations Helpful for Abstract Visual Reasoning?

A disentangled representation encodes information about the salient factors of variation in the data independently. Although it is often argued that this representational format is useful in learning to solve many real-world down-stream…

Machine Learning · Computer Science 2020-01-08 Sjoerd van Steenkiste , Francesco Locatello , Jürgen Schmidhuber , Olivier Bachem

Inverse Transport Networks

We introduce inverse transport networks as a learning architecture for inverse rendering problems where, given input image measurements, we seek to infer physical scene parameters such as shape, material, and illumination. During training,…

Computer Vision and Pattern Recognition · Computer Science 2018-10-01 Chengqian Che , Fujun Luan , Shuang Zhao , Kavita Bala , Ioannis Gkioulekas

Learning-based Inverse Rendering of Complex Indoor Scenes with Differentiable Monte Carlo Raytracing

Indoor scenes typically exhibit complex, spatially-varying appearance from global illumination, making inverse rendering a challenging ill-posed problem. This work presents an end-to-end, learning-based inverse rendering framework…

Computer Vision and Pattern Recognition · Computer Science 2022-11-24 Jingsen Zhu , Fujun Luan , Yuchi Huo , Zihao Lin , Zhihua Zhong , Dianbing Xi , Jiaxiang Zheng , Rui Tang , Hujun Bao , Rui Wang

Contrastive Representation Disentanglement for Clustering

Clustering continues to be a significant and challenging task. Recent studies have demonstrated impressive results by applying clustering to feature representations acquired through self-supervised learning, particularly on small datasets.…

Machine Learning · Computer Science 2023-07-19 Fei Ding , Dan Zhang , Yin Yang , Venkat Krovi , Feng Luo

Inverse Rendering of Translucent Objects using Physical and Neural Renderers

In this work, we propose an inverse rendering model that estimates 3D shape, spatially-varying reflectance, homogeneous subsurface scattering parameters, and an environment illumination jointly from only a pair of captured images of a…

Computer Vision and Pattern Recognition · Computer Science 2023-05-16 Chenhao Li , Trung Thanh Ngo , Hajime Nagahara

Language-Mediated, Object-Centric Representation Learning

We present Language-mediated, Object-centric Representation Learning (LORL), a paradigm for learning disentangled, object-centric scene representations from vision and language. LORL builds upon recent advances in unsupervised object…

Machine Learning · Computer Science 2021-06-09 Ruocheng Wang , Jiayuan Mao , Samuel J. Gershman , Jiajun Wu

Scene Graph Embeddings Using Relative Similarity Supervision

Scene graphs are a powerful structured representation of the underlying content of images, and embeddings derived from them have been shown to be useful in multiple downstream tasks. In this work, we employ a graph convolutional network to…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Paridhi Maheshwari , Ritwick Chaudhry , Vishwa Vinay