English
Related papers

Related papers: Physically Disentangled Representations

200 papers

Scene text images contain not only style information (font, background) but also content information (character, texture). Different scene text tasks need different information, but previous representation learning methods use tightly…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Boqiang Zhang , Hongtao Xie , Zuan Gao , Yuxin Wang

Contrastive instance discrimination methods outperform supervised learning in downstream tasks such as image classification and object detection. However, these methods rely heavily on data augmentation during representation learning, which…

Computer Vision and Pattern Recognition · Computer Science 2025-04-21 Mohammad Alkhalefi , Georgios Leontidis , Mingjun Zhong

Representation learning is the foundation for the recent success of neural network models. However, the distributed representations generated by neural networks are far from ideal. Due to their highly entangled nature, they are di cult to…

Machine Learning · Computer Science 2016-02-09 William Whitney

Contrastive vision-language models, such as CLIP, have demonstrated excellent zero-shot capability across semantic recognition tasks, mainly attributed to the training on a large-scale I&1T (one Image with one Text) dataset. This kind of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-05 Zhichao Yang , Leida Li , Pengfei Chen , Jinjian Wu , Giuseppe Valenzise

Inverse rendering aims to estimate physical attributes of a scene, e.g., reflectance, geometry, and lighting, from image(s). Inverse rendering has been studied primarily for single objects or with methods that solve for only one of the…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Soumyadip Sengupta , Jinwei Gu , Kihwan Kim , Guilin Liu , David W. Jacobs , Jan Kautz

Extracting structured representations from raw visual data is an important and long-standing challenge in machine learning. Recently, techniques for unsupervised learning of object-centric representations have raised growing interest. In…

Computer Vision and Pattern Recognition · Computer Science 2025-01-24 Riccardo Majellaro , Jonathan Collu , Aske Plaat , Thomas M. Moerland

Recent work has shown that object-centric representations can greatly help improve the accuracy of learning dynamics while also bringing interpretability. In this work, we take this idea one step further, ask the following question: "can…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Sanket Gandhi , Atul , Samanyu Mahajan , Vishal Sharma , Rushil Gupta , Arnab Kumar Mondal , Parag Singla

Learning meaningful representations that disentangle the underlying structure of the data generating process is considered to be of key importance in machine learning. While disentangled representations were found to be useful for diverse…

Disentangled representation learning has been proposed as an approach to learning general representations even in the absence of, or with limited, supervision. A good general representation can be fine-tuned for new target tasks using…

Computer Vision and Pattern Recognition · Computer Science 2022-08-01 Xiao Liu , Pedro Sanchez , Spyridon Thermos , Alison Q. O'Neil , Sotirios A. Tsaftaris

We introduce a method to generate 3D scenes that are disentangled into their component objects. This disentanglement is unsupervised, relying only on the knowledge of a large pretrained text-to-image model. Our key insight is that objects…

Computer Vision and Pattern Recognition · Computer Science 2024-02-28 Dave Epstein , Ben Poole , Ben Mildenhall , Alexei A. Efros , Aleksander Holynski

Although self-/un-supervised methods have led to rapid progress in visual representation learning, these methods generally treat objects and scenes using the same lens. In this paper, we focus on learning representations for objects and…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Songwei Ge , Shlok Mishra , Simon Kornblith , Chun-Liang Li , David Jacobs

Image generating neural networks are mostly viewed as black boxes, where any change in the input can have a number of globally effective changes on the output. In this work, we propose a method for learning disentangled representations to…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Maren Awiszus , Hanno Ackermann , Bodo Rosenhahn

From the intuitive notion of disentanglement, the image variations corresponding to different factors should be distinct from each other, and the disentangled representation should reflect those variations with separate dimensions. To…

Computer Vision and Pattern Recognition · Computer Science 2022-02-15 Xuanchi Ren , Tao Yang , Yuwang Wang , Wenjun Zeng

A disentangled representation encodes information about the salient factors of variation in the data independently. Although it is often argued that this representational format is useful in learning to solve many real-world down-stream…

Machine Learning · Computer Science 2020-01-08 Sjoerd van Steenkiste , Francesco Locatello , Jürgen Schmidhuber , Olivier Bachem

We introduce inverse transport networks as a learning architecture for inverse rendering problems where, given input image measurements, we seek to infer physical scene parameters such as shape, material, and illumination. During training,…

Computer Vision and Pattern Recognition · Computer Science 2018-10-01 Chengqian Che , Fujun Luan , Shuang Zhao , Kavita Bala , Ioannis Gkioulekas

Indoor scenes typically exhibit complex, spatially-varying appearance from global illumination, making inverse rendering a challenging ill-posed problem. This work presents an end-to-end, learning-based inverse rendering framework…

Computer Vision and Pattern Recognition · Computer Science 2022-11-24 Jingsen Zhu , Fujun Luan , Yuchi Huo , Zihao Lin , Zhihua Zhong , Dianbing Xi , Jiaxiang Zheng , Rui Tang , Hujun Bao , Rui Wang

Clustering continues to be a significant and challenging task. Recent studies have demonstrated impressive results by applying clustering to feature representations acquired through self-supervised learning, particularly on small datasets.…

Machine Learning · Computer Science 2023-07-19 Fei Ding , Dan Zhang , Yin Yang , Venkat Krovi , Feng Luo

In this work, we propose an inverse rendering model that estimates 3D shape, spatially-varying reflectance, homogeneous subsurface scattering parameters, and an environment illumination jointly from only a pair of captured images of a…

Computer Vision and Pattern Recognition · Computer Science 2023-05-16 Chenhao Li , Trung Thanh Ngo , Hajime Nagahara

We present Language-mediated, Object-centric Representation Learning (LORL), a paradigm for learning disentangled, object-centric scene representations from vision and language. LORL builds upon recent advances in unsupervised object…

Machine Learning · Computer Science 2021-06-09 Ruocheng Wang , Jiayuan Mao , Samuel J. Gershman , Jiajun Wu

Scene graphs are a powerful structured representation of the underlying content of images, and embeddings derived from them have been shown to be useful in multiple downstream tasks. In this work, we employ a graph convolutional network to…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Paridhi Maheshwari , Ritwick Chaudhry , Vishwa Vinay
‹ Prev 1 2 3 10 Next ›