Related papers: Learning to Place Objects with Programs and Iterat…

Learning Object Arrangements in 3D Scenes using Human Context

We consider the problem of learning object arrangements in a 3D scene. The key idea here is to learn how objects relate to human poses based on their affordances, ease of use and reachability. In contrast to modeling object-object…

Machine Learning · Computer Science 2012-07-03 Yun Jiang , Marcus Lim , Ashutosh Saxena

Learning to Place New Objects in a Scene

Placing is a necessary skill for a personal robot to have in order to perform tasks such as arranging objects in a disorganized room. The object placements should not only be stable but also be in their semantically preferred placing areas…

Robotics · Computer Science 2012-02-09 Yun Jiang , Marcus Lim , Changxi Zheng , Ashutosh Saxena

Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model

In this paper, a self-learning approach is proposed towards solving scene-specific pedestrian detection problem without any human' annotation involved. The self-learning approach is deployed as progressive steps of object discovery, object…

Computer Vision and Pattern Recognition · Computer Science 2016-11-24 Qixiang Ye , Tianliang Zhang , Qiang Qiu , Baochang Zhang , Jie Chen , Guillermo Sapiro

Learning Object Localization and 6D Pose Estimation from Simulation and Weakly Labeled Real Images

This work proposes a process for efficiently training a point-wise object detector that enables localizing objects and computing their 6D poses in cluttered and occluded scenes. Accurate pose estimation is typically a requirement for robust…

Computer Vision and Pattern Recognition · Computer Science 2019-02-22 Jean-Philippe Mercier , Chaitanya Mitash , Philippe Giguère , Abdeslam Boularias

Scene Classification in Indoor Environments for Robots using Context Based Word Embeddings

Scene Classification has been addressed with numerous techniques in computer vision literature. However, with the increasing number of scene classes in datasets in the field, it has become difficult to achieve high accuracy in the context…

Robotics · Computer Science 2019-08-29 Bao Xin Chen , Raghavender Sahdev , Dekun Wu , Xing Zhao , Manos Papagelis , John K. Tsotsos

BOOTPLACE: Bootstrapped Object Placement with Detection Transformers

In this paper, we tackle the copy-paste image-to-image composition problem with a focus on object placement learning. Prior methods have leveraged generative models to reduce the reliance for dense supervision. However, this often limits…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Hang Zhou , Xinxin Zuo , Rui Ma , Li Cheng

Predicting Stable Configurations for Semantic Placement of Novel Objects

Human environments contain numerous objects configured in a variety of arrangements. Our goal is to enable robots to repose previously unseen objects according to learned semantic relationships in novel environments. We break this problem…

Robotics · Computer Science 2021-08-30 Chris Paxton , Chris Xie , Tucker Hermans , Dieter Fox

Learning Less is More - 6D Camera Localization via 3D Surface Regression

Popular research areas like autonomous driving and augmented reality have renewed the interest in image-based camera localization. In this work, we address the task of predicting the 6D camera pose from a single RGB image in a given 3D…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Eric Brachmann , Carsten Rother

Indoor Scene Recognition in 3D

Recognising in what type of environment one is located is an important perception task. For instance, for a robot operating in indoors it is helpful to be aware whether it is in a kitchen, a hallway or a bedroom. Existing approaches attempt…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Shengyu Huang , Mikhail Usvyatsov , Konrad Schindler

SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation

In this paper we propose a neural message passing approach to augment an input 3D indoor scene with new objects matching their surroundings. Given an input, potentially incomplete, 3D scene and a query location, our method predicts a…

Computer Vision and Pattern Recognition · Computer Science 2019-07-29 Yang Zhou , Zachary While , Evangelos Kalogerakis

A Self-supervised Learning System for Object Detection using Physics Simulation and Multi-view Pose Estimation

Progress has been achieved recently in object detection given advancements in deep learning. Nevertheless, such tools typically require a large amount of training data and significant manual effort to label objects. This limits their…

Robotics · Computer Science 2017-08-04 Chaitanya Mitash , Kostas E. Bekris , Abdeslam Boularias

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes

We introduce the novel task of Language-Guided Object Placement in Real 3D Scenes. Our model is given a 3D scene's point cloud, a 3D asset, and a textual prompt broadly describing where the 3D asset should be placed. The task here is to…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Ahmed Abdelreheem , Filippo Aleotti , Jamie Watson , Zawar Qureshi , Abdelrahman Eldesokey , Peter Wonka , Gabriel Brostow , Sara Vicente , Guillermo Garcia-Hernando

Imagining the Unseen: Generative Location Modeling for Object Placement

Location modeling, or determining where non-existing objects could feasibly appear in a scene, has the potential to benefit numerous computer vision tasks, from automatic object insertion to scene creation in virtual reality. Yet, this…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Jooyeol Yun , Davide Abati , Mohamed Omran , Jaegul Choo , Amirhossein Habibian , Auke Wiggers

Self-supervised Learning of 3D Object Understanding by Data Association and Landmark Estimation for Image Sequence

In this paper, we propose a self-supervised learningmethod for multi-object pose estimation. 3D object under-standing from 2D image is a challenging task that infers ad-ditional dimension from reduced-dimensional information.In particular,…

Computer Vision and Pattern Recognition · Computer Science 2021-04-16 Hyeonwoo Yu , Jean Oh

Efficient Learning of Object Placement with Intra-Category Transfer

Efficient learning from demonstration for long-horizon tasks remains an open challenge in robotics. While significant effort has been directed toward learning trajectories, a recent resurgence of object-centric approaches has demonstrated…

Robotics · Computer Science 2025-12-01 Adrian Röfer , Russell Buchanan , Max Argus , Sethu Vijayakumar , Abhinav Valada

Self-taught Object Localization with Deep Networks

This paper introduces self-taught object localization, a novel approach that leverages deep convolutional networks trained for whole-image recognition to localize objects in images without additional human supervision, i.e., without using…

Computer Vision and Pattern Recognition · Computer Science 2016-02-03 Loris Bazzani , Alessandro Bergamo , Dragomir Anguelov , Lorenzo Torresani

From Geometry to Culture: An Iterative VLM Layout Framework for Placing Objects in Complex 3D Scene Contexts

3D layout tasks have traditionally concentrated on geometric constraints, but many practical applications demand richer contextual understanding that spans social interactions, cultural traditions, and usage conventions. Existing methods…

Graphics · Computer Science 2025-04-01 Yuto Asano , Naruya Kondo , Tatsuki Fushimi , Yoichi Ochiai

RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection

3D point cloud understanding has made great progress in recent years. However, one major bottleneck is the scarcity of annotated real datasets, especially compared to 2D object detection tasks, since a large amount of labor is involved in…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Yongming Rao , Benlin Liu , Yi Wei , Jiwen Lu , Cho-Jui Hsieh , Jie Zhou

Scene-level Pose Estimation for Multiple Instances of Densely Packed Objects

This paper introduces key machine learning operations that allow the realization of robust, joint 6D pose estimation of multiple instances of objects either densely packed or in unstructured piles from RGB-D data. The first objective is to…

Robotics · Computer Science 2019-10-14 Chaitanya Mitash , Bowen Wen , Kostas Bekris , Abdeslam Boularias

Controllable 3D Placement of Objects with Scene-Aware Diffusion Models

Image editing approaches have become more powerful and flexible with the advent of powerful text-conditioned generative models. However, placing objects in an environment with a precise location and orientation still remains a challenge, as…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Mohamed Omran , Dimitris Kalatzis , Jens Petersen , Amirhossein Habibian , Auke Wiggers