Related papers: Incremental Object Database: Building 3D Models fr…

Kinematically-Informed Interactive Perception: Robot-Generated 3D Models for Classification

To be useful in everyday environments, robots must be able to observe and learn about objects. Recent datasets enable progress for classifying data into known object categories; however, it is unclear how to collect reliable object data…

Robotics · Computer Science 2019-01-18 Abhishek Venkataraman , Brent Griffin , Jason J. Corso

Has Anything Changed? 3D Change Detection by 2D Segmentation Masks

As capturing devices become common, 3D scans of interior spaces are acquired on a daily basis. Through scene comparison over time, information about objects in the scene and their changes is inferred. This information is important for…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Aikaterini Adam , Konstantinos Karantzalos , Lazaros Grammatikopoulos , Torsten Sattler

Understanding 3D Object Interaction from a Single Image

Humans can easily understand a single image as depicting multiple potential objects permitting interaction. We use this skill to plan our interactions with the world and accelerate understanding new objects without engaging in interaction.…

Computer Vision and Pattern Recognition · Computer Science 2023-08-09 Shengyi Qian , David F. Fouhey

Tracking by 3D Model Estimation of Unknown Objects in Videos

Most model-free visual object tracking methods formulate the tracking task as object location estimation given by a 2D segmentation or a bounding box in each video frame. We argue that this representation is limited and instead propose to…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Denys Rozumnyi , Jiri Matas , Marc Pollefeys , Vittorio Ferrari , Martin R. Oswald

Augmented Reality Meets Computer Vision : Efficient Data Generation for Urban Driving Scenes

The success of deep learning in computer vision is based on availability of large annotated datasets. To lower the need for hand labeled images, virtually rendered 3D worlds have recently gained popularity. Creating realistic 3D content is…

Computer Vision and Pattern Recognition · Computer Science 2017-08-07 Hassan Abu Alhaija , Siva Karthik Mustikovela , Lars Mescheder , Andreas Geiger , Carsten Rother

Learning to Complete Object Shapes for Object-level Mapping in Dynamic Scenes

In this paper, we propose a novel object-level mapping system that can simultaneously segment, track, and reconstruct objects in dynamic scenes. It can further predict and complete their full geometries by conditioning on reconstructions…

Computer Vision and Pattern Recognition · Computer Science 2022-08-11 Binbin Xu , Andrew J. Davison , Stefan Leutenegger

Incremental Scene Synthesis

We present a method to incrementally generate complete 2D or 3D scenes with the following properties: (a) it is globally consistent at each step according to a learned scene prior, (b) real observations of a scene can be incorporated while…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Benjamin Planche , Xuejian Rong , Ziyan Wu , Srikrishna Karanam , Harald Kosch , YingLi Tian , Jan Ernst , Andreas Hutter

Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery

To autonomously navigate and plan interactions in real-world environments, robots require the ability to robustly perceive and map complex, unstructured surrounding scenes. Besides building an internal representation of the observed scene…

Robotics · Computer Science 2021-05-18 Margarita Grinvald , Fadri Furrer , Tonci Novkovic , Jen Jen Chung , Cesar Cadena , Roland Siegwart , Juan Nieto

Template-based Object Detection Using a Foundation Model

Most currently used object detection methods are learning-based, and can detect objects under varying appearances. Those models require training and a training dataset. We focus on use cases with less data variation, but the requirement of…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Valentin Braeutigam , Matthias Stock , Bernhard Egger

InteractMove: Text-Controlled Human-Object Interaction Generation in 3D Scenes with Movable Objects

We propose a novel task of text-controlled human object interaction generation in 3D scenes with movable objects. Existing human-scene interaction datasets suffer from insufficient interaction categories and typically only consider…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Xinhao Cai , Minghang Zheng , Xin Jin , Yang Liu

Unsupervised object-centric video generation and decomposition in 3D

A natural approach to generative modeling of videos is to represent them as a composition of moving objects. Recent works model a set of 2D sprites over a slowly-varying background, but without considering the underlying 3D scene that gives…

Computer Vision and Pattern Recognition · Computer Science 2021-03-26 Paul Henderson , Christoph H. Lampert

Automatic dataset generation for specific object detection

In the past decade, object detection tasks are defined mostly by large public datasets. However, building object detection datasets is not scalable due to inefficient image collecting and labeling. Furthermore, most labels are still in the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-19 Xiaotian Lin , Leiyang Xu , Qiang Wang

NeuralMeshing: Complete Object Mesh Extraction from Casual Captures

How can we extract complete geometric models of objects that we encounter in our daily life, without having access to commercial 3D scanners? In this paper we present an automated system for generating geometric models of objects from two…

Computer Vision and Pattern Recognition · Computer Science 2025-08-25 Floris Erich , Naoya Chiba , Abdullah Mustafa , Ryo Hanai , Noriaki Ando , Yusuke Yoshiyasu , Yukiyasu Domae

Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection

Generative image models are increasingly being used for training data augmentation in vision tasks. In the context of automotive object detection, methods usually focus on producing augmented frames that look as realistic as possible, for…

Computer Vision and Pattern Recognition · Computer Science 2025-04-25 Jens Petersen , Davide Abati , Amirhossein Habibian , Auke Wiggers

A Spatial Data Model for Moving Object Databases

Moving Object Databases will have significant role in Geospatial Information Systems as they allow users to model continuous movements of entities in the databases and perform spatio-temporal analysis. For representing and querying moving…

Databases · Computer Science 2014-03-14 Hadi Hajari , Farshad Hakimpour

Emergence of Object Segmentation in Perturbed Generative Models

We introduce a novel framework to build a model that can learn how to segment objects from a collection of images without any human annotation. Our method builds on the observation that the location of object segments can be perturbed…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Adam Bielski , Paolo Favaro

SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects

We address the challenge of creating 3D assets for household articulated objects from a single image. Prior work on articulated object creation either requires multi-view multi-state input, or only allows coarse control over the generation…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Jiayi Liu , Denys Iliash , Angel X. Chang , Manolis Savva , Ali Mahdavi-Amiri

Symmetry Aware Evaluation of 3D Object Detection and Pose Estimation in Scenes of Many Parts in Bulk

While 3D object detection and pose estimation has been studied for a long time, its evaluation is not yet completely satisfactory. Indeed, existing datasets typically consist in numerous acquisitions of only a few scenes because of the…

Computer Vision and Pattern Recognition · Computer Science 2018-06-22 Romain Brégier , Frédéric Devernay , Laetitia Leyrit , James Crowley

Every Dataset Counts: Scaling up Monocular 3D Object Detection with Joint Datasets Training

Monocular 3D object detection plays a crucial role in autonomous driving. However, existing monocular 3D detection algorithms depend on 3D labels derived from LiDAR measurements, which are costly to acquire for new datasets and challenging…

Computer Vision and Pattern Recognition · Computer Science 2024-09-25 Fulong Ma , Xiaoyang Yan , Guoyang Zhao , Xiaojie Xu , Yuxuan Liu , Jun Ma , Ming Liu

Street Gaussians without 3D Object Tracker

Realistic scene reconstruction in driving scenarios poses significant challenges due to fast-moving objects. Most existing methods rely on labor-intensive manual labeling of object poses to reconstruct dynamic objects in canonical space and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Ruida Zhang , Chengxi Li , Chenyangguang Zhang , Xingyu Liu , Haili Yuan , Yanyan Li , Xiangyang Ji , Gim Hee Lee