Related papers: Low-Cost Scene Modeling using a Density Function I…

Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions

4D reconstruction of human-object interaction is critical for immersive VR/AR experience and human activity understanding. Recent advances still fail to recover fine geometry and texture results from sparse RGB inputs, especially under…

Computer Vision and Pattern Recognition · Computer Science 2021-08-04 Guoxing Sun , Xin Chen , Yizhang Chen , Anqi Pang , Pei Lin , Yuheng Jiang , Lan Xu , Jingya Wang , Jingyi Yu

Efficient RGB-D Scene Understanding via Multi-task Adaptive Learning and Cross-dimensional Feature Guidance

Scene understanding plays a critical role in enabling intelligence and autonomy in robotic systems. Traditional approaches often face challenges, including occlusions, ambiguous boundaries, and the inability to adapt attention based on…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Guodong Sun , Junjie Liu , Gaoyang Zhang , Bo Wu , Yang Zhang

Motion-based Object Segmentation based on Dense RGB-D Scene Flow

Given two consecutive RGB-D images, we propose a model that estimates a dense 3D motion field, also known as scene flow. We take advantage of the fact that in robot manipulation scenarios, scenes often consist of a set of rigidly moving…

Robotics · Computer Science 2018-07-25 Lin Shao , Parth Shah , Vikranth Dwaracherla , Jeannette Bohg

Spatial Semantic Embedding Network: Fast 3D Instance Segmentation with Deep Metric Learning

We propose spatial semantic embedding network (SSEN), a simple, yet efficient algorithm for 3D instance segmentation using deep metric learning. The raw 3D reconstruction of an indoor environment suffers from occlusions, noise, and is…

Computer Vision and Pattern Recognition · Computer Science 2020-07-08 Dongsu Zhang , Junha Chun , Sang Kyun Cha , Young Min Kim

Urban Scene Segmentation with Laser-Constrained CRFs

Robots typically possess sensors of different modalities, such as colour cameras, inertial measurement units, and 3D laser scanners. Often, solving a particular problem becomes easier when more than one modality is used. However, while…

Computer Vision and Pattern Recognition · Computer Science 2017-01-10 Charika De Alvis , Lionel Ott , Fabio Ramos

Learning Object Arrangements in 3D Scenes using Human Context

We consider the problem of learning object arrangements in a 3D scene. The key idea here is to learn how objects relate to human poses based on their affordances, ease of use and reachability. In contrast to modeling object-object…

Machine Learning · Computer Science 2012-07-03 Yun Jiang , Marcus Lim , Ashutosh Saxena

Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery

To autonomously navigate and plan interactions in real-world environments, robots require the ability to robustly perceive and map complex, unstructured surrounding scenes. Besides building an internal representation of the observed scene…

Robotics · Computer Science 2021-05-18 Margarita Grinvald , Fadri Furrer , Tonci Novkovic , Jen Jen Chung , Cesar Cadena , Roland Siegwart , Juan Nieto

Augmented Reality Meets Computer Vision : Efficient Data Generation for Urban Driving Scenes

The success of deep learning in computer vision is based on availability of large annotated datasets. To lower the need for hand labeled images, virtually rendered 3D worlds have recently gained popularity. Creating realistic 3D content is…

Computer Vision and Pattern Recognition · Computer Science 2017-08-07 Hassan Abu Alhaija , Siva Karthik Mustikovela , Lars Mescheder , Andreas Geiger , Carsten Rother

Attend and Interact: Higher-Order Object Interactions for Video Understanding

Human actions often involve complex interactions across several inter-related objects in the scene. However, existing approaches to fine-grained video understanding or visual relationship detection often rely on single object representation…

Computer Vision and Pattern Recognition · Computer Science 2018-03-22 Chih-Yao Ma , Asim Kadav , Iain Melvin , Zsolt Kira , Ghassan AlRegib , Hans Peter Graf

3D Segmentation of Humans in Point Clouds with Synthetic Data

Segmenting humans in 3D indoor scenes has become increasingly important with the rise of human-centered robotics and AR/VR applications. To this end, we propose the task of joint 3D human semantic segmentation, instance segmentation and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Ayça Takmaz , Jonas Schult , Irem Kaftan , Mertcan Akçay , Bastian Leibe , Robert Sumner , Francis Engelmann , Siyu Tang

FunGraph: Functionality Aware 3D Scene Graphs for Language-Prompted Scene Interaction

The concept of 3D scene graphs is increasingly recognized as a powerful semantic and hierarchical representation of the environment. Current approaches often address this at a coarse, object-level resolution. In contrast, our goal is to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Dennis Rotondi , Fabio Scaparro , Hermann Blum , Kai O. Arras

Understanding Human Context in 3D Scenes by Learning Spatial Affordances with Virtual Skeleton Models

Robots are often required to operate in environments where humans are not present, but yet require the human context information for better human-robot interaction. Even when humans are present in the environment, detecting their presence…

Computer Vision and Pattern Recognition · Computer Science 2019-06-14 Lasitha Piyathilaka , Sarath Kodagoda

Learning event representation: As sparse as possible, but not sparser

Selecting an optimal event representation is essential for event classification in real world contexts. In this paper, we investigate the application of qualitative spatial reasoning (QSR) frameworks for classification of human-object…

Computer Vision and Pattern Recognition · Computer Science 2017-10-03 Tuan Do , James Pustejovsky

Labeling 3D scenes for Personal Assistant Robots

Inexpensive RGB-D cameras that give an RGB image together with depth data have become widely available. We use this data to build 3D point clouds of a full scene. In this paper, we address the task of labeling objects in this 3D point cloud…

Robotics · Computer Science 2011-06-29 Hema Swetha Koppula , Abhishek Anand , Thorsten Joachims , Ashutosh Saxena

Using Depth for Improving Referring Expression Comprehension in Real-World Environments

In a human-robot collaborative task where a robot helps its partner by finding described objects, the depth dimension plays a critical role in successful task completion. Existing studies have mostly focused on comprehending the object…

Robotics · Computer Science 2021-07-13 Fethiye Irmak Dogan , Iolanda Leite

Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation

Semantic segmentation of point clouds is an essential task for understanding the environment in autonomous driving and robotics. Recent range-based works achieve real-time efficiency, while point- and voxel-based methods produce better…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Daniel Fusaro , Simone Mosco , Emanuele Menegatti , Alberto Pretto

Visual Mesh: Real-time Object Detection Using Constant Sample Density

This paper proposes an enhancement of convolutional neural networks for object detection in resource-constrained robotics through a geometric input transformation called Visual Mesh. It uses object geometry to create a graph in vision…

Computer Vision and Pattern Recognition · Computer Science 2018-07-24 Trent Houliston , Stephan K. Chalup

3D Model Assisted Image Segmentation

The problem of segmenting a given image into coherent regions is important in Computer Vision and many industrial applications require segmenting a known object into its components. Examples include identifying individual parts of a…

Computer Vision and Pattern Recognition · Computer Science 2013-05-17 Srimal Jayawardena , Di Yang , Marcus Hutter

NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions

4D modeling of human-object interactions is critical for numerous applications. However, efficient volumetric capture and rendering of complex interaction scenarios, especially from sparse inputs, remain challenging. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Yuheng Jiang , Suyi Jiang , Guoxing Sun , Zhuo Su , Kaiwen Guo , Minye Wu , Jingyi Yu , Lan Xu

Video Object Segmentation-based Visual Servo Control and Object Depth Estimation on a Mobile Robot

To be useful in everyday environments, robots must be able to identify and locate real-world objects. In recent years, video object segmentation has made significant progress on densely separating such objects from background in real and…

Robotics · Computer Science 2020-01-13 Brent A. Griffin , Victoria Florence , Jason J. Corso