English
Related papers

Related papers: Learning Generalizable Feature Fields for Mobile M…

200 papers

Comprehensive visual, geometric, and semantic understanding of a 3D scene is crucial for successful execution of robotic tasks, especially in unstructured and complex environments. Additionally, to make robust decisions, it is necessary for…

Robotics · Computer Science 2026-03-13 Christian Maurer , Snehal Jauhri , Sophie Lueth , Georgia Chalvatzaki

A robot operating in a household environment will see a wide range of unique and unfamiliar objects. While a system could train on many of these, it is infeasible to predict all the objects a robot will see. In this paper, we present a…

Robotics · Computer Science 2023-03-08 Ethan Chun , Yilun Du , Anthony Simeonov , Tomas Lozano-Perez , Leslie Kaelbling

Safe manipulation-oriented navigation for humanoid robots requires scene memory that remains reliable under locomotion-induced perceptual distortion, environmental changes, and interaction-level geometric safety constraints. Existing…

Robotics · Computer Science 2026-05-22 Peifeng Jiang , Hong Liu , Jin Jin , Wenshuai Wang , Xia Li

This paper presents a unified approach to realize versatile distributed maneuvering with generalized formations. Specifically, we decompose the robots' maneuvers into two independent components, i.e., interception and enclosing, which are…

Robotics · Computer Science 2025-05-12 Yang Lu , Sha Luo , Pengming Zhu , Weijia Yao , Hector Garcia de Marina , Xinglong Zhang , Xin Xu

Recent advancements in vision-language foundation models have significantly enhanced open-vocabulary 3D scene understanding. However, the generalizability of existing methods is constrained due to their framework designs and their reliance…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Yunsong Wang , Hanlin Chen , Gim Hee Lee

Combining accurate geometry with rich semantics has been proven to be highly effective for language-guided robotic manipulation. Existing methods for dynamic scenes either fail to update in real-time or rely on additional depth sensors for…

Robotics · Computer Science 2024-10-22 Yu Sheng , Runfeng Lin , Lidian Wang , Quecheng Qiu , YanYong Zhang , Yu Zhang , Bei Hua , Jianmin Ji

Self-supervised and language-supervised image models contain rich knowledge of the world that is important for generalization. Many robotic tasks, however, require a detailed understanding of 3D geometry, which is often lacking in 2D image…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 William Shen , Ge Yang , Alan Yu , Jansen Wong , Leslie Pack Kaelbling , Phillip Isola

Articulated object manipulation is essential for various real-world robotic tasks, yet generalizing across diverse objects remains a major challenge. A key to generalization lies in understanding functional parts (e.g., door handles and…

Robotics · Computer Science 2026-02-17 Yue Chen , Muqing Jiang , Kaifeng Zheng , Jiaqi Liang , Chenrui Tie , Haoran Lu , Ruihai Wu , Hao Dong

General scene understanding for robotics requires flexible semantic representation, so that novel objects and structures which may not have been known at training time can be identified, segmented and grouped. We present an algorithm which…

Computer Vision and Pattern Recognition · Computer Science 2022-10-07 Kirill Mazur , Edgar Sucar , Andrew J. Davison

This paper addresses the challenge of Neural Field (NeF) generalization, where models must efficiently adapt to new signals given only a few observations. To tackle this, we propose Geometric Neural Process Fields (G-NPF), a probabilistic…

Computer Vision and Pattern Recognition · Computer Science 2025-02-05 Wenzhe Yin , Zehao Xiao , Jiayi Shen , Yunlu Chen , Cees G. M. Snoek , Jan-Jakob Sonke , Efstratios Gavves

We introduce Generalizable 3D-Language Feature Fields (g3D-LF), a 3D representation model pre-trained on large-scale 3D-language dataset for embodied tasks. Our g3D-LF processes posed RGB-D images from agents to encode feature fields for:…

Computer Vision and Pattern Recognition · Computer Science 2024-11-27 Zihan Wang , Gim Hee Lee

We present an implicit neural representation to learn the spatio-temporal space of kinematic motions. Unlike previous work that represents motion as discrete sequential samples, we propose to express the vast motion space as a continuous…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Chengan He , Jun Saito , James Zachary , Holly Rushmeier , Yi Zhou

This paper introduces a novel paradigm for the generalizable neural radiance field (NeRF). Previous generic NeRF methods combine multiview stereo techniques with image-based neural rendering for generalization, yielding impressive results,…

Computer Vision and Pattern Recognition · Computer Science 2024-01-26 Jiaxu Wang , Ziyi Zhang , Renjing Xu

We present Neural Descriptor Fields (NDFs), an object representation that encodes both points and relative poses between an object and a target (such as a robot gripper or a rack used for hanging) via category-level descriptors. We employ…

The advent of generative radiance fields has significantly promoted the development of 3D-aware image synthesis. The cumulative rendering process in radiance fields makes training these generative models much easier since gradients are…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Xudong Xu , Xingang Pan , Dahua Lin , Bo Dai

This paper presents a novel Learning from Demonstration (LfD) method that uses neural fields to learn new skills efficiently and accurately. It achieves this by utilizing a shared embedding to learn both scene and motion representations in…

Robotics · Computer Science 2023-08-16 Ahmet Tekden , Marc Peter Deisenroth , Yasemin Bekiroglu

Traditional approaches for manipulation planning rely on an explicit geometric model of the environment to formulate a given task as an optimization problem. However, inferring an accurate model from raw sensor input is a hard problem in…

Robotics · Computer Science 2023-09-15 Phillip Grote , Joaquim Ortiz-Haro , Marc Toussaint , Ozgur S. Oguz

Accurate scene perception is critical for vision-based robotic manipulation. Existing approaches typically follow either a Vision-to-Action (V-A) paradigm, predicting actions directly from visual inputs, or a Vision-to-3D-to-Action (V-3D-A)…

Robotics · Computer Science 2026-05-25 Ying Chai , Litao Deng , Ruizhi Shao , Jiajun Zhang , Kangchen Lv , Liangjun Xing , Xiang Li , Hongwen Zhang , Yebin Liu

It is a long-standing problem in robotics to develop agents capable of executing diverse manipulation tasks from visual observations in unstructured real-world environments. To achieve this goal, the robot needs to have a comprehensive…

High-fidelity 3D scene reconstruction has been substantially advanced by recent progress in neural fields. However, most existing methods train a separate network from scratch for each individual scene. This is not scalable, inefficient,…

Computer Vision and Pattern Recognition · Computer Science 2023-10-02 Yang Fu , Shalini De Mello , Xueting Li , Amey Kulkarni , Jan Kautz , Xiaolong Wang , Sifei Liu
‹ Prev 1 2 3 10 Next ›