Related papers: Visual-Geometry GP-based Navigable Space for Auton…

Autonomous Mapless Navigation on Uneven Terrains

We propose a new method for autonomous navigation in uneven terrains by utilizing a sparse Gaussian Process (SGP) based local perception model. The SGP local perception model is trained on local ranging observation (pointcloud) to learn the…

Robotics · Computer Science 2024-02-22 Hassan Jardali , Mahmoud Ali , Lantao Liu

VGD: Visual Geometry Gaussian Splatting for Feed-Forward Surround-view Driving Reconstruction

Feed-forward surround-view autonomous driving scene reconstruction offers fast, generalizable inference ability, which faces the core challenge of ensuring generalization while elevating novel view quality. Due to the surround-view with…

Computer Vision and Pattern Recognition · Computer Science 2025-10-23 Junhong Lin , Kangli Wang , Shunzhou Wang , Songlin Fan , Ge Li , Wei Gao

Gaussian Process-based Traversability Analysis for Terrain Mapless Navigation

Efficient navigation through uneven terrain remains a challenging endeavor for autonomous robots. We propose a new geometric-based uneven terrain mapless navigation framework combining a Sparse Gaussian Process (SGP) local map with a…

Robotics · Computer Science 2024-03-29 Abe Leininger , Mahmoud Ali , Hassan Jardali , Lantao Liu

MG-Nav: Dual-Scale Visual Navigation via Sparse Spatial Memory

We present MG-Nav (Memory-Guided Navigation), a dual-scale framework for zero-shot visual navigation that unifies global memory-guided planning with local geometry-enhanced control. At its core is the Sparse Spatial Memory Graph (SMG), a…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Bo Wang , Jiehong Lin , Chenzhi Liu , Xinting Hu , Yifei Yu , Tianjia Liu , Zhongrui Wang , Xiaojuan Qi

Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian Process

Terrain analysis is critical for the practical ap- plication of ground mobile robots in real-world tasks, espe- cially in outdoor unstructured environments. In this paper, we propose a novel spatial-temporal traversability assessment…

Robotics · Computer Science 2025-10-21 Zhenyu Hou , Senming Tan , Zhihao Zhang , Long Xu , Mengke Zhang , Zhaoqi He , Chao Xu , Fei Gao , Yanjun Cao

Multi-Scale Gaussian-Language Map for Zero-shot Embodied Navigation and Reasoning

Understanding the geometric and semantic structure of environments is essential for embodied navigation and reasoning. Existing semantic mapping methods trade off between explicit geometry and multi-scale semantics, and lack a native…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Sixian Zhang , Yiyao Wang , Xinhang Song , Keming Zhang , Zijian Xu , Shuqiang Jiang

Uncertainty-Aware Gaussian Map for Vision-Language Navigation

Vision-Language Navigation (VLN) requires an agent to navigate 3D environments following natural language instructions. During navigation, existing agents commonly encounter perceptual uncertainty, such as insufficient evidence for reliable…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Jianzhe Gao , Rui Liu , Yuxuan Xu , Tongtong Cao , Yingxue Zhang , Zhanguang Zhang , Sida Peng , Yi Yang , Wenguan Wang

Towards Physically Executable 3D Gaussian for Embodied Navigation

3D Gaussian Splatting (3DGS), a 3D representation method with photorealistic real-time rendering capabilities, is regarded as an effective tool for narrowing the sim-to-real gap. However, it lacks fine-grained semantics and physical…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Bingchen Miao , Rong Wei , Zhiqi Ge , Xiaoquan sun , Shiqi Gao , Jingzhe Zhu , Renhan Wang , Siliang Tang , Jun Xiao , Rui Tang , Juncheng Li

Polyline Generative Navigable Space Segmentation for Autonomous Visual Navigation

Detecting navigable space is a fundamental capability for mobile robots navigating in unknown or unmapped environments. In this work, we treat visual navigable space segmentation as a scene decomposition problem and propose Polyline…

Computer Vision and Pattern Recognition · Computer Science 2023-03-07 Zheng Chen , Zhengming Ding , David Crandall , Lantao Liu

3D Gaussian Map with Open-Set Semantic Grouping for Vision-Language Navigation

Vision-language navigation (VLN) requires an agent to traverse complex 3D environments based on natural language instructions, necessitating a thorough scene understanding. While existing works equip agents with various scene…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Jianzhe Gao , Rui Liu , Wenguan Wang

Sparse Gaussian Process Variational Autoencoders

Large, multi-dimensional spatio-temporal datasets are omnipresent in modern science and engineering. An effective framework for handling such data are Gaussian process deep generative models (GP-DGMs), which employ GP priors over the latent…

Machine Learning · Statistics 2020-10-26 Matthew Ashman , Jonathan So , Will Tebbutt , Vincent Fortuin , Michael Pearce , Richard E. Turner

Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation Using Vision Language Models

Visual target navigation is a critical capability for autonomous robots operating in unknown environments, particularly in human-robot interaction scenarios. While classical and learning-based methods have shown promise, most existing…

Robotics · Computer Science 2025-05-07 Bangguo Yu , Qihao Yuan , Kailai Li , Hamidreza Kasaei , Ming Cao

GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving

Self-supervised pre-training based on next-token prediction has enabled large language models to capture the underlying structure of text, and has led to unprecedented performance on a large array of tasks when applied at scale. Similarly,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 William Ljungbergh , Adam Lilja , Adam Tonderski. Arvid Laveno Ling , Carl Lindström , Willem Verbeke , Junsheng Fu , Christoffer Petersson , Lars Hammarstrand , Michael Felsberg

Long-Term Autonomous Ocean Monitoring with Streaming Samples

In the autonomous ocean monitoring task, the sampling robot moves in the environment and accumulates data continuously. The widely adopted spatial modeling method - standard Gaussian process (GP) regression - becomes inadequate in…

Robotics · Computer Science 2023-06-13 Weizhe Chen , Lantao Liu

ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting

3D occupancy prediction is critical for comprehensive scene understanding in vision-centric autonomous driving. Recent advances have explored utilizing 3D semantic Gaussians to model occupancy while reducing computational overhead, but they…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Xiaoyang Yan , Muleilan Pei , Shaojie Shen

Safe and Efficient Navigation in Extreme Environments using Semantic Belief Graphs

To achieve autonomy in unknown and unstructured environments, we propose a method for semantic-based planning under perceptual uncertainty. This capability is crucial for safe and efficient robot navigation in environment with…

Robotics · Computer Science 2023-04-04 Muhammad Fadhil Ginting , Sung-Kyun Kim , Oriana Peltzer , Joshua Ott , Sunggoo Jung , Mykel J. Kochenderfer , Ali-akbar Agha-mohammadi

GaussNav: Gaussian Splatting for Visual Navigation

In embodied vision, Instance ImageGoal Navigation (IIN) requires an agent to locate a specific object depicted in a goal image within an unexplored environment. The primary challenge of IIN arises from the need to recognize the target…

Computer Vision and Pattern Recognition · Computer Science 2025-02-05 Xiaohan Lei , Min Wang , Wengang Zhou , Houqiang Li

Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation

The ability to perform effective planning is crucial for building an instruction-following agent. When navigating through a new environment, an agent is challenged with (1) connecting the natural language instructions with its progressively…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Zhiwei Deng , Karthik Narasimhan , Olga Russakovsky

Online Gaussian Process State-Space Model: Learning and Planning for Partially Observable Dynamical Systems

This paper proposes an online learning method of Gaussian process state-space model (GP-SSM). GP-SSM is a probabilistic representation learning scheme that represents unknown state transition and/or measurement models as Gaussian processes…

Robotics · Computer Science 2024-10-30 Soon-Seo Park , Young-Jin Park , Youngjae Min , Han-Lim Choi

Camera-based 3D Semantic Scene Completion with Sparse Guidance Network

Semantic scene completion (SSC) aims to predict the semantic occupancy of each voxel in the entire 3D scene from limited observations, which is an emerging and critical task for autonomous driving. Recently, many studies have turned to…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Jianbiao Mei , Yu Yang , Mengmeng Wang , Junyu Zhu , Jongwon Ra , Yukai Ma , Laijian Li , Yong Liu