Robotics · Computer Science
SpatialPoint: Spatial-aware Point Prediction for Embodied Localization
Qiming Zhu, Zhirui Fang, Tianming Zhang, Chuanxiu Liu +2
2026-03-31
Computer Vision and Pattern Recognition · Computer Science
Spatial-ViLT: Enhancing Visual Spatial Reasoning through Multi-Task Learning
Chashi Mahiul Islam, Oteo Mamo, Samuel Jacob Chacko, Xiuwen Liu +1
2025-10-07
Computer Vision and Pattern Recognition · Computer Science
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
Boyuan Chen, Zhuo Xu, Sean Kirmani, Brian Ichter +5
2024-01-23
Artificial Intelligence · Computer Science
EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models
Mengfei Du, Binhao Wu, Zejun Li, Xuanjing Huang +1
2024-06-11
Computer Vision and Pattern Recognition · Computer Science
SD-VLM: Spatial Measuring and Understanding with Depth-Encoded Vision-Language Models
Pingyi Chen, Yujing Lou, Shen Cao, Jinhui Guo +5
2025-09-23
Computer Vision and Pattern Recognition · Computer Science
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models
An-Chieh Cheng, Hongxu Yin, Yang Fu, Qiushan Guo +4
2024-10-16
Computer Vision and Pattern Recognition · Computer Science
Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic Scenes
Zhiyuan Feng, Zhaolu Kang, Qijie Wang, Zhiying Du +15
2026-03-03
Computer Vision and Pattern Recognition · Computer Science
AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval
Yue Zhou, Ran Ding, Xue Yang, Xue Jiang +1
2026-01-06
Robotics · Computer Science
SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning
Yuecheng Liu, Dafeng Chi, Shiguang Wu, Zhanguang Zhang +12
2025-01-24
Computer Vision and Pattern Recognition · Computer Science
Embodied Scene Understanding for Vision Language Models via MetaVQA
Weizhen Wang, Chenda Duan, Zhenghao Peng, Yuxin Liu +1
2025-01-17
Computer Vision and Pattern Recognition · Computer Science
The Spatial Blindspot of Vision-Language Models
Nahid Alam, Leema Krishna Murali, Siddhant Bharadwaj, Patrick Liu +6
2026-01-26
Computer Vision and Pattern Recognition · Computer Science
ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Dingming Li, Hongxing Li, Zixuan Wang, Yuchen Yan +8
2025-10-01
Computer Vision and Pattern Recognition · Computer Science
Spatial Understanding from Videos: Structured Prompts Meet Simulation Data
Haoyu Zhang, Meng Liu, Zaijing Li, Haokun Wen +3
2025-09-22
Artificial Intelligence · Computer Science
Towards Embodied Cognition in Robots via Spatially Grounded Synthetic Worlds
Joel Currie, Gioele Migno, Enrico Piacenti, Maria Elena Giannaccini +3
2025-05-21
Computer Vision and Pattern Recognition · Computer Science
Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
Diankun Wu, Fangfu Liu, Yi-Hsin Hung, Yueqi Duan
2026-05-20
Computer Vision and Pattern Recognition · Computer Science
Embodied3DBench: Benchmarking Low-Level Embodied Spatial Intelligence of Vision Language Models
Jiyao Zhang, Mingxu Zhang, Yitong Peng, Haoxuan Liu +7
2026-05-29
Computer Vision and Pattern Recognition · Computer Science
LRR-Bench: Left, Right or Rotate? Vision-Language models Still Struggle With Spatial Understanding Tasks
Fei Kong, Jinhao Duan, Kaidi Xu, Zhenhua Guo +2
2026-02-24
Machine Learning · Computer Science
SpatialMath: Spatial Comprehension-Infused Symbolic Reasoning for Mathematical Problem-Solving
Ashutosh Bajpai, Akshat Bhandari, Akshay Nambi, Tanmoy Chakraborty
2026-01-27
Computer Vision and Pattern Recognition · Computer Science
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
Yun Li, Yiming Zhang, Tao Lin, Xiangrui Liu +3
2025-07-18
Computer Vision and Pattern Recognition · Computer Science
Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Jiayu Wang, Yifei Ming, Zhenmei Shi, Vibhav Vineet +3
2024-11-06
Computer Vision and Pattern Recognition · Computer Science
Beyond Medical Diagnostics: How Medical Multimodal Large Language Models Think in Space
Quoc-Huy Trinh, Xi Ding, Yang Liu, Zhenyue Qin +6
2026-03-17
Artificial Intelligence · Computer Science
A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science
Jie Feng, Jinwei Zeng, Qingyue Long, Hongyi Chen +14
2025-04-15
Computer Vision and Pattern Recognition · Computer Science
SpatialMosaic: A Multiview VLM Dataset for Partial Visibility
Kanghee Lee, Injae Lee, Minseok Kwak, Jungi Hong +2
2026-04-10