Related papers: LiLMaps: Learnable Implicit Language Maps

LAMP: Implicit Language Map for Robot Navigation

Recent advances in vision-language models have made zero-shot navigation feasible, enabling robots to follow natural language instructions without requiring labeling. However, existing methods that explicitly store language vectors in grid…

Robotics · Computer Science 2026-02-13 Sibaek Lee , Hyeonwoo Yu , Giseop Kim , Sunwook Choi

Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models

Large Language Models (LLM) have emerged as a tool for robots to generate task plans using common sense reasoning. For the LLM to generate actionable plans, scene context must be provided, often through a map. Recent works have shifted from…

Robotics · Computer Science 2024-09-25 Mike Zhang , Kaixian Qu , Vaishakh Patil , Cesar Cadena , Marco Hutter

Open-vocabulary Queryable Scene Representations for Real World Planning

Large language models (LLMs) have unlocked new capabilities of task planning from human instructions. However, prior attempts to apply LLMs to real-world robotic tasks are limited by the lack of grounding in the surrounding scene. In this…

Robotics · Computer Science 2022-10-18 Boyuan Chen , Fei Xia , Brian Ichter , Kanishka Rao , Keerthana Gopalakrishnan , Michael S. Ryoo , Austin Stone , Daniel Kappler

ReplanVLM: Replanning Robotic Tasks with Visual Language Models

Large language models (LLMs) have gained increasing popularity in robotic task planning due to their exceptional abilities in text analytics and generation, as well as their broad knowledge of the world. However, they fall short in decoding…

Robotics · Computer Science 2024-08-01 Aoran Mei , Guo-Niu Zhu , Huaxiang Zhang , Zhongxue Gan

TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models

TalkWithMachines aims to enhance human-robot interaction by contributing to interpretable industrial robotic systems, especially for safety-critical applications. The presented paper investigates recent advancements in Large Language Models…

Robotics · Computer Science 2024-12-23 Ammar N. Abbas , Csaba Beleznai

TR-LLM: Integrating Trajectory Data for Scene-Aware LLM-Based Human Action Prediction

Accurate prediction of human behavior is crucial for AI systems to effectively support real-world applications, such as autonomous robots anticipating and assisting with human tasks. Real-world scenarios frequently present challenges such…

Human-Computer Interaction · Computer Science 2025-07-21 Kojiro Takeyama , Yimeng Liu , Misha Sra

IVLMap: Instance-Aware Visual Language Grounding for Consumer Robot Navigation

Vision-and-Language Navigation (VLN) is a challenging task that requires a robot to navigate in photo-realistic environments with human natural language promptings. Recent studies aim to handle this task by constructing the semantic spatial…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Jiacui Huang , Hongtao Zhang , Mingbo Zhao , Zhou Wu

Expanding Frozen Vision-Language Models without Retraining: Towards Improved Robot Perception

Vision-language models (VLMs) have shown powerful capabilities in visual question answering and reasoning tasks by combining visual representations with the abstract skill set large language models (LLMs) learn during pretraining. Vision,…

Artificial Intelligence · Computer Science 2023-09-01 Riley Tavassoli , Mani Amani , Reza Akhavian

ImagineNav: Prompting Vision-Language Models as Embodied Navigator through Scene Imagination

Visual navigation is an essential skill for home-assistance robots, providing the object-searching ability to accomplish long-horizon daily tasks. Many recent approaches use Large Language Models (LLMs) for commonsense inference to improve…

Robotics · Computer Science 2024-10-15 Xinxin Zhao , Wenzhe Cai , Likun Tang , Teng Wang

Language-Augmented Symbolic Planner for Open-World Task Planning

Enabling robotic agents to perform complex long-horizon tasks has been a long-standing goal in robotics and artificial intelligence (AI). Despite the potential shown by large language models (LLMs), their planning capabilities remain…

Robotics · Computer Science 2024-07-16 Guanqi Chen , Lei Yang , Ruixing Jia , Zhe Hu , Yizhou Chen , Wei Zhang , Wenping Wang , Jia Pan

A Survey on Integration of Large Language Models with Intelligent Robots

In recent years, the integration of large language models (LLMs) has revolutionized the field of robotics, enabling robots to communicate, understand, and reason with human-like proficiency. This paper explores the multifaceted impact of…

Robotics · Computer Science 2024-08-16 Yeseung Kim , Dohyun Kim , Jieun Choi , Jisang Park , Nayoung Oh , Daehyung Park

Towards Human Awareness in Robot Task Planning with Large Language Models

The recent breakthroughs in the research on Large Language Models (LLMs) have triggered a transformation across several research domains. Notably, the integration of LLMs has greatly enhanced performance in robot Task And Motion Planning…

Robotics · Computer Science 2024-06-12 Yuchen Liu , Luigi Palmieri , Sebastian Koch , Ilche Georgievski , Marco Aiello

Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks

Designing robotic agents to perform open vocabulary tasks has been the long-standing goal in robotics and AI. Recently, Large Language Models (LLMs) have achieved impressive results in creating robotic agents for performing open vocabulary…

Robotics · Computer Science 2023-12-13 Lingfeng Sun , Devesh K. Jha , Chiori Hori , Siddarth Jain , Radu Corcodel , Xinghao Zhu , Masayoshi Tomizuka , Diego Romeres

Instance-Level Semantic Maps for Vision Language Navigation

Humans have a natural ability to perform semantic associations with the surrounding objects in the environment. This allows them to create a mental map of the environment, allowing them to navigate on-demand when given linguistic…

Robotics · Computer Science 2023-11-21 Laksh Nanwani , Anmol Agarwal , Kanishk Jain , Raghav Prabhakar , Aaron Monis , Aditya Mathur , Krishna Murthy , Abdul Hafez , Vineet Gandhi , K. Madhava Krishna

Vision and Language: Novel Representations and Artificial intelligence for Driving Scene Safety Assessment and Autonomous Vehicle Planning

Vision-language models (VLMs) have recently emerged as powerful representation learning systems that align visual observations with natural language concepts, offering new opportunities for semantic reasoning in safety-critical autonomous…

Computer Vision and Pattern Recognition · Computer Science 2026-02-19 Ross Greer , Maitrayee Keskar , Angel Martinez-Sanchez , Parthib Roy , Shashank Shriram , Mohan Trivedi

Large Language Models for Robotics: Opportunities, Challenges, and Perspectives

Large language models (LLMs) have undergone significant expansion and have been increasingly integrated across various domains. Notably, in the realm of robot task planning, LLMs harness their advanced reasoning and language comprehension…

Robotics · Computer Science 2024-01-10 Jiaqi Wang , Zihao Wu , Yiwei Li , Hanqi Jiang , Peng Shu , Enze Shi , Huawen Hu , Chong Ma , Yiheng Liu , Xuhui Wang , Yincheng Yao , Xuan Liu , Huaqin Zhao , Zhengliang Liu , Haixing Dai , Lin Zhao , Bao Ge , Xiang Li , Tianming Liu , Shu Zhang

Visual Language Maps for Robot Navigation

Grounding language to the visual observations of a navigating agent can be performed using off-the-shelf visual-language models pretrained on Internet-scale data (e.g., image captions). While this is useful for matching images to natural…

Robotics · Computer Science 2023-03-09 Chenguang Huang , Oier Mees , Andy Zeng , Wolfram Burgard

LLM-Enhanced Path Planning: Safe and Efficient Autonomous Navigation with Instructional Inputs

Autonomous navigation guided by natural language instructions is essential for improving human-robot interaction and enabling complex operations in dynamic environments. While large language models (LLMs) are not inherently designed for…

Robotics · Computer Science 2024-12-04 Pranav Doma , Aliasghar Arab , Xuesu Xiao

LIEREx: Language-Image Embeddings for Robotic Exploration

Semantic maps allow a robot to reason about its surroundings to fulfill tasks such as navigating known environments, finding specific objects, and exploring unmapped areas. Traditional mapping approaches provide accurate geometric…

Robotics · Computer Science 2026-02-03 Felix Igelbrink , Lennart Niecksch , Marian Renz , Martin Günther , Martin Atzmueller

LLM-Based Human-Robot Collaboration Framework for Manipulation Tasks

This paper presents a novel approach to enhance autonomous robotic manipulation using the Large Language Model (LLM) for logical inference, converting high-level language commands into sequences of executable motion functions. The proposed…

Robotics · Computer Science 2023-08-30 Haokun Liu , Yaonan Zhu , Kenji Kato , Izumi Kondo , Tadayoshi Aoyama , Yasuhisa Hasegawa