English
Related papers

Related papers: ProCompNav: Proactive Instance Navigation with Com…

200 papers

Text-goal instance navigation (TGIN) asks an agent to resolve a single, free-form description into actions that reach the correct object instance among same-category distractors. We present \textit{Context-Nav}, which elevates long,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Won Shik Jang , Ue-Hwan Kim

We propose Question-Asking Navigation (QAsk-Nav), the first reproducible benchmark for Collaborative Instance Object Navigation (CoIN) that enables an explicit, separate assessment of embodied navigation and collaborative question asking.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Edoardo Zorzi , Francesco Taioli , Yiming Wang , Marco Cristani , Alessandro Farinelli , Alberto Castellini , Loris Bazzani

Language-driven instance object navigation assumes that human users initiate the task by providing a detailed description of the target instance to the embodied agent. While this description is crucial for distinguishing the target from…

Artificial Intelligence · Computer Science 2025-03-19 Francesco Taioli , Edoardo Zorzi , Gianni Franchi , Alberto Castellini , Alessandro Farinelli , Marco Cristani , Yiming Wang

The ability to predict a user's information need would have wide-ranging implications, from saving time and effort to mitigating vocabulary gaps. We study how to interactively predict a user's information need by letting them select a…

Information Retrieval · Computer Science 2025-01-07 Kevin Ros , Dhyey Pandya , ChengXiang Zhai

The query suggestion or auto-completion mechanisms help users to type less while interacting with a search engine. A basic approach that ranks suggestions according to their frequency in the query logs is suboptimal. Firstly, many candidate…

Information Retrieval · Computer Science 2013-12-06 Eugene Kharitonov , Craig Macdonald , Pavel Serdyukov , Iadh Ounis

Candidate object proposals generated by object detectors based on convolutional neural network (CNN) encounter easy-hard samples imbalance problem, which can affect overall performance. In this study, we propose a Proposal-balanced Network…

Computer Vision and Pattern Recognition · Computer Science 2020-05-28 Jing Wu , Xiang Zhang , Mingyi Zhou , Ce Zhu

Improving instance-specific image goal navigation (InstanceImageNav), which locates the identical object in a real-world environment from a query image, is essential for robotic systems to assist users in finding desired objects. The…

While AI agents demonstrate remarkable capabilities in reasoning and tool use, they remain fundamentally reactive: they compute responses only after explicit user prompts. This paradigm ignores a critical opportunity: the idle time between…

Computation and Language · Computer Science 2026-05-27 Haoyi Hu , Qirong Lyu , Xianghan Kong , Weiwen Liu , Jianghao Lin , Zixuan Guo , Yan Xu , Yasheng Wang , Weinan Zhang , Yong Yu

Language agents have demonstrated remarkable potential in web search and information retrieval. However, these search agents assume user queries are complete and unambiguous, an assumption that diverges from reality where users begin with…

Providing user-understandable explanations to justify recommendations could help users better understand the recommended items, increase the system's ease of use, and gain users' trust. A typical approach to realize it is natural language…

Information Retrieval · Computer Science 2023-01-16 Lei Li , Yongfeng Zhang , Li Chen

Large language models (LLMs) have become integral to modern Human-AI collaboration workflows, where accurately understanding user intent serves as a crucial step for generating satisfactory responses. Context-aware intent understanding,…

Computation and Language · Computer Science 2026-03-05 Guanming Liu , Meng Wu , Peng Zhang , Yu Zhang , Yubo Shu , Xianliang Huang , Kainan Tu , Ning Gu , Liuxin Zhang , Qianying Wang , Tun Lu

Voice Assistants aim to fulfill user requests by choosing the best intent from multiple options generated by its Automated Speech Recognition and Natural Language Understanding sub-systems. However, voice assistants do not always produce…

Machine Learning · Computer Science 2020-05-05 Raviteja Anantha , Srinivas Chappidi , William Dawoodi

Object goal navigation (ObjectNav) is a fundamental task in embodied AI, requiring an agent to locate a target object in previously unseen environments. This task is particularly challenging because it requires both perceptual and cognitive…

Computer Vision and Pattern Recognition · Computer Science 2025-08-29 Yihan Cao , Jiazhao Zhang , Zhinan Yu , Shuzhen Liu , Zheng Qin , Qin Zou , Bo Du , Kai Xu

The rise of personal assistant agents, e.g., OpenClaw, highlights the growing potential of large language models to support users across everyday life and work. A core challenge in these settings is proactive assistance, since users often…

Artificial Intelligence · Computer Science 2026-05-20 Haoran Zhang , Luxin Xu , Zhilin Wang , Runquan Gui , Shunkai Zhang , Haodi Lei , Zihao He , Bingsu He , Chicheng Qin , Tong Zhu , Xiaoye Qu , Yang Yang , Yu Cheng , Yafu Li

Driven by the increasing demand for applications in augmented and virtual reality, egocentric action recognition has emerged as a prominent research area. It is typically divided into two subtasks: recognizing the performed behavior (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Huaihai Lyu , Chaofan Chen , Yuheng Ji , Changsheng Xu

The field of conversational information seeking, which is rapidly gaining interest in both academia and industry, is changing how we interact with search engines through natural language interactions. Existing datasets and methods are…

Information Retrieval · Computer Science 2024-05-13 Chris Samarinas , Hamed Zamani

A proactive dialogue system has the ability to proactively lead the conversation. Different from the general chatbots which only react to the user, proactive dialogue systems can be used to achieve some goals, e.g., to recommend some items…

Computation and Language · Computer Science 2021-07-20 Yutao Zhu , Jian-Yun Nie , Kun Zhou , Pan Du , Hao Jiang , Zhicheng Dou

Conversational assistants are being progressively adopted by the general population. However, they are not capable of handling complicated information-seeking tasks that involve multiple turns of information exchange. Due to the limited…

Information Retrieval · Computer Science 2019-01-14 Chen Qu , Liu Yang , Bruce Croft , Yongfeng Zhang , Johanne R. Trippas , Minghui Qiu

We deal with the scenario of conversational search, where user queries are under-specified or ambiguous. This calls for a mixed-initiative setup. User-asks (queries) and system-answers, as well as system-asks (clarification questions) and…

Computation and Language · Computer Science 2022-05-24 Yosi Mass , Doron Cohen , Asaf Yehudai , David Konopnicki

Vision-and-Language Navigation (VLN) requires agents to accurately perceive complex visual environments and reason over navigation instructions and histories. However, existing methods passively process redundant visual inputs and treat all…

Robotics · Computer Science 2026-03-17 Wei Xue , Mingcheng Li , Xuecheng Wu , Jingqun Tang , Dingkang Yang , Lihua Zhang
‹ Prev 1 2 3 10 Next ›