Related papers: ProCompNav: Proactive Instance Navigation with Com…

Context-Nav: Context-Driven Exploration and Viewpoint-Aware 3D Spatial Reasoning for Instance Navigation

Text-goal instance navigation (TGIN) asks an agent to resolve a single, free-form description into actions that reach the correct object instance among same-category distractors. We present \textit{Context-Nav}, which elevates long,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Won Shik Jang , Ue-Hwan Kim

Benchmarking Interaction, Beyond Policy: a Reproducible Benchmark for Collaborative Instance Object Navigation

We propose Question-Asking Navigation (QAsk-Nav), the first reproducible benchmark for Collaborative Instance Object Navigation (CoIN) that enables an explicit, separate assessment of embodied navigation and collaborative question asking.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Edoardo Zorzi , Francesco Taioli , Yiming Wang , Marco Cristani , Alessandro Farinelli , Alberto Castellini , Loris Bazzani

Collaborative Instance Object Navigation: Leveraging Uncertainty-Awareness to Minimize Human-Agent Dialogues

Language-driven instance object navigation assumes that human users initiate the task by providing a detailed description of the target instance to the embodied agent. While this description is crucial for distinguishing the target from…

Artificial Intelligence · Computer Science 2025-03-19 Francesco Taioli , Edoardo Zorzi , Gianni Franchi , Alberto Castellini , Alessandro Farinelli , Marco Cristani , Yiming Wang

Interactive Information Need Prediction with Intent and Context

The ability to predict a user's information need would have wide-ranging implications, from saving time and effort to mitigating vocabulary gaps. We study how to interactively predict a user's information need by letting them select a…

Information Retrieval · Computer Science 2025-01-07 Kevin Ros , Dhyey Pandya , ChengXiang Zhai

Intent Models for Contextualising and Diversifying Query Suggestions

The query suggestion or auto-completion mechanisms help users to type less while interacting with a search engine. A basic approach that ranks suggestions according to their frequency in the query logs is suboptimal. Firstly, many candidate…

Information Retrieval · Computer Science 2013-12-06 Eugene Kharitonov , Craig Macdonald , Pavel Serdyukov , Iadh Ounis

ProbaNet: Proposal-balanced Network for Object Detection

Candidate object proposals generated by object detectors based on convolutional neural network (CNN) encounter easy-hard samples imbalance problem, which can affect overall performance. In this study, we propose a Proposal-balanced Network…

Computer Vision and Pattern Recognition · Computer Science 2020-05-28 Jing Wu , Xiang Zhang , Mingyi Zhou , Ce Zhu

Real-world Instance-specific Image Goal Navigation: Bridging Domain Gaps via Contrastive Learning

Improving instance-specific image goal navigation (InstanceImageNav), which locates the identical object in a real-world environment from a query image, is essential for robotic systems to assist users in finding desired objects. The…

Robotics · Computer Science 2025-09-08 Taichi Sakaguchi , Akira Taniguchi , Yoshinobu Hagiwara , Lotfi El Hafi , Shoichi Hasegawa , Tadahiro Taniguchi

Anticipate and Learn: Unleashing Idle-Time Compute in Proactive Agents

While AI agents demonstrate remarkable capabilities in reasoning and tool use, they remain fundamentally reactive: they compute responses only after explicit user prompts. This paradigm ignores a critical opportunity: the idle time between…

Computation and Language · Computer Science 2026-05-27 Haoyi Hu , Qirong Lyu , Xianghan Kong , Weiwen Liu , Jianghao Lin , Zixuan Guo , Yan Xu , Yasheng Wang , Weinan Zhang , Yong Yu

InteractComp: Evaluating Search Agents With Ambiguous Queries

Language agents have demonstrated remarkable potential in web search and information retrieval. However, these search agents assume user queries are complete and unambiguous, an assumption that diverges from reality where users begin with…

Computation and Language · Computer Science 2025-10-29 Mingyi Deng , Lijun Huang , Yani Fan , Jiayi Zhang , Fashen Ren , Jinyi Bai , Fuzhen Yang , Dayi Miao , Zhaoyang Yu , Yifan Wu , Yanfei Zhang , Fengwei Teng , Yingjia Wan , Song Hu , Yude Li , Xin Jin , Conghao Hu , Haoyu Li , Qirui Fu , Tai Zhong , Xinyu Wang , Xiangru Tang , Nan Tang , Chenglin Wu , Yuyu Luo

Personalized Prompt Learning for Explainable Recommendation

Providing user-understandable explanations to justify recommendations could help users better understand the recommended items, increase the system's ease of use, and gain users' trust. A typical approach to realize it is natural language…

Information Retrieval · Computer Science 2023-01-16 Lei Li , Yongfeng Zhang , Li Chen

IntPro: A Proxy Agent for Context-Aware Intent Understanding via Retrieval-conditioned Inference

Large language models (LLMs) have become integral to modern Human-AI collaboration workflows, where accurately understanding user intent serves as a crucial step for generating satisfactory responses. Context-aware intent understanding,…

Computation and Language · Computer Science 2026-03-05 Guanming Liu , Meng Wu , Peng Zhang , Yu Zhang , Yubo Shu , Xianliang Huang , Kainan Tu , Ning Gu , Liuxin Zhang , Qianying Wang , Tun Lu

Learning to Rank Intents in Voice Assistants

Voice Assistants aim to fulfill user requests by choosing the best intent from multiple options generated by its Automated Speech Recognition and Natural Language Understanding sub-systems. However, voice assistants do not always produce…

Machine Learning · Computer Science 2020-05-05 Raviteja Anantha , Srinivas Chappidi , William Dawoodi

CogNav: Cognitive Process Modeling for Object Goal Navigation with LLMs

Object goal navigation (ObjectNav) is a fundamental task in embodied AI, requiring an agent to locate a target object in previously unseen environments. This task is particularly challenging because it requires both perceptual and cognitive…

Computer Vision and Pattern Recognition · Computer Science 2025-08-29 Yihan Cao , Jiazhao Zhang , Zhinan Yu , Shuzhen Liu , Zheng Qin , Qin Zou , Bo Du , Kai Xu

$\pi$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

The rise of personal assistant agents, e.g., OpenClaw, highlights the growing potential of large language models to support users across everyday life and work. A core challenge in these settings is proactive assistance, since users often…

Artificial Intelligence · Computer Science 2026-05-20 Haoran Zhang , Luxin Xu , Zhilin Wang , Runquan Gui , Shunkai Zhang , Haodi Lei , Zihao He , Bingsu He , Chicheng Qin , Tong Zhu , Xiaoye Qu , Yang Yang , Yu Cheng , Yafu Li

EgoPrompt: Prompt Learning for Egocentric Action Recognition

Driven by the increasing demand for applications in augmented and virtual reality, egocentric action recognition has emerged as a prominent research area. It is typically divided into two subtasks: recognizing the performed behavior (i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Huaihai Lyu , Chaofan Chen , Yuheng Ji , Changsheng Xu

ProCIS: A Benchmark for Proactive Retrieval in Conversations

The field of conversational information seeking, which is rapidly gaining interest in both academia and industry, is changing how we interact with search engines through natural language interactions. Existing datasets and methods are…

Information Retrieval · Computer Science 2024-05-13 Chris Samarinas , Hamed Zamani

Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals

A proactive dialogue system has the ability to proactively lead the conversation. Different from the general chatbots which only react to the user, proactive dialogue systems can be used to achieve some goals, e.g., to recommend some items…

Computation and Language · Computer Science 2021-07-20 Yutao Zhu , Jian-Yun Nie , Kun Zhou , Pan Du , Hao Jiang , Zhicheng Dou

User Intent Prediction in Information-seeking Conversations

Conversational assistants are being progressively adopted by the general population. However, they are not capable of handling complicated information-seeking tasks that involve multiple turns of information exchange. Due to the limited…

Information Retrieval · Computer Science 2019-01-14 Chen Qu , Liu Yang , Bruce Croft , Yongfeng Zhang , Johanne R. Trippas , Minghui Qiu

Conversational Search with Mixed-Initiative -- Asking Good Clarification Questions backed-up by Passage Retrieval

We deal with the scenario of conversational search, where user queries are under-specified or ambiguous. This calls for a mixed-initiative setup. User-asks (queries) and system-answers, as well as system-asks (clarification questions) and…

Computation and Language · Computer Science 2022-05-24 Yosi Mass , Doron Cohen , Asaf Yehudai , David Konopnicki

ProFocus: Proactive Perception and Focused Reasoning in Vision-and-Language Navigation

Vision-and-Language Navigation (VLN) requires agents to accurately perceive complex visual environments and reason over navigation instructions and histories. However, existing methods passively process redundant visual inputs and treat all…

Robotics · Computer Science 2026-03-17 Wei Xue , Mingcheng Li , Xuecheng Wu , Jingqun Tang , Dingkang Yang , Lihua Zhang