English
Related papers

Related papers: Evaluating Human-Language Model Interaction

200 papers

The outstanding performance capabilities of large language model have driven the evolution of current AI system interaction patterns. This has led to considerable discussion within the Human-AI Interaction (HAII) community. Numerous studies…

Human-Computer Interaction · Computer Science 2026-04-02 Jiayang Li , Jiale Li

Conversational human-likeness plays a central role in human-AI interaction, yet it has remained difficult to define, measure, and optimize. As a result, improvements in human-like behavior are largely driven by scale or broad supervised…

Artificial Intelligence · Computer Science 2026-01-08 Masum Hasan , Junjie Zhao , Ehsan Hoque

While large language models (LLMs) are increasingly used to assist users in various tasks through natural language interactions, these interactions often fall short due to LLMs' limited ability to infer contextual nuances and user…

Human-Computer Interaction · Computer Science 2025-03-04 Yoonsu Kim , Brandon Chin , Kihoon Son , Seoyoung Kim , Juho Kim

Large Language Models (LLMs) have made progress in various real-world tasks, which stimulates requirements for the evaluation of LLMs. Existing LLM evaluation methods are mainly supervised signal-based which depends on static datasets and…

Computation and Language · Computer Science 2023-09-11 Jiatong Li , Rui Li , Qi Liu

The ability to communicate uncertainty, risk, and limitation is crucial for the safety of large language models. However, current evaluations of these abilities rely on simple calibration, asking whether the language generated by the model…

Computation and Language · Computer Science 2024-10-04 Kaitlyn Zhou , Jena D. Hwang , Xiang Ren , Nouha Dziri , Dan Jurafsky , Maarten Sap

Large language models (LLMs) have seen increasing popularity in enterprise applications where AI agents and humans engage in objective-driven interactions. However, these systems are difficult to evaluate: data may be complex and unlabeled;…

Machine Learning · Computer Science 2025-11-06 Emi Soroka , Tanmay Chopra , Krish Desai , Sanjay Lall

To evaluate Large Language Models (LLMs) for question answering (QA), traditional methods typically focus on assessing single-turn responses to given questions. However, this approach doesn't capture the dynamic nature of human-AI…

Computation and Language · Computer Science 2024-11-19 Ruosen Li , Ruochen Li , Barry Wang , Xinya Du

Deployed artificial intelligence (AI) often impacts humans, and there is no one-size-fits-all metric to evaluate these tools. Human-centered evaluation of AI-based systems combines quantitative and qualitative analysis and human input. It…

Human-Computer Interaction · Computer Science 2023-03-14 Teresa Datta , John P. Dickerson

The evaluation of large language models faces significant challenges. Technical benchmarks often lack real-world relevance, while existing human preference evaluations suffer from unrepresentative sampling, superficial assessment depth, and…

Computation and Language · Computer Science 2026-03-06 Nora Petrova , Andrew Gordon , Enzo Blindow

As Large Language Models (LLMs) are increasingly adopted in software engineering, recently in the form of conversational assistants, ensuring these technologies align with developers' needs is essential. The limitations of traditional…

Software Engineering · Computer Science 2025-02-13 Jonan Richards , Mairieli Wessel

Improving the Theory of Mind (ToM) capability of Large Language Models (LLMs) is crucial for effective social interactions between these AI models and humans. However, the existing benchmarks often measure ToM capability improvement through…

Artificial Intelligence · Computer Science 2026-05-18 Nanxu Gong , Zixin Chen , Haotian Li , Zishu Zhao , Jianxun Lian , Huamin Qu , Yanjie Fu , Xing Xie

Recent progress in large language model (LLM) technology has significantly enhanced the interaction experience between humans and voice assistants (VAs). This project aims to explore a user's continuous interaction with LLM-based VA…

Human-Computer Interaction · Computer Science 2024-09-04 Szeyi Chan , Shihan Fu , Jiachen Li , Bingsheng Yao , Smit Desai , Mirjana Prpa , Dakuo Wang

Standard single-turn, static benchmarks fall short in evaluating the nuanced capabilities of Large Language Models (LLMs) on complex tasks such as software engineering. In this work, we propose a novel interactive evaluation framework that…

Artificial Intelligence · Computer Science 2025-08-27 Dimitrios Rontogiannis , Maxime Peyrard , Nicolas Baldwin , Martin Josifoski , Robert West , Dimitrios Gunopulos

The rise of large language models (LLMs) has revolutionized the way that we interact with artificial intelligence systems through natural language. However, LLMs often misinterpret user queries because of their uncertain intention, leading…

Computation and Language · Computer Science 2024-02-07 Jing-Cheng Pang , Heng-Bo Fan , Pengyuan Wang , Jia-Hao Xiao , Nan Tang , Si-Hang Yang , Chengxing Jia , Sheng-Jun Huang , Yang Yu

Human-robot interaction (HRI) has long studied how agents and people coordinate to achieve shared goals. In this work, we formalize and benchmark the non-intrusive assistance as an independent paradigm of HRI, where a robot proactively…

Robotics · Computer Science 2026-05-05 Yuedi Zhang , Shuanghao Bai , Wanqi Zhou , Haoran Zhang , Qi Zhang , Zhirong Luan , Badong Chen

To achieve natural and intuitive interaction with people, HRI frameworks combine a wide array of methods for human perception, intention communication, human-aware navigation and collaborative action. In practice, when encountering…

As synthetic data becomes increasingly prevalent in training language models, particularly through generated dialogue, concerns have emerged that these models may deviate from authentic human language patterns, potentially losing the…

Computation and Language · Computer Science 2024-09-25 Xufeng Duan , Bei Xiao , Xuemei Tang , Zhenguang G. Cai

Existing benchmarks do not test Large Multimodal Models (LMMs) on their interactive intelligence with human users, which is vital for developing general-purpose AI assistants. We design InterFeedback, an interactive framework, which can be…

Computation and Language · Computer Science 2025-11-10 Henry Hengyuan Zhao , Wenqi Pei , Yifei Tao , Haiyang Mei , Mike Zheng Shou

Human-AI interactions are increasingly part of everyday life, yet the interpersonal dynamics that unfold during such exchanges remain underexplored. This study investigates how emotional alignment, semantic exploration, and linguistic…

Human-Computer Interaction · Computer Science 2025-12-22 Halfdan Nordahl Fundal , Johannes Eide Rambøll , Karsten Olsen

Human-Object Interaction (HOI) detection is a longstanding computer vision problem concerned with predicting the interaction between humans and objects. Current HOI models rely on a vocabulary of interactions at training and inference time,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Francesco Tonini , Alessandro Conti , Lorenzo Vaquero , Cigdem Beyan , Elisa Ricci
‹ Prev 1 2 3 10 Next ›