English
Related papers

Related papers: UXAgent: An LLM Agent-Based Usability Testing Fram…

200 papers

Usability testing is a fundamental research method that user experience (UX) researchers use to evaluate and iterate their new designs. But what about evaluating and iterating the usability testing study design itself? Recent advances in…

Computation and Language · Computer Science 2025-09-22 Yuxuan Lu , Bingsheng Yao , Hansu Gu , Jing Huang , Jessie Wang , Yang Li , Jiri Gesi , Qi He , Toby Jia-Jun Li , Dakuo Wang

Usability evaluation is critical to the impact and adoption of open source software (OSS), yet traditional methods relying on human evaluators suffer from high costs and limited scalability. To address these limitations, we introduce…

Software Engineering · Computer Science 2025-05-30 Lingkai Meng , Yu Shao , Long Yuan , Longbin Lai , Peng Cheng , Wenyuan Yu , Wenjie Zhang , Xuemin Lin , Jingren Zhou

Simulated user agents are increasingly used in usability testing to support fast, iterative UX workflows, as they generate rich data such as action logs and think-aloud reasoning, but the unstructured nature of this output often obscures…

Human-Computer Interaction · Computer Science 2026-01-23 Steffen Holter , Eunyee Koh , Mustafa Doga Dogan , Gromit Yeuk-Yin Chan

Simulating nuanced user experiences within complex interactive search systems poses distinct challenge for traditional methodologies, which often rely on static user proxies or, more recently, on standalone large language model (LLM) agents…

Information Retrieval · Computer Science 2026-03-02 Saber Zerhoudi , Michael Granitzer

Due to the advantages in the cost-efficiency and reproducibility, user simulation has become a promising solution to the user-centric evaluation of information retrieval systems. Nonetheless, accurately simulating user search behaviors has…

Information Retrieval · Computer Science 2024-10-30 Erhan Zhang , Xingzhu Wang , Peiyuan Gong , Yankai Lin , Jiaxin Mao

The diversification of information access systems, from RAG to autonomous agents, creates a critical need for comparative user studies. However, the technical overhead to deploy and manage these distinct systems is a major barrier. We…

Human-Computer Interaction · Computer Science 2026-01-16 Saber Zerhoudi , Michael Granitzer

The automation of functional testing in software has allowed developers to continuously check for negative impacts on functionality throughout the iterative phases of development. This is not the case for User eXperience (UX), which has…

Artificial Intelligence · Computer Science 2021-04-14 Pedro M. Fernandes , Manuel Lopes , Rui Prada

The rapid appearance of large language models (LLMs) has led to systems that turn natural-language intent into real user interfaces (UIs). Free-form code generation maximizes expressiveness but often hurts reliability, security, and…

Human-Computer Interaction · Computer Science 2025-11-04 Xinsong Li , Ning Jiang , Jay Selvaraj

Accurately assessing internal human states is key to understanding preferences, offering personalized services, and identifying challenges in real-world applications. Originating from psychometrics, adaptive testing has become the…

Artificial Intelligence · Computer Science 2025-06-04 Junhao Yu , Yan Zhuang , YuXuan Sun , Weibo Gao , Qi Liu , Mingyue Cheng , Zhenya Huang , Enhong Chen

A/B testing experiment is a widely adopted method for evaluating UI/UX design decisions in modern web applications. Yet, traditional A/B testing remains constrained by its dependence on the large-scale and live traffic of human…

Human-Computer Interaction · Computer Science 2026-03-12 Yuxuan Lu , Ting-Yao Hsu , Hansu Gu , Limeng Cui , Yaochen Xie , William Headden , Bingsheng Yao , Akash Veeragouni , Jiapeng Liu , Sreyashi Nag , Jessie Wang , Dakuo Wang

LLM-powered agents are both a promising new technology and a source of complexity, where choices about models, tools, and prompting can affect their usefulness. While numerous benchmarks measure agent accuracy across domains, they mostly…

Evaluating web usability typically requires time-consuming user studies and expert reviews, which often limits iteration speed during product development, especially for small teams and agile workflows. We present Avenir-UX, a…

Artificial Intelligence · Computer Science 2026-04-16 Wee Joe Tan , Zi Rui Lucas Lim , Shashank Durgad , Karim Obegi , Aiden Yiliu Li

Evaluating UX in the context of AI's complexity, unpredictability, and generative nature presents unique challenges. How can we support HCI researchers to create comprehensive UX evaluation plans? In this paper, we introduce EvAlignUX, a…

Human-Computer Interaction · Computer Science 2025-07-09 Qingxiao Zheng , Minrui Chen , Pranav Sharma , Yiliu Tang , Mehul Oswal , Yiren Liu , Yun Huang

As multi-agent systems powered by Large Language Models (LLMs) are increasingly adopted in real-world workflows, users with diverse technical backgrounds are now building and refining their own agentic processes. However, these systems can…

Human-Computer Interaction · Computer Science 2026-03-05 Xinru Wang , Ming Yin , Eunyee Koh , Mustafa Doga Dogan

Large language model (LLM)-based computer use agents execute user commands by interacting with available UI elements, but little is known about how users want to interact with these agents or what design factors matter for their user…

Human-Computer Interaction · Computer Science 2026-02-10 Ruijia Cheng , Jenny T. Liang , Eldon Schoop , Jeffrey Nichols

Graphical User Interface (GUI) Agents have emerged as a transformative paradigm in human-computer interaction, evolving from rule-based automation scripts to sophisticated AI-driven systems capable of understanding and executing complex…

Human-Computer Interaction · Computer Science 2025-06-05 Fei Tang , Haolei Xu , Hang Zhang , Siqi Chen , Xingyu Wu , Yongliang Shen , Wenqi Zhang , Guiyang Hou , Zeqi Tan , Yuchen Yan , Kaitao Song , Jian Shao , Weiming Lu , Jun Xiao , Yueting Zhuang

Usability testing with experts and potential users can assess the effectiveness, efficiency, and user satisfaction of graphical user interfaces (GUIs) but doing so remains a costly and time-intensive process. Prior work has used computer…

Computation and Language · Computer Science 2026-04-30 Alice Gao , Weixi Tong , Rishab Vempati , Katharina Reinecke , R. Benjamin Shapiro , Tianyi Zhang , Jason Wu

Simulations, although powerful in accurately replicating real-world systems, often remain inaccessible to non-technical users due to their complexity. Conversely, large language models (LLMs) provide intuitive, language-based interactions…

Computation and Language · Computer Science 2025-05-22 Jacob Kleiman , Kevin Frank , Joseph Voyles , Sindy Campagna

In recommender systems, online A/B testing is a crucial method for evaluating the performance of different models. However, conducting online A/B testing often presents significant challenges, including substantial economic costs, user…

Training AI models has always been challenging, especially when there is a need for custom models to provide personalized services. Algorithm engineers often face a lengthy process to iteratively develop models tailored to specific business…

Artificial Intelligence · Computer Science 2023-11-27 Haoyuan Li , Hao Jiang , Tianke Zhang , Zhelun Yu , Aoxiong Yin , Hao Cheng , Siming Fu , Yuhao Zhang , Wanggui He
‹ Prev 1 2 3 10 Next ›