Related papers: UXAgent: An LLM Agent-Based Usability Testing Fram…

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Usability testing is a fundamental research method that user experience (UX) researchers use to evaluate and iterate their new designs. But what about evaluating and iterating the usability testing study design itself? Recent advances in…

Computation and Language · Computer Science 2025-09-22 Yuxuan Lu , Bingsheng Yao , Hansu Gu , Jing Huang , Jessie Wang , Yang Li , Jiri Gesi , Qi He , Toby Jia-Jun Li , Dakuo Wang

OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software

Usability evaluation is critical to the impact and adoption of open source software (OSS), yet traditional methods relying on human evaluators suffer from high costs and limited scalability. To address these limitations, we introduce…

Software Engineering · Computer Science 2025-05-30 Lingkai Meng , Yu Shao , Long Yuan , Longbin Lai , Peng Cheng , Wenyuan Yu , Wenjie Zhang , Xuemin Lin , Jingren Zhou

UXCascade: Scalable Usability Testing with Simulated User Agents

Simulated user agents are increasingly used in usability testing to support fast, iterative UX workflows, as they generate rich data such as action logs and think-aloud reasoning, but the unstructured nature of this output often obscures…

Human-Computer Interaction · Computer Science 2026-01-23 Steffen Holter , Eunyee Koh , Mustafa Doga Dogan , Gromit Yeuk-Yin Chan

UXSim: Towards a Hybrid User Search Simulation

Simulating nuanced user experiences within complex interactive search systems poses distinct challenge for traditional methodologies, which often rely on static user proxies or, more recently, on standalone large language model (LLM) agents…

Information Retrieval · Computer Science 2026-03-02 Saber Zerhoudi , Michael Granitzer

USimAgent: Large Language Models for Simulating Search Users

Due to the advantages in the cost-efficiency and reproducibility, user simulation has become a promising solution to the user-centric evaluation of information retrieval systems. Nonetheless, accurately simulating user search behaviors has…

Information Retrieval · Computer Science 2024-10-30 Erhan Zhang , Xingzhu Wang , Peiyuan Gong , Yankai Lin , Jiaxin Mao

From SERPs to Agents: A Platform for Comparative Studies of Information Interaction

The diversification of information access systems, from RAG to autonomous agents, creates a critical need for comparative user studies. However, the technical overhead to deploy and manage these distinct systems is a major barrier. We…

Human-Computer Interaction · Computer Science 2026-01-16 Saber Zerhoudi , Michael Granitzer

Agents for Automated User Experience Testing

The automation of functional testing in software has allowed developers to continuously check for negative impacts on functionality throughout the iterative phases of development. This is not the case for User eXperience (UX), which has…

Artificial Intelligence · Computer Science 2021-04-14 Pedro M. Fernandes , Manuel Lopes , Rui Prada

Portal UX Agent -- A Plug-and-Play Engine for Rendering UIs from Natural Language Specifications

The rapid appearance of large language models (LLMs) has led to systems that turn natural-language intent into real user interfaces (UIs). Free-form code generation maximizes expressiveness but often hurts reliability, security, and…

Human-Computer Interaction · Computer Science 2025-11-04 Xinsong Li , Ning Jiang , Jay Selvaraj

TestAgent: An Adaptive and Intelligent Expert for Human Assessment

Accurately assessing internal human states is key to understanding preferences, offering personalized services, and identifying challenges in real-world applications. Originating from psychometrics, adaptive testing has become the…

Artificial Intelligence · Computer Science 2025-06-04 Junhao Yu , Yan Zhuang , YuXuan Sun , Weibo Gao , Qi Liu , Mingyue Cheng , Zhenya Huang , Enhong Chen

AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents

A/B testing experiment is a widely adopted method for evaluating UI/UX design decisions in modern web applications. Yet, traditional A/B testing remains constrained by its dependence on the large-scale and live traffic of human…

Human-Computer Interaction · Computer Science 2026-03-12 Yuxuan Lu , Ting-Yao Hsu , Hansu Gu , Limeng Cui , Yaochen Xie , William Headden , Bingsheng Yao , Akash Veeragouni , Jiapeng Liu , Sreyashi Nag , Jessie Wang , Dakuo Wang

How can we assess human-agent interactions? Case studies in software agent design

LLM-powered agents are both a promising new technology and a source of complexity, where choices about models, tools, and prompting can affect their usefulness. While numerous benchmarks measure agent accuracy across domains, they mostly…

Artificial Intelligence · Computer Science 2025-11-05 Valerie Chen , Rohit Malhotra , Xingyao Wang , Juan Michelini , Xuhui Zhou , Aditya Bharat Soni , Hoang H. Tran , Calvin Smith , Ameet Talwalkar , Graham Neubig

Avenir-UX: Automated UX Evaluation via Simulated Human Web Interaction with GUI Grounding

Evaluating web usability typically requires time-consuming user studies and expert reviews, which often limits iteration speed during product development, especially for small teams and agile workflows. We present Avenir-UX, a…

Artificial Intelligence · Computer Science 2026-04-16 Wee Joe Tan , Zi Rui Lucas Lim , Shashank Durgad , Karim Obegi , Aiden Yiliu Li

EvAlignUX: Advancing UX Evaluation through LLM-Supported Metrics Exploration

Evaluating UX in the context of AI's complexity, unpredictability, and generative nature presents unique challenges. How can we support HCI researchers to create comprehensive UX evaluation plans? In this paper, we introduce EvAlignUX, a…

Human-Computer Interaction · Computer Science 2025-07-09 Qingxiao Zheng , Minrui Chen , Pranav Sharma , Yiliu Tang , Mehul Oswal , Yiren Liu , Yun Huang

XAgen: An Explainability Tool for Identifying and Correcting Failures in Multi-Agent Workflows

As multi-agent systems powered by Large Language Models (LLMs) are increasingly adopted in real-world workflows, users with diverse technical backgrounds are now building and refining their own agentic processes. However, these systems can…

Human-Computer Interaction · Computer Science 2026-03-05 Xinru Wang , Ming Yin , Eunyee Koh , Mustafa Doga Dogan

Mapping the Design Space of User Experience for Computer Use Agents

Large language model (LLM)-based computer use agents execute user commands by interacting with available UI elements, but little is known about how users want to interact with these agents or what design factors matter for their user…

Human-Computer Interaction · Computer Science 2026-02-10 Ruijia Cheng , Jenny T. Liang , Eldon Schoop , Jeffrey Nichols

A Survey on (M)LLM-Based GUI Agents

Graphical User Interface (GUI) Agents have emerged as a transformative paradigm in human-computer interaction, evolving from rule-based automation scripts to sophisticated AI-driven systems capable of understanding and executing complex…

Human-Computer Interaction · Computer Science 2025-06-05 Fei Tang , Haolei Xu , Hang Zhang , Siqi Chen , Xingyu Wu , Yongliang Shen , Wenqi Zhang , Guiyang Hou , Zeqi Tan , Yuchen Yan , Kaitao Song , Jian Shao , Weiming Lu , Jun Xiao , Yueting Zhuang

Training Computer Use Agents to Assess the Usability of Graphical User Interfaces

Usability testing with experts and potential users can assess the effectiveness, efficiency, and user satisfaction of graphical user interfaces (GUIs) but doing so remains a costly and time-intensive process. Prior work has used computer…

Computation and Language · Computer Science 2026-04-30 Alice Gao , Weixi Tong , Rishab Vempati , Katharina Reinecke , R. Benjamin Shapiro , Tianyi Zhang , Jason Wu

Simulation Agent: A Framework for Integrating Simulation and Large Language Models for Enhanced Decision-Making

Simulations, although powerful in accurately replicating real-world systems, often remain inaccessible to non-technical users due to their complexity. Conversely, large language models (LLMs) provide intuitive, language-based interactions…

Computation and Language · Computer Science 2025-05-22 Jacob Kleiman , Kevin Frank , Joseph Voyles , Sindy Campagna

Exploring Recommender System Evaluation: A Multi-Modal User Agent Framework for A/B Testing

In recommender systems, online A/B testing is a crucial method for evaluating the performance of different models. However, conducting online A/B testing often presents significant challenges, including substantial economic costs, user…

Information Retrieval · Computer Science 2026-01-09 Wenlin Zhang , Xiangyang Li , Qiyuan Ge , Kuicai Dong , Pengyue Jia , Xiaopeng Li , Zijian Zhang , Maolin Wang , Yichao Wang , Huifeng Guo , Ruiming Tang , Xiangyu Zhao

TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System

Training AI models has always been challenging, especially when there is a need for custom models to provide personalized services. Algorithm engineers often face a lengthy process to iteratively develop models tailored to specific business…

Artificial Intelligence · Computer Science 2023-11-27 Haoyuan Li , Hao Jiang , Tianke Zhang , Zhelun Yu , Aoxiong Yin , Hao Cheng , Siming Fu , Yuhao Zhang , Wanggui He