Related papers: OSS-UAgent: An Agent-based Usability Evaluation Fr…

UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design

Usability testing is a fundamental yet challenging (e.g., inflexible to iterate the study design flaws and hard to recruit study participants) research method for user experience (UX) researchers to evaluate a web design. Recent advances in…

Human-Computer Interaction · Computer Science 2025-04-08 Yuxuan Lu , Bingsheng Yao , Hansu Gu , Jing Huang , Jessie Wang , Yang Li , Jiri Gesi , Qi He , Toby Jia-Jun Li , Dakuo Wang

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Usability testing is a fundamental research method that user experience (UX) researchers use to evaluate and iterate their new designs. But what about evaluating and iterating the usability testing study design itself? Recent advances in…

Computation and Language · Computer Science 2025-09-22 Yuxuan Lu , Bingsheng Yao , Hansu Gu , Jing Huang , Jessie Wang , Yang Li , Jiri Gesi , Qi He , Toby Jia-Jun Li , Dakuo Wang

BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software

Automatically compiling open-source software (OSS) projects is a vital, labor-intensive, and complex task, which makes it a good challenge for LLM Agents. Existing methods rely on manually curated rules and workflows, which cannot adapt to…

Software Engineering · Computer Science 2025-10-01 Zehua Zhang , Ati Priya Bajaj , Divij Handa , Siyu Liu , Arvind S Raj , Hongkai Chen , Hulin Wang , Yibo Liu , Zion Leonahenahe Basque , Souradip Nath , Vishal Juneja , Nikhil Chapre , Yan Shoshitaishvili , Adam Doupé , Chitta Baral , Ruoyu Wang

Agent S: An Open Agentic Framework that Uses Computers Like a Human

We present Agent S, an open agentic framework that enables autonomous interaction with computers through a Graphical User Interface (GUI), aimed at transforming human-computer interaction by automating complex, multi-step tasks. Agent S…

Artificial Intelligence · Computer Science 2024-10-11 Saaket Agashe , Jiuzhou Han , Shuyu Gan , Jiachen Yang , Ang Li , Xin Eric Wang

The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents

Agents are now used widely in the process of software development, but building production-ready software engineering agents is a complex task. Deploying software agents effectively requires flexibility in implementation and…

Software Engineering · Computer Science 2026-04-23 Xingyao Wang , Simon Rosenberg , Juan Michelini , Calvin Smith , Hoang Tran , Engel Nyst , Rohit Malhotra , Xuhui Zhou , Valerie Chen , Robert Brennan , Graham Neubig

Agents: An Open-source Framework for Autonomous Language Agents

Recent advances on large language models (LLMs) enable researchers and developers to build autonomous language agents that can automatically solve various tasks and interact with environments, humans, and other agents using natural language…

Computation and Language · Computer Science 2023-12-13 Wangchunshu Zhou , Yuchen Eleanor Jiang , Long Li , Jialong Wu , Tiannan Wang , Shi Qiu , Jintian Zhang , Jing Chen , Ruipu Wu , Shuai Wang , Shiding Zhu , Jiyu Chen , Wentao Zhang , Xiangru Tang , Ningyu Zhang , Huajun Chen , Peng Cui , Mrinmaya Sachan

OAgents: An Empirical Study of Building Effective Agents

Recently, Agentic AI has become an increasingly popular research field. However, we argue that current agent research practices lack standardization and scientific rigor, making it hard to conduct fair comparisons among methods. As a…

Artificial Intelligence · Computer Science 2025-06-24 He Zhu , Tianrui Qin , King Zhu , Heyuan Huang , Yeyi Guan , Jinxiang Xia , Yi Yao , Hanhao Li , Ningning Wang , Pai Liu , Tianhao Peng , Xin Gui , Xiaowan Li , Yuhui Liu , Yuchen Eleanor Jiang , Jun Wang , Changwang Zhang , Xiangru Tang , Ge Zhang , Jian Yang , Minghao Liu , Xitong Gao , Jiaheng Liu , Wangchunshu Zhou

Portal UX Agent -- A Plug-and-Play Engine for Rendering UIs from Natural Language Specifications

The rapid appearance of large language models (LLMs) has led to systems that turn natural-language intent into real user interfaces (UIs). Free-form code generation maximizes expressiveness but often hurts reliability, security, and…

Human-Computer Interaction · Computer Science 2025-11-04 Xinsong Li , Ning Jiang , Jay Selvaraj

UXCascade: Scalable Usability Testing with Simulated User Agents

Simulated user agents are increasingly used in usability testing to support fast, iterative UX workflows, as they generate rich data such as action logs and think-aloud reasoning, but the unstructured nature of this output often obscures…

Human-Computer Interaction · Computer Science 2026-01-23 Steffen Holter , Eunyee Koh , Mustafa Doga Dogan , Gromit Yeuk-Yin Chan

Agents for Automated User Experience Testing

The automation of functional testing in software has allowed developers to continuously check for negative impacts on functionality throughout the iterative phases of development. This is not the case for User eXperience (UX), which has…

Artificial Intelligence · Computer Science 2021-04-14 Pedro M. Fernandes , Manuel Lopes , Rui Prada

How can we assess human-agent interactions? Case studies in software agent design

LLM-powered agents are both a promising new technology and a source of complexity, where choices about models, tools, and prompting can affect their usefulness. While numerous benchmarks measure agent accuracy across domains, they mostly…

Artificial Intelligence · Computer Science 2025-11-05 Valerie Chen , Rohit Malhotra , Xingyao Wang , Juan Michelini , Xuhui Zhou , Aditya Bharat Soni , Hoang H. Tran , Calvin Smith , Ameet Talwalkar , Graham Neubig

Experimenting with Multi-Agent Software Development: Towards a Unified Platform

Large language models are redefining software engineering by implementing AI-powered techniques throughout the whole software development process, including requirement gathering, software architecture, code generation, testing, and…

Software Engineering · Computer Science 2024-06-11 Malik Abdul Sami , Muhammad Waseem , Zeeshan Rasheed , Mika Saari , Kari Systä , Pekka Abrahamsson

Unified Software Engineering Agent as AI Software Engineer

The growth of Large Language Model (LLM) technology has raised expectations for automated coding. However, software engineering is more than coding and is concerned with activities including maintenance and evolution of a project. In this…

Software Engineering · Computer Science 2025-12-09 Leonhard Applis , Yuntong Zhang , Shanchao Liang , Nan Jiang , Lin Tan , Abhik Roychoudhury

A Comprehensive Empirical Evaluation of Agent Frameworks on Code-centric Software Engineering Tasks

Unlike traditional automation tools or static LLM-based systems, agents combine decision-making and tool utilization to accomplish complex tasks, showing great potential in software engineering. However, existing studies largely focus on…

Software Engineering · Computer Science 2025-11-04 Zhuowen Yin , Cuifeng Gao , Chunsong Fan , Wenzhang Yang , Yinxing Xue , Lijun Zhang

AutoAgent: A Fully-Automated and Zero-Code Framework for LLM Agents

Large Language Model (LLM) Agents have demonstrated remarkable capabilities in task automation and intelligent decision-making, driving the widespread adoption of agent development frameworks such as LangChain and AutoGen. However, these…

Artificial Intelligence · Computer Science 2025-10-10 Jiabin Tang , Tianyu Fan , Chao Huang

OpenApps: Simulating Environment Variations to Measure UI-Agent Reliability

Reliability is key to realizing the promise of autonomous UI-Agents, multimodal agents that directly interact with apps in the same manner as humans, as users must be able to trust an agent to complete a given task. Current evaluations rely…

Artificial Intelligence · Computer Science 2025-11-27 Karen Ullrich , Jingtong Su , Claudia Shi , Arjun Subramonian , Amir Bar , Ivan Evtimov , Nikolaos Tsilivis , Randall Balestriero , Julia Kempe , Mark Ibrahim

How Do Open Source Software Contributors Perceive and Address Usability? Valued Factors, Practices, and Challenges

Usability is an increasing concern in open source software (OSS). Given the recent changes in the OSS landscape, it is imperative to examine the OSS contributors' current valued factors, practices, and challenges concerning usability. We…

Software Engineering · Computer Science 2020-07-15 Wenting Wang , Jinghui Cheng , Jin L. C. Guo

AgentSims: An Open-Source Sandbox for Large Language Model Evaluation

With ChatGPT-like large language models (LLM) prevailing in the community, how to evaluate the ability of LLMs is an open question. Existing evaluation methods suffer from following shortcomings: (1) constrained evaluation abilities, (2)…

Artificial Intelligence · Computer Science 2023-08-09 Jiaju Lin , Haoran Zhao , Aochi Zhang , Yiting Wu , Huqiuyue Ping , Qin Chen

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

Large language models (LLMs) have recently demonstrated remarkable capabilities to comprehend human intentions, engage in reasoning, and design planning-like behavior. To further unleash the power of LLMs to accomplish complex tasks, there…

Computation and Language · Computer Science 2023-09-06 Chenliang Li , Hehong Chen , Ming Yan , Weizhou Shen , Haiyang Xu , Zhikai Wu , Zhicheng Zhang , Wenmeng Zhou , Yingda Chen , Chen Cheng , Hongzhu Shi , Ji Zhang , Fei Huang , Jingren Zhou

OpenCUA: Open Foundations for Computer-Use Agents

Vision-language models have demonstrated impressive capabilities as computer-use agents (CUAs) capable of automating diverse computer tasks. As their commercial potential grows, critical details of the most capable CUA systems remain…

Artificial Intelligence · Computer Science 2025-10-07 Xinyuan Wang , Bowen Wang , Dunjie Lu , Junlin Yang , Tianbao Xie , Junli Wang , Jiaqi Deng , Xiaole Guo , Yiheng Xu , Chen Henry Wu , Zhennan Shen , Zhuokai Li , Ryan Li , Xiaochuan Li , Junda Chen , Boyuan Zheng , Peihang Li , Fangyu Lei , Ruisheng Cao , Yeqiao Fu , Dongchan Shin , Martin Shin , Jiarui Hu , Yuyan Wang , Jixuan Chen , Yuxiao Ye , Danyang Zhang , Dikang Du , Hao Hu , Huarong Chen , Zaida Zhou , Haotian Yao , Ziwei Chen , Qizheng Gu , Yipu Wang , Heng Wang , Diyi Yang , Victor Zhong , Flood Sung , Y. Charles , Zhilin Yang , Tao Yu