English
Related papers

Related papers: OSS-UAgent: An Agent-based Usability Evaluation Fr…

200 papers

Usability testing is a fundamental yet challenging (e.g., inflexible to iterate the study design flaws and hard to recruit study participants) research method for user experience (UX) researchers to evaluate a web design. Recent advances in…

Human-Computer Interaction · Computer Science 2025-04-08 Yuxuan Lu , Bingsheng Yao , Hansu Gu , Jing Huang , Jessie Wang , Yang Li , Jiri Gesi , Qi He , Toby Jia-Jun Li , Dakuo Wang

Usability testing is a fundamental research method that user experience (UX) researchers use to evaluate and iterate their new designs. But what about evaluating and iterating the usability testing study design itself? Recent advances in…

Computation and Language · Computer Science 2025-09-22 Yuxuan Lu , Bingsheng Yao , Hansu Gu , Jing Huang , Jessie Wang , Yang Li , Jiri Gesi , Qi He , Toby Jia-Jun Li , Dakuo Wang

Automatically compiling open-source software (OSS) projects is a vital, labor-intensive, and complex task, which makes it a good challenge for LLM Agents. Existing methods rely on manually curated rules and workflows, which cannot adapt to…

We present Agent S, an open agentic framework that enables autonomous interaction with computers through a Graphical User Interface (GUI), aimed at transforming human-computer interaction by automating complex, multi-step tasks. Agent S…

Artificial Intelligence · Computer Science 2024-10-11 Saaket Agashe , Jiuzhou Han , Shuyu Gan , Jiachen Yang , Ang Li , Xin Eric Wang

Agents are now used widely in the process of software development, but building production-ready software engineering agents is a complex task. Deploying software agents effectively requires flexibility in implementation and…

Recent advances on large language models (LLMs) enable researchers and developers to build autonomous language agents that can automatically solve various tasks and interact with environments, humans, and other agents using natural language…

Recently, Agentic AI has become an increasingly popular research field. However, we argue that current agent research practices lack standardization and scientific rigor, making it hard to conduct fair comparisons among methods. As a…

The rapid appearance of large language models (LLMs) has led to systems that turn natural-language intent into real user interfaces (UIs). Free-form code generation maximizes expressiveness but often hurts reliability, security, and…

Human-Computer Interaction · Computer Science 2025-11-04 Xinsong Li , Ning Jiang , Jay Selvaraj

Simulated user agents are increasingly used in usability testing to support fast, iterative UX workflows, as they generate rich data such as action logs and think-aloud reasoning, but the unstructured nature of this output often obscures…

Human-Computer Interaction · Computer Science 2026-01-23 Steffen Holter , Eunyee Koh , Mustafa Doga Dogan , Gromit Yeuk-Yin Chan

The automation of functional testing in software has allowed developers to continuously check for negative impacts on functionality throughout the iterative phases of development. This is not the case for User eXperience (UX), which has…

Artificial Intelligence · Computer Science 2021-04-14 Pedro M. Fernandes , Manuel Lopes , Rui Prada

LLM-powered agents are both a promising new technology and a source of complexity, where choices about models, tools, and prompting can affect their usefulness. While numerous benchmarks measure agent accuracy across domains, they mostly…

Large language models are redefining software engineering by implementing AI-powered techniques throughout the whole software development process, including requirement gathering, software architecture, code generation, testing, and…

Software Engineering · Computer Science 2024-06-11 Malik Abdul Sami , Muhammad Waseem , Zeeshan Rasheed , Mika Saari , Kari Systä , Pekka Abrahamsson

The growth of Large Language Model (LLM) technology has raised expectations for automated coding. However, software engineering is more than coding and is concerned with activities including maintenance and evolution of a project. In this…

Software Engineering · Computer Science 2025-12-09 Leonhard Applis , Yuntong Zhang , Shanchao Liang , Nan Jiang , Lin Tan , Abhik Roychoudhury

Unlike traditional automation tools or static LLM-based systems, agents combine decision-making and tool utilization to accomplish complex tasks, showing great potential in software engineering. However, existing studies largely focus on…

Software Engineering · Computer Science 2025-11-04 Zhuowen Yin , Cuifeng Gao , Chunsong Fan , Wenzhang Yang , Yinxing Xue , Lijun Zhang

Large Language Model (LLM) Agents have demonstrated remarkable capabilities in task automation and intelligent decision-making, driving the widespread adoption of agent development frameworks such as LangChain and AutoGen. However, these…

Artificial Intelligence · Computer Science 2025-10-10 Jiabin Tang , Tianyu Fan , Chao Huang

Reliability is key to realizing the promise of autonomous UI-Agents, multimodal agents that directly interact with apps in the same manner as humans, as users must be able to trust an agent to complete a given task. Current evaluations rely…

Usability is an increasing concern in open source software (OSS). Given the recent changes in the OSS landscape, it is imperative to examine the OSS contributors' current valued factors, practices, and challenges concerning usability. We…

Software Engineering · Computer Science 2020-07-15 Wenting Wang , Jinghui Cheng , Jin L. C. Guo

With ChatGPT-like large language models (LLM) prevailing in the community, how to evaluate the ability of LLMs is an open question. Existing evaluation methods suffer from following shortcomings: (1) constrained evaluation abilities, (2)…

Artificial Intelligence · Computer Science 2023-08-09 Jiaju Lin , Haoran Zhao , Aochi Zhang , Yiting Wu , Huqiuyue Ping , Qin Chen

Large language models (LLMs) have recently demonstrated remarkable capabilities to comprehend human intentions, engage in reasoning, and design planning-like behavior. To further unleash the power of LLMs to accomplish complex tasks, there…

Vision-language models have demonstrated impressive capabilities as computer-use agents (CUAs) capable of automating diverse computer tasks. As their commercial potential grows, critical details of the most capable CUA systems remain…

‹ Prev 1 2 3 10 Next ›