English

TaskAudit: Detecting Functiona11ity Errors in Mobile Apps via Agentic Task Execution

Human-Computer Interaction 2026-02-06 v2 Software Engineering

Abstract

Accessibility checkers are tools in support of accessible app development, and their use is encouraged by accessibility best practices. However, most current checkers evaluate static or mechanically-generated contexts, failing to capture common accessibility errors impacting mobile app functionality. In this work, we define functiona11ity errors as accessibility barriers that only manifest through interaction (i.e., named according to a blend of "functionality" and "accessibility"). We introduce TaskAudit, which comprises three components: a Task Generator that constructs interactive tasks from app screens, a Task Executor that uses agents with a screen reader proxy to perform these tasks, and an Accessibility Analyzer that detects and reports accessibility errors by examining interaction traces. Our evaluation on real-world apps shows that TaskAudit detects 48 functiona11ity errors from 54 app screens, compared to between 4 and 20 with existing checkers. Our analysis demonstrates common error patterns that TaskAudit can detect in addition to those from prior work, including label-functionality mismatch, cluttered navigation, and inappropriate feedback.

Keywords

Cite

@article{arxiv.2510.12972,
  title  = {TaskAudit: Detecting Functiona11ity Errors in Mobile Apps via Agentic Task Execution},
  author = {Mingyuan Zhong and Xia Chen and Davin Win Kyi and Chen Li and James Fogarty and Jacob O. Wobbrock},
  journal= {arXiv preprint arXiv:2510.12972},
  year   = {2026}
}

Comments

CHI 2026

R2 v1 2026-07-01T06:37:41.244Z