PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback

Alexandru Coca; Bo-Hsiang Tseng; Pete Boothroyd; Jianpeng Cheng; Mark Gaynor; Zhenxing Zhang; Joe Stacey; Tristan Guigue; Héctor Martinez Alonso; Diarmuid Ó Séaghdha; Anders Johannsen

PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback

Computation and Language 2025-08-22 v1

Authors: Alexandru Coca , Bo-Hsiang Tseng , Pete Boothroyd , Jianpeng Cheng , Mark Gaynor , Zhenxing Zhang , Joe Stacey , Tristan Guigue , Héctor Martinez Alonso , Diarmuid Ó Séaghdha , Anders Johannsen

View on arXiv ↗ PDF ↗

Abstract

Programmable task-oriented dialogue (TOD) agents enable language models to follow structured dialogue policies, but their effectiveness hinges on accurate state tracking. We present PyTOD, an agent that generates executable code to track dialogue state and uses policy and execution feedback for efficient error correction. To this end, PyTOD employs a simple constrained decoding approach, using a language model instead of grammar rules to follow API schemata. This leads to state-of-the-art state tracking performance on the challenging SGD benchmark. Our experiments show that PyTOD surpasses strong baselines in both accuracy and robust user goal estimation as the dialogue progresses, demonstrating the effectiveness of execution-aware state tracking.

Keywords

dialogue systems multi-agent systems programming languages

Cite

@article{arxiv.2508.15456,
  title  = {PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback},
  author = {Alexandru Coca and Bo-Hsiang Tseng and Pete Boothroyd and Jianpeng Cheng and Mark Gaynor and Zhenxing Zhang and Joe Stacey and Tristan Guigue and Héctor Martinez Alonso and Diarmuid Ó Séaghdha and Anders Johannsen},
  journal= {arXiv preprint arXiv:2508.15456},
  year   = {2025}
}

Comments

20 pages, 12 figures. To appear at SIGDIAL 2025

PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback

Abstract

Keywords

Cite

Comments

Related papers