English

AnyTOD: A Programmable Task-Oriented Dialog System

Computation and Language 2023-02-14 v2

Abstract

We propose AnyTOD, an end-to-end, zero-shot task-oriented dialog (TOD) system capable of handling unseen tasks without task-specific training. We view TOD as a program executed by a language model (LM), where program logic and ontology is provided by a designer as a schema. To enable generalization to unseen schemas and programs without prior training, AnyTOD adopts a neuro-symbolic approach. A neural LM keeps track of events occurring during a conversation and a symbolic program implementing the dialog policy is executed to recommend next actions AnyTOD should take. This approach drastically reduces data annotation and model training requirements, addressing the enduring challenge of rapidly adapting a TOD system to unseen tasks and domains. We demonstrate state-of-the-art results on STAR, ABCD and SGD benchmarks. We also demonstrate strong zero-shot transfer ability in low-resource settings, such as zero-shot on MultiWOZ. In addition, we release STARv2, an updated version of the STAR dataset with richer annotations, for benchmarking zero-shot end-to-end TOD models.

Keywords

Cite

@article{arxiv.2212.09939,
  title  = {AnyTOD: A Programmable Task-Oriented Dialog System},
  author = {Jeffrey Zhao and Yuan Cao and Raghav Gupta and Harrison Lee and Abhinav Rastogi and Mingqiu Wang and Hagen Soltau and Izhak Shafran and Yonghui Wu},
  journal= {arXiv preprint arXiv:2212.09939},
  year   = {2023}
}

Comments

v2, update with Multiwoz, SGD results

R2 v1 2026-06-28T07:43:36.680Z