Related papers: DFEE: Interactive DataFlow Execution and Evaluatio…

Simplifying Dataflow Dialogue Design

In \citep{andreas2020task-oriented}, a dataflow (DF) based dialogue system was introduced, showing clear advantages compared to many commonly used current systems. This was accompanied by the release of SMCalFlow, a practically relevant,…

Computation and Language · Computer Science 2022-06-29 Joram Meron

Task-Oriented Dialogue as Dataflow Synthesis

We describe an approach to task-oriented dialogue in which dialogue state is represented as a dataflow graph. A dialogue agent maps each user utterance to a program that extends this graph. Programs include metacomputation operators for…

Computation and Language · Computer Science 2021-02-12 Semantic Machines , Jacob Andreas , John Bufe , David Burkett , Charles Chen , Josh Clausman , Jean Crawford , Kate Crim , Jordan DeLoach , Leah Dorner , Jason Eisner , Hao Fang , Alan Guo , David Hall , Kristin Hayes , Kellie Hill , Diana Ho , Wendy Iwaszuk , Smriti Jha , Dan Klein , Jayant Krishnamurthy , Theo Lanman , Percy Liang , Christopher H Lin , Ilya Lintsbakh , Andy McGovern , Aleksandr Nisnevich , Adam Pauls , Dmitrij Petters , Brent Read , Dan Roth , Subhro Roy , Jesse Rusak , Beth Short , Div Slomin , Ben Snyder , Stephon Striplin , Yu Su , Zachary Tellman , Sam Thomson , Andrei Vorobev , Izabela Witoszko , Jason Wolfe , Abby Wray , Yuchen Zhang , Alexander Zotov

DFlow: Diverse Dialogue Flow Simulation with Large Language Models

Developing language model-based dialogue agents requires effective data to train models that can follow specific task logic. However, most existing data simulation methods focus on increasing diversity in language, topics, or dialogue acts…

Computation and Language · Computer Science 2025-03-04 Wanyu Du , Song Feng , James Gung , Lijia Sun , Yi Zhang , Saab Mansour , Yanjun Qi

A Low-Cost, Controllable and Interpretable Task-Oriented Chatbot: With Real-World After-Sale Services as Example

Though widely used in industry, traditional task-oriented dialogue systems suffer from three bottlenecks: (i) difficult ontology construction (e.g., intents and slots); (ii) poor controllability and interpretability; (iii)…

Computation and Language · Computer Science 2022-05-16 Xiangyu Xi , Chenxu Lv , Yuncheng Hua , Wei Ye , Chaobo Sun , Shuaipeng Liu , Fan Yang , Guanglu Wan

Dialog-based Automation of Decision Making in Processes

The use of chatbots has spread, generating great interest in the industry for the possibility of automating tasks within the execution of their processes. The implementation of chatbots, however simple, is a complex endeavor that involves…

Software Engineering · Computer Science 2021-09-03 Bedilia Estrada-Torres , Adela del-Río-Ortega , Manuel Resinas

Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances

Nowadays, open-domain dialogue models can generate acceptable responses according to the historical context based on the large-scale pre-trained language models. However, they generally concatenate the dialogue history directly as the model…

Computation and Language · Computer Science 2021-06-07 Zekang Li , Jinchao Zhang , Zhengcong Fei , Yang Feng , Jie Zhou

DeepEye: A Steerable Self-driving Data Agent System

Large Language Models (LLMs) have revolutionized natural language interaction with data. The "holy grail" of data analytics is to build autonomous Data Agents that can self-drive complex data analysis workflows. However, current…

Databases · Computer Science 2026-04-01 Boyan Li , Yiran Peng , Yupeng Xie , Sirong Lu , Yizhang Zhu , Xing Mu , Xinyu Liu , Yuyu Luo

Towards Efficient Data-flow Test Data Generation

Data-flow testing (DFT) aims to detect potential data interaction anomalies by focusing on the points at which variables receive values and the points at which these values are used. Such test objectives are referred as \emph{def-use…

Software Engineering · Computer Science 2019-04-02 Ting Su , Chengyu Zhang , Yichen Yan , Lingling Fan , Geguang Pu , Yang Liu , Zhoulai Fu , Zhendong Su

FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows

Despite recent progress in open-domain dialogue evaluation, how to develop automatic metrics remains an open problem. We explore the potential of dialogue evaluation featuring dialog act information, which was hardly explicitly modeled in…

Computation and Language · Computer Science 2022-11-04 Jianqiao Zhao , Yanyang Li , Wanyu Du , Yangfeng Ji , Dong Yu , Michael R. Lyu , Liwei Wang

Presentation Proposal: Towards Efficient Data-flow Test Data Generation Using KLEE

Dataflow coverage, one of the white-box testing criteria, focuses on the relations between variable definitions and their uses.Several empirical studies have proved data-flow testing is more effective than control-flow testing. However,…

Software Engineering · Computer Science 2019-03-20 Chengyu Zhang , Ting Su , Yichen Yan , Ke Wu , Geguang Pu

Comprehensive Framework for Evaluating Conversational AI Chatbots

Conversational AI chatbots are transforming industries by streamlining customer service, automating transactions, and enhancing user engagement. However, evaluating these systems remains a challenge, particularly in financial services,…

Computers and Society · Computer Science 2025-02-11 Shailja Gupta , Rajesh Ranjan , Surya Narayan Singh

EmbodiedClaw: Conversational Workflow Execution for Embodied AI Development

Embodied AI research is increasingly moving beyond single-task, single-environment policy learning toward multi-task, multi-scene, and multi-model settings. This shift substantially increases the engineering overhead and development time…

Robotics · Computer Science 2026-04-16 Xueyang Zhou , Yihan Sun , Xijie Gong , Guiyao Tie , Pan Zhou , Lichao Sun , Yongchao Chen

MultiWOZ-DF -- A Dataflow implementation of the MultiWOZ dataset

Semantic Machines (SM) have introduced the use of the dataflow (DF) paradigm to dialogue modelling, using computational graphs to hierarchically represent user requests, data, and the dialogue history [Semantic Machines et al. 2020].…

Computation and Language · Computer Science 2022-11-07 Joram Meron , Victor Guimarães

BotEval: Facilitating Interactive Human Evaluation

Following the rapid progress in natural language processing (NLP) models, language models are applied to increasingly more complex interactive tasks such as negotiations and conversation moderations. Having human evaluators directly…

Computation and Language · Computer Science 2024-07-26 Hyundong Cho , Thamme Gowda , Yuyang Huang , Zixun Lu , Tianli Tong , Jonathan May

Causify DataFlow: A Framework For High-performance Machine Learning Stream Computing

We present DataFlow, a computational framework for building, testing, and deploying high-performance machine learning systems on unbounded time-series data. Traditional data science workflows assume finite datasets and require substantial…

Machine Learning · Computer Science 2026-01-01 Giacinto Paolo Saggese , Paul Smith

DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues

Existing function-calling benchmarks focus on single-turn interactions. However, they overlook the complexity of real-world scenarios. To quantify how existing benchmarks address practical applications, we introduce DICE-SCORE, a metric…

Computation and Language · Computer Science 2025-07-03 Kyochul Jang , Donghyeon Lee , Kyusik Kim , Dongseok Heo , Taewhoo Lee , Woojeong Kim , Bongwon Suh

Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents

Automated service agents require well-structured workflows to provide consistent and accurate responses to customer queries. However, these workflows are often undocumented, and their automatic extraction from conversations remains…

Computation and Language · Computer Science 2025-02-25 Prafulla Kumar Choubey , Xiangyu Peng , Shilpa Bhagavath , Caiming Xiong , Shiva Kumar Pentyala , Chien-Sheng Wu

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable, reliable, and semantically rich data preparation pipelines. However, current practices remain dominated by ad-hoc…

Machine Learning · Computer Science 2025-12-19 Hao Liang , Xiaochen Ma , Zhou Liu , Zhen Hao Wong , Zhengyang Zhao , Zimo Meng , Runming He , Chengyu Shen , Qifeng Cai , Zhaoyang Han , Meiyi Qiang , Yalin Feng , Tianyi Bai , Zewei Pan , Ziyi Guo , Yizhen Jiang , Jingwen Deng , Qijie You , Peichao Lai , Tianyu Guo , Chi Hsu Tsai , Hengyi Feng , Rui Hu , Wenkai Yu , Junbo Niu , Bohan Zeng , Ruichuan An , Lu Ma , Jihao Huang , Yaowei Zheng , Conghui He , Linpeng Tang , Bin Cui , Weinan E , Wentao Zhang

UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities

Benchmarking AI systems in multi-turn interactive scenarios is essential for understanding their practical capabilities in real-world applications. However, existing evaluation protocols are highly heterogeneous, differing significantly in…

Computation and Language · Computer Science 2026-03-25 Qi Jia , Haodong Zhao , Dun Pei , Xiujie Song , Shibo Wang , Zijian Chen , Zicheng Zhang , Xiangyang Zhu , Guangtao Zhai

Democratizing Chatbot Debugging: A Computational Framework for Evaluating and Explaining Inappropriate Chatbot Responses

Evaluating and understanding the inappropriateness of chatbot behaviors can be challenging, particularly for chatbot designers without technical backgrounds. To democratize the debugging process of chatbot misbehaviors for non-technical…

Human-Computer Interaction · Computer Science 2023-06-21 Xu Han , Michelle Zhou , Yichen Wang , Wenxi Chen , Tom Yeh