Make LLM Learn to Synthesize from Streaming Experiences through Feedback

Authors: Zhenlin Hu, Yan Wang, Zhen Bi, Zihao Xue, Bingyu Zhu, Longtao Huang, Xiongtao Zhang, Zeyu Yang, Zhixuan Chu, Jungang Lou

Artificial Intelligence2026-05v1license

View on arXiv ↗ PDF ↗

Abstract

Large language models (LLMs) have been widely adopted for synthetic data generation, significantly reducing annotation costs. However, most existing studies treat synthesis as a set of isolated tasks and overlook a more fundamental question: whether a model can learn to synthesize by accumulating experience from past tasks and transferring it to future ones. In this work, we introduce StreamSynth, a new setting in which synthesis tasks arrive sequentially and experience from historical tasks provides informative signals for future synthesis. To address this setting, we propose SynLearner, a general framework that enables synthesis models to acquire reusable synthesis experience over a task stream. Instead of generating data independently for each task, SynLearner encourages the model to explore diverse synthesis patterns, learn from feedback, and balance sample quality with set-level diversity as tasks evolve. Extensive experiments across multiple benchmarks show that SynLearner effectively leverages experience from earlier tasks to improve synthesis performance on later ones, exhibiting consistent cross-task transferability. These findings provide evidence for the feasibility of StreamSynth and highlight synthetic data generation as an experience-driven process that can benefit from task streams.

Cite

@article{arxiv.2605.29940,
  title  = {Make LLM Learn to Synthesize from Streaming Experiences through Feedback},
  author = {Zhenlin Hu and Yan Wang and Zhen Bi and Zihao Xue and Bingyu Zhu and Longtao Huang and Xiongtao Zhang and Zeyu Yang and Zhixuan Chu and Jungang Lou},
  journal= {arXiv preprint arXiv:2605.29940},
  year   = {2026}
}

← Artificial Intelligence · Home