English

Autoregressive, Yet Revisable: In Decoding Revision for Secure Code Generation

Software Engineering 2026-05-07 v2 Artificial Intelligence

Abstract

Large Language Model (LLM) based code generation is predominantly formulated as a strictly monotonic process, appending tokens linearly to an immutable prefix. This formulation contrasts to the cognitive process of programming, which is inherently interleaved with forward generation and on-the-fly revision. While prior works attempt to introduce revision via post-hoc agents or external static tools, they either suffer from high latency or fail to leverage the model's intrinsic semantic reasoning. In this paper, we propose Stream of Revision, a paradigm shift that elevates code generation from a monotonic stream to a dynamic, self-correcting trajectory by leveraging model's intrinsic capabilities. We introduce specific action tokens that enable the model to seamlessly backtrack and edit its own history within a single forward pass. By internalizing the revision loop, our framework Stream of Revision allows the model to activate its latent capabilities just-in-time without external dependencies. Empirical results on secure code generation show that Stream of Revision significantly reduces vulnerabilities with minimal inference overhead.

Keywords

Cite

@article{arxiv.2602.01187,
  title  = {Autoregressive, Yet Revisable: In Decoding Revision for Secure Code Generation},
  author = {Chengran Yang and Zichao Wei and Heminghao Deng and Jinfeng Jiang and Zhensu Sun and Ting Zhang and Tianyi Wu and Ming Wen and David Lo},
  journal= {arXiv preprint arXiv:2602.01187},
  year   = {2026}
}