English
Related papers

Related papers: AnCoder: Anchored Code Generation via Discrete Dif…

200 papers

Code generation is increasingly critical for real-world applications. Still, diffusion-based large language models continue to struggle with this demand. Unlike free-form text, code requires syntactic precision; even minor structural…

Computation and Language · Computer Science 2026-01-07 Yiming Zeng , Jinghan Cao , Zexin Li , Yiming Chen , Tao Ren , Zhuochun Li , Dawei Xiang , Xidong Wu , Shangqian Gao , Tingting Yu

Discrete diffusion language models have emerged as a competitive alternative to auto-regressive language models, but training them efficiently under limited parameter and memory budgets remains challenging. Modern architectures are…

Computation and Language · Computer Science 2026-04-07 Zihao Wu , Haoming Yang , Juncheng Dong , Vahid Tarokh

Large language models generate code one token at a time. Their autoregressive generation process lacks the feedback of observing the program's output. Training LLMs to suggest edits directly can be challenging due to the scarcity of rich…

Artificial Intelligence · Computer Science 2024-06-03 Shreyas Kapur , Erik Jenner , Stuart Russell

Diffusion Language Models (DLMs) promise parallel generation and bidirectional context, yet they underperform autoregressive (AR) models in both likelihood modeling and generated text quality. We identify that this performance gap arises…

Computation and Language · Computer Science 2025-05-27 Litu Rout , Constantine Caramanis , Sanjay Shakkottai

Discrete diffusion models are a powerful, emerging paradigm for code generation. They construct programs through iterative refinement of partially corrupted token sequences and enable parallel token refinement. Importantly, this paradigm…

Computation and Language · Computer Science 2026-05-19 Lize Shao , Michael Cardei , Zichen Xie , Ferdinando Fioretto , Wenxi Wang

LLMs have become the mainstream approaches to code generation. Existing LLMs mainly employ autoregressive generation, i.e. generating code token-by-token from left to right. However, the underlying autoregressive generation has two…

Software Engineering · Computer Science 2025-11-04 Chengze Li , Yitong Zhang , Jia Li , Liyi Cai , Ge Li

Diffusion Language Models (DLMs) present a compelling alternative to autoregressive models, offering flexible, any-order infilling without specialized prompting design. However, their practical utility is blocked by a critical limitation:…

Computation and Language · Computer Science 2026-02-03 Zirui Wu , Lin Zheng , Zhihui Xie , Jiacheng Ye , Jiahui Gao , Shansan Gong , Yansong Feng , Zhenguo Li , Wei Bi , Guorui Zhou , Lingpeng Kong

Past work on story generation has demonstrated the usefulness of conditioning on a generation plan to generate coherent stories. However, these approaches have used heuristics or off-the-shelf models to first tag training stories with the…

Computation and Language · Computer Science 2020-10-08 Harsh Jhamtani , Taylor Berg-Kirkpatrick

Diffusion models have achieved remarkable performance in generative modeling, yet their theoretical foundations are often intricate, and the gap between mathematical formulations in papers and practical open-source implementations can be…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Cheng Yu

Diffusion-based language models (DLLMs) offer non-sequential, block-wise generation and richer data reuse compared to autoregressive (AR) models, but existing code DLLMs still lag behind strong AR baselines under comparable budgets. We…

Computation and Language · Computer Science 2026-01-26 Chenghao Fan , Wen Heng , Bo Li , Sichen Liu , Yuxuan Song , Jing Su , Xiaoye Qu , Kai Shen , Wei Wei

Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have…

Computation and Language · Computer Science 2023-11-08 Justin Lovelace , Varsha Kishore , Chao Wan , Eliot Shekhtman , Kilian Q. Weinberger

End-to-end multi-modal planning has become a transformative paradigm in autonomous driving, effectively addressing behavioral multi-modality and the generalization challenge in long-tail scenarios. We propose AnchDrive, a framework for…

Robotics · Computer Science 2025-09-29 Jinhao Chai , Anqing Jiang , Hao Jiang , Shiyi Mu , Zichong Gu , Hao Sun , Shugong Xu

The collection of large-scale and diverse robot demonstrations remains a major bottleneck for imitation learning, as real-world data acquisition is costly and simulators offer limited diversity and fidelity with pronounced sim-to-real gaps.…

Radiology report generation (RRG) aims to automatically produce clinically accurate textual reports from medical images. Existing methods predominantly rely on autoregressive (AR) language models, whose causal dependency structure restricts…

Artificial Intelligence · Computer Science 2026-05-19 Shiying Yu , Jielei Wang , Guoming Lu

Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as…

Existing methods for code generation use code snippets as seed data, restricting the complexity and diversity of the synthesized data. In this paper, we introduce a novel feature tree-based synthesis framework, which revolves around…

Computation and Language · Computer Science 2025-10-10 Yaoxiang Wang , Haoling Li , Xin Zhang , Jie Wu , Xiao Liu , Wenxiang Hu , Zhongxin Guo , Yangyu Huang , Ying Xin , Yujiu Yang , Jinsong Su , Qi Chen , Scarlett Li

Diffusion language models theoretically allow for efficient parallel generation but are practically hindered by the "factorization barrier": the assumption that simultaneously predicted tokens are independent. This limitation forces a…

Machine Learning · Computer Science 2026-03-11 Ian Li , Zilei Shao , Benjie Wang , Rose Yu , Guy Van den Broeck , Anji Liu

We present Dream-Coder 7B, an open-source discrete diffusion language model for code generation that exhibits emergent any-order generation capabilities. Unlike traditional autoregressive (AR) models that decode strictly left-to-right,…

Computation and Language · Computer Science 2025-09-03 Zhihui Xie , Jiacheng Ye , Lin Zheng , Jiahui Gao , Jingwei Dong , Zirui Wu , Xueliang Zhao , Shansan Gong , Xin Jiang , Zhenguo Li , Lingpeng Kong

There has been a recent surge of interest in automating software engineering tasks using deep learning. This paper addresses the problem of code generation, where the goal is to generate target code given source code in a different language…

Machine Learning · Computer Science 2024-02-01 Sindhu Tipirneni , Ming Zhu , Chandan K. Reddy

Diffusion language models enable any-order generation and bidirectional conditioning, offering appealing flexibility for tasks such as infilling, rewriting, and self-correction. However, their formulation-predicting one part of a sequence…

Computation and Language · Computer Science 2026-01-21 Tianqi Du , Lizhe Fang , Weijie Yang , Chenheng Zhang , Zeming Wei , Yifei Wang , Yisen Wang
‹ Prev 1 2 3 10 Next ›