Related papers: Diffuse Thinking: Exploring Diffusion Language Mod…

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Recent large language models (LLMs) have demonstrated strong reasoning capabilities that benefits from online reinforcement learning (RL). These capabilities have primarily been demonstrated within the left-to-right autoregressive (AR)…

Computation and Language · Computer Science 2025-06-04 Siyan Zhao , Devaansh Gupta , Qinqing Zheng , Aditya Grover

DiffuRank: Effective Document Reranking with Diffusion Language Models

Recent advances in large language models (LLMs) have inspired new paradigms for document reranking. While this paradigm better exploits the reasoning and contextual understanding capabilities of LLMs, most existing LLM-based rerankers rely…

Information Retrieval · Computer Science 2026-02-16 Qi Liu , Kun Ai , Jiaxin Mao , Yanzhao Zhang , Mingxin Li , Dingkun Long , Pengjun Xie , Fengbin Zhu , Ji-Rong Wen

LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Large Language Models (LLMs) demonstrate their reasoning ability through chain-of-thought (CoT) generation. However, LLM's autoregressive decoding may limit the ability to revisit and refine earlier tokens in a holistic manner, which can…

Machine Learning · Computer Science 2026-04-24 Haoqiang Kang , Yizhe Zhang , Nikki Lijing Kuang , Nicklas Majamaki , Navdeep Jaitly , Yi-An Ma , Lianhui Qin

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Recently, diffusion models have garnered significant interest in the field of text processing due to their many potential advantages compared to conventional autoregressive models. In this work, we propose Diffusion-of-Thought (DoT), a…

Computation and Language · Computer Science 2024-12-06 Jiacheng Ye , Shansan Gong , Liheng Chen , Lin Zheng , Jiahui Gao , Han Shi , Chuan Wu , Xin Jiang , Zhenguo Li , Wei Bi , Lingpeng Kong

Reasoning with Autoregressive-Diffusion Collaborative Thoughts

Autoregressive and diffusion models represent two complementary generative paradigms. Autoregressive models excel at sequential planning and constraint composition, yet struggle with tasks that require explicit spatial or physical…

Artificial Intelligence · Computer Science 2026-02-03 Mu Yuan , Liekang Zeng , Guoliang Xing , Lan Zhang , Yunhao Liu

Refining Answer Distributions for Improved Large Language Model Reasoning

Large Language Models (LLMs) have exhibited an impressive capability to perform reasoning tasks, especially if they are encouraged to generate a sequence of intermediate steps. Reasoning performance can be improved by suitably combining…

Computation and Language · Computer Science 2025-04-11 Soumyasundar Pal , Didier Chételat , Yingxue Zhang , Mark Coates

A Survey on Diffusion Language Models

Diffusion Language Models (DLMs) are rapidly emerging as a powerful and promising alternative to the dominant autoregressive (AR) paradigm. By generating tokens in parallel through an iterative denoising process, DLMs possess inherent…

Computation and Language · Computer Science 2025-12-08 Tianyi Li , Mingda Chen , Bowei Guo , Zhiqiang Shen

Diffusion Language Models Generation Can Be Halted Early

Diffusion Language models (DLMs) are a promising avenue for text generation due to their practical properties on tractable controllable generation. They also have the advantage of not having to predict text autoregressively. However,…

Machine Learning · Computer Science 2024-02-13 Sofia Maria Lo Cicero Vaina , Nikita Balagansky , Daniil Gavrilov

Discrete Diffusion in Large Language and Multimodal Models: A Survey

In this work, we provide a systematic survey of Discrete Diffusion Language Models (dLLMs) and Discrete Diffusion Multimodal Language Models (dMLLMs). Unlike autoregressive (AR) models, dLLMs and dMLLMs adopt a multi-token, parallel…

Machine Learning · Computer Science 2025-09-22 Runpeng Yu , Qi Li , Xinchao Wang

Beyond Autoregression: An Empirical Study of Diffusion Large Language Models for Code Generation

LLMs have become the mainstream approaches to code generation. Existing LLMs mainly employ autoregressive generation, i.e. generating code token-by-token from left to right. However, the underlying autoregressive generation has two…

Software Engineering · Computer Science 2025-11-04 Chengze Li , Yitong Zhang , Jia Li , Liyi Cai , Ge Li

Thinking Diffusion: Penalize and Guide Visual-Grounded Reasoning in Diffusion Multimodal Language Models

Diffusion large language models (dLLMs) are emerging as promising alternatives to autoregressive (AR) LLMs. Recently, this paradigm has been extended to multimodal tasks, leading to the development of diffusion multimodal large language…

Artificial Intelligence · Computer Science 2026-04-08 Keuntae Kim , Mingyu Kang , Yong Suk Choi

Planner and Executor: Collaboration between Discrete Diffusion And Autoregressive Models in Reasoning

Current autoregressive language models (ARMs) achieve high accuracy but require long token sequences, making them costly. Discrete diffusion language models (DDLMs) enable parallel and flexible generation within a fixed number of steps and…

Computation and Language · Computer Science 2025-10-21 Lina Berrayana , Ahmed Heakl , Muhammad Abdullah Sohail , Thomas Hofmann , Salman Khan , Wei Chen

Advancing Reasoning in Large Language Models: Promising Methods and Approaches

Large Language Models (LLMs) have succeeded remarkably in various natural language processing (NLP) tasks, yet their reasoning capabilities remain a fundamental challenge. While LLMs exhibit impressive fluency and factual recall, their…

Computation and Language · Computer Science 2025-05-29 Avinash Patil , Aryan Jadon

Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models

Recently, Large Reasoning Models (LRMs) have gradually become a research hotspot due to their outstanding performance in handling complex tasks. Among them, DeepSeek R1 has garnered significant attention for its exceptional performance and…

Artificial Intelligence · Computer Science 2025-08-05 Linan Yue , Yichao Du , Yizhi Wang , Weibo Gao , Fangzhou Yao , Li Wang , Ye Liu , Ziyu Xu , Qi Liu , Shimin Di , Min-Ling Zhang

Active Confusion Expression in Large Language Models: Leveraging World Models toward Better Social Reasoning

While large language models (LLMs) excel in mathematical and code reasoning, we observe they struggle with social reasoning tasks, exhibiting cognitive confusion, logical inconsistencies, and conflation between objective world states and…

Computation and Language · Computer Science 2025-10-14 Jialu Du , Guiyang Hou , Yihui Fu , Chen Wu , Wenqi Zhang , Yongliang Shen , Weiming Lu

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

While recent Multimodal Large Language Models (MLLMs) have attained significant strides in multimodal reasoning, their reasoning processes remain predominantly text-centric, leading to suboptimal performance in complex long-horizon,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-01 Zefeng He , Xiaoye Qu , Yafu Li , Tong Zhu , Siyuan Huang , Yu Cheng

Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching

Reasoning with large language models often benefits from generating multiple chains-of-thought, but existing aggregation strategies are typically trajectory-level (e.g., selecting the best trace or voting on the final answer), discarding…

Computation and Language · Computer Science 2026-02-27 Roy Miles , Aysim Toker , Andreea-Maria Oncescu , Songcen Xu , Jiankang Deng , Ismail Elezi

Reasoning in Diffusion Large Language Models is Concentrated in Dynamic Confusion Zones

Diffusion Large Language Models (dLLMs) are rapidly emerging alongside autoregressive models as a powerful paradigm for complex reasoning, with reinforcement learning increasingly used for downstream alignment. Existing trajectory-based RL…

Machine Learning · Computer Science 2025-11-20 Ranfei Chen , Ming Chen , Kaifei Wang

Beyond Surface Reasoning: Unveiling the True Long Chain-of-Thought Capacity of Diffusion Large Language Models

Recently, Diffusion Large Language Models (DLLMs) have offered high throughput and effective sequential reasoning, making them a competitive alternative to autoregressive LLMs (ALLMs). However, parallel decoding, which enables simultaneous…

Computation and Language · Computer Science 2025-10-13 Qiguang Chen , Hanjing Li , Libo Qin , Dengyun Peng , Jinhao Liu , Jiangyi Wang , Chengyue Wu , Xie Chen , Yantao Du , Wanxiang Che

Thinker-DDM: Modeling Deliberation for Machine Translation with a Drift-Diffusion Process

Large language models (LLMs) have demonstrated promising potential in various downstream tasks, including machine translation. However, prior work on LLM-based machine translation has mainly focused on better utilizing training data,…

Computation and Language · Computer Science 2025-08-05 Hongbin Na , Zimu Wang , Mieradilijiang Maimaiti , Tong Chen , Wei Wang , Tao Shen , Ling Chen