Related papers: STaRR: Spatial-Temporal Token-Dynamics-Aware Respo…

LADR: Locality-Aware Dynamic Rescue for Efficient Text-to-Image Generation with Diffusion Large Language Models

Discrete Diffusion Language Models have emerged as a compelling paradigm for unified multimodal generation, yet their deployment is hindered by high inference latency arising from iterative decoding. Existing acceleration strategies often…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Chenglin Wang , Yucheng Zhou , Shawn Chen , Tao Wang , Kai Zhang

Decoding Strategies for Diffusion-Based ASR: A Systematic Evaluation of Confidence-Based Thresholding

While LLM-based Automatic Speech Recognition (ASR) achieves high accuracy, its speed is limited by sequential autoregressive decoding. Diffusion Language Models (DLMs) offer a parallel alternative, yet their decoding strategies remain…

Audio and Speech Processing · Electrical Eng. & Systems 2026-05-29 Jeong Hun Yeo , Minsu Kim , Hyeongseop Rha , Yong Man Ro

STaR: Towards Effective and Stable Table Reasoning via Slow-Thinking Large Language Models

Table reasoning with large language models (LLMs) plays a critical role in building intelligent systems capable of understanding and analyzing tabular data. Despite recent progress, existing methods still face key limitations: their…

Artificial Intelligence · Computer Science 2026-01-27 Huajian Zhang , Mingyue Cheng , Yucong Luo , Xiaoyu Tao

Stability-Weighted Decoding for Diffusion Language Models

Diffusion large language models (dLLMs) enable parallel text generation by iteratively denoising a fully masked sequence, unmasking a subset of masked tokens at each step. Existing decoding strategies rely on static confidence metrics…

Computation and Language · Computer Science 2026-04-21 Yue Wu , Jian Huang

Saber: An Efficient Sampling with Adaptive Acceleration and Backtracking Enhanced Remasking for Diffusion Language Model

Diffusion language models (DLMs) are emerging as a compelling alternative to the dominant autoregressive paradigm, offering inherent advantages in parallel generation and bidirectional context modeling. However, for the tasks with strict…

Artificial Intelligence · Computer Science 2026-04-30 Yihong Dong , Zhaoyu Ma , Xue Jiang , Zhiyuan Fan , Jiaru Qian , Yongmin Li , Jianha Xiao , Zhi Jin , Rongyu Cao , Binhua Li , Fei Huang , Yongbin Li , Ge Li

Adaptive Steering and Remasking for Safe Generation in Diffusion Language Models

Diffusion Language Models (DLMs) provide a promising alternative to autoregressive language models by generating text through iterative denoising and bidirectional refinement. However, this iterative generation paradigm also introduces…

Computation and Language · Computer Science 2026-05-14 Yejin Lee , Yo-Sub Han

Efficient and Stable Reinforcement Learning for Diffusion Language Models

Reinforcement Learning (RL) is crucial for unlocking the complex reasoning capabilities of Diffusion-based Large Language Models (dLLMs). However, applying RL to dLLMs faces unique challenges in efficiency and stability. To address these…

Artificial Intelligence · Computer Science 2026-02-10 Jiawei Liu , Xiting Wang , Yuanyuan Zhong , Defu Lian , Yu Yang

dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition

Automatic speech recognition (ASR) systems based on large language models (LLMs) achieve superior performance by leveraging pretrained LLMs as decoders, but their token-by-token generation mechanism leads to inference latency that grows…

Sound · Computer Science 2026-01-27 Wenjie Tian , Bingshen Mu , Guobin Ma , Xuelong Geng , Zhixian Zhao , Lei Xie

STDec: Spatio-Temporal Stability Guided Decoding for dLLMs

Diffusion Large Language Models (dLLMs) have achieved rapid progress, viewed as a promising alternative to the autoregressive paradigm. However, most dLLM decoders still adopt a global confidence threshold, and do not explicitly model local…

Computation and Language · Computer Science 2026-04-09 Yuzhe Chen , Jiale Cao , Xuyang Liu , Jin Xie , Aiping Yang , Yanwei Pang

ILRR: Inference-Time Steering Method for Masked Diffusion Language Models

Discrete Diffusion Language Models (DLMs) offer a promising non-autoregressive alternative for text generation, yet effective mechanisms for inference-time control remain relatively underexplored. Existing approaches include sampling-level…

Computation and Language · Computer Science 2026-01-30 Eden Avrahami , Eliya Nachmani

Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States

Autoregressive (AR) models remain the standard for natural language generation but still suffer from high latency due to strictly sequential decoding. Recent diffusion-inspired approaches, such as LlaDA and Dream, mitigate this by…

Computation and Language · Computer Science 2025-10-16 Qinglin Zhu , Yizhen Yao , Runcong Zhao , Yanzheng Xiang , Amrutha Saseendran , Chen Jin , Philip Teare , Bin Liang , Yulan He , Lin Gui

DLM-SWAI: Steering Diffusion Language Models Before They Unmask

Steering language model generation toward desired textual properties is essential for practical deployment, and inference-time methods are particularly appealing because they enable controllable generation without retraining. Recent work…

Computation and Language · Computer Science 2026-05-29 Hyeseon An , Yo-Sub Han

$R^2$-dLLM: Accelerating Diffusion Large Language Models via Spatio-Temporal Redundancy Reduction

Diffusion Large Language Models (dLLMs) have emerged as a promising alternative to autoregressive generation by enabling parallel token prediction. However, practical dLLM decoding still suffers from high inference latency, which limits…

Computation and Language · Computer Science 2026-04-22 Zhenbang Du , Kejing Xia , Xinrui Zhong , Yonggan Fu , Nicolai Oswald , Binfei Ji , Brucek Khailany , Pavlo Molchanov , Yingyan Lin

STEER: Inference-Time Risk Control via Constrained Quality-Diversity Search

Large Language Models (LLMs) trained for average correctness often exhibit mode collapse, producing narrow decision behaviors on tasks where multiple responses may be reasonable. This limitation is particularly problematic in ordinal…

Artificial Intelligence · Computer Science 2026-02-04 Eric Yang , Jong Ha Lee , Jonathan Amar , Elissa Ye , Yugang Jia

STARS: Synchronous Token Alignment for Robust Supervision in Large Language Models

Aligning large language models (LLMs) with human values is crucial for safe deployment. Inference-time techniques offer granular control over generation; however, they rely on model uncertainty, meaning an internal estimate of how likely…

Computation and Language · Computer Science 2026-03-04 Mohammad Atif Quamar , Mohammad Areeb , Mikhail Kuznetsov , Muslum Ozgur Ozmen , Z. Berkay Celik

TAD: Temporal-Aware Trajectory Self-Distillation for Fast and Accurate Diffusion LLM

Diffusion large language models (dLLMs) offer a promising paradigm for parallel text generation, but in practice they face an accuracy-parallelism trade-off, where increasing tokens per forward (TPF) often degrades generation quality.…

Computation and Language · Computer Science 2026-05-12 Haoyang Zhou , Li Kong , Shijie Ren , Xiting Wang , Shuang Liang , Guowei Wang , Zhenxuan Pan

STEAD: Robust Provably Secure Linguistic Steganography with Diffusion Language Model

Recent provably secure linguistic steganography (PSLS) methods rely on mainstream autoregressive language models (ARMs) to address historically challenging tasks, that is, to disguise covert communication as ``innocuous'' natural language…

Cryptography and Security · Computer Science 2026-01-22 Yuang Qi , Na Zhao , Qiyi Yao , Benlong Wu , Weiming Zhang , Nenghai Yu , Kejiang Chen

Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models

Diffusion Language Models (DLMs) generate text by iteratively denoising a masked sequence, repeatedly deciding which positions to commit at each step. Standard decoding follows a greedy rule: unmask the most confident positions, yet this…

Computation and Language · Computer Science 2026-02-26 Mingyu Cao , Alvaro H. C. Correia , Christos Louizos , Shiwei Liu , Lu Yin

STEAR: Layer-Aware Spatiotemporal Evidence Intervention for Hallucination Mitigation in Video Large Language Models

Video Large Language Models (Video-LLMs) remain prone to spatiotemporal hallucinations, often generating visually unsupported details or incorrect temporal relations. Existing mitigation methods typically treat hallucination as a uniform…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Linfeng Fan , Yuan Tian , Ziwei Li , Zhiwu Lu

FlashDLM: Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion

Diffusion language models offer parallel token generation and inherent bidirectionality, promising more efficient and powerful sequence modeling compared to autoregressive approaches. However, state-of-the-art diffusion models (e.g., Dream…

Computation and Language · Computer Science 2025-10-10 Zhanqiu Hu , Jian Meng , Yash Akhauri , Mohamed S. Abdelfattah , Jae-sun Seo , Zhiru Zhang , Udit Gupta