English
Related papers

Related papers: STaRR: Spatial-Temporal Token-Dynamics-Aware Respo…

200 papers

Discrete Diffusion Language Models have emerged as a compelling paradigm for unified multimodal generation, yet their deployment is hindered by high inference latency arising from iterative decoding. Existing acceleration strategies often…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Chenglin Wang , Yucheng Zhou , Shawn Chen , Tao Wang , Kai Zhang

While LLM-based Automatic Speech Recognition (ASR) achieves high accuracy, its speed is limited by sequential autoregressive decoding. Diffusion Language Models (DLMs) offer a parallel alternative, yet their decoding strategies remain…

Audio and Speech Processing · Electrical Eng. & Systems 2026-05-29 Jeong Hun Yeo , Minsu Kim , Hyeongseop Rha , Yong Man Ro

Table reasoning with large language models (LLMs) plays a critical role in building intelligent systems capable of understanding and analyzing tabular data. Despite recent progress, existing methods still face key limitations: their…

Artificial Intelligence · Computer Science 2026-01-27 Huajian Zhang , Mingyue Cheng , Yucong Luo , Xiaoyu Tao

Diffusion large language models (dLLMs) enable parallel text generation by iteratively denoising a fully masked sequence, unmasking a subset of masked tokens at each step. Existing decoding strategies rely on static confidence metrics…

Computation and Language · Computer Science 2026-04-21 Yue Wu , Jian Huang

Diffusion language models (DLMs) are emerging as a compelling alternative to the dominant autoregressive paradigm, offering inherent advantages in parallel generation and bidirectional context modeling. However, for the tasks with strict…

Artificial Intelligence · Computer Science 2026-04-30 Yihong Dong , Zhaoyu Ma , Xue Jiang , Zhiyuan Fan , Jiaru Qian , Yongmin Li , Jianha Xiao , Zhi Jin , Rongyu Cao , Binhua Li , Fei Huang , Yongbin Li , Ge Li

Diffusion Language Models (DLMs) provide a promising alternative to autoregressive language models by generating text through iterative denoising and bidirectional refinement. However, this iterative generation paradigm also introduces…

Computation and Language · Computer Science 2026-05-14 Yejin Lee , Yo-Sub Han

Reinforcement Learning (RL) is crucial for unlocking the complex reasoning capabilities of Diffusion-based Large Language Models (dLLMs). However, applying RL to dLLMs faces unique challenges in efficiency and stability. To address these…

Artificial Intelligence · Computer Science 2026-02-10 Jiawei Liu , Xiting Wang , Yuanyuan Zhong , Defu Lian , Yu Yang

Automatic speech recognition (ASR) systems based on large language models (LLMs) achieve superior performance by leveraging pretrained LLMs as decoders, but their token-by-token generation mechanism leads to inference latency that grows…

Sound · Computer Science 2026-01-27 Wenjie Tian , Bingshen Mu , Guobin Ma , Xuelong Geng , Zhixian Zhao , Lei Xie

Diffusion Large Language Models (dLLMs) have achieved rapid progress, viewed as a promising alternative to the autoregressive paradigm. However, most dLLM decoders still adopt a global confidence threshold, and do not explicitly model local…

Computation and Language · Computer Science 2026-04-09 Yuzhe Chen , Jiale Cao , Xuyang Liu , Jin Xie , Aiping Yang , Yanwei Pang

Discrete Diffusion Language Models (DLMs) offer a promising non-autoregressive alternative for text generation, yet effective mechanisms for inference-time control remain relatively underexplored. Existing approaches include sampling-level…

Computation and Language · Computer Science 2026-01-30 Eden Avrahami , Eliya Nachmani

Autoregressive (AR) models remain the standard for natural language generation but still suffer from high latency due to strictly sequential decoding. Recent diffusion-inspired approaches, such as LlaDA and Dream, mitigate this by…

Computation and Language · Computer Science 2025-10-16 Qinglin Zhu , Yizhen Yao , Runcong Zhao , Yanzheng Xiang , Amrutha Saseendran , Chen Jin , Philip Teare , Bin Liang , Yulan He , Lin Gui

Steering language model generation toward desired textual properties is essential for practical deployment, and inference-time methods are particularly appealing because they enable controllable generation without retraining. Recent work…

Computation and Language · Computer Science 2026-05-29 Hyeseon An , Yo-Sub Han

Diffusion Large Language Models (dLLMs) have emerged as a promising alternative to autoregressive generation by enabling parallel token prediction. However, practical dLLM decoding still suffers from high inference latency, which limits…

Computation and Language · Computer Science 2026-04-22 Zhenbang Du , Kejing Xia , Xinrui Zhong , Yonggan Fu , Nicolai Oswald , Binfei Ji , Brucek Khailany , Pavlo Molchanov , Yingyan Lin

Large Language Models (LLMs) trained for average correctness often exhibit mode collapse, producing narrow decision behaviors on tasks where multiple responses may be reasonable. This limitation is particularly problematic in ordinal…

Artificial Intelligence · Computer Science 2026-02-04 Eric Yang , Jong Ha Lee , Jonathan Amar , Elissa Ye , Yugang Jia

Aligning large language models (LLMs) with human values is crucial for safe deployment. Inference-time techniques offer granular control over generation; however, they rely on model uncertainty, meaning an internal estimate of how likely…

Computation and Language · Computer Science 2026-03-04 Mohammad Atif Quamar , Mohammad Areeb , Mikhail Kuznetsov , Muslum Ozgur Ozmen , Z. Berkay Celik

Diffusion large language models (dLLMs) offer a promising paradigm for parallel text generation, but in practice they face an accuracy-parallelism trade-off, where increasing tokens per forward (TPF) often degrades generation quality.…

Computation and Language · Computer Science 2026-05-12 Haoyang Zhou , Li Kong , Shijie Ren , Xiting Wang , Shuang Liang , Guowei Wang , Zhenxuan Pan

Recent provably secure linguistic steganography (PSLS) methods rely on mainstream autoregressive language models (ARMs) to address historically challenging tasks, that is, to disguise covert communication as ``innocuous'' natural language…

Cryptography and Security · Computer Science 2026-01-22 Yuang Qi , Na Zhao , Qiyi Yao , Benlong Wu , Weiming Zhang , Nenghai Yu , Kejiang Chen

Diffusion Language Models (DLMs) generate text by iteratively denoising a masked sequence, repeatedly deciding which positions to commit at each step. Standard decoding follows a greedy rule: unmask the most confident positions, yet this…

Computation and Language · Computer Science 2026-02-26 Mingyu Cao , Alvaro H. C. Correia , Christos Louizos , Shiwei Liu , Lu Yin

Video Large Language Models (Video-LLMs) remain prone to spatiotemporal hallucinations, often generating visually unsupported details or incorrect temporal relations. Existing mitigation methods typically treat hallucination as a uniform…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Linfeng Fan , Yuan Tian , Ziwei Li , Zhiwu Lu

Diffusion language models offer parallel token generation and inherent bidirectionality, promising more efficient and powerful sequence modeling compared to autoregressive approaches. However, state-of-the-art diffusion models (e.g., Dream…

Computation and Language · Computer Science 2025-10-10 Zhanqiu Hu , Jian Meng , Yash Akhauri , Mohamed S. Abdelfattah , Jae-sun Seo , Zhiru Zhang , Udit Gupta
‹ Prev 1 2 3 10 Next ›