Related papers: Quantized Embedding Vectors for Controllable Diffu…

CDLM: Consistency Diffusion Language Models For Faster Sampling

Diffusion Language Models (DLMs) offer a promising parallel generation paradigm but suffer from slow inference due to numerous refinement steps and the inability to use standard KV caching. We introduce CDLM (Consistency Diffusion Language…

Machine Learning · Computer Science 2026-02-23 Minseo Kim , Chenfeng Xu , Coleman Hooper , Harman Singh , Ben Athiwaratkun , Ce Zhang , Kurt Keutzer , Amir Gholami

Memory-Efficient Fine-Tuning for Quantized Diffusion Model

The emergence of billion-parameter diffusion models such as Stable Diffusion XL, Imagen, and DALL-E 3 has significantly propelled the domain of generative AI. However, their large-scale architecture presents challenges in fine-tuning and…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Hyogon Ryu , Seohyun Lim , Hyunjung Shim

DLLMQuant: Quantizing Diffusion-based Large Language Models

Diffusion-based large language models (DLLMs) have shown promise for non-autoregressive text generation, but their deployment is constrained by large model sizes and heavy computational costs. Post-training quantization (PTQ), a widely used…

Computation and Language · Computer Science 2025-08-27 Chen Xu , Dawei Yang

Diffusion-LM Improves Controllable Text Generation

Controlling the behavior of language models (LMs) without re-training is a major open problem in natural language generation. While recent works have demonstrated successes on controlling simple sentence attributes (e.g., sentiment), there…

Computation and Language · Computer Science 2022-05-31 Xiang Lisa Li , John Thickstun , Ishaan Gulrajani , Percy Liang , Tatsunori B. Hashimoto

Efficient Quantization Strategies for Latent Diffusion Models

Latent Diffusion Models (LDMs) capture the dynamic evolution of latent variables over time, blending patterns and multimodality in a generative system. Despite the proficiency of LDM in various applications, such as text-to-image…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Yuewei Yang , Xiaoliang Dai , Jialiang Wang , Peizhao Zhang , Hongbo Zhang

Quant-dLLM: Post-Training Extreme Low-Bit Quantization for Diffusion Large Language Models

Diffusion large language models (dLLMs), which offer bidirectional context and flexible masked-denoising generation, are emerging as a compelling alternative to autoregressive (AR) LLMs. However, like AR LLMs, their model sizes continue to…

Machine Learning · Computer Science 2025-10-07 Tianao Zhang , Zhiteng Li , Xianglong Yan , Haotong Qin , Yong Guo , Yulun Zhang

Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning

Recently, the application of diffusion probabilistic models has advanced speech enhancement through generative approaches. However, existing diffusion-based methods have focused on the generation process in high-dimensional waveform or…

Sound · Computer Science 2025-01-20 Shengkui Zhao , Zexu Pan , Kun Zhou , Yukun Ma , Chong Zhang , Bin Ma

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Recent advances in diffusion large language models (dLLMs) have introduced a promising alternative to autoregressive (AR) LLMs for natural language generation tasks, leveraging full attention and denoising-based decoding strategies.…

Computation and Language · Computer Science 2026-03-17 Haokun Lin , Haobo Xu , Yichen Wu , Ziyu Guo , Renrui Zhang , Zhichao Lu , Ying Wei , Qingfu Zhang , Zhenan Sun

Cross-Modality Controlled Molecule Generation with Diffusion Language Model

Current SMILES-based diffusion models for molecule generation typically support only unimodal constraint. They inject conditioning signals at the start of the training process and require retraining a new model from scratch whenever the…

Machine Learning · Computer Science 2025-08-21 Yunzhe Zhang , Yifei Wang , Khanh Vinh Nguyen , Pengyu Hong

Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings

Accurately quantifying uncertainty in large language models (LLMs) is crucial for their reliable deployment, especially in high-stakes applications. Current state-of-the-art methods for measuring semantic uncertainty in LLMs rely on strict…

Machine Learning · Computer Science 2024-10-31 Yashvir S. Grewal , Edwin V. Bonilla , Thang D. Bui

QEFT: Quantization for Efficient Fine-Tuning of LLMs

With the rapid growth in the use of fine-tuning for large language models (LLMs), optimizing fine-tuning while keeping inference efficient has become highly important. However, this is a challenging task as it requires improvements in all…

Computation and Language · Computer Science 2024-10-14 Changhun Lee , Jun-gyu Jin , Younghyun Cho , Eunhyeok Park

LLM-QFL: Distilling Large Language Model for Quantum Federated Learning

Inspired by the power of large language models (LLMs), our research adapts them to quantum federated learning (QFL) to boost efficiency and performance. We propose a federated fine-tuning method that distills an LLM within QFL, allowing…

Machine Learning · Computer Science 2025-05-27 Dev Gurung , Shiva Raj Pokhrel

XDLM: Cross-lingual Diffusion Language Model for Machine Translation

Recently, diffusion models have excelled in image generation tasks and have also been applied to neural language processing (NLP) for controllable text generation. However, the application of diffusion models in a cross-lingual setting is…

Computation and Language · Computer Science 2023-08-01 Linyao Chen , Aosong Feng , Boming Yang , Zihui Li

On the Compressibility of Quantized Large Language Models

Deploying Large Language Models (LLMs) on edge or mobile devices offers significant benefits, such as enhanced data privacy and real-time processing capabilities. However, it also faces critical challenges due to the substantial memory…

Machine Learning · Computer Science 2024-05-07 Yu Mao , Weilan Wang , Hongchao Du , Nan Guan , Chun Jason Xue

Diffusion Language Models Generation Can Be Halted Early

Diffusion Language models (DLMs) are a promising avenue for text generation due to their practical properties on tractable controllable generation. They also have the advantage of not having to predict text autoregressively. However,…

Machine Learning · Computer Science 2024-02-13 Sofia Maria Lo Cicero Vaina , Nikita Balagansky , Daniil Gavrilov

Diffusion LLM with Native Variable Generation Lengths: Let [EOS] Lead the Way

Diffusion-based large language models (dLLMs) have exhibited substantial potential for parallel text generation, which may enable more efficient generation compared to autoregressive models. However, current dLLMs suffer from fixed…

Computation and Language · Computer Science 2025-10-29 Yicun Yang , Cong Wang , Shaobo Wang , Zichen Wen , Biqing Qi , Hanlin Xu , Linfeng Zhang

Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution

Large Language Models exhibit mode collapse, producing homogeneous outputs that fail to explore valid solution spaces. We present QD-LLM, a framework for parameter-efficient neuroevolution that evolves prompt embeddings, compact neural…

Neural and Evolutionary Computing · Computer Science 2026-05-12 Dongxin Guo , Jikun Wu , Siu Ming Yiu

On the Quantization Robustness of Diffusion Language Models in Coding Benchmarks

Auto-regressive Large Language Models (LLMs) achieve strong performance on coding tasks, but incur high memory and inference costs. Diffusion-based language models (d-LLMs) offer bounded inference cost via iterative denoising, but their…

Machine Learning · Computer Science 2026-04-23 Aarav Gupta , Gururaj Deshpande , Chandreyi Chakraborty

Analyzing Diffusion and Autoregressive Vision Language Models in Multimodal Embedding Space

Embedding models are a fundamental component of modern AI systems such as semantic search and retrieval-augmented generation. Recent advances in large foundation models have substantially accelerated the development of embedding models,…

Multimedia · Computer Science 2026-02-09 Zihang Wang , Siyue Zhang , Yilun Zhao , Jingyi Yang , Tingyu Song , Anh Tuan Luu , Chen Zhao

Scalable Text-Embedding-informed Cognitive Diagnosis of Large Language Models

Large language models (LLMs) have achieved remarkable performance on diverse benchmarks, yet existing evaluation practices largely rely on coarse summary metrics that obscure underlying reasoning abilities. In this work, we propose novel…

Methodology · Statistics 2026-03-17 Jia Liu , Zhiyu Xu , Yuqi Gu