English
Related papers

Related papers: Prompt Optimization Via Diffusion Language Models

200 papers

Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation. However, the quality of the generated code is heavily dependent on the structure and composition of the prompts used. Crafting high-quality prompts…

Software Engineering · Computer Science 2025-04-08 Jinyang Li , Sangwon Hyun , M. Ali Babar

Offline reinforcement learning (RL) methods harness previous experiences to derive an optimal policy, forming the foundation for pre-trained large-scale models (PLMs). When encountering tasks not seen before, PLMs often utilize several…

Machine Learning · Computer Science 2024-11-05 Shengchao Hu , Wanru Zhao , Weixiong Lin , Li Shen , Ya Zhang , Dacheng Tao

In-Context Learning and Chain-of-Thought prompting improve reasoning in large language models (LLMs). These typically come at the cost of longer, more expensive prompts that may contain redundant information. Prompt compression based on…

Computation and Language · Computer Science 2026-04-09 Caleb Zheng , Jyotika Singh , Fang Tu , Weiyi Sun , Sujeeth Bharadwaj , Yassine Benajiba , Sujith Ravi , Eli Shlizerman , Dan Roth

Diffusion models that are based on iterative denoising have been recently proposed and leveraged in various generation tasks like image generation. Whereas, as a way inherently built for continuous data, existing diffusion models still have…

Computation and Language · Computer Science 2023-04-11 Jiaao Chen , Aston Zhang , Mu Li , Alex Smola , Diyi Yang

Automatic prompt optimization is an important approach to improving the performance of large language models (LLMs). Recent research demonstrates the potential of using LLMs as prompt optimizers, which can generate improved task prompts via…

Computation and Language · Computer Science 2025-01-28 Xinyu Tang , Xiaolei Wang , Wayne Xin Zhao , Siyuan Lu , Yaliang Li , Ji-Rong Wen

Recent advances have shown that optimizing prompts for Large Language Models (LLMs) can significantly improve task performance, yet many optimization techniques rely on heuristics or manual exploration. We present LatentPrompt, a…

Computation and Language · Computer Science 2025-08-05 Mateusz Bystroński , Grzegorz Piotrowski , Nitesh V. Chawla , Tomasz Kajdanowicz

Prompt learning has demonstrated promising results in fine-tuning pre-trained multimodal models. However, the performance improvement is limited when applied to more complex and fine-grained tasks. The reason is that most existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Weicai Yan , Wang Lin , Zirun Guo , Ye Wang , Fangming Feng , Xiaoda Yang , Zehan Wang , Tao Jin

Despite recent progress, reinforcement learning (RL)-based fine-tuning of diffusion models often struggles with generalization, composability, and robustness against reward hacking. Recent studies have explored prompt refinement as a…

Machine Learning · Computer Science 2026-03-26 Suhyeon Lee , Jong Chul Ye

Preference optimization for diffusion models aims to align them with human preferences for images. Previous methods typically use Vision-Language Models (VLMs) as pixel-level reward models to approximate human preferences. However, when…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Tao Zhang , Cheng Da , Kun Ding , Huan Yang , Kun Jin , Yan Li , Tingting Gao , Di Zhang , Shiming Xiang , Chunhong Pan

We propose a new finetuning method to provide pre-trained large language models (LMs) the ability to scale test-time compute through the diffusion framework. By increasing the number of diffusion steps, we show our finetuned models achieve…

Computation and Language · Computer Science 2025-06-04 Edoardo Cetin , Tianyu Zhao , Yujin Tang

This paper introduces the first gradient-based framework for prompt optimization in text-to-image diffusion models. We formulate prompt engineering as a discrete optimization problem over the language space. Two major challenges arise in…

Machine Learning · Computer Science 2024-07-03 Ruochen Wang , Ting Liu , Cho-Jui Hsieh , Boqing Gong

Discrete diffusion models have recently emerged as strong alternatives to autoregressive language models, matching their performance through large-scale training. However, inference-time control remains relatively underexplored. In this…

Machine Learning · Computer Science 2026-04-09 Meihua Dang , Jiaqi Han , Minkai Xu , Kai Xu , Akash Srivastava , Stefano Ermon

Masked diffusion language models enable parallel token generation and offer improved decoding efficiency over autoregressive models. However, their performance degrades significantly when generating multiple tokens simultaneously, due to a…

Computation and Language · Computer Science 2026-05-12 Houxing Ren , Mingjie Zhan , Zimu Lu , Ke Wang , Yunqiao Yang , Haotian Hou , Junting Pan , Hongsheng Li

Large Language Model (LLM) has demonstrated significant ability in various Natural Language Processing tasks. However, their effectiveness is highly dependent on the phrasing of the task prompt, leading to research on automatic prompt…

Computation and Language · Computer Science 2024-02-06 Moxin Li , Wenjie Wang , Fuli Feng , Yixin Cao , Jizhi Zhang , Tat-Seng Chua

Unlike autoregressive language models, which terminate variable-length generation upon predicting an End-of-Sequence (EoS) token, Diffusion Language Models (DLMs) operate over a fixed maximum-length context window for a predetermined number…

Computation and Language · Computer Science 2026-03-09 Vittorio Rossi , Giacomo Cirò , Davide Beltrame , Luca Gandolfi , Paul Röttger , Dirk Hovy

Inference-time alignment provides an efficient alternative for aligning LLMs with humans. However, these approaches still face challenges, such as limited scalability due to policy-specific value functions and latency during the inference…

Computation and Language · Computer Science 2025-05-27 Ruizhe Chen , Wenhao Chai , Zhifei Yang , Xiaotian Zhang , Joey Tianyi Zhou , Tony Quek , Soujanya Poria , Zuozhu Liu

Text-to-video models have made remarkable advancements through optimization on high-quality text-video pairs, where the textual prompts play a pivotal role in determining quality of output videos. However, achieving the desired output often…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Yatai Ji , Jiacheng Zhang , Jie Wu , Shilong Zhang , Shoufa Chen , Chongjian GE , Peize Sun , Weifeng Chen , Wenqi Shao , Xuefeng Xiao , Weilin Huang , Ping Luo

Finding effective prompts for language models (LMs) is critical yet notoriously difficult: the prompt space is combinatorially large, rewards are sparse due to expensive target-LM evaluation. Yet, existing RL-based prompt optimizers often…

Artificial Intelligence · Computer Science 2026-02-04 Junmo Cho , Suhan Kim , Sangjune An , Minsu Kim , Dong Bok Lee , Heejun Lee , Sung Ju Hwang , Hae Beom Lee

We introduce a novel framework that transforms the resource-intensive (adversarial) prompt optimization problem into an \emph{efficient, amortized inference task}. Our core insight is that pretrained, non-autoregressive generative LLMs,…

Machine Learning · Computer Science 2025-11-04 David Lüdke , Tom Wollschläger , Paul Ungermann , Stephan Günnemann , Leo Schwinn

Prompt-based pre-trained language models (PLMs) paradigm have succeeded substantially in few-shot natural language processing (NLP) tasks. However, prior discrete prompt optimization methods require expert knowledge to design the base…

Machine Learning · Computer Science 2024-01-17 Chengzhengxu Li , Xiaoming Liu , Yichen Wang , Duyi Li , Yu Lan , Chao Shen
‹ Prev 1 2 3 10 Next ›