Related papers: Preference-Guided Diffusion for Multi-Objective Of…

Gradient Guidance for Diffusion Models: An Optimization Perspective

Diffusion models have demonstrated empirical successes in various applications and can be adapted to task-specific needs via guidance. This paper studies a form of gradient guidance for adapting a pre-trained diffusion model towards…

Machine Learning · Statistics 2024-10-17 Yingqing Guo , Hui Yuan , Yukang Yang , Minshuo Chen , Mengdi Wang

ParetoFlow: Guided Flows in Multi-Objective Optimization

In offline multi-objective optimization (MOO), we leverage an offline dataset of designs and their associated labels to simultaneously minimize multiple objectives. This setting more closely mirrors complex real-world problems compared to…

Computational Engineering, Finance, and Science · Computer Science 2025-02-21 Ye Yuan , Can Chen , Christopher Pal , Xue Liu

Direct Preference Optimization-Enhanced Multi-Guided Diffusion Model for Traffic Scenario Generation

Diffusion-based models are recognized for their effectiveness in using real-world driving data to generate realistic and diverse traffic scenarios. These models employ guided sampling to incorporate specific traffic preferences and enhance…

Machine Learning · Computer Science 2025-02-19 Seungjun Yu , Kisung Kim , Daejung Kim , Haewook Han , Jinhan Lee

Pareto-Conditioned Diffusion Models for Offline Multi-Objective Optimization

Multi-objective optimization (MOO) arises in many real-world applications where trade-offs between competing objectives must be carefully balanced. In the offline setting, where only a static dataset is available, the main challenge is…

Machine Learning · Computer Science 2026-02-16 Jatan Shrestha , Santeri Heiskanen , Kari Hepola , Severi Rissanen , Pekka Jääskeläinen , Joni Pajarinen

Diffusion-NPO: Negative Preference Optimization for Better Preference Aligned Generation of Diffusion Models

Diffusion models have made substantial advances in image generation, yet models trained on large, unfiltered datasets often yield outputs misaligned with human preferences. Numerous methods have been proposed to fine-tune pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2025-05-19 Fu-Yun Wang , Yunhao Shui , Jingtan Piao , Keqiang Sun , Hongsheng Li

PC-Diffusion: Aligning Diffusion Models with Human Preferences via Preference Classifier

Diffusion models have achieved remarkable success in conditional image generation, yet their outputs often remain misaligned with human preferences. To address this, recent work has applied Direct Preference Optimization (DPO) to diffusion…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Shaomeng Wang , He Wang , Xiaolu Wei , Longquan Dai , Jinhui Tang

MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning

Multi-objective Reinforcement Learning (MORL) seeks to develop policies that simultaneously optimize multiple conflicting objectives, but it requires extensive online interactions. Offline MORL provides a promising solution by training on…

Machine Learning · Computer Science 2025-05-28 Yifu Yuan , Zhenrui Zheng , Zibin Dong , Jianye Hao

SPREAD: Sampling-based Pareto front Refinement via Efficient Adaptive Diffusion

Developing efficient multi-objective optimization methods to compute the Pareto set of optimal compromises between conflicting objectives remains a key challenge, especially for large-scale and expensive problems. To bridge this gap, we…

Machine Learning · Computer Science 2026-02-05 Sedjro Salomon Hotegni , Sebastian Peitz

The Offline-Frontier Shift: Diagnosing Distributional Limits in Generative Multi-Objective Optimization

Offline multi-objective optimization (MOO) aims to recover Pareto-optimal designs given a finite, static dataset. Recent generative approaches, including diffusion models, show strong performance under hypervolume, yet their behavior under…

Machine Learning · Computer Science 2026-05-13 Stephanie Holly , Alexandru-Ciprian Zăvoianu , Siegfried Silber , Sepp Hochreiter , Werner Zellinger

Refining Alignment Framework for Diffusion Models with Intermediate-Step Preference Ranking

Direct preference optimization (DPO) has shown success in aligning diffusion models with human preference. Previous approaches typically assume a consistent preference label between final generations and noisy samples at intermediate steps,…

Machine Learning · Computer Science 2025-02-05 Jie Ren , Yuhang Zhang , Dongrui Liu , Xiaopeng Zhang , Qi Tian

Preference Diffusion for Recommendation

Recommender systems predict personalized item rankings based on user preference distributions derived from historical behavior data. Recently, diffusion models (DMs) have gained attention in recommendation for their ability to model complex…

Information Retrieval · Computer Science 2025-04-22 Shuo Liu , An Zhang , Guoqing Hu , Hong Qian , Tat-seng Chua

Rethinking Direct Preference Optimization in Diffusion Models

Aligning text-to-image (T2I) diffusion models with human preferences has emerged as a critical research challenge. While recent advances in this area have extended preference optimization techniques from large language models (LLMs) to the…

Computer Vision and Pattern Recognition · Computer Science 2025-12-25 Junyong Kang , Seohyun Lim , Kyungjune Baek , Hyunjung Shim

Learning Design-Score Manifold to Guide Diffusion Models for Offline Optimization

Optimizing complex systems, from discovering therapeutic drugs to designing high-performance materials, remains a fundamental challenge across science and engineering, as the underlying rules are often unknown and costly to evaluate.…

Machine Learning · Computer Science 2026-01-13 Tailin Zhou , Zhilin Chen , Wenlong Lyu , Zhitang Chen , Danny H. K. Tsang , Jun Zhang

Robust Guided Diffusion for Offline Black-Box Optimization

Offline black-box optimization aims to maximize a black-box function using an offline dataset of designs and their measured properties. Two main approaches have emerged: the forward approach, which learns a mapping from input to its value,…

Machine Learning · Computer Science 2025-01-03 Can Sam Chen , Christopher Beckham , Zixuan Liu , Xue Liu , Christopher Pal

Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Diffusion models have recently gained prominence in offline reinforcement learning due to their ability to effectively learn high-performing, generalizable policies from static datasets. Diffusion-based planners facilitate long-horizon…

Machine Learning · Computer Science 2025-10-27 Donghyeon Ki , JunHyeok Oh , Seong-Woong Shim , Byung-Jun Lee

Guided Diffusion from Self-Supervised Diffusion Features

Guidance serves as a key concept in diffusion models, yet its effectiveness is often limited by the need for extra data annotation or classifier pretraining. That is why guidance was harnessed from self-supervised learning backbones, like…

Computer Vision and Pattern Recognition · Computer Science 2023-12-15 Vincent Tao Hu , Yunlu Chen , Mathilde Caron , Yuki M. Asano , Cees G. M. Snoek , Bjorn Ommer

Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback

Direct preference optimization (DPO) methods have shown strong potential in aligning text-to-image diffusion models with human preferences by training on paired comparisons. These methods improve training stability by avoiding the REINFORCE…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Yi-Lun Wu , Bo-Kai Ruan , Chiang Tseng , Hong-Han Shuai

What Makes a Good Diffusion Planner for Decision Making?

Diffusion models have recently shown significant potential in solving decision-making problems, particularly in generating behavior plans -- also known as diffusion planning. While numerous studies have demonstrated the impressive…

Machine Learning · Computer Science 2025-03-04 Haofei Lu , Dongqi Han , Yifei Shen , Dongsheng Li

One Step Preference Elicitation in Multi-Objective Bayesian Optimization

We consider a multi-objective optimization problem with objective functions that are expensive to evaluate. The decision maker (DM) has unknown preferences, and so the standard approach is to generate an approximation of the Pareto front…

Machine Learning · Computer Science 2021-05-28 Juan Ungredda , Mariapia Marchi , Teresa Montrone , Juergen Branke

Preference-Based Alignment of Discrete Diffusion Models

Diffusion models have achieved state-of-the-art performance across multiple domains, with recent advancements extending their applicability to discrete data. However, aligning discrete diffusion models with task-specific preferences remains…

Machine Learning · Computer Science 2025-04-10 Umberto Borso , Davide Paglieri , Jude Wells , Tim Rocktäschel