Related papers: Does Preference Always Help? A Holistic Study on P…

Interactive Evolutionary Multi-Objective Optimization via Learning-to-Rank

In practical multi-criterion decision-making, it is cumbersome if a decision maker (DM) is asked to choose among a set of trade-off alternatives covering the whole Pareto-optimal front. This is a paradox in conventional evolutionary…

Neural and Evolutionary Computing · Computer Science 2022-04-07 Ke Li , Guiyu Lai , Xin Yao

Interactive Decomposition Multi-Objective Optimization via Progressively Learned Value Functions

Decomposition has become an increasingly popular technique for evolutionary multi-objective optimization (EMO). A decomposition-based EMO algorithm is usually designed to approximate a whole Pareto-optimal front (PF). However, in practice,…

Neural and Evolutionary Computing · Computer Science 2018-10-02 Ke Li , Renzhi Chen , Dragan Savic , Xin Yao

Dynamic Detection of Relevant Objectives and Adaptation to Preference Drifts in Interactive Evolutionary Multi-Objective Optimization

Evolutionary Multi-Objective Optimization Algorithms (EMOAs) are widely employed to tackle problems with multiple conflicting objectives. Recent research indicates that not all objectives are equally important to the decision-maker (DM). In…

Artificial Intelligence · Computer Science 2024-11-08 Seyed Mahdi Shavarani , Mahmoud Golabi , Richard Allmendinger , Lhassane Idoumghar

Techniques for Highly Multiobjective Optimisation: Some Nondominated Points are Better than Others

The research area of evolutionary multiobjective optimization (EMO) is reaching better understandings of the properties and capabilities of EMO algorithms, and accumulating much evidence of their worth in practical scenarios. An urgent…

Neural and Evolutionary Computing · Computer Science 2009-08-24 David Corne , Joshua Knowles

Multi-Objective Bayesian Optimization with Active Preference Learning

There are a lot of real-world black-box optimization problems that need to optimize multiple criteria simultaneously. However, in a multi-objective optimization (MOO) problem, identifying the whole Pareto front requires the prohibitive…

Machine Learning · Computer Science 2023-11-23 Ryota Ozaki , Kazuki Ishikawa , Youhei Kanzaki , Shinya Suzuki , Shion Takeno , Ichiro Takeuchi , Masayuki Karasuyama

One Step Preference Elicitation in Multi-Objective Bayesian Optimization

We consider a multi-objective optimization problem with objective functions that are expensive to evaluate. The decision maker (DM) has unknown preferences, and so the standard approach is to generate an approximation of the Pareto front…

Machine Learning · Computer Science 2021-05-28 Juan Ungredda , Mariapia Marchi , Teresa Montrone , Juergen Branke

Integration of Preferences in Decomposition Multi-Objective Optimization

Most existing studies on evolutionary multi-objective optimization focus on approximating the whole Pareto-optimal front. Nevertheless, rather than the whole front, which demands for too many points (especially in a high-dimensional space),…

Neural and Evolutionary Computing · Computer Science 2017-01-24 Ke Li , Kalyanmoy Deb , Xin Yao

Multi-Objective Preference Optimization: Improving Human Alignment of Generative Models

Post-training of LLMs with RLHF, and subsequently preference optimization algorithms such as DPO, IPO, etc., made a big difference in improving human alignment. However, all such techniques can only work with a single (human) objective. In…

Machine Learning · Computer Science 2025-05-19 Akhil Agnihotri , Rahul Jain , Deepak Ramachandran , Zheng Wen

User preference extraction using dynamic query sliders in conjunction with UPS-EMO algorithm

One drawback of evolutionary multiobjective optimization algorithms (EMOA) has traditionally been high computational cost to create an approximation of the Pareto front: number of required objective function evaluations usually grows high.…

Neural and Evolutionary Computing · Computer Science 2015-03-19 Timo Aittokoski , Suvi Tarkkanen

What Is Preference Optimization Doing, and Why?

Preference optimization (PO) is indispensable for large language models (LLMs), with methods such as direct preference optimization (DPO) and proximal policy optimization (PPO) achieving great success. A common belief is that DPO is…

Machine Learning · Computer Science 2026-05-18 Yue Wang , Qizhou Wang , Zizhuo Zhang , Gang Niu , Bo Han , Masashi Sugiyama

Is On-Policy Data always the Best Choice for Direct Preference Optimization-based LM Alignment?

The alignment of language models~(LMs) with human preferences is critical for building reliable AI systems. The problem is typically framed as optimizing an LM policy to maximize the expected reward that reflects human preferences.…

Artificial Intelligence · Computer Science 2026-01-28 Zetian Sun , Dongfang Li , Xuhui Chen , Baotian Hu , Min Zhang

Quality Indicators for Preference-based Evolutionary Multi-objective Optimization Using a Reference Point: A Review and Analysis

Some quality indicators have been proposed for benchmarking preference-based evolutionary multi-objective optimization algorithms using a reference point. Although a systematic review and analysis of the quality indicators are helpful for…

Neural and Evolutionary Computing · Computer Science 2023-09-27 Ryoji Tanabe , Ke Li

Multi-Objective Archiving

Most multi-objective optimisation algorithms maintain an archive explicitly or implicitly during their search. Such an archive can be solely used to store high-quality solutions presented to the decision maker, but in many cases may…

Neural and Evolutionary Computing · Computer Science 2023-09-15 Miqing Li , Manuel López-Ibáñez , Xin Yao

Investigating Normalization in Preference-based Evolutionary Multi-objective Optimization Using a Reference Point

Normalization of objectives plays a crucial role in evolutionary multi-objective optimization (EMO) to handle objective functions with different scales, which can be found in real-world problems. Although the effect of normalization methods…

Neural and Evolutionary Computing · Computer Science 2023-07-14 Ryoji Tanabe

Multilevel evolutionary developmental optimization (MEDO): A theoretical framework for understanding preferences and selection dynamics

What is motivation and how does it work? Where do goals come from and how do they vary within and between species and individuals? Why do we prefer some things over others? MEDO is a theoretical framework for understanding these questions…

Neurons and Cognition · Quantitative Biology 2019-11-12 Adam Safron

What Matters in Data for DPO?

Direct Preference Optimization (DPO) has emerged as a simple and effective approach for aligning large language models (LLMs) with human preferences, bypassing the need for a learned reward model. Despite its growing adoption, a fundamental…

Machine Learning · Computer Science 2025-11-10 Yu Pan , Zhongze Cai , Guanting Chen , Huaiyang Zhong , Chonghuan Wang

Self-Improvement Towards Pareto Optimality: Mitigating Preference Conflicts in Multi-Objective Alignment

Multi-Objective Alignment (MOA) aims to align LLMs' responses with multiple human preference objectives, with Direct Preference Optimization (DPO) emerging as a prominent approach. However, we find that DPO-based MOA approaches suffer from…

Machine Learning · Computer Science 2025-12-09 Moxin Li , Yuantao Zhang , Wenjie Wang , Wentao Shi , Zhuo Liu , Fuli Feng , Tat-Seng Chua

Optimizing LLMs with Direct Preferences: A Data Efficiency Perspective

Aligning the output of Large Language Models (LLMs) with human preferences (e.g., by means of reinforcement learning with human feedback, or RLHF) is essential for ensuring their effectiveness in real-world scenarios. Despite significant…

Artificial Intelligence · Computer Science 2024-10-23 Pietro Bernardelle , Gianluca Demartini

Multi-Attribute Bayesian Optimization With Interactive Preference Learning

We consider black-box global optimization of time-consuming-to-evaluate functions on behalf of a decision-maker (DM) whose preferences must be learned. Each feasible design is associated with a time-consuming-to-evaluate vector of…

Machine Learning · Statistics 2020-03-05 Raul Astudillo , Peter I. Frazier

Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling Bandit

Optimization problems find widespread use in both single-objective and multi-objective scenarios. In practical applications, users aspire for solutions that converge to the region of interest (ROI) along the Pareto front (PF). While the…

Artificial Intelligence · Computer Science 2025-03-10 Tian Huang , Shengbo Wang , Ke Li