Related papers: Multi-Attribute Utility Preference Robust Optimiza…

Distributionally Preference Robust Optimization in Multi-Attribute Decision Making

Utility preference robust optimization (PRO) has recently been proposed to deal with optimal decision making problems where the decision maker's (DM) preference over gains and losses is ambiguous. In this paper, we take a step further to…

Optimization and Control · Mathematics 2024-03-11 Jian Hu , Dali Zhang , Huifu Xu , Sainan Zhang

Multistage Utility Preference Robust Optimization

In this paper, we consider a multistage expected utility maximization problem where the decision maker's utility function at each stage depends on historical data and the information on the true utility function is incomplete. To mitigate…

Optimization and Control · Mathematics 2023-02-22 Jia Liu , Zhiping Chen , Huifu Xu

Preference Elicitation and Robust Optimization with Multi-Attribute Quasi-Concave Choice Functions

Decision maker's preferences are often captured by some choice functions which are used to rank prospects. In this paper, we consider ambiguity in choice functions over a multi-attribute prospect space. Our main result is a robust…

Risk Management · Quantitative Finance 2018-05-21 William B. Haskell , Wenjie Huang , Huifu Xu

Robust Utility Maximization with Intractable Claims under Distributional Ambiguity: A Random Distributionally Robust Optimization Approach

This paper studies a robust utility maximization problem for intractable claims under distributional ambiguity, where the distribution of the claim cannot be inferred from market information and its dependence with tradable assets is…

Optimization and Control · Mathematics 2026-04-17 Guohui Guan , Zongxia Liang , Xingjian Ma

Model Selection in Utility-Maximizing Binary Prediction

The maximum utility estimation proposed by Elliott and Lieli (2013) can be viewed as cost-sensitive binary classification; thus, its in-sample overfitting issue is similar to that of perceptron learning. A utility-maximizing prediction rule…

Econometrics · Economics 2021-09-29 Jiun-Hua Su

$\alpha$-robust utility maximization with intractable claims: A quantile optimization approach

This paper studies an $\alpha$-robust utility maximization problem where an investor faces an intractable claim -- an exogenous contingent claim with known marginal distribution but unspecified dependence structure with financial market…

Portfolio Management · Quantitative Finance 2026-04-07 Xinyu Chen , Zuo Quan Xu

Multi-Attribute Bayesian Optimization With Interactive Preference Learning

We consider black-box global optimization of time-consuming-to-evaluate functions on behalf of a decision-maker (DM) whose preferences must be learned. Each feasible design is associated with a time-consuming-to-evaluate vector of…

Machine Learning · Statistics 2020-03-05 Raul Astudillo , Peter I. Frazier

Eliciting Von Neumann-Morgenstern utility from discrete choices with response error

We develop a preference elicitation method for a Von Neumann-Morgenstern (VNM)-type decision-maker from pairwise comparison data in the presence of response errors. We apply the maximum likelihood estimation (MLE) method to jointly elicit…

Optimization and Control · Mathematics 2026-03-30 Bo Chen , Jia Liu

Risk-sensitive Markov Decision Process and Learning under General Utility Functions

Reinforcement Learning (RL) has gained substantial attention across diverse application domains and theoretical investigations. Existing literature on RL theory largely focuses on risk-neutral settings where the decision-maker learns to…

Machine Learning · Computer Science 2024-12-24 Zhengqi Wu , Renyuan Xu

Preference Robustness for DPO with Applications to Public Health

We study an LLM fine-tuning task for designing reward functions for sequential resource allocation problems in public health, guided by human preferences expressed in natural language. This setting presents a challenging testbed for…

Machine Learning · Computer Science 2025-11-19 Cheol Woo Kim , Shresth Verma , Mauricio Tec , Milind Tambe

Robust utility maximization with nonlinear continuous semimartingales

In this paper we study a robust utility maximization problem in continuous time under model uncertainty. The model uncertainty is governed by a continuous semimartingale with uncertain local characteristics. Here, the differential…

Mathematical Finance · Quantitative Finance 2023-08-04 David Criens , Lars Niemann

Adaptive Preference Optimization with Uncertainty-aware Utility Anchor

Offline preference optimization methods are efficient for large language models (LLMs) alignment. Direct Preference optimization (DPO)-like learning, one of the most popular approaches, stands out for its efficiency in reward modeling.…

Machine Learning · Computer Science 2026-05-26 Xiaobo Wang , Zixia Jia , Jiaqi Li , Qi Liu , Zilong Zheng

Uncertainty-Aware Exploratory Direct Preference Optimization for Multimodal Large Language Models

Direct Preference Optimization (DPO) has proven to be an effective solution for mitigating hallucination in Multimodal Large Language Models (MLLMs) by learning from preference pairs. One of its key challenges lies in how to transfer the…

Machine Learning · Computer Science 2026-05-07 Huatian Zhang , Zhendong Mao , Lei Zhang , Yongdong Zhang

Robust utility maximization under model uncertainty via a penalization approach

This paper addresses the problem of utility maximization under uncertain parameters. In contrast with the classical approach, where the parameters of the model evolve freely within a given range, we constrain them via a penalty function. We…

Optimization and Control · Mathematics 2022-03-08 Ivan Guo , Nicolas Langrené , Grégoire Loeper , Wei Ning

Robust Adaptive Submodular Maximization

The goal of a sequential decision making problem is to design an interactive policy that adaptively selects a group of items, each selection is based on the feedback from the past, in order to maximize the expected utility of selected…

Data Structures and Algorithms · Computer Science 2022-09-13 Shaojie Tang

Preference Robust Ordinal Priority Approach with Preference Elicitation under Incomplete Information for Multi-Attribute Robust Ranking and Selection

Ordinal Priority Approach (OPA) has recently been proposed to determine the weights of experts, attributes, and alternatives using ordinal preference without precise information for multi-attribute ranking and selection (MARS). This study…

Optimization and Control · Mathematics 2025-06-10 Renlong Wang

Problem-Focused Incremental Elicitation of Multi-Attribute Utility Models

Decision theory has become widely accepted in the AI community as a useful framework for planning and decision making. Applying the framework typically requires elicitation of some form of probability and utility information. While much…

Artificial Intelligence · Computer Science 2013-02-08 Vu A. Ha , Peter Haddawy

Exponential utility maximization under model uncertainty for unbounded endowments

We consider the robust exponential utility maximization problem in discrete time: An investor maximizes the worst case expected exponential utility with respect to a family of nondominated probabilistic models of her endowment by…

Portfolio Management · Quantitative Finance 2019-02-12 Daniel Bartl

Robust Utility Maximizing Strategies under Model Uncertainty and their Convergence

In this paper we investigate a utility maximization problem with drift uncertainty in a multivariate continuous-time Black-Scholes type financial market which may be incomplete. We impose a constraint on the admissible strategies that…

Portfolio Management · Quantitative Finance 2021-11-04 Jörn Sass , Dorothee Westphal

A Safe Approximation Based on Mixed-Integer Optimization for Non-Convex Distributional Robustness Governed by Univariate Indicator Functions

In this work, we present an algorithmically tractable safe approximation of distributionally robust optimization (DRO) problems that contain univariate indicator functions. The latter appear in different applications, but render the model…

Optimization and Control · Mathematics 2026-01-22 Jana Dienstbier , Frauke Liers , Florian Rösel , Jan Rolfes