Related papers: Multialternative Neural Decision Processes

Algorithmic Decision Processes

We develop a full-fledged analysis of an algorithmic decision process that, in a multialternative choice problem, produces computable choice probabilities and expected decision times.

Theoretical Economics · Economics 2023-05-08 Carlo Baldassi , Fabio Maccheroni , Massimo Marinacci , Marco Pirazzini

Cautious Learning of Multiattribute Preferences

This paper is dedicated to a cautious learning methodology for predicting preferences between alternatives characterized by binary attributes (formally, each alternative is seen as a subset of attributes). By "cautious", we mean that the…

Artificial Intelligence · Computer Science 2022-06-16 Hugo Gilbert , Mohamed Ouaguenouni , Meltem Ozturk , Olivier Spanjaard

Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions

We study the problem of learning Markov decision processes with finite state and action spaces when the transition probability distributions and loss functions are chosen adversarially and are allowed to change with time. We introduce an…

Machine Learning · Computer Science 2013-03-14 Yasin Abbasi-Yadkori , Peter L. Bartlett , Csaba Szepesvari

Learning Algorithms for Verification of Markov Decision Processes

We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive…

Systems and Control · Electrical Eng. & Systems 2025-04-02 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelik , Vojtěch Forejt , Jan Křetínský , Marta Kwiatkowska , Tobias Meggendorfer , David Parker , Mateusz Ujma

Conformal testing: binary case with Markov alternatives

We continue study of conformal testing in binary model situations. In this note we consider Markov alternatives to the null hypothesis of exchangeability. We propose two new classes of conformal test martingales; one class is statistically…

Statistics Theory · Mathematics 2021-11-04 Vladimir Vovk , Ilia Nouretdinov , Alex Gammerman

PCMC-Net: Feature-based Pairwise Choice Markov Chains

Pairwise Choice Markov Chains (PCMC) have been recently introduced to overcome limitations of choice models based on traditional axioms unable to express empirical observations from modern behavior economics like context effects occurring…

Machine Learning · Computer Science 2020-02-03 Alix Lhéritier

Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models

We propose a safe exploration algorithm for deterministic Markov Decision Processes with unknown transition models. Our algorithm guarantees safety by leveraging Lipschitz-continuity to ensure that no unsafe states are visited during…

Robotics · Computer Science 2020-06-05 Erdem Bıyık , Jonathan Margoliash , Shahrouz Ryan Alimo , Dorsa Sadigh

Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes

We consider Bayesian optimization of expensive-to-evaluate experiments that generate vector-valued outcomes over which a decision-maker (DM) has preferences. These preferences are encoded by a utility function that is not known in closed…

Machine Learning · Computer Science 2022-03-23 Zhiyuan Jerry Lin , Raul Astudillo , Peter I. Frazier , Eytan Bakshy

The Knowledge Gradient with Logistic Belief Models for Binary Classification

We consider sequential decision making problems for binary classification scenario in which the learner takes an active role in repeatedly selecting samples from the action pool and receives the binary label of the selected alternatives.…

Machine Learning · Statistics 2015-10-09 Yingfei Wang , Chu Wang , Warren Powell

Scoring from Pairwise Winning Indices

The pairwise winning indices, computed in the Stochastic Multicriteria Acceptability Analysis, give the probability with which an alternative is preferred to another taking into account all the instances of the assumed preference model…

Optimization and Control · Mathematics 2022-03-29 Sally Giuseppe Arcidiacono , Salvatore Corrente , Salvatore Greco

From Preference-Based to Multiobjective Sequential Decision-Making

In this paper, we present a link between preference-based and multiobjective sequential decision-making. While transforming a multiobjective problem to a preference-based one is quite natural, the other direction is a bit less obvious. We…

Artificial Intelligence · Computer Science 2017-01-04 Paul Weng

Markov Automata with Multiple Objectives

Markov automata combine non-determinism, probabilistic branching, and exponentially distributed delays. This compositional variant of continuous-time Markov decision processes is used in reliability engineering, performance evaluation and…

Logic in Computer Science · Computer Science 2017-05-11 Tim Quatmann , Sebastian Junges , Joost-Pieter Katoen

Bayesian preference elicitation for multiobjective combinatorial optimization

We introduce a new incremental preference elicitation procedure able to deal with noisy responses of a Decision Maker (DM). The originality of the contribution is to propose a Bayesian approach for determining a preferred solution in a…

Artificial Intelligence · Computer Science 2020-07-30 Nadjet Bourdache , Patrice Perny , Olivier Spanjaard

Markov Stochastic Choice

We examine the effect of item arrangement on choices using a novel decision-making model based on the Markovian exploration of choice sets. This model is inspired by experimental evidence suggesting that the decision-making process involves…

Theoretical Economics · Economics 2024-10-30 Kremena Valkanova

Nontransitive Preferences and Stochastic Rationalizability: A Behavioral Equivalence

Nontransitive choices have long been an area of curiosity within economics. However, determining whether nontransitive choices represent an individual's preference is a difficult task since choice data is inherently stochastic. This paper…

Theoretical Economics · Economics 2023-05-01 Mogens Fosgerau , John Rehbeck

Online Markov decision processes with policy iteration

The online Markov decision process (MDP) is a generalization of the classical Markov decision process that incorporates changing reward functions. In this paper, we propose practical online MDP algorithms with policy iteration and…

Machine Learning · Computer Science 2015-10-16 Yao Ma , Hao Zhang , Masashi Sugiyama

Bayesian Active Learning for Classification and Preference Learning

Information theoretic active learning has been widely studied for probabilistic models. For simple regression an optimal myopic policy is easily tractable. However, for other tasks and with more complex models, such as classification with…

Machine Learning · Statistics 2011-12-30 Neil Houlsby , Ferenc Huszár , Zoubin Ghahramani , Máté Lengyel

Active Algorithms For Preference Learning Problems with Multiple Populations

In this paper we model the problem of learning preferences of a population as an active learning problem. We propose an algorithm can adaptively choose pairs of items to show to users coming from a heterogeneous population, and use the…

Machine Learning · Statistics 2016-06-23 Aniruddha Bhargava , Ravi Ganti , Robert Nowak

Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design

We study reinforcement learning from human feedback in general Markov decision processes, where agents learn from trajectory-level preference comparisons. A central challenge in this setting is to design algorithms that select informative…

Machine Learning · Computer Science 2025-12-05 Andreas Schlaginhaufen , Reda Ouhamma , Maryam Kamgarpour

Reasons and Means to Model Preferences as Incomplete

Literature involving preferences of artificial agents or human beings often assume their preferences can be represented using a complete transitive binary relation. Much has been written however on different models of preferences. We review…

Artificial Intelligence · Computer Science 2018-01-17 Olivier Cailloux , Sébastien Destercke