Related papers: Sequential Selection with Expirations

Globally-Optimal Greedy Experiment Selection for Active Sequential Estimation

Motivated by modern applications such as computerized adaptive testing, sequential rank aggregation, and heterogeneous data source selection, we study the problem of active sequential estimation, which involves adaptively selecting…

Statistics Theory · Mathematics 2024-02-14 Xiaoou Li , Hongru Zhao

Sequential Resource Access: Theory and Algorithm

We formulate and analyze a generic sequential resource access problem arising in a variety of engineering fields, where a user disposes a number of heterogeneous computing, communication, or storage resources, each characterized by the…

Networking and Internet Architecture · Computer Science 2020-12-08 Lin Chen , Anastasios Giovanidis , Wei Wang , Lin Shan

The (Surprising) Sample Optimality of Greedy Procedures for Large-Scale Ranking and Selection

Ranking and selection (R&S) aims to select the best alternative with the largest mean performance from a finite set of alternatives. Recently, considerable attention has turned towards the large-scale R&S problem which involves a large…

Methodology · Statistics 2025-09-09 Zaile Li , Weiwei Fan , L. Jeff Hong

Time-Varying Parameters in Sequential Decision Making Problems

In this paper we address the class of Sequential Decision Making (SDM) problems that are characterized by time-varying parameters. These parameter dynamics are either pre-specified or manipulable. At any given time instant the decision…

Optimization and Control · Mathematics 2022-01-26 Amber Srivastava , S. M. Salapaka

Online Stochastic Allocation of Reusable Resources

We study a multi-objective model on the allocation of reusable resources under model uncertainty. Heterogeneous customers arrive sequentially according to a latent stochastic process, request for certain amounts of resources, and occupy…

Optimization and Control · Mathematics 2023-08-02 Xilin Zhang , Wang Chi Cheung

Opportunistic Qualitative Planning in Stochastic Systems with Incomplete Preferences over Reachability Objectives

Preferences play a key role in determining what goals/constraints to satisfy when not all constraints can be satisfied simultaneously. In this paper, we study how to synthesize preference satisfying plans in stochastic systems, modeled as…

Artificial Intelligence · Computer Science 2022-10-06 Abhishek N. Kulkarni , Jie Fu

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

We consider the problem of sequentially making decisions that are rewarded by "successes" and "failures" which can be predicted through an unknown relationship that depends on a partially controllable vector of attributes for each instance.…

Machine Learning · Statistics 2017-09-18 Yingfei Wang , Chu Wang , Warren Powell

Geometry and Determinism of Optimal Stationary Control in Partially Observable Markov Decision Processes

It is well known that for any finite state Markov decision process (MDP) there is a memoryless deterministic policy that maximizes the expected reward. For partially observable Markov decision processes (POMDPs), optimal memoryless policies…

Optimization and Control · Mathematics 2016-02-16 Guido Montufar , Keyan Ghazi-Zahedi , Nihat Ay

Stochastic Choice and Optimal Sequential Sampling

We model the joint distribution of choice probabilities and decision times in binary choice tasks as the solution to a problem of optimal sequential sampling, where the agent is uncertain of the utility of each action and pays a constant…

Neurons and Cognition · Quantitative Biology 2015-05-14 Drew Fudenberg , Philipp Strack , Tomasz Strzalecki

Distributed Submodular Maximization with Parallel Execution

The submodular maximization problem is widely applicable in many engineering problems where objectives exhibit diminishing returns. While this problem is known to be NP-hard for certain subclasses of objective functions, there is a greedy…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-01 Haoyuan Sun , David Grimsman , Jason R Marden

Greedy Selection under Independent Increments: A Toy Model Analysis

We study an iterative selection problem over N i.i.d. discrete-time stochastic processes with independent increments. At each stage, a fixed number of processes are retained based on their observed values. Under this simple model, we prove…

Probability · Mathematics 2025-06-24 Huitao Yang

Non-Parametric Stochastic Sequential Assignment With Random Arrival Times

We consider a problem wherein jobs arrive at random times and assume random values. Upon each job arrival, the decision-maker must decide immediately whether or not to accept the job and gain the value on offer as a reward, with the…

Artificial Intelligence · Computer Science 2022-02-03 Danial Dervovic , Parisa Hassanzadeh , Samuel Assefa , Prashant Reddy

Finite-time Analysis for the Knowledge-Gradient Policy

We consider sequential decision problems in which we adaptively choose one of finitely many alternatives and observe a stochastic reward. We offer a new perspective of interpreting Bayesian ranking and selection problems as adaptive…

Machine Learning · Computer Science 2016-06-16 Yingfei Wang , Warren Powell

Learning From Scenarios for Stochastic Repairable Scheduling

When optimizing problems with uncertain parameter values in a linear objective, decision-focused learning enables end-to-end learning of these values. We are interested in a stochastic scheduling problem, in which processing times are…

Machine Learning · Computer Science 2024-08-16 Kim van den Houten , David M. J. Tax , Esteban Freydell , Mathijs de Weerdt

Efficient Sampling Policy for Selecting a Good Enough Subset

The note studies the problem of selecting a good enough subset out of a finite number of alternatives under a fixed simulation budget. Our work aims to maximize the posterior probability of correctly selecting a good subset. We formulate…

Optimization and Control · Mathematics 2023-05-09 Gongbo Zhang , Bin Chen , Qing-shan Jia , Yijie Peng

Preference-Based Planning in Stochastic Environments: From Partially-Ordered Temporal Goals to Most Preferred Policies

Human preferences are not always represented via complete linear orders: It is natural to employ partially-ordered preferences for expressing incomparable outcomes. In this work, we consider decision-making and probabilistic planning in…

Robotics · Computer Science 2024-10-21 Hazhar Rahmani , Abhishek N. Kulkarni , Jie Fu

A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option

We consider a sequential decision-making problem where an agent can take one action at a time and each action has a stochastic temporal extent, i.e., a new action cannot be taken until the previous one is finished. Upon completion, the…

Machine Learning · Computer Science 2020-03-26 P Sharoff , Nishant A. Mehta , Ravi Ganti

Minimizing Completion Times of Stochastic Jobs on Parallel Machines is Hard

This paper considers the scheduling of stochastic jobs on parallel identical machines to minimize the expected total weighted completion time. While this is a classical problem with a significant body of research on approximation algorithms…

Data Structures and Algorithms · Computer Science 2026-01-27 Benjamin Moseley , Kirk Pruhs , Marc Uetz , Rudy Zhou

Optimal Exploitation of a Resource with Stochastic Population Dynamics and Delayed Renewal

In this work, we study the optimization problem of a renewable resource in finite time. The resource is assumed to evolve according to a logistic stochastic differential equation. The manager may harvest partially the resource at any time…

Optimization and Control · Mathematics 2018-07-12 Thomas Lim , Idris Kharroubi , Vathana Ly-Vath

Probably Approximately Correct MDP Learning and Control With Temporal Logic Constraints

We consider synthesis of control policies that maximize the probability of satisfying given temporal logic specifications in unknown, stochastic environments. We model the interaction between the system and its environment as a Markov…

Systems and Control · Computer Science 2014-05-01 Jie Fu , Ufuk Topcu