Related papers: Safe Sequential Optimization for Switching Environ…

An adaptive approach to Bayesian Optimization with switching costs

We investigate modifications to Bayesian Optimization for a resource-constrained setting of sequential experimental design where changes to certain design variables of the search space incur a switching cost. This models the scenario where…

Machine Learning · Computer Science 2024-05-16 Stefan Pricopie , Richard Allmendinger , Manuel Lopez-Ibanez , Clyde Fare , Matt Benatan , Joshua Knowles

Safe Time-Varying Optimization based on Gaussian Processes with Spatio-Temporal Kernel

Ensuring safety is a key aspect in sequential decision making problems, such as robotics or process control. The complexity of the underlying systems often makes finding the optimal decision challenging, especially when the safety-critical…

Machine Learning · Computer Science 2024-09-27 Jialin Li , Marta Zagorowska , Giulia De Pasquale , Alisa Rupenyan , John Lygeros

Change Acceleration and Detection

A novel sequential change detection problem is proposed, in which the goal is to not only detect but also accelerate the change. Specifically, it is assumed that the sequentially collected observations are responses to treatments selected…

Statistics Theory · Mathematics 2024-06-24 Yanglei Song , Georgios Fellouris

Bayesian Persuasion in Sequential Decision-Making

We study a dynamic model of Bayesian persuasion in sequential decision-making settings. An informed principal observes an external parameter of the world and advises an uninformed agent about actions to take over time. The agent takes…

Computer Science and Game Theory · Computer Science 2022-05-25 Jiarui Gan , Rupak Majumdar , Goran Radanovic , Adish Singla

Safe Option-Critic: Learning Safety in the Option-Critic Architecture

Designing hierarchical reinforcement learning algorithms that exhibit safe behaviour is not only vital for practical applications but also, facilitates a better understanding of an agent's decisions. We tackle this problem in the options…

Artificial Intelligence · Computer Science 2021-07-01 Arushi Jain , Khimya Khetarpal , Doina Precup

Adaptive sequential Monte Carlo for multiple changepoint analysis

Process monitoring and control requires detection of structural changes in a data stream in real time. This article introduces an efficient sequential Monte Carlo algorithm designed for learning unknown changepoints in continuous time. The…

Applications · Statistics 2015-09-29 Melissa J. M. Turcotte , Nicholas A. Heard

Quickest Search for a Change Point

This paper considers a sequence of random variables generated according to a common distribution. The distribution might undergo periods of transient changes at an unknown set of time instants, referred to as change-points. The objective is…

Information Theory · Computer Science 2018-04-26 Javad Heydari , Ali Tajer

Adaptive Uncertainty Resolution in Bayesian Combinatorial Optimization Problems

In several applications such as databases, planning, and sensor networks, parameters such as selectivity, load, or sensed values are known only with some associated uncertainty. The performance of such a system (as captured by some…

Data Structures and Algorithms · Computer Science 2010-01-28 Sudipto Guha , Kamesh Munagala

Active Anomaly Detection with Switching Cost

The problem of detecting a single anomalous process among multiple independent processes is considered. Under a constraint on the number of processes that can be probed simultaneously, the decision maker should decide which processes to…

Signal Processing · Electrical Eng. & Systems 2021-01-15 Fengfan Qin , Da Chen , Hui Feng , Qing Zhao , Tao Yang , Bo Hu

Optimal Switching Problems under Partial Information

In this paper we formulate and study an optimal switching problem under partial information. In our model the agent/manager/investor attempts to maximize the expected reward by switching between different states/investments. However, he is…

Optimization and Control · Mathematics 2014-03-10 Kai Li , Kaj Nyström , Marcus Olofsson

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

We consider the problem of sequentially making decisions that are rewarded by "successes" and "failures" which can be predicted through an unknown relationship that depends on a partially controllable vector of attributes for each instance.…

Machine Learning · Statistics 2017-09-18 Yingfei Wang , Chu Wang , Warren Powell

Stochastic Choice and Optimal Sequential Sampling

We model the joint distribution of choice probabilities and decision times in binary choice tasks as the solution to a problem of optimal sequential sampling, where the agent is uncertain of the utility of each action and pays a constant…

Neurons and Cognition · Quantitative Biology 2015-05-14 Drew Fudenberg , Philipp Strack , Tomasz Strzalecki

A Scheme for Dynamic Risk-Sensitive Sequential Decision Making

We present a scheme for sequential decision making with a risk-sensitive objective and constraints in a dynamic environment. A neural network is trained as an approximator of the mapping from parameter space to space of risk and policy with…

Artificial Intelligence · Computer Science 2019-07-10 Shuai Ma , Jia Yuan Yu , Ahmet Satir

A precision of the sequential change point detection

A random sequence having two segments being the homogeneous Markov processes is registered. Each segment has his own transition probability law and the length of the segment is unknown and random. The transition probabilities of each…

Statistics Theory · Mathematics 2020-11-17 A. Ochman-Gozdek , W. Sarnowski , K. J. Szajowski

Towards safe control parameter tuning in distributed multi-agent systems

Many safety-critical real-world problems, such as autonomous driving and collaborative robots, are of a distributed multi-agent nature. To optimize the performance of these systems while ensuring safety, we can cast them as distributed…

Systems and Control · Electrical Eng. & Systems 2025-08-20 Abdullah Tokmak , Thomas B. Schön , Dominik Baumann

Adaptive Decision-Making with Constraints and Dependent Losses: Performance Guarantees and Applications to Online and Nonlinear Identification

We consider adaptive decision-making problems where an agent optimizes a cumulative performance objective by repeatedly choosing among a finite set of options. Compared to the classical prediction-with-expert-advice set-up, we consider…

Machine Learning · Computer Science 2023-04-10 Michael Muehlebach

Variational Policy for Guiding Point Processes

Temporal point processes have been widely applied to model event sequence data generated by online users. In this paper, we consider the problem of how to design the optimal control policy for point processes, such that the stochastic…

Machine Learning · Computer Science 2017-11-13 Yichen Wang , Grady Williams , Evangelos Theodorou , Le Song

An Observer-based Switching Algorithm for Safety under Sensor Denial-of-Service Attacks

The design of safe-critical control algorithms for systems under Denial-of-Service (DoS) attacks on the system output is studied in this work. We aim to address scenarios where attack-mitigation approaches are not feasible, and the system…

Systems and Control · Electrical Eng. & Systems 2023-11-14 Santiago Jimenez Leudo , Kunal Garg , Ricardo G. Sanfelice , Alvaro A. Cardenas

Learning to Trust: Bayesian Adaptation to Varying Suggester Reliability in Sequential Decision Making

Autonomous agents operating in sequential decision-making tasks under uncertainty can benefit from external action suggestions, which provide valuable guidance but inherently vary in reliability. Existing methods for incorporating such…

Artificial Intelligence · Computer Science 2026-05-26 Dylan M. Asmar , Mykel J. Kochenderfer

Stagewise Safe Bayesian Optimization with Gaussian Processes

Enforcing safety is a key aspect of many problems pertaining to sequential decision making under uncertainty, which require the decisions made at every step to be both informative of the optimal decision and also safe. For example, we value…

Machine Learning · Computer Science 2020-01-28 Yanan Sui , Vincent Zhuang , Joel W. Burdick , Yisong Yue