Related papers: Sensor Scheduling for Optimal Observability Using …

What should be observed for optimal reward in POMDPs?

Partially observable Markov Decision Processes (POMDPs) are a standard model for agents making decisions in uncertain environments. Most work on POMDPs focuses on synthesizing strategies based on the available capabilities. However, system…

Artificial Intelligence · Computer Science 2024-07-12 Alyzia-Maria Konsta , Alberto Lluch Lafuente , Christoph Matheja

Unpredictable Planning Under Partial Observability

We study the problem of synthesizing a controller that maximizes the entropy of a partially observable Markov decision process (POMDP) subject to a constraint on the expected total reward. Such a controller minimizes the predictability of a…

Optimization and Control · Mathematics 2019-09-16 Michael Hibbard , Yagiz Savas , Bo Wu , Takashi Tanaka , Ufuk Topcu

Partially Observable Markov Decision Process Framework for Operating Condition Optimization Using Real-Time Degradation Signals

In many engineering systems, proper predictive maintenance and operational control are essential to increase efficiency and reliability while reducing maintenance costs. However, one of the major challenges is that many sensors are used for…

Applications · Statistics 2025-12-09 Boyang Xu , Yunyi Kang , Xinyu Zhao , Hao Yan , Feng Ju

Optimal Scheduling of Multiple Sensors with Packet Length Constraint

This paper considers the problem of sensory data scheduling of multiple processes. There are $n$ independent linear time-invariant processes and a remote estimator monitoring all the processes. Each process is measured by a sensor, which…

Systems and Control · Computer Science 2017-03-28 Shuang Wu , Xiaoqiang Ren , Subhrakanti Dey , Ling Shi

Entropy-Regularized Partially Observed Markov Decision Processes

We investigate partially observed Markov decision processes (POMDPs) with cost functions regularized by entropy terms describing state, observation, and control uncertainty. Standard POMDP techniques are shown to offer bounded-error…

Systems and Control · Electrical Eng. & Systems 2023-05-10 Timothy L. Molloy , Girish N. Nair

MDPs with a State Sensing Cost

In many practical sequential decision-making problems, tracking the state of the environment incurs a sensing/communication/computation cost. In these settings, the agent's interaction with its environment includes the additional component…

Machine Learning · Computer Science 2026-04-16 Vansh Kapoor , Jayakrishnan Nair

Entropy Maximization for Partially Observable Markov Decision Processes

We study the problem of synthesizing a controller that maximizes the entropy of a partially observable Markov decision process (POMDP) subject to a constraint on the expected total reward. Such a controller minimizes the predictability of…

Optimization and Control · Mathematics 2021-05-18 Yagiz Savas , Michael Hibbard , Bo Wu , Takashi Tanaka , Ufuk Topcu

Another Look at Partially Observed Optimal Stochastic Control: Existence, Ergodicity, and Approximations without Belief-Reduction

We present an alternative view for the study of optimal control of partially observed Markov Decision Processes (POMDPs). We first revisit the traditional (and by now standard) separated-design method of reducing the problem to fully…

Optimization and Control · Mathematics 2024-12-20 Serdar Yüksel

Scaling Observation-aware Planning in Uncertain Domains

Deciding which sensing capabilities to deploy on an agent in uncertain domains is a fundamental engineering challenge, in which one balances task achievability against the high costs of hardware and processing. This problem has previously…

Artificial Intelligence · Computer Science 2026-05-22 Adrian Zvizdenco , Arthur Conrado Veiga Bosquetti , Alberto Lluch Lafuente , Christoph Matheja

Optimal Sensor and Actuator Selection for Factored Markov Decision Processes: Complexity, Approximability and Algorithms

Factored Markov Decision Processes (fMDPs) are a class of Markov Decision Processes (MDPs) in which the states (and actions) can be factored into a set of state (and action) variables and can be encoded compactly using a factored…

Systems and Control · Electrical Eng. & Systems 2025-02-26 Jayanth Bhargav , Mahsa Ghasemi , Shreyas Sundaram

Optimal Policies Search for Sensor Management

This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as Partially-Observed Markov Decision Processes (POMPD). The original approach developped here consists…

Machine Learning · Computer Science 2009-03-20 Thomas Bréhard , Emmanuel Duflos , Philippe Vanheeghe , Pierre-Arnaud Coquelin

Sensor Synthesis for POMDPs with Reachability Objectives

Partially observable Markov decision processes (POMDPs) are widely used in probabilistic planning problems in which an agent interacts with an environment using noisy and imprecise sensors. We study a setting in which the sensors are only…

Artificial Intelligence · Computer Science 2017-10-03 Krishnendu Chatterjee , Martin Chmelik , Ufuk Topcu

Planning in Observable POMDPs in Quasipolynomial Time

Partially Observable Markov Decision Processes (POMDPs) are a natural and general model in reinforcement learning that take into account the agent's uncertainty about its current state. In the literature on POMDPs, it is customary to assume…

Machine Learning · Computer Science 2022-03-24 Noah Golowich , Ankur Moitra , Dhruv Rohatgi

OCMDP: Observation-Constrained Markov Decision Process

In many practical applications, decision-making processes must balance the costs of acquiring information with the benefits it provides. Traditional control systems often assume full observability, an unrealistic assumption when…

Artificial Intelligence · Computer Science 2025-01-24 Taiyi Wang , Jianheng Liu , Bryan Lee , Zhihao Wu , Yu Wu

Myopic Policy Bounds for Information Acquisition POMDPs

This paper addresses the problem of optimal control of robotic sensing systems aimed at autonomous information gathering in scenarios such as environmental monitoring, search and rescue, and surveillance and reconnaissance. The information…

Systems and Control · Computer Science 2016-01-28 Mikko Lauri , Nikolay Atanasov , George J. Pappas , Risto Ritala

Structural Estimation of Partially Observable Markov Decision Processes

In many practical settings control decisions must be made under partial/imperfect information about the evolution of a relevant state variable. Partially Observable Markov Decision Processes (POMDPs) is a relatively well-developed framework…

Machine Learning · Computer Science 2021-12-30 Yanling Chang , Alfredo Garcia , Zhide Wang , Lu Sun

Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs

We study the problem of controlling a partially observed Markov decision process (POMDP) to either aid or hinder the estimation of its state trajectory. We encode the estimation objectives via the smoother entropy, which is the conditional…

Systems and Control · Electrical Eng. & Systems 2023-05-10 Timothy L. Molloy , Girish N. Nair

Sensor Scheduling for Energy-Efficient Target Tracking in Sensor Networks

In this paper we study the problem of tracking an object moving randomly through a network of wireless sensors. Our objective is to devise strategies for scheduling the sensors to optimize the tradeoff between tracking performance and…

Multiagent Systems · Computer Science 2013-02-07 George K. Atia , Venugopal V. Veeravalli , Jason A. Fuemmeler

Integrated Control and Active Perception in POMDPs for Temporal Logic Tasks and Information Acquisition

This paper studies the synthesis of a joint control and active perception policy for a stochastic system modeled as a partially observable Markov decision process (POMDP), subject to temporal logic specifications. The POMDP actions…

Systems and Control · Electrical Eng. & Systems 2025-04-21 Chongyang Shi , Michael R. Dorothy , Jie Fu

Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes

Active classification, i.e., the sequential decision-making process aimed at data acquisition for classification purposes, arises naturally in many applications, including medical diagnosis, intrusion detection, and object tracking. In this…

Systems and Control · Computer Science 2018-10-02 Bo Wu , Mohamadreza Ahmadi , Suda Bharadwaj , Ufuk Topcu