Related papers: State-Centric Decision Process

A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming

Non-stationary domains, that change in unpredicted ways, are a challenge for agents searching for optimal policies in sequential decision-making problems. This paper presents a combination of Markov Decision Processes (MDP) with Answer Set…

Artificial Intelligence · Computer Science 2017-06-06 Leonardo A. Ferreira , Reinaldo A. C. Bianchi , Paulo E. Santos , Ramon Lopez de Mantaras

MDPs with a State Sensing Cost

In many practical sequential decision-making problems, tracking the state of the environment incurs a sensing/communication/computation cost. In these settings, the agent's interaction with its environment includes the additional component…

Machine Learning · Computer Science 2026-04-16 Vansh Kapoor , Jayakrishnan Nair

A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents

Production LLM agents combine stochastic model outputs with deterministic software systems, yet the boundary between the two is rarely treated as a first-class architectural object. This paper names that boundary the…

Artificial Intelligence · Computer Science 2026-05-20 Vasundra Srinivasan

Constructive Decision Theory

In most contemporary approaches to decision making, a decision problem is described by a sets of states and set of outcomes, and a rich set of acts, which are functions from states to outcomes over which the decision maker (DM) has…

Computer Science and Game Theory · Computer Science 2021-09-07 Lawrence Blume , David Easley , Joseph Y. Halpern

Actions Speak Louder Than Words: Rate-Reward Trade-off in Markov Decision Processes

The impact of communication on decision-making systems has been extensively studied under the assumption of dedicated communication channels. We instead consider communicating through actions, where the message is embedded into the actions…

Information Theory · Computer Science 2025-09-05 Haotian Wu , Gongpu Chen , Deniz Gündüz

Learning Algorithms for Verification of Markov Decision Processes

We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive…

Systems and Control · Electrical Eng. & Systems 2025-04-02 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelik , Vojtěch Forejt , Jan Křetínský , Marta Kwiatkowska , Tobias Meggendorfer , David Parker , Mateusz Ujma

Make Your Decision Convincing! A Unified Two-Stage Framework: Self-Attribution and Decision-Making

Explaining black-box model behavior with natural language has achieved impressive results in various NLP tasks. Recent research has explored the utilization of subsequences from the input text as a rationale, providing users with evidence…

Computation and Language · Computer Science 2023-10-23 Yanrui Du , Sendong Zhao , Haochun Wang , Yuhan Chen , Rui Bai , Zewen Qiang , Muzhen Cai , Bing Qin

Real-Time Reinforcement Learning

Markov Decision Processes (MDPs), the mathematical framework underlying most algorithms in Reinforcement Learning (RL), are often used in a way that wrongfully assumes that the state of an agent's environment does not change during action…

Machine Learning · Computer Science 2019-12-13 Simon Ramstedt , Christopher Pal

The Context Gathering Decision Process: A POMDP Framework for Agentic Search

Large Language Model (LLM) agents are deployed in complex environments -- such as massive codebases, enterprise databases, and conversational histories -- where the relevant state far exceeds their context windows. To navigate these spaces,…

Artificial Intelligence · Computer Science 2026-05-11 Chinmaya Kausik , Adith Swaminathan , Nathan Kallus

Acting in Delayed Environments with Non-Stationary Markov Policies

The standard Markov Decision Process (MDP) formulation hinges on the assumption that an action is executed immediately after it was chosen. However, assuming it is often unrealistic and can lead to catastrophic failures in applications such…

Machine Learning · Computer Science 2023-12-14 Esther Derman , Gal Dalal , Shie Mannor

Learning The Minimum Action Distance

This paper presents a state representation framework for Markov decision processes (MDPs) that can be learned solely from state trajectories, requiring neither reward signals nor the actions executed by the agent. We propose learning the…

Machine Learning · Computer Science 2026-03-25 Lorenzo Steccanella , Joshua B. Evans , Özgür Şimşek , Anders Jonsson

Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection

Non-stationary environments are challenging for reinforcement learning algorithms. If the state transition and/or reward functions change based on latent factors, the agent is effectively tasked with optimizing a behavior that maximizes…

Machine Learning · Computer Science 2021-05-21 Lucas N. Alegre , Ana L. C. Bazzan , Bruno C. da Silva

A Framework of Explanation Generation toward Reliable Autonomous Robots

To realize autonomous collaborative robots, it is important to increase the trust that users have in them. Toward this goal, this paper proposes an algorithm which endows an autonomous agent with the ability to explain the transition from…

Artificial Intelligence · Computer Science 2021-05-07 Tatsuya Sakai , Kazuki Miyazawa , Takato Horii , Takayuki Nagai

State-Inference-Based Prompting for Natural Language Trading with Game NPCs

Large Language Models enable dynamic game interactions but struggle with rule-governed trading systems. Current implementations suffer from rule violations, such as item hallucinations and calculation errors, that erode player trust. Here,…

Artificial Intelligence · Computer Science 2025-07-11 Minkyung Kim , Junsik Kim , Hwidong Bae , Woongcheol Yang , Sangdon Park , Sohee Bae

Planning and Acting under Uncertainty: A New Model for Spoken Dialogue Systems

Uncertainty plays a central role in spoken dialogue systems. Some stochastic models like Markov decision process (MDP) are used to model the dialogue manager. But the partially observable system state and user intention hinder the natural…

Artificial Intelligence · Computer Science 2013-01-14 Bo Zhang , Qingsheng Cai , Jianfeng Mao , Baining Guo

A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

Despite rapid progress in AI agents for enterprise automation and decision-making, their real-world deployment and further performance gains remain constrained by limited data quality and quantity, complex real-world reasoning demands,…

Artificial Intelligence · Computer Science 2026-03-24 Xi Yang , Aurelie Lozano , Naoki Abe , Bhavya , Saurabh Jha , Noah Zheutlin , Rohan R. Arora , Yu Deng , Daby M. Sow

State Action Separable Reinforcement Learning

Reinforcement Learning (RL) based methods have seen their paramount successes in solving serial decision-making and control problems in recent years. For conventional RL formulations, Markov Decision Process (MDP) and state-action-value…

Machine Learning · Computer Science 2020-06-09 Ziyao Zhang , Liang Ma , Kin K. Leung , Konstantinos Poularakis , Mudhakar Srivatsa

Learning and Reasoning for Robot Sequential Decision Making under Uncertainty

Robots frequently face complex tasks that require more than one action, where sequential decision-making (SDM) capabilities become necessary. The key contribution of this work is a robot SDM framework, called LCORPP, that supports the…

Artificial Intelligence · Computer Science 2019-12-11 Saeid Amiri , Mohammad Shokrolah Shirazi , Shiqi Zhang

Expert-Guided Symmetry Detection in Markov Decision Processes

Learning a Markov Decision Process (MDP) from a fixed batch of trajectories is a non-trivial task whose outcome's quality depends on both the amount and the diversity of the sampled regions of the state-action space. Yet, many MDPs are…

Machine Learning · Computer Science 2022-03-08 Giorgio Angelotti , Nicolas Drougard , Caroline P. C. Chanel

PODDP: Partially Observable Differential Dynamic Programming for Latent Belief Space Planning

Autonomous agents are limited in their ability to observe the world state. Partially observable Markov decision processes (POMDPs) formally model the problem of planning under world state uncertainty, but POMDPs with continuous actions and…

Robotics · Computer Science 2020-07-08 Dicong Qiu , Yibiao Zhao , Chris L. Baker