English
Related papers

Related papers: State-Centric Decision Process

200 papers

Non-stationary domains, that change in unpredicted ways, are a challenge for agents searching for optimal policies in sequential decision-making problems. This paper presents a combination of Markov Decision Processes (MDP) with Answer Set…

Artificial Intelligence · Computer Science 2017-06-06 Leonardo A. Ferreira , Reinaldo A. C. Bianchi , Paulo E. Santos , Ramon Lopez de Mantaras

In many practical sequential decision-making problems, tracking the state of the environment incurs a sensing/communication/computation cost. In these settings, the agent's interaction with its environment includes the additional component…

Machine Learning · Computer Science 2026-04-16 Vansh Kapoor , Jayakrishnan Nair

Production LLM agents combine stochastic model outputs with deterministic software systems, yet the boundary between the two is rarely treated as a first-class architectural object. This paper names that boundary the…

Artificial Intelligence · Computer Science 2026-05-20 Vasundra Srinivasan

In most contemporary approaches to decision making, a decision problem is described by a sets of states and set of outcomes, and a rich set of acts, which are functions from states to outcomes over which the decision maker (DM) has…

Computer Science and Game Theory · Computer Science 2021-09-07 Lawrence Blume , David Easley , Joseph Y. Halpern

The impact of communication on decision-making systems has been extensively studied under the assumption of dedicated communication channels. We instead consider communicating through actions, where the message is embedded into the actions…

Information Theory · Computer Science 2025-09-05 Haotian Wu , Gongpu Chen , Deniz Gündüz

We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive…

Explaining black-box model behavior with natural language has achieved impressive results in various NLP tasks. Recent research has explored the utilization of subsequences from the input text as a rationale, providing users with evidence…

Computation and Language · Computer Science 2023-10-23 Yanrui Du , Sendong Zhao , Haochun Wang , Yuhan Chen , Rui Bai , Zewen Qiang , Muzhen Cai , Bing Qin

Markov Decision Processes (MDPs), the mathematical framework underlying most algorithms in Reinforcement Learning (RL), are often used in a way that wrongfully assumes that the state of an agent's environment does not change during action…

Machine Learning · Computer Science 2019-12-13 Simon Ramstedt , Christopher Pal

Large Language Model (LLM) agents are deployed in complex environments -- such as massive codebases, enterprise databases, and conversational histories -- where the relevant state far exceeds their context windows. To navigate these spaces,…

Artificial Intelligence · Computer Science 2026-05-11 Chinmaya Kausik , Adith Swaminathan , Nathan Kallus

The standard Markov Decision Process (MDP) formulation hinges on the assumption that an action is executed immediately after it was chosen. However, assuming it is often unrealistic and can lead to catastrophic failures in applications such…

Machine Learning · Computer Science 2023-12-14 Esther Derman , Gal Dalal , Shie Mannor

This paper presents a state representation framework for Markov decision processes (MDPs) that can be learned solely from state trajectories, requiring neither reward signals nor the actions executed by the agent. We propose learning the…

Machine Learning · Computer Science 2026-03-25 Lorenzo Steccanella , Joshua B. Evans , Özgür Şimşek , Anders Jonsson

Non-stationary environments are challenging for reinforcement learning algorithms. If the state transition and/or reward functions change based on latent factors, the agent is effectively tasked with optimizing a behavior that maximizes…

Machine Learning · Computer Science 2021-05-21 Lucas N. Alegre , Ana L. C. Bazzan , Bruno C. da Silva

To realize autonomous collaborative robots, it is important to increase the trust that users have in them. Toward this goal, this paper proposes an algorithm which endows an autonomous agent with the ability to explain the transition from…

Artificial Intelligence · Computer Science 2021-05-07 Tatsuya Sakai , Kazuki Miyazawa , Takato Horii , Takayuki Nagai

Large Language Models enable dynamic game interactions but struggle with rule-governed trading systems. Current implementations suffer from rule violations, such as item hallucinations and calculation errors, that erode player trust. Here,…

Artificial Intelligence · Computer Science 2025-07-11 Minkyung Kim , Junsik Kim , Hwidong Bae , Woongcheol Yang , Sangdon Park , Sohee Bae

Uncertainty plays a central role in spoken dialogue systems. Some stochastic models like Markov decision process (MDP) are used to model the dialogue manager. But the partially observable system state and user intention hinder the natural…

Artificial Intelligence · Computer Science 2013-01-14 Bo Zhang , Qingsheng Cai , Jianfeng Mao , Baining Guo

Despite rapid progress in AI agents for enterprise automation and decision-making, their real-world deployment and further performance gains remain constrained by limited data quality and quantity, complex real-world reasoning demands,…

Artificial Intelligence · Computer Science 2026-03-24 Xi Yang , Aurelie Lozano , Naoki Abe , Bhavya , Saurabh Jha , Noah Zheutlin , Rohan R. Arora , Yu Deng , Daby M. Sow

Reinforcement Learning (RL) based methods have seen their paramount successes in solving serial decision-making and control problems in recent years. For conventional RL formulations, Markov Decision Process (MDP) and state-action-value…

Machine Learning · Computer Science 2020-06-09 Ziyao Zhang , Liang Ma , Kin K. Leung , Konstantinos Poularakis , Mudhakar Srivatsa

Robots frequently face complex tasks that require more than one action, where sequential decision-making (SDM) capabilities become necessary. The key contribution of this work is a robot SDM framework, called LCORPP, that supports the…

Artificial Intelligence · Computer Science 2019-12-11 Saeid Amiri , Mohammad Shokrolah Shirazi , Shiqi Zhang

Learning a Markov Decision Process (MDP) from a fixed batch of trajectories is a non-trivial task whose outcome's quality depends on both the amount and the diversity of the sampled regions of the state-action space. Yet, many MDPs are…

Machine Learning · Computer Science 2022-03-08 Giorgio Angelotti , Nicolas Drougard , Caroline P. C. Chanel

Autonomous agents are limited in their ability to observe the world state. Partially observable Markov decision processes (POMDPs) formally model the problem of planning under world state uncertainty, but POMDPs with continuous actions and…

Robotics · Computer Science 2020-07-08 Dicong Qiu , Yibiao Zhao , Chris L. Baker
‹ Prev 1 2 3 10 Next ›