Related papers: Inferring Implicit Goals Across Differing Task Mod…

Explanation through Reward Model Reconciliation using POMDP Tree Search

As artificial intelligence (AI) algorithms are increasingly used in mission-critical applications, promoting user-trust of these systems will be essential to their success. Ensuring users understand the models over which algorithms reason…

Artificial Intelligence · Computer Science 2026-04-27 Benjamin D. Kraske , Anshu Saksena , Anna L. Buczak , Zachary N. Sunberg

Inverse Rational Control: Inferring What You Think from How You Forage

Complex behaviors are often driven by an internal model, which integrates sensory information over time and facilitates long-term planning. Inferring an agent's internal model is a crucial ingredient in social interactions (theory of mind),…

Machine Learning · Computer Science 2019-06-13 Zhengwei Wu , Paul Schrater , Xaq Pitkow

Provably Efficient Maximum Entropy Exploration

Suppose an agent is in a (possibly unknown) Markov Decision Process in the absence of a reward signal, what might we hope that an agent can efficiently learn to do? This work studies a broad class of objectives that are defined solely as…

Machine Learning · Computer Science 2019-01-29 Elad Hazan , Sham M. Kakade , Karan Singh , Abby Van Soest

Model-Based Reinforcement Learning Under Confounding

We investigate model-based reinforcement learning in contextual Markov decision processes (C-MDPs) in which the context is unobserved and induces confounding in the offline dataset. In such settings, conventional model-learning methods are…

Machine Learning · Computer Science 2025-12-09 Nishanth Venkatesh , Andreas A. Malikopoulos

Learning Task Automata for Reinforcement Learning using Hidden Markov Models

Training reinforcement learning (RL) agents using scalar reward signals is often infeasible when an environment has sparse and non-Markovian rewards. Moreover, handcrafting these reward functions before training is prone to…

Machine Learning · Computer Science 2023-10-04 Alessandro Abate , Yousif Almulla , James Fox , David Hyland , Michael Wooldridge

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Goal-conditioned reinforcement learning endows an agent with a large variety of skills, but it often struggles to solve tasks that require more temporally extended reasoning. In this work, we propose to incorporate imagined subgoals into…

Machine Learning · Computer Science 2021-07-02 Elliot Chane-Sane , Cordelia Schmid , Ivan Laptev

Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents

Current language model-driven agents often lack mechanisms for effective user participation, which is crucial given the vagueness commonly found in user instructions. Although adept at devising strategies and performing tasks, these agents…

Computation and Language · Computer Science 2024-02-16 Cheng Qian , Bingxiang He , Zhong Zhuang , Jia Deng , Yujia Qin , Xin Cong , Zhong Zhang , Jie Zhou , Yankai Lin , Zhiyuan Liu , Maosong Sun

Contextual Markov Decision Processes

We consider a planning problem where the dynamics and rewards of the environment depend on a hidden static parameter referred to as the context. The objective is to learn a strategy that maximizes the accumulated reward across all contexts.…

Machine Learning · Statistics 2015-02-10 Assaf Hallak , Dotan Di Castro , Shie Mannor

Interpretable Reinforcement Learning with Multilevel Subgoal Discovery

We propose a novel Reinforcement Learning model for discrete environments, which is inherently interpretable and supports the discovery of deep subgoal hierarchies. In the model, an agent learns information about environment in the form of…

Artificial Intelligence · Computer Science 2022-02-16 Alexander Demin , Denis Ponomaryov

Explainable Reinforcement Learning via Model Transforms

Understanding emerging behaviors of reinforcement learning (RL) agents may be difficult since such agents are often trained in complex environments using highly complex decision making procedures. This has given rise to a variety of…

Artificial Intelligence · Computer Science 2022-12-02 Mira Finkelstein , Lucy Liu , Nitsan Levy Schlot , Yoav Kolumbus , David C. Parkes , Jeffrey S. Rosenshein , Sarah Keren

Multi-Objective Approaches to Markov Decision Processes with Uncertain Transition Parameters

Markov decision processes (MDPs) are a popular model for performance analysis and optimization of stochastic systems. The parameters of stochastic behavior of MDPs are estimates from empirical observations of a system; their values are not…

Artificial Intelligence · Computer Science 2017-10-26 Dimitri Scheftelowitsch , Peter Buchholz , Vahid Hashemi , Holger Hermanns

Goal Recognition over Imperfect Domain Models

Goal recognition is the problem of recognizing the intended goal of autonomous agents or humans by observing their behavior in an environment. Over the past years, most existing approaches to goal and plan recognition have been ignoring the…

Artificial Intelligence · Computer Science 2020-05-13 Ramon Fraga Pereira

Gaining Insights into Unrecognized User Utterances in Task-Oriented Dialog Systems

The rapidly growing market demand for automatic dialogue agents capable of goal-oriented behavior has caused many tech-industry leaders to invest considerable efforts into task-oriented dialog systems. The success of these systems is highly…

Computation and Language · Computer Science 2022-10-25 Ella Rabinovich , Matan Vetzler , David Boaz , Vineet Kumar , Gaurav Pandey , Ateret Anaby-Tavor

A tale of two goals: leveraging sequentiality in multi-goal scenarios

Several hierarchical reinforcement learning methods leverage planning to create a graph or sequences of intermediate goals, guiding a lower-level goal-conditioned (GC) policy to reach some final goals. The low-level policy is typically…

Machine Learning · Computer Science 2025-03-31 Olivier Serris , Stéphane Doncieux , Olivier Sigaud

Active Inference through Incentive Design in Markov Decision Processes

We present a method for active inference with partial observations in stochastic systems through incentive design, also known as the leader-follower game. Consider a leader agent who aims to infer a follower agent's type given a finite set…

Systems and Control · Electrical Eng. & Systems 2025-02-12 Xinyi Wei , Chongyang Shi , Shuo Han , Ahmed H. Hemida , Charles A. Kamhoua , Jie Fu

Constrained Process Maps for Multi-Agent Generative AI Workflows

Large language model (LLM)-based agents are increasingly used to perform complex, multi-step workflows in regulated settings such as compliance and due diligence. However, many agentic architectures rely primarily on prompt engineering of a…

Artificial Intelligence · Computer Science 2026-02-03 Ananya Joshi , Michael Rudow

Feature Markov Decision Processes

General purpose intelligent learning agents cycle through (complex,non-MDP) sequences of observations, actions, and rewards. On the other hand, reinforcement learning is well-developed for small finite state Markov Decision Processes…

Artificial Intelligence · Computer Science 2009-12-30 Marcus Hutter

Resolving Multiple-Dynamic Model Uncertainty in Hypothesis-Driven Belief-MDPs

When human operators of cyber-physical systems encounter surprising behavior, they often consider multiple hypotheses that might explain it. In some cases, taking information-gathering actions such as additional measurements or control…

Artificial Intelligence · Computer Science 2024-11-22 Ofer Dagan , Tyler Becker , Zachary N. Sunberg

Infinite Ends from Finite Samples: Open-Ended Goal Inference as Top-Down Bayesian Filtering of Bottom-Up Proposals

The space of human goals is tremendously vast; and yet, from just a few moments of watching a scene or reading a story, we seem to spontaneously infer a range of plausible motivations for the people and characters involved. What explains…

Artificial Intelligence · Computer Science 2024-07-25 Tan Zhi-Xuan , Gloria Kang , Vikash Mansinghka , Joshua B. Tenenbaum

Modeling Boundedly Rational Agents with Latent Inference Budgets

We study the problem of modeling a population of agents pursuing unknown goals subject to unknown computational constraints. In standard models of bounded rationality, sub-optimal decision-making is simulated by adding homoscedastic noise…

Artificial Intelligence · Computer Science 2023-12-08 Athul Paul Jacob , Abhishek Gupta , Jacob Andreas