Related papers: Counterexample-guided Planning

Learning Abstract World Model for Value-preserving Planning with Options

General-purpose agents require fine-grained controls and rich sensory inputs to perform a wide range of tasks. However, this complexity often leads to intractable decision-making. Traditionally, agents are provided with task-specific action…

Machine Learning · Computer Science 2024-06-25 Rafael Rodriguez-Sanchez , George Konidaris

Projection Abstractions in Planning Under the Lenses of Abstractions for MDPs

The concept of abstraction has been independently developed both in the context of AI Planning and discounted Markov Decision Processes (MDPs). However, the way abstractions are built and used in the context of Planning and MDPs is…

Artificial Intelligence · Computer Science 2024-12-04 Giuseppe Canonaco , Alberto Pozanco , Daniel Borrajo

Counterfactual Strategies for Markov Decision Processes

Counterfactuals are widely used in AI to explain how minimal changes to a model's input can lead to a different output. However, established methods for computing counterfactuals typically focus on one-step decision-making, and are not…

Artificial Intelligence · Computer Science 2025-05-15 Paul Kobialka , Lina Gerlach , Francesco Leofante , Erika Ábrahám , Silvia Lizeth Tapia Tarifa , Einar Broch Johnsen

Decision-Theoretic Planning: Structural Assumptions and Computational Leverage

Planning under uncertainty is a central problem in the study of automated sequential decision making, and has been addressed by researchers in many different fields, including AI planning, decision analysis, operations research, control…

Artificial Intelligence · Computer Science 2011-05-30 C. Boutilier , T. Dean , S. Hanks

Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions

In this paper, we consider planning in stochastic shortest path (SSP) problems, a subclass of Markov Decision Problems (MDP). We focus on medium-size problems whose state space can be fully enumerated. This problem has numerous important…

Artificial Intelligence · Computer Science 2012-06-18 Alejandro Isaza , Csaba Szepesvari , Vadim Bulitko , Russell Greiner

Strategy Synthesis in POMDPs via Game-Based Abstractions

We study synthesis problems with constraints in partially observable Markov decision processes (POMDPs), where the objective is to compute a strategy for an agent that is guaranteed to satisfy certain safety and performance specifications.…

Robotics · Computer Science 2019-05-28 Leonore Winterer , Sebastian Junges , Ralf Wimmer , Nils Jansen , Ufuk Topcu , Joost-Pieter Katoen , Bernd Becker

Abstraction for Offline Goal-Conditioned Reinforcement Learning

Markov Decision Processes (MDPs) often exhibit significant redundancy due to symmetries and shared structure across state-goal pairs in real-world Goal-Conditioned Reinforcement Learning (GCRL). While hierarchical policies have been…

Machine Learning · Computer Science 2026-05-22 Clarisse Wibault , Alexander Goldie , Antonio Villares , Maike Osborne , Jakob Foerster

Learning Markov State Abstractions for Deep Reinforcement Learning

A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov. However, when MDPs have rich observations, agents typically learn by way of an abstract state…

Machine Learning · Computer Science 2024-03-18 Cameron Allen , Neev Parikh , Omer Gottesman , George Konidaris

Probabilistic inverse reinforcement learning in unknown environments

We consider the problem of learning by demonstration from agents acting in unknown stochastic Markov environments or games. Our aim is to estimate agent preferences in order to construct improved policies for the same task that the agents…

Machine Learning · Computer Science 2014-08-12 Aristide Tossou , Christos Dimitrakakis

Probabilistic inverse reinforcement learning in unknown environments

We consider the problem of learning by demonstration from agents acting in unknown stochastic Markov environments or games. Our aim is to estimate agent preferences in order to construct improved policies for the same task that the agents…

Machine Learning · Statistics 2013-07-16 Aristide C. Y. Tossou , Christos Dimitrakakis

Counterexample Explanation by Learning Small Strategies in Markov Decision Processes

While for deterministic systems, a counterexample to a property can simply be an error trace, counterexamples in probabilistic systems are necessarily more complex. For instance, a set of erroneous traces with a sufficient cumulative…

Logic in Computer Science · Computer Science 2015-02-11 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelík , Andreas Fellner , Jan Křetínský

CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Meta-planning, or learning to guide planning from experience, is a promising approach to improving the computational cost of planning. A general meta-planning strategy is to learn to impose constraints on the states considered and actions…

Machine Learning · Computer Science 2020-11-10 Rohan Chitnis , Tom Silver , Beomjoon Kim , Leslie Pack Kaelbling , Tomas Lozano-Perez

Consistent Opponent Modeling in Imperfect-Information Games

The goal of agents in multi-agent environments is to maximize total reward against the opposing agents that are encountered. Following a game-theoretic solution concept, such as Nash equilibrium, may obtain a strong performance in some…

Computer Science and Game Theory · Computer Science 2026-01-05 Sam Ganzfried

Learning Abstract Models for Strategic Exploration and Fast Reward Transfer

Model-based reinforcement learning (RL) is appealing because (i) it enables planning and thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables fast transfer to new reward functions. However, learning an…

Machine Learning · Computer Science 2020-07-14 Evan Zheran Liu , Ramtin Keramati , Sudarshan Seshadri , Kelvin Guu , Panupong Pasupat , Emma Brunskill , Percy Liang

Online Abstraction with MDP Homomorphisms for Deep Learning

Abstraction of Markov Decision Processes is a useful tool for solving complex problems, as it can ignore unimportant aspects of an environment, simplifying the process of learning an optimal policy. In this paper, we propose a new algorithm…

Machine Learning · Computer Science 2021-04-20 Ondrej Biza , Robert Platt

Planning to the Information Horizon of BAMDPs via Epistemic State Abstraction

The Bayes-Adaptive Markov Decision Process (BAMDP) formalism pursues the Bayes-optimal solution to the exploration-exploitation trade-off in reinforcement learning. As the computation of exact solutions to Bayesian reinforcement-learning…

Machine Learning · Computer Science 2022-11-01 Dilip Arumugam , Satinder Singh

Counterexample-guided Abstraction Refinement for POMDPs

Partially Observable Markov Decision Process (POMDP) is widely used to model probabilistic behavior for complex systems. Compared with MDPs, POMDP models a system more accurate but solving a POMDP generally takes exponential time in the…

Logic in Computer Science · Computer Science 2017-03-13 Xiaobin Zhang , Bo Wu , Hai Lin

Learning with Expert Abstractions for Efficient Multi-Task Continuous Control

Decision-making in complex, continuous multi-task environments is often hindered by the difficulty of obtaining accurate models for planning and the inefficiency of learning purely from trial and error. While precise environment dynamics…

Machine Learning · Computer Science 2025-03-20 Jeff Jewett , Sandhya Saisubramanian

Public Information Representation for Adversarial Team Games

The peculiarity of adversarial team games resides in the asymmetric information available to the team members during the play, which makes the equilibrium computation problem hard even with zero-sum payoffs. The algorithms available in the…

Computer Science and Game Theory · Computer Science 2022-01-26 Luca Carminati , Federico Cacciamani , Marco Ciccone , Nicola Gatti

Counterexamples for Robotic Planning Explained in Structured Language

Automated techniques such as model checking have been used to verify models of robotic mission plans based on Markov decision processes (MDPs) and generate counterexamples that may help diagnose requirement violations. However, such…

Robotics · Computer Science 2018-03-28 Lu Feng , Mahsa Ghasemi , Kai-Wei Chang , Ufuk Topcu