English
Related papers

Related papers: Counterexample-guided Planning

200 papers

General-purpose agents require fine-grained controls and rich sensory inputs to perform a wide range of tasks. However, this complexity often leads to intractable decision-making. Traditionally, agents are provided with task-specific action…

Machine Learning · Computer Science 2024-06-25 Rafael Rodriguez-Sanchez , George Konidaris

The concept of abstraction has been independently developed both in the context of AI Planning and discounted Markov Decision Processes (MDPs). However, the way abstractions are built and used in the context of Planning and MDPs is…

Artificial Intelligence · Computer Science 2024-12-04 Giuseppe Canonaco , Alberto Pozanco , Daniel Borrajo

Counterfactuals are widely used in AI to explain how minimal changes to a model's input can lead to a different output. However, established methods for computing counterfactuals typically focus on one-step decision-making, and are not…

Artificial Intelligence · Computer Science 2025-05-15 Paul Kobialka , Lina Gerlach , Francesco Leofante , Erika Ábrahám , Silvia Lizeth Tapia Tarifa , Einar Broch Johnsen

Planning under uncertainty is a central problem in the study of automated sequential decision making, and has been addressed by researchers in many different fields, including AI planning, decision analysis, operations research, control…

Artificial Intelligence · Computer Science 2011-05-30 C. Boutilier , T. Dean , S. Hanks

In this paper, we consider planning in stochastic shortest path (SSP) problems, a subclass of Markov Decision Problems (MDP). We focus on medium-size problems whose state space can be fully enumerated. This problem has numerous important…

Artificial Intelligence · Computer Science 2012-06-18 Alejandro Isaza , Csaba Szepesvari , Vadim Bulitko , Russell Greiner

We study synthesis problems with constraints in partially observable Markov decision processes (POMDPs), where the objective is to compute a strategy for an agent that is guaranteed to satisfy certain safety and performance specifications.…

Markov Decision Processes (MDPs) often exhibit significant redundancy due to symmetries and shared structure across state-goal pairs in real-world Goal-Conditioned Reinforcement Learning (GCRL). While hierarchical policies have been…

Machine Learning · Computer Science 2026-05-22 Clarisse Wibault , Alexander Goldie , Antonio Villares , Maike Osborne , Jakob Foerster

A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov. However, when MDPs have rich observations, agents typically learn by way of an abstract state…

Machine Learning · Computer Science 2024-03-18 Cameron Allen , Neev Parikh , Omer Gottesman , George Konidaris

We consider the problem of learning by demonstration from agents acting in unknown stochastic Markov environments or games. Our aim is to estimate agent preferences in order to construct improved policies for the same task that the agents…

Machine Learning · Computer Science 2014-08-12 Aristide Tossou , Christos Dimitrakakis

We consider the problem of learning by demonstration from agents acting in unknown stochastic Markov environments or games. Our aim is to estimate agent preferences in order to construct improved policies for the same task that the agents…

Machine Learning · Statistics 2013-07-16 Aristide C. Y. Tossou , Christos Dimitrakakis

While for deterministic systems, a counterexample to a property can simply be an error trace, counterexamples in probabilistic systems are necessarily more complex. For instance, a set of erroneous traces with a sufficient cumulative…

Logic in Computer Science · Computer Science 2015-02-11 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelík , Andreas Fellner , Jan Křetínský

Meta-planning, or learning to guide planning from experience, is a promising approach to improving the computational cost of planning. A general meta-planning strategy is to learn to impose constraints on the states considered and actions…

Machine Learning · Computer Science 2020-11-10 Rohan Chitnis , Tom Silver , Beomjoon Kim , Leslie Pack Kaelbling , Tomas Lozano-Perez

The goal of agents in multi-agent environments is to maximize total reward against the opposing agents that are encountered. Following a game-theoretic solution concept, such as Nash equilibrium, may obtain a strong performance in some…

Computer Science and Game Theory · Computer Science 2026-01-05 Sam Ganzfried

Model-based reinforcement learning (RL) is appealing because (i) it enables planning and thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables fast transfer to new reward functions. However, learning an…

Machine Learning · Computer Science 2020-07-14 Evan Zheran Liu , Ramtin Keramati , Sudarshan Seshadri , Kelvin Guu , Panupong Pasupat , Emma Brunskill , Percy Liang

Abstraction of Markov Decision Processes is a useful tool for solving complex problems, as it can ignore unimportant aspects of an environment, simplifying the process of learning an optimal policy. In this paper, we propose a new algorithm…

Machine Learning · Computer Science 2021-04-20 Ondrej Biza , Robert Platt

The Bayes-Adaptive Markov Decision Process (BAMDP) formalism pursues the Bayes-optimal solution to the exploration-exploitation trade-off in reinforcement learning. As the computation of exact solutions to Bayesian reinforcement-learning…

Machine Learning · Computer Science 2022-11-01 Dilip Arumugam , Satinder Singh

Partially Observable Markov Decision Process (POMDP) is widely used to model probabilistic behavior for complex systems. Compared with MDPs, POMDP models a system more accurate but solving a POMDP generally takes exponential time in the…

Logic in Computer Science · Computer Science 2017-03-13 Xiaobin Zhang , Bo Wu , Hai Lin

Decision-making in complex, continuous multi-task environments is often hindered by the difficulty of obtaining accurate models for planning and the inefficiency of learning purely from trial and error. While precise environment dynamics…

Machine Learning · Computer Science 2025-03-20 Jeff Jewett , Sandhya Saisubramanian

The peculiarity of adversarial team games resides in the asymmetric information available to the team members during the play, which makes the equilibrium computation problem hard even with zero-sum payoffs. The algorithms available in the…

Computer Science and Game Theory · Computer Science 2022-01-26 Luca Carminati , Federico Cacciamani , Marco Ciccone , Nicola Gatti

Automated techniques such as model checking have been used to verify models of robotic mission plans based on Markov decision processes (MDPs) and generate counterexamples that may help diagnose requirement violations. However, such…

Robotics · Computer Science 2018-03-28 Lu Feng , Mahsa Ghasemi , Kai-Wei Chang , Ufuk Topcu
‹ Prev 1 2 3 10 Next ›