Related papers: Learning Partially Observable Deterministic Action…

Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observability

Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios. Yet, previous modeling work on agent learning and…

Adaptation and Self-Organizing Systems · Physics 2022-04-15 Wolfram Barfuss , Richard P. Mann

Learning to Act and Observe in Partially Observable Domains

We consider a learning agent in a partially observable environment, with which the agent has never interacted before, and about which it learns both what it can observe and how its actions affect the environment. The agent can learn about…

Artificial Intelligence · Computer Science 2021-09-14 Thomas Bolander , Nina Gierasimczuk , Andrés Occhipinti Liberman

Learning Action Models: Qualitative Approach

In dynamic epistemic logic, actions are described using action models. In this paper we introduce a framework for studying learnability of action models from observations. We present first results concerning propositional action models.…

Machine Learning · Computer Science 2015-07-16 Thomas Bolander , Nina Gierasimczuk

Uncertainty Maximization in Partially Observable Domains: A Cognitive Perspective

Faced with an ever-increasing complexity of their domains of application, artificial learning agents are now able to scale up in their ability to process an overwhelming amount of information coming from their interaction with an…

Artificial Intelligence · Computer Science 2022-04-05 Mirza Ramicic , Andrea Bonarini

Deterministic POMDPs Revisited

We study a subclass of POMDPs, called Deterministic POMDPs, that is characterized by deterministic actions and observations. These models do not provide the same generality of POMDPs yet they capture a number of interesting and challenging…

Artificial Intelligence · Computer Science 2012-05-14 Blai Bonet

Causally Correct Partial Models for Reinforcement Learning

In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the…

Machine Learning · Computer Science 2020-02-10 Danilo J. Rezende , Ivo Danihelka , George Papamakarios , Nan Rosemary Ke , Ray Jiang , Theophane Weber , Karol Gregor , Hamza Merzic , Fabio Viola , Jane Wang , Jovana Mitrovic , Frederic Besse , Ioannis Antonoglou , Lars Buesing

Predictive User Modeling with Actionable Attributes

Different machine learning techniques have been proposed and used for modeling individual and group user needs, interests and preferences. In the traditional predictive modeling instances are described by observable variables, called…

Artificial Intelligence · Computer Science 2013-12-24 Indre Zliobaite , Mykola Pechenizkiy

Learning Predictive Models From Observation and Interaction

Learning predictive models from interaction with the world allows an agent, such as a robot, to learn about how the world works, and then use this learned model to plan coordinated sequences of actions to bring about desired outcomes.…

Machine Learning · Computer Science 2020-01-01 Karl Schmeckpeper , Annie Xie , Oleh Rybkin , Stephen Tian , Kostas Daniilidis , Sergey Levine , Chelsea Finn

Reinforcement Learning from Delayed Observations via World Models

In standard reinforcement learning settings, agents typically assume immediate feedback about the effects of their actions after taking them. However, in practice, this assumption may not hold true due to physical constraints and can…

Machine Learning · Computer Science 2024-06-27 Armin Karamzade , Kyungmin Kim , Montek Kalsi , Roy Fox

Learning Conditional Random Fields with Augmented Observations for Partially Observed Action Recognition

This paper aims at recognizing partially observed human actions in videos. Action videos acquired in uncontrolled environments often contain corrupt frames, which make actions partially observed. Furthermore, these frames can last for…

Computer Vision and Pattern Recognition · Computer Science 2018-12-06 Shih-Yao Lin , Yen-Yu Lin , Chu-Song Chen , Yi-Ping Hung

Safe Learning of PDDL Domains with Conditional Effects -- Extended Version

Powerful domain-independent planners have been developed to solve various types of planning problems. These planners often require a model of the acting agent's actions, given in some planning domain description language. Manually designing…

Artificial Intelligence · Computer Science 2024-03-25 Argaman Mordoch , Enrico Scala , Roni Stern , Brendan Juba

Provable Reinforcement Learning with a Short-Term Memory

Real-world sequential decision making problems commonly involve partial observability, which requires the agent to maintain a memory of history in order to infer the latent states, plan and make good decisions. Coping with partial…

Machine Learning · Computer Science 2022-02-09 Yonathan Efroni , Chi Jin , Akshay Krishnamurthy , Sobhan Miryoosefi

Differentiable Learning of Lifted Action Schemas for Classical Planning

Classical planners can effectively solve very large deterministic MDPs represented in STRIPS or PDDL where states are sets of atoms over objects and relations, and lifted action schemas add or delete these atoms. This compact representation…

Artificial Intelligence · Computer Science 2026-05-26 Jonas Reiter , Jakob Elias Gebler , Hector Geffner

Learning Symbolic Models of Stochastic Domains

In this article, we work towards the goal of developing agents that can learn to act in complex worlds. We develop a probabilistic, relational planning rule representation that compactly models noisy, nondeterministic action effects, and…

Machine Learning · Computer Science 2011-10-12 L. P. Kaelbling , H. M. Pasula , L. S. Zettlemoyer

Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning

In most real-world reinforcement learning applications, state information is only partially observable, which breaks the Markov decision process assumption and leads to inferior performance for algorithms that conflate observations with…

Machine Learning · Computer Science 2024-06-12 Hongming Zhang , Tongzheng Ren , Chenjun Xiao , Dale Schuurmans , Bo Dai

Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems

We study Reinforcement Learning for partially observable dynamical systems using function approximation. We propose a new \textit{Partially Observable Bilinear Actor-Critic framework}, that is general enough to include models such as…

Machine Learning · Computer Science 2022-06-27 Masatoshi Uehara , Ayush Sekhari , Jason D. Lee , Nathan Kallus , Wen Sun

Policy Learning with Hypothesis based Local Action Selection

For robots to be able to manipulate in unknown and unstructured environments the robot should be capable of operating under partial observability of the environment. Object occlusions and unmodeled environments are some of the factors that…

Robotics · Computer Science 2015-05-11 Bharath Sankaran , Jeannette Bohg , Nathan Ratliff , Stefan Schaal

Partial Transportability for Domain Generalization

A fundamental task in AI is providing performance guarantees for predictions made in unseen domains. In practice, there can be substantial uncertainty about the distribution of new data, and corresponding variability in the performance of…

Machine Learning · Computer Science 2025-04-01 Kasra Jalaldoust , Alexis Bellot , Elias Bareinboim

Policy Optimization in Multi-Agent Settings under Partially Observable Environments

This work leverages adaptive social learning to estimate partially observable global states in multi-agent reinforcement learning (MARL) problems. Unlike existing methods, the proposed approach enables the concurrent operation of social…

Multiagent Systems · Computer Science 2025-08-11 Ainur Zhaikhan , Malek Khammassi , Ali H. Sayed

Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning

Learning and planning in partially-observable domains is one of the most difficult problems in reinforcement learning. Traditional methods consider these two problems as independent, resulting in a classical two-stage paradigm: first learn…

Artificial Intelligence · Computer Science 2019-11-25 Tianyu Li , Bogdan Mazoure , Doina Precup , Guillaume Rabusseau