Related papers: Vector Quantized Models for Planning

Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction

Sequential decision-making in high-dimensional continuous action spaces, particularly in stochastic environments, faces significant computational challenges. We explore this challenge in the traditional offline RL setting, where an agent…

Machine Learning · Computer Science 2025-03-04 Baiting Luo , Ava Pettet , Aron Laszka , Abhishek Dubey , Ayan Mukhopadhyay

Combining Offline Models and Online Monte-Carlo Tree Search for Planning from Scratch

Planning in stochastic and partially observable environments is a central issue in artificial intelligence. One commonly used technique for solving such a problem is by constructing an accurate model firstly. Although some recent approaches…

Artificial Intelligence · Computer Science 2019-04-08 Yunlong Liu , Jianyang Zheng

Sequential Planning in Large Partially Observable Environments guided by LLMs

Sequential planning in large state space and action space quickly becomes intractable due to combinatorial explosion of the search space. Heuristic methods, like monte-carlo tree search, though effective for large state space, but struggle…

Artificial Intelligence · Computer Science 2023-12-13 Swarna Kamal Paul

Object-Centric World Models Meet Monte Carlo Tree Search

In this paper, we introduce ObjectZero, a novel reinforcement learning (RL) algorithm that leverages the power of object-level representations to model dynamic environments more effectively. Unlike traditional approaches that process the…

Artificial Intelligence · Computer Science 2026-01-13 Rodion Vakhitov , Leonid Ugadiarov , Aleksandr Panov

Safer Autonomous Driving in a Stochastic, Partially-Observable Environment by Hierarchical Contingency Planning

When learning to act in a stochastic, partially observable environment, an intelligent agent should be prepared to anticipate a change in its belief of the environment state, and be capable of adapting its actions on-the-fly to changing…

Machine Learning · Computer Science 2022-04-14 Ugo Lecerf , Christelle Yemdji-Tchassi , Pietro Michiardi

Optimal Decision-Making in Mixed-Agent Partially Observable Stochastic Environments via Reinforcement Learning

Optimal decision making with limited or no information in stochastic environments where multiple agents interact is a challenging topic in the realm of artificial intelligence. Reinforcement learning (RL) is a popular approach for arriving…

Machine Learning · Computer Science 2019-01-08 Roi Ceren

Learning and Planning in Complex Action Spaces

Many important real-world problems have action spaces that are high-dimensional, continuous or both, making full enumeration of all possible actions infeasible. Instead, only small subsets of actions can be sampled for the purpose of policy…

Machine Learning · Computer Science 2021-04-14 Thomas Hubert , Julian Schrittwieser , Ioannis Antonoglou , Mohammadamin Barekatain , Simon Schmitt , David Silver

Generative Temporal Models with Spatial Memory for Partially Observed Environments

In model-based reinforcement learning, generative and temporal models of environments can be leveraged to boost agent performance, either by tuning the agent's representations during training or via use as part of an explicit planning…

Machine Learning · Statistics 2018-07-20 Marco Fraccaro , Danilo Jimenez Rezende , Yori Zwols , Alexander Pritzel , S. M. Ali Eslami , Fabio Viola

Pathfinding in Random Partially Observable Environments with Vision-Informed Deep Reinforcement Learning

Deep reinforcement learning is a technique for solving problems in a variety of environments, ranging from Atari video games to stock trading. This method leverages deep neural network models to make decisions based on observations of a…

Machine Learning · Computer Science 2022-09-13 Anthony Dowling

Online Learning and Planning in Partially Observable Domains without Prior Knowledge

How an agent can act optimally in stochastic, partially observable domains is a challenge problem, the standard approach to address this issue is to learn the domain model firstly and then based on the learned model to find the (near)…

Artificial Intelligence · Computer Science 2019-06-13 Yunlong Liu , Jianyang Zheng

Planning from Pixels in Environments with Combinatorially Hard Search Spaces

The ability to form complex plans based on raw visual input is a litmus test for current capabilities of artificial intelligence, as it requires a seamless combination of visual processing and abstract algorithmic execution, two…

Machine Learning · Computer Science 2022-03-21 Marco Bagatella , Mirek Olšák , Michal Rolínek , Georg Martius

Efficient Policy Evaluation with Offline Data Informed Behavior Policy Design

Most reinforcement learning practitioners evaluate their policies with online Monte Carlo estimators for either hyperparameter tuning or testing different algorithmic design choices, where the policy is repeatedly executed in the…

Machine Learning · Computer Science 2024-10-03 Shuze Liu , Shangtong Zhang

Variational Recurrent Models for Solving Partially Observable Control Tasks

In partially observable (PO) environments, deep reinforcement learning (RL) agents often suffer from unsatisfactory performance, since two problems need to be tackled together: how to extract information from the raw observations to solve…

Machine Learning · Computer Science 2019-12-25 Dongqi Han , Kenji Doya , Jun Tani

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

We present an end-to-end, model-based deep reinforcement learning agent which dynamically attends to relevant parts of its state during planning. The agent uses a bottleneck mechanism over a set-based representation to force the number of…

Artificial Intelligence · Computer Science 2021-11-05 Mingde Zhao , Zhen Liu , Sitao Luan , Shuyuan Zhang , Doina Precup , Yoshua Bengio

Stochastic Q-learning for Large Discrete Action Spaces

In complex environments with large discrete action spaces, effective decision-making is critical in reinforcement learning (RL). Despite the widespread use of value-based RL approaches like Q-learning, they come with a computational burden,…

Machine Learning · Computer Science 2024-05-17 Fares Fourati , Vaneet Aggarwal , Mohamed-Slim Alouini

Dream and Search to Control: Latent Space Planning for Continuous Control

Learning and planning with latent space dynamics has been shown to be useful for sample efficiency in model-based reinforcement learning (MBRL) for discrete and continuous control tasks. In particular, recent work, for discrete action…

Machine Learning · Computer Science 2020-10-21 Anurag Koul , Varun V. Kumar , Alan Fern , Somdeb Majumdar

Harnessing Discrete Representations For Continual Reinforcement Learning

Reinforcement learning (RL) agents make decisions using nothing but observations from the environment, and consequently, heavily rely on the representations of those observations. Though some recent breakthroughs have used vector-based…

Machine Learning · Computer Science 2024-07-16 Edan Meyer , Adam White , Marlos C. Machado

On the role of planning in model-based deep reinforcement learning

Model-based planning is often thought to be necessary for deep, careful reasoning and generalization in artificial agents. While recent successes of model-based reinforcement learning (MBRL) with deep function approximation have…

Artificial Intelligence · Computer Science 2021-03-18 Jessica B. Hamrick , Abram L. Friesen , Feryal Behbahani , Arthur Guez , Fabio Viola , Sims Witherspoon , Thomas Anthony , Lars Buesing , Petar Veličković , Théophane Weber

Iterative Formalization and Planning in Partially Observable Environments

Using LLMs not to predict plans but to formalize an environment into the Planning Domain Definition Language (PDDL) has been shown to improve performance and control. While most existing methodology only applies to fully observable…

Artificial Intelligence · Computer Science 2026-04-10 Liancheng Gong , Wang Zhu , Jesse Thomason , Li Zhang

Collaborative creativity with Monte-Carlo Tree Search and Convolutional Neural Networks

We investigate a human-machine collaborative drawing environment in which an autonomous agent sketches images while optionally allowing a user to directly influence the agent's trajectory. We combine Monte Carlo Tree Search with image…

Artificial Intelligence · Computer Science 2016-12-16 Memo Akten , Mick Grierson