English
Related papers

Related papers: Vector Quantized Models for Planning

200 papers

Sequential decision-making in high-dimensional continuous action spaces, particularly in stochastic environments, faces significant computational challenges. We explore this challenge in the traditional offline RL setting, where an agent…

Machine Learning · Computer Science 2025-03-04 Baiting Luo , Ava Pettet , Aron Laszka , Abhishek Dubey , Ayan Mukhopadhyay

Planning in stochastic and partially observable environments is a central issue in artificial intelligence. One commonly used technique for solving such a problem is by constructing an accurate model firstly. Although some recent approaches…

Artificial Intelligence · Computer Science 2019-04-08 Yunlong Liu , Jianyang Zheng

Sequential planning in large state space and action space quickly becomes intractable due to combinatorial explosion of the search space. Heuristic methods, like monte-carlo tree search, though effective for large state space, but struggle…

Artificial Intelligence · Computer Science 2023-12-13 Swarna Kamal Paul

In this paper, we introduce ObjectZero, a novel reinforcement learning (RL) algorithm that leverages the power of object-level representations to model dynamic environments more effectively. Unlike traditional approaches that process the…

Artificial Intelligence · Computer Science 2026-01-13 Rodion Vakhitov , Leonid Ugadiarov , Aleksandr Panov

When learning to act in a stochastic, partially observable environment, an intelligent agent should be prepared to anticipate a change in its belief of the environment state, and be capable of adapting its actions on-the-fly to changing…

Machine Learning · Computer Science 2022-04-14 Ugo Lecerf , Christelle Yemdji-Tchassi , Pietro Michiardi

Optimal decision making with limited or no information in stochastic environments where multiple agents interact is a challenging topic in the realm of artificial intelligence. Reinforcement learning (RL) is a popular approach for arriving…

Machine Learning · Computer Science 2019-01-08 Roi Ceren

Many important real-world problems have action spaces that are high-dimensional, continuous or both, making full enumeration of all possible actions infeasible. Instead, only small subsets of actions can be sampled for the purpose of policy…

In model-based reinforcement learning, generative and temporal models of environments can be leveraged to boost agent performance, either by tuning the agent's representations during training or via use as part of an explicit planning…

Deep reinforcement learning is a technique for solving problems in a variety of environments, ranging from Atari video games to stock trading. This method leverages deep neural network models to make decisions based on observations of a…

Machine Learning · Computer Science 2022-09-13 Anthony Dowling

How an agent can act optimally in stochastic, partially observable domains is a challenge problem, the standard approach to address this issue is to learn the domain model firstly and then based on the learned model to find the (near)…

Artificial Intelligence · Computer Science 2019-06-13 Yunlong Liu , Jianyang Zheng

The ability to form complex plans based on raw visual input is a litmus test for current capabilities of artificial intelligence, as it requires a seamless combination of visual processing and abstract algorithmic execution, two…

Machine Learning · Computer Science 2022-03-21 Marco Bagatella , Mirek Olšák , Michal Rolínek , Georg Martius

Most reinforcement learning practitioners evaluate their policies with online Monte Carlo estimators for either hyperparameter tuning or testing different algorithmic design choices, where the policy is repeatedly executed in the…

Machine Learning · Computer Science 2024-10-03 Shuze Liu , Shangtong Zhang

In partially observable (PO) environments, deep reinforcement learning (RL) agents often suffer from unsatisfactory performance, since two problems need to be tackled together: how to extract information from the raw observations to solve…

Machine Learning · Computer Science 2019-12-25 Dongqi Han , Kenji Doya , Jun Tani

We present an end-to-end, model-based deep reinforcement learning agent which dynamically attends to relevant parts of its state during planning. The agent uses a bottleneck mechanism over a set-based representation to force the number of…

Artificial Intelligence · Computer Science 2021-11-05 Mingde Zhao , Zhen Liu , Sitao Luan , Shuyuan Zhang , Doina Precup , Yoshua Bengio

In complex environments with large discrete action spaces, effective decision-making is critical in reinforcement learning (RL). Despite the widespread use of value-based RL approaches like Q-learning, they come with a computational burden,…

Machine Learning · Computer Science 2024-05-17 Fares Fourati , Vaneet Aggarwal , Mohamed-Slim Alouini

Learning and planning with latent space dynamics has been shown to be useful for sample efficiency in model-based reinforcement learning (MBRL) for discrete and continuous control tasks. In particular, recent work, for discrete action…

Machine Learning · Computer Science 2020-10-21 Anurag Koul , Varun V. Kumar , Alan Fern , Somdeb Majumdar

Reinforcement learning (RL) agents make decisions using nothing but observations from the environment, and consequently, heavily rely on the representations of those observations. Though some recent breakthroughs have used vector-based…

Machine Learning · Computer Science 2024-07-16 Edan Meyer , Adam White , Marlos C. Machado

Model-based planning is often thought to be necessary for deep, careful reasoning and generalization in artificial agents. While recent successes of model-based reinforcement learning (MBRL) with deep function approximation have…

Using LLMs not to predict plans but to formalize an environment into the Planning Domain Definition Language (PDDL) has been shown to improve performance and control. While most existing methodology only applies to fully observable…

Artificial Intelligence · Computer Science 2026-04-10 Liancheng Gong , Wang Zhu , Jesse Thomason , Li Zhang

We investigate a human-machine collaborative drawing environment in which an autonomous agent sketches images while optionally allowing a user to directly influence the agent's trajectory. We combine Monte Carlo Tree Search with image…

Artificial Intelligence · Computer Science 2016-12-16 Memo Akten , Mick Grierson
‹ Prev 1 2 3 10 Next ›