Related papers: On Planning while Learning

Plan Explicability and Predictability for Robot Task Planning

Intelligent robots and machines are becoming pervasive in human populated environments. A desirable capability of these agents is to respond to goal-oriented commands by autonomously constructing task plans. However, such autonomy can add…

Artificial Intelligence · Computer Science 2016-04-14 Yu Zhang , Sarath Sreedharan , Anagha Kulkarni , Tathagata Chakraborti , Hankz Hankui Zhuo , Subbarao Kambhampati

Latent Skill Planning for Exploration and Transfer

To quickly solve new tasks in complex environments, intelligent agents need to build up reusable knowledge. For example, a learned world model captures knowledge about the environment that applies to new tasks. Similarly, skills capture…

Machine Learning · Computer Science 2021-05-04 Kevin Xie , Homanga Bharadhwaj , Danijar Hafner , Animesh Garg , Florian Shkurti

Learning and Planning for Temporally Extended Tasks in Unknown Environments

We propose a novel planning technique for satisfying tasks specified in temporal logic in partially revealed environments. We define high-level actions derived from the environment and the given task itself, and estimate how each action…

Robotics · Computer Science 2021-04-29 Christopher Bradley , Adam Pacheck , Gregory J. Stein , Sebastian Castro , Hadas Kress-Gazit , Nicholas Roy

On the Job Training

We propose a new framework for building and evaluating machine learning algorithms. We argue that many real-world problems require an agent which must quickly learn to respond to demands, yet can continue to perform and respond to new…

Machine Learning · Computer Science 2007-05-23 Jason E. Holt

Conservative classifiers do consistently well with improving agents: characterizing statistical and online learning

Machine learning is now ubiquitous in societal decision-making, for example in evaluating job candidates or loan applications, and it is increasingly important to take into account how classified agents will react to the learning…

Machine Learning · Computer Science 2025-08-08 Dravyansh Sharma , Alec Sun

Composable Planning with Attributes

The tasks that an agent will need to solve often are not known during training. However, if the agent knows which properties of the environment are important then, after learning how its actions affect those properties, it may be able to…

Artificial Intelligence · Computer Science 2019-04-29 Amy Zhang , Adam Lerer , Sainbayar Sukhbaatar , Rob Fergus , Arthur Szlam

An intelligent tutor for planning in large partially observable environments

AI can not only outperform people in many planning tasks, but it can also teach them how to plan better. A recent and promising approach to improving human decision-making is to create intelligent tutors that utilize AI to discover and…

Artificial Intelligence · Computer Science 2025-06-24 Lovis Heindrich , Saksham Consul , Falk Lieder

Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning

Learning and planning in partially-observable domains is one of the most difficult problems in reinforcement learning. Traditional methods consider these two problems as independent, resulting in a classical two-stage paradigm: first learn…

Artificial Intelligence · Computer Science 2019-11-25 Tianyu Li , Bogdan Mazoure , Doina Precup , Guillaume Rabusseau

Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer

In many machine learning applications, there are multiple decision-makers involved, both automated and human. The interaction between these agents often goes unaddressed in algorithmic development. In this work, we explore a simple version…

Machine Learning · Statistics 2018-09-10 David Madras , Toniann Pitassi , Richard Zemel

Intent-aware Multi-agent Reinforcement Learning

This paper proposes an intent-aware multi-agent planning framework as well as a learning algorithm. Under this framework, an agent plans in the goal space to maximize the expected utility. The planning process takes the belief of other…

Artificial Intelligence · Computer Science 2018-03-07 Siyuan Qi , Song-Chun Zhu

Online Learning and Planning in Partially Observable Domains without Prior Knowledge

How an agent can act optimally in stochastic, partially observable domains is a challenge problem, the standard approach to address this issue is to learn the domain model firstly and then based on the learned model to find the (near)…

Artificial Intelligence · Computer Science 2019-06-13 Yunlong Liu , Jianyang Zheng

Efficient, Safe, and Probably Approximately Complete Learning of Action Models

In this paper we explore the theoretical boundaries of planning in a setting where no model of the agent's actions is given. Instead of an action model, a set of successfully executed plans are given and the task is to generate a plan that…

Artificial Intelligence · Computer Science 2017-05-26 Roni Stern , Brendan Juba

When Remembering and Planning are Worth it: Navigating under Change

We explore how different types and uses of memory can aid spatial navigation in changing uncertain environments. In the simple foraging task we study, every day, our agent has to find its way from its home, through barriers, to food.…

Artificial Intelligence · Computer Science 2026-02-18 Omid Madani , J. Brian Burns , Reza Eghbali , Thomas L. Dean

Offline Learning for Planning: A Summary

The training of autonomous agents often requires expensive and unsafe trial-and-error interactions with the environment. Nowadays several data sets containing recorded experiences of intelligent agents performing various tasks, spanning…

Machine Learning · Computer Science 2020-10-06 Giorgio Angelotti , Nicolas Drougard , Caroline Ponzoni Carvalho Chanel

What's the Plan? Evaluating and Developing Planning-Aware Techniques for Language Models

Planning is a fundamental task in artificial intelligence that involves finding a sequence of actions that achieve a specified goal in a given environment. Large language models (LLMs) are increasingly used for applications that require…

Computation and Language · Computer Science 2024-05-24 Eran Hirsch , Guy Uziel , Ateret Anaby-Tavor

On Avoidance Learning with Partial Observability

We study a framework where agents have to avoid aversive signals. The agents are given only partial information, in the form of features that are projections of task states. Additionally, the agents have to cope with non-determinism,…

Artificial Intelligence · Computer Science 2016-05-17 Tom J. Ameloot

From Undecidability of Non-Triviality and Finiteness to Undecidability of Learnability

Machine learning researchers and practitioners steadily enlarge the multitude of successful learning models. They achieve this through in-depth theoretical analyses and experiential heuristics. However, there is no known general-purpose…

Computational Complexity · Computer Science 2023-10-18 Matthias C. Caro

Active Learning within Constrained Environments through Imitation of an Expert Questioner

Active learning agents typically employ a query selection algorithm which solely considers the agent's learning objectives. However, this may be insufficient in more realistic human domains. This work uses imitation learning to enable an…

Machine Learning · Computer Science 2019-07-02 Kalesha Bullard , Yannick Schroecker , Sonia Chernova

Explicablility as Minimizing Distance from Expected Behavior

In order to have effective human-AI collaboration, it is necessary to address how the AI agent's behavior is being perceived by the humans-in-the-loop. When the agent's task plans are generated without such considerations, they may often…

Artificial Intelligence · Computer Science 2019-03-15 Anagha Kulkarni , Yantian Zha , Tathagata Chakraborti , Satya Gautam Vadlamudi , Yu Zhang , Subbarao Kambhampati

Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization

In the sequential decision making setting, an agent aims to achieve systematic generalization over a large, possibly infinite, set of environments. Such environments are modeled as discrete Markov decision processes with both states and…

Machine Learning · Computer Science 2023-03-31 Mirco Mutti , Riccardo De Santi , Emanuele Rossi , Juan Felipe Calderon , Michael Bronstein , Marcello Restelli