Related papers: STAND: Self-Aware Precondition Induction for Inter…

Learning Agent State Online with Recurrent Generate-and-Test

Learning continually and online from a continuous stream of data is challenging, especially for a reinforcement learning agent with sequential data. When the environment only provides observations giving partial information about the state…

Machine Learning · Computer Science 2022-01-03 Amir Samani , Richard S. Sutton

Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks

A longstanding goal of artificial intelligence is to create artificial agents capable of learning to perform tasks that require sequential decision making. Importantly, while it is the artificial agent that learns and acts, it is still up…

Artificial Intelligence · Computer Science 2021-07-14 Ruohan Zhang , Faraz Torabi , Garrett Warnell , Peter Stone

When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning

A long-term goal of reinforcement learning is to design agents that can autonomously interact and learn in the world. A critical challenge to such autonomy is the presence of irreversible states which require external assistance to recover…

Machine Learning · Computer Science 2022-10-20 Annie Xie , Fahim Tajwar , Archit Sharma , Chelsea Finn

Learning by Distilling Context

Language models significantly benefit from context tokens, such as prompts or scratchpads. They perform better when prompted with informative instructions, and they acquire new reasoning capabilities by generating a scratch-pad before…

Computation and Language · Computer Science 2022-10-03 Charlie Snell , Dan Klein , Ruiqi Zhong

Agentic Knowledgeable Self-awareness

Large Language Models (LLMs) have achieved considerable performance across various agentic planning tasks. However, traditional agent planning approaches adopt a "flood irrigation" methodology that indiscriminately injects gold…

Computation and Language · Computer Science 2025-05-30 Shuofei Qiao , Zhisong Qiu , Baochang Ren , Xiaobin Wang , Xiangyuan Ru , Ningyu Zhang , Xiang Chen , Yong Jiang , Pengjun Xie , Fei Huang , Huajun Chen

Autonomous Reinforcement Learning via Subgoal Curricula

Reinforcement learning (RL) promises to enable autonomous acquisition of complex behaviors for diverse agents. However, the success of current reinforcement learning algorithms is predicated on an often under-emphasised requirement -- each…

Machine Learning · Computer Science 2021-10-29 Archit Sharma , Abhishek Gupta , Sergey Levine , Karol Hausman , Chelsea Finn

LaND: Learning to Navigate from Disengagements

Consistently testing autonomous mobile robots in real world scenarios is a necessary aspect of developing autonomous navigation systems. Each time the human safety monitor disengages the robot's autonomy system due to the robot performing…

Robotics · Computer Science 2020-10-12 Gregory Kahn , Pieter Abbeel , Sergey Levine

Episodic Memory for Learning Subjective-Timescale Models

In model-based learning, an agent's model is commonly defined over transitions between consecutive states of an environment even though planning often requires reasoning over multi-step timescales, with intermediate states either…

Machine Learning · Computer Science 2020-10-06 Alexey Zakharov , Matthew Crosby , Zafeirios Fountas

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

Natural Language-conditioned reinforcement learning (RL) enables the agents to follow human instructions. Previous approaches generally implemented language-conditioned RL by providing human instructions in natural language (NL) and…

Computation and Language · Computer Science 2023-02-21 Jing-Cheng Pang , Xin-Yu Yang , Si-Hang Yang , Yang Yu

Learning Occlusion-aware Decision-making from Agent Interaction via Active Perception

Occlusion-aware decision-making is essential in autonomous driving due to the high uncertainty of various occlusions. Recent occlusion-aware decision-making methods encounter issues such as high computational complexity, scenario…

Robotics · Computer Science 2025-04-10 Jie Jia , Yiming Shu , Zhongxue Gan , Wenchao Ding

From Learning to Meta-Learning: Reduced Training Overhead and Complexity for Communication Systems

Machine learning methods adapt the parameters of a model, constrained to lie in a given model class, by using a fixed learning procedure based on data or active observations. Adaptation is done on a per-task basis, and retraining is needed…

Machine Learning · Computer Science 2021-10-22 Osvaldo Simeone , Sangwoo Park , Joonhyuk Kang

Reinforcement Learning for Long-Horizon Interactive LLM Agents

Interactive digital agents (IDAs) leverage APIs of stateful digital environments to perform tasks in response to user requests. While IDAs powered by instruction-tuned large language models (LLMs) can react to feedback from interface…

Machine Learning · Computer Science 2025-03-11 Kevin Chen , Marco Cusumano-Towner , Brody Huval , Aleksei Petrenko , Jackson Hamburger , Vladlen Koltun , Philipp Krähenbühl

Learning in Context, Guided by Choice: A Reward-Free Paradigm for Reinforcement Learning with Transformers

In-context reinforcement learning (ICRL) leverages the in-context learning capabilities of transformer models (TMs) to efficiently generalize to unseen sequential decision-making tasks without parameter updates. However, existing ICRL…

Machine Learning · Computer Science 2026-02-10 Juncheng Dong , Bowen He , Moyang Guo , Ethan X. Fang , Zhuoran Yang , Vahid Tarokh

Pre-Learning Environment Representations for Data-Efficient Neural Instruction Following

We consider the problem of learning to map from natural language instructions to state transitions (actions) in a data-efficient manner. Our method takes inspiration from the idea that it should be easier to ground language to concepts that…

Computation and Language · Computer Science 2019-07-24 David Gaddy , Dan Klein

Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits

We present an adaptive learning Intelligent Tutoring System, which uses model-based reinforcement learning in the form of contextual bandits to assign learning activities to students. The model is trained on the trajectories of thousands of…

Computation and Language · Computer Science 2022-07-29 Robert Belfer , Ekaterina Kochmar , Iulian Vlad Serban

Dynamic Conditional Imitation Learning for Autonomous Driving

Conditional imitation learning (CIL) trains deep neural networks, in an end-to-end manner, to mimic human driving. This approach has demonstrated suitable vehicle control when following roads, avoiding obstacles, or taking specific turns at…

Robotics · Computer Science 2022-11-22 Hesham M. Eraqi , Mohamed N. Moustafa , Jens Honer

Language Conditioned Imitation Learning over Unstructured Data

Natural language is perhaps the most flexible and intuitive way for humans to communicate tasks to a robot. Prior work in imitation learning typically requires each task be specified with a task id or goal image -- something that is often…

Robotics · Computer Science 2021-07-09 Corey Lynch , Pierre Sermanet

Dual Operating Modes of In-Context Learning

In-context learning (ICL) exhibits dual operating modes: task learning, i.e., acquiring a new skill from in-context samples, and task retrieval, i.e., locating and activating a relevant pretrained skill. Recent theoretical work investigates…

Machine Learning · Computer Science 2024-08-05 Ziqian Lin , Kangwook Lee

MANSA: Learning Fast and Slow in Multi-Agent Systems

In multi-agent reinforcement learning (MARL), independent learning (IL) often shows remarkable performance and easily scales with the number of agents. Yet, using IL can be inefficient and runs the risk of failing to successfully train,…

Multiagent Systems · Computer Science 2023-06-06 David Mguni , Haojun Chen , Taher Jafferjee , Jianhong Wang , Long Fei , Xidong Feng , Stephen McAleer , Feifei Tong , Jun Wang , Yaodong Yang

Learning a Better Initialization for Soft Prompts via Meta-Learning

Prompt tuning (PT) is an effective approach to adapting pre-trained language models to downstream tasks. Without a good initialization, prompt tuning doesn't perform well under few-shot settings. So pre-trained prompt tuning (PPT) is…

Computation and Language · Computer Science 2022-05-26 Yukun Huang , Kun Qian , Zhou Yu