Related papers: Training a Generally Curious Agent

Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings

This paper introduces a new approach for continual planning and model learning in relational, non-stationary stochastic environments. Such capabilities are essential for the deployment of sequential decision-making systems in the uncertain…

Artificial Intelligence · Computer Science 2024-07-24 Rushang Karia , Pulkit Verma , Alberto Speranzon , Siddharth Srivastava

GRAPPA: Generalizing and Adapting Robot Policies via Online Agentic Guidance

Robot learning approaches such as behavior cloning and reinforcement learning have shown great promise in synthesizing robot skills from human demonstrations in specific environments. However, these approaches often require task-specific…

Robotics · Computer Science 2025-04-09 Arthur Bucker , Pablo Ortega-Kral , Jonathan Francis , Jean Oh

Generalization of Agent Behavior through Explicit Representation of Context

In order to deploy autonomous agents in digital interactive environments, they must be able to act robustly in unseen situations. The standard machine learning approach is to include as much variation as possible into training these agents.…

Neural and Evolutionary Computing · Computer Science 2021-02-11 Cem C Tutum , Suhaib Abdulquddos , Risto Miikkulainen

Active Example Selection for In-Context Learning

With a handful of demonstration examples, large-scale language models show strong capability to perform various tasks by in-context learning from these examples, without any fine-tuning. We demonstrate that in-context learning performance…

Computation and Language · Computer Science 2022-11-10 Yiming Zhang , Shi Feng , Chenhao Tan

Adaptive Elicitation of Latent Information Using Natural Language

Eliciting information to reduce uncertainty about a latent entity is a critical task in many application domains, e.g., assessing individual student learning outcomes, diagnosing underlying diseases, or learning user preferences. Though…

Computation and Language · Computer Science 2025-07-10 Jimmy Wang , Thomas Zollo , Richard Zemel , Hongseok Namkoong

Language and Experience: A Computational Model of Social Learning in Complex Tasks

The ability to combine linguistic guidance from others with direct experience is central to human development, enabling safe and rapid learning in new environments. How do people integrate these two sources of knowledge, and how might AI…

Artificial Intelligence · Computer Science 2026-02-19 Cédric Colas , Tracey Mills , Ben Prystawski , Michael Henry Tessler , Noah Goodman , Jacob Andreas , Joshua Tenenbaum

Learning to Generalize for Sequential Decision Making

We consider problems of making sequences of decisions to accomplish tasks, interacting via the medium of language. These problems are often tackled with reinforcement learning approaches. We find that these models do not generalize well…

Computation and Language · Computer Science 2020-10-07 Xusen Yin , Ralph Weischedel , Jonathan May

CARMA: Collective Adaptive Resource-sharing Markovian Agents

In this paper we present CARMA, a language recently defined to support specification and analysis of collective adaptive systems. CARMA is a stochastic process algebra equipped with linguistic constructs specifically developed for modelling…

Programming Languages · Computer Science 2015-09-30 Luca Bortolussi , Rocco De Nicola , Vashti Galpin , Stephen Gilmore , Jane Hillston , Diego Latella , Michele Loreti , Mieke Massink

Training Agents with Weakly Supervised Feedback from Large Language Models

Large Language Models (LLMs) offer a promising basis for creating agents that can tackle complex tasks through iterative environmental interaction. Existing methods either require these agents to mimic expert-provided trajectories or rely…

Computation and Language · Computer Science 2024-12-02 Dihong Gong , Pu Lu , Zelong Wang , Meng Zhou , Xiuqiang He

Active Learning within Constrained Environments through Imitation of an Expert Questioner

Active learning agents typically employ a query selection algorithm which solely considers the agent's learning objectives. However, this may be insufficient in more realistic human domains. This work uses imitation learning to enable an…

Machine Learning · Computer Science 2019-07-02 Kalesha Bullard , Yannick Schroecker , Sonia Chernova

Sample-efficient Deep Reinforcement Learning with Imaginary Rollouts for Human-Robot Interaction

Deep reinforcement learning has proven to be a great success in allowing agents to learn complex tasks. However, its application to actual robots can be prohibitively expensive. Furthermore, the unpredictability of human behavior in…

Robotics · Computer Science 2019-08-16 Mohammad Thabet , Massimiliano Patacchiola , Angelo Cangelosi

Generative Prompt Internalization

Prompts used in recent large language model based applications are often fixed and lengthy, leading to significant computational overhead. To address this challenge, we propose Generative Prompt Internalization (GenPI), a lightweight method…

Computation and Language · Computer Science 2026-02-12 Haebin Shin , Lei Ji , Yeyun Gong , Sungdong Kim , Eunbi Choi , Minjoon Seo

Towards General Agentic Intelligence via Environment Scaling

Advanced agentic intelligence is a prerequisite for deploying Large Language Models in practical, real-world applications. Diverse real-world APIs demand precise, robust function-calling intelligence, which needs agents to develop these…

Computation and Language · Computer Science 2025-09-17 Runnan Fang , Shihao Cai , Baixuan Li , Jialong Wu , Guangyu Li , Wenbiao Yin , Xinyu Wang , Xiaobin Wang , Liangcai Su , Zhen Zhang , Shibin Wu , Zhengwei Tao , Yong Jiang , Pengjun Xie , Fei Huang , Jingren Zhou

Personalized Dialogue Generation with Persona-Adaptive Attention

Persona-based dialogue systems aim to generate consistent responses based on historical context and predefined persona. Unlike conventional dialogue generation, the persona-based dialogue needs to consider both dialogue context and persona,…

Computation and Language · Computer Science 2024-01-11 Qiushi Huang , Yu Zhang , Tom Ko , Xubo Liu , Bo Wu , Wenwu Wang , Lilian Tang

Learning To Explore With Predictive World Model Via Self-Supervised Learning

Autonomous artificial agents must be able to learn behaviors in complex environments without humans to design tasks and rewards. Designing these functions for each environment is not feasible, thus, motivating the development of intrinsic…

Machine Learning · Computer Science 2025-02-20 Alana Santana , Paula P. Costa , Esther L. Colombini

Focus on What's Informative and Ignore What's not: Communication Strategies in a Referential Game

Research in multi-agent cooperation has shown that artificial agents are able to learn to play a simple referential game while developing a shared lexicon. This lexicon is not easy to analyze, as it does not show many properties of a…

Computation and Language · Computer Science 2019-11-06 Roberto Dessì , Diane Bouchacourt , Davide Crepaldi , Marco Baroni

Composable Planning with Attributes

The tasks that an agent will need to solve often are not known during training. However, if the agent knows which properties of the environment are important then, after learning how its actions affect those properties, it may be able to…

Artificial Intelligence · Computer Science 2019-04-29 Amy Zhang , Adam Lerer , Sainbayar Sukhbaatar , Rob Fergus , Arthur Szlam

Emergence of Communication in an Interactive World with Consistent Speakers

Training agents to communicate with one another given task-based supervision only has attracted considerable attention recently, due to the growing interest in developing models for human-agent interaction. Prior work on the topic focused…

Computation and Language · Computer Science 2019-03-26 Ben Bogin , Mor Geva , Jonathan Berant

Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based Games

Language models pre-trained on large self-supervised corpora, followed by task-specific fine-tuning has become the dominant paradigm in NLP. These pre-training datasets often have a one-to-many structure--e.g. in dialogue there are many…

Computation and Language · Computer Science 2023-04-27 Benjamin Towle , Ke Zhou

Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols

Learning to communicate through interaction, rather than relying on explicit supervision, is often considered a prerequisite for developing a general AI. We study a setting where two agents engage in playing a referential game and, from…

Machine Learning · Computer Science 2017-11-07 Serhii Havrylov , Ivan Titov