Related papers: Learning to Generalize for Sequential Decision Mak…

RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning

Recent developments in large pre-trained language models have enabled unprecedented performance on a variety of downstream tasks. Achieving best performance with these models often leverages in-context learning, where a model performs a…

Computation and Language · Computer Science 2024-04-17 Alexander Scarlatos , Andrew Lan

Active Example Selection for In-Context Learning

With a handful of demonstration examples, large-scale language models show strong capability to perform various tasks by in-context learning from these examples, without any fine-tuning. We demonstrate that in-context learning performance…

Computation and Language · Computer Science 2022-11-10 Yiming Zhang , Shi Feng , Chenhao Tan

Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning

Complex, multi-task problems have proven to be difficult to solve efficiently in a sparse-reward reinforcement learning setting. In order to be sample efficient, multi-task learning requires reuse and sharing of low-level policies. To…

Machine Learning · Computer Science 2021-09-28 Valerie Chen , Abhinav Gupta , Kenneth Marino

On the Modeling Capabilities of Large Language Models for Sequential Decision Making

Large pretrained models are showing increasingly better performance in reasoning and planning tasks across different modalities, opening the possibility to leverage them for complex sequential decision making problems. In this paper, we…

Artificial Intelligence · Computer Science 2024-10-10 Martin Klissarov , Devon Hjelm , Alexander Toshev , Bogdan Mazoure

Pre-Trained Language Models for Interactive Decision-Making

Language model (LM) pre-training is useful in many language processing tasks. But can pre-trained LMs be further leveraged for more general machine learning problems? We propose an approach for using LMs to scaffold learning and…

Machine Learning · Computer Science 2022-11-01 Shuang Li , Xavier Puig , Chris Paxton , Yilun Du , Clinton Wang , Linxi Fan , Tao Chen , De-An Huang , Ekin Akyürek , Anima Anandkumar , Jacob Andreas , Igor Mordatch , Antonio Torralba , Yuke Zhu

Teacher-student curriculum learning for reinforcement learning

Reinforcement learning (rl) is a popular paradigm for sequential decision making problems. The past decade's advances in rl have led to breakthroughs in many challenging domains such as video games, board games, robotics, and chip design.…

Machine Learning · Computer Science 2022-11-01 Yanick Schraner

Representation Learning for Grounded Spatial Reasoning

The interpretation of spatial references is highly contextual, requiring joint inference over both language and the environment. We consider the task of spatial reasoning in a simulated environment, where an agent can act and receive…

Computation and Language · Computer Science 2017-11-15 Michael Janner , Karthik Narasimhan , Regina Barzilay

Improving Generalization in Reinforcement Learning Training Regimes for Social Robot Navigation

In order for autonomous mobile robots to navigate in human spaces, they must abide by our social norms. Reinforcement learning (RL) has emerged as an effective method to train sequential decision-making policies that are able to respect…

Robotics · Computer Science 2024-03-01 Adam Sigal , Hsiu-Chin Lin , AJung Moon

Deep Generative Models for Decision-Making and Control

Deep model-based reinforcement learning methods offer a conceptually simple approach to the decision-making and control problem: use learning for the purpose of estimating an approximate dynamics model, and offload the rest of the work to…

Machine Learning · Computer Science 2023-07-13 Michael Janner

A Sequential Model for Multi-Class Classification

Many classification problems require decisions among a large number of competing classes. These tasks, however, are not handled well by general purpose learning methods and are usually addressed in an ad-hoc fashion. We suggest a general…

Artificial Intelligence · Computer Science 2007-05-23 Yair Even-Zohar , Dan Roth

Lexicon Learning for Few-Shot Neural Sequence Modeling

Sequence-to-sequence transduction is the core problem in language processing applications as diverse as semantic parsing, machine translation, and instruction following. The neural network models that provide the dominant solution to these…

Computation and Language · Computer Science 2021-06-09 Ekin Akyürek , Jacob Andreas

Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning

Modern approaches to text to speech require the entire input character sequence to be processed before any audio is synthesised. This latency limits the suitability of such models for time-sensitive tasks like simultaneous interpretation.…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-16 Devang S Ram Mohan , Raphael Lenain , Lorenzo Foglianti , Tian Huey Teh , Marlene Staib , Alexandra Torresquintero , Jiameng Gao

Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration

Persuasion dialogue systems reflect the machine's ability to make strategic moves beyond verbal communication, and therefore differentiate themselves from task-oriented or open-domain dialogue systems and have their own unique values.…

Computation and Language · Computer Science 2022-10-25 Weiyan Shi , Yu Li , Saurav Sahay , Zhou Yu

Improving Latent Generalization Using Test-time Compute

Language Models (LMs) exhibit two distinct mechanisms for knowledge acquisition: in-weights learning (i.e., encoding information within the model weights) and in-context learning (ICL). Although these two modes offer complementary…

Machine Learning · Computer Science 2026-04-03 Arslan Chaudhry , Sridhar Thiagarajan , Andrew Lampinen

Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment

The natural world is abundant with concepts expressed via visual, acoustic, tactile, and linguistic modalities. Much of the existing progress in multimodal learning, however, focuses primarily on problems where the same set of modalities…

Machine Learning · Computer Science 2020-12-08 Paul Pu Liang , Peter Wu , Liu Ziyin , Louis-Philippe Morency , Ruslan Salakhutdinov

Exploring Length Generalization in Large Language Models

The ability to extrapolate from short problem instances to longer ones is an important form of out-of-distribution generalization in reasoning tasks, and is crucial when learning from datasets where longer problem instances are rare. These…

Computation and Language · Computer Science 2022-11-15 Cem Anil , Yuhuai Wu , Anders Andreassen , Aitor Lewkowycz , Vedant Misra , Vinay Ramasesh , Ambrose Slone , Guy Gur-Ari , Ethan Dyer , Behnam Neyshabur

Improving Systematic Generalization Through Modularity and Augmentation

Systematic generalization is the ability to combine known parts into novel meaning; an important aspect of efficient human learning, but a weakness of neural network learning. In this work, we investigate how two well-known modeling…

Artificial Intelligence · Computer Science 2022-02-23 Laura Ruis , Brenden Lake

Building Decision Making Models Through Language Model Regime

We propose a novel approach for decision making problems leveraging the generalization capabilities of large language models (LLMs). Traditional methods such as expert systems, planning algorithms, and reinforcement learning often exhibit…

Computation and Language · Computer Science 2024-08-13 Yu Zhang , Haoxiang Liu , Feijun Jiang , Weihua Luo , Kaifu Zhang

Learning Invariances for Policy Generalization

While recent progress has spawned very powerful machine learning systems, those agents remain extremely specialized and fail to transfer the knowledge they gain to similar yet unseen tasks. In this paper, we study a simple reinforcement…

Machine Learning · Computer Science 2020-12-15 Remi Tachet , Philip Bachman , Harm van Seijen

Towards Understanding the Relationship between In-context Learning and Compositional Generalization

According to the principle of compositional generalization, the meaning of a complex expression can be understood as a function of the meaning of its parts and of how they are combined. This principle is crucial for human language…

Computation and Language · Computer Science 2024-03-19 Sungjun Han , Sebastian Padó