Related papers: Preference-Conditioned Language-Guided Abstraction

Learning with Language-Guided State Abstractions

We describe a framework for using natural language to design state abstractions for imitation learning. Generalizable policy learning in high-dimensional observation spaces is facilitated by well-designed state representations, which can…

Robotics · Computer Science 2024-03-07 Andi Peng , Ilia Sucholutsky , Belinda Z. Li , Theodore R. Sumers , Thomas L. Griffiths , Jacob Andreas , Julie A. Shah

Humans and transformer LMs: Abstraction drives language learning

Categorization is a core component of human linguistic competence. We investigate how a transformer-based language model (LM) learns linguistic categories by comparing its behaviour over the course of training to behaviours which…

Computation and Language · Computer Science 2026-03-19 Jasper Jian , Christopher D. Manning

PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning

Preference-based reinforcement learning (RL) has emerged as a new field in robot learning, where humans play a pivotal role in shaping robot behavior by expressing preferences on different sequences of state-action pairs. However,…

Robotics · Computer Science 2024-02-26 Simon Holk , Daniel Marta , Iolanda Leite

Abstracting Probabilistic Models: A Logical Perspective

Abstraction is a powerful idea widely used in science, to model, reason and explain the behavior of systems in a more tractable search space, by omitting irrelevant details. While notions of abstraction have matured for deterministic…

Artificial Intelligence · Computer Science 2020-01-14 Vaishak Belle

HELM: Human-Preferred Exploration with Language Models

In autonomous exploration tasks, robots are required to explore and map unknown environments while efficiently planning in dynamic and uncertain conditions. Given the significant variability of environments, human operators often have…

Robotics · Computer Science 2025-03-11 Shuhao Liao , Xuxin Lv , Yuhong Cao , Jeric Lew , Wenjun Wu , Guillaume Sartoretti

Adaptive Language-Guided Abstraction from Contrastive Explanations

Many approaches to robot learning begin by inferring a reward function from a set of human demonstrations. To learn a good reward, it is necessary to determine which features of the environment are relevant before determining how these…

Robotics · Computer Science 2024-09-17 Andi Peng , Belinda Z. Li , Ilia Sucholutsky , Nishanth Kumar , Julie A. Shah , Jacob Andreas , Andreea Bobu

Efficiently Generating Expressive Quadruped Behaviors via Language-Guided Preference Learning

Expressive robotic behavior is essential for the widespread acceptance of robots in social environments. Recent advancements in learned legged locomotion controllers have enabled more dynamic and versatile robot behaviors. However,…

Robotics · Computer Science 2025-04-02 Jaden Clark , Joey Hejna , Dorsa Sadigh

LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning

We introduce Large Language Model-Assisted Preference Prediction (LAPP), a novel framework for robot learning that enables efficient, customizable, and expressive behavior acquisition with minimum human effort. Unlike prior approaches that…

Robotics · Computer Science 2025-04-23 Pingcheng Jian , Xiao Wei , Yanbaihui Liu , Samuel A. Moore , Michael M. Zavlanos , Boyuan Chen

Interactive Learning of State Representation through Natural Language Instruction and Explanation

One significant simplification in most previous work on robot learning is the closed-world assumption where the robot is assumed to know ahead of time a complete set of predicates describing the state of the physical world. However, robots…

Artificial Intelligence · Computer Science 2017-10-10 Qiaozi Gao , Lanbo She , Joyce Y. Chai

Embodied Active Learning of Relational State Abstractions for Bilevel Planning

State abstraction is an effective technique for planning in robotics environments with continuous states and actions, long task horizons, and sparse feedback. In object-oriented environments, predicates are a particularly useful form of…

Robotics · Computer Science 2023-06-21 Amber Li , Tom Silver

VLP: Vision-Language Preference Learning for Embodied Manipulation

Reward engineering is one of the key challenges in Reinforcement Learning (RL). Preference-based RL effectively addresses this issue by learning from human feedback. However, it is both time-consuming and expensive to collect human…

Machine Learning · Computer Science 2025-02-18 Runze Liu , Chenjia Bai , Jiafei Lyu , Shengjie Sun , Yali Du , Xiu Li

(How) Do Language Models Track State?

Transformer language models (LMs) exhibit behaviors -- from storytelling to code generation -- that seem to require tracking the unobserved state of an evolving world. How do they do this? We study state tracking in LMs trained or…

Computation and Language · Computer Science 2025-11-03 Belinda Z. Li , Zifan Carl Guo , Jacob Andreas

Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models

Research has repeatedly demonstrated that intermediate hidden states extracted from large language models are able to predict measured brain response to natural language stimuli. Yet, very little is known about the representation properties…

Computation and Language · Computer Science 2026-03-16 Emily Cheng , Richard J. Antonello

Using Large Language Models for Abstraction of Planning Domains - Extended Version

Generating an abstraction of a dynamic domain that aligns with a given purpose remains a significant challenge given that the choice of such an abstraction can impact an agent's ability to plan, reason, and provide explanations effectively.…

Artificial Intelligence · Computer Science 2025-10-24 Bita Banihashemi , Megh Patel , Yves Lespérance

A Survey on Human Preference Learning for Large Language Models

The recent surge of versatile large language models (LLMs) largely depends on aligning increasingly capable foundation models with human intentions by preference learning, enhancing LLMs with excellent applicability and effectiveness in a…

Computation and Language · Computer Science 2024-06-19 Ruili Jiang , Kehai Chen , Xuefeng Bai , Zhixuan He , Juntao Li , Muyun Yang , Tiejun Zhao , Liqiang Nie , Min Zhang

Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context. At the same time, there is a lack of clear understanding…

Machine Learning · Computer Science 2023-02-24 Shengnan An , Zeqi Lin , Bei Chen , Qiang Fu , Nanning Zheng , Jian-Guang Lou

Towards Preference Learning for Autonomous Ground Robot Navigation Tasks

We are interested in the design of autonomous robot behaviors that learn the preferences of users over continued interactions, with the goal of efficiently executing navigation behaviors in a way that the user expects. In this paper, we…

Robotics · Computer Science 2020-11-06 Cory Hayes , Matthew Marge

Pretraining Language Models with Human Preferences

Language models (LMs) are pretrained to imitate internet text, including content that would violate human preferences if generated by an LM: falsehoods, offensive comments, personally identifiable information, low-quality or buggy code, and…

Computation and Language · Computer Science 2023-06-16 Tomasz Korbak , Kejian Shi , Angelica Chen , Rasika Bhalerao , Christopher L. Buckley , Jason Phang , Samuel R. Bowman , Ethan Perez

Behavior and Representation in Large Language Models for Combinatorial Optimization: From Feature Extraction to Algorithm Selection

Recent advances in Large Language Models (LLMs) have opened new perspectives for automation in optimization. While several studies have explored how LLMs can generate or solve optimization models, far less is understood about what these…

Artificial Intelligence · Computer Science 2025-12-16 Francesca Da Ros , Luca Di Gaspero , Kevin Roitero

Preferences Implicit in the State of the World

Reinforcement learning (RL) agents optimize only the features specified in a reward function and are indifferent to anything left out inadvertently. This means that we must not only specify what to do, but also the much larger space of what…

Machine Learning · Computer Science 2019-04-22 Rohin Shah , Dmitrii Krasheninnikov , Jordan Alexander , Pieter Abbeel , Anca Dragan