Related papers: An Extensible Interactive Interface for Agent Desi…

Learning to Interactively Learn and Assist

When deploying autonomous agents in the real world, we need effective ways of communicating objectives to them. Traditional skill learning has revolved around reinforcement and imitation learning, each with rigid constraints on the format…

Artificial Intelligence · Computer Science 2019-11-21 Mark Woodward , Chelsea Finn , Karol Hausman

Guiding Policies with Language via Meta-Learning

Behavioral skills or policies for autonomous agents are conventionally learned from reward functions, via reinforcement learning, or from demonstrations, via imitation learning. However, both modes of task specification have their…

Machine Learning · Computer Science 2019-01-30 John D. Co-Reyes , Abhishek Gupta , Suvansh Sanjeev , Nick Altieri , Jacob Andreas , John DeNero , Pieter Abbeel , Sergey Levine

Learning User Preferences via Reinforcement Learning with Spatial Interface Valuing

Interactive Machine Learning is concerned with creating systems that operate in environments alongside humans to achieve a task. A typical use is to extend or amplify the capabilities of a human in cognitive or physical ways, requiring the…

Machine Learning · Computer Science 2019-02-05 Miguel Alonso

Learning Dense Rewards for Contact-Rich Manipulation Tasks

Rewards play a crucial role in reinforcement learning. To arrive at the desired policy, the design of a suitable reward function often requires significant domain expertise as well as trial-and-error. Here, we aim to minimize the effort…

Robotics · Computer Science 2020-11-18 Zheng Wu , Wenzhao Lian , Vaibhav Unhelkar , Masayoshi Tomizuka , Stefan Schaal

Towards better dense rewards in Reinforcement Learning Applications

Finding meaningful and accurate dense rewards is a fundamental task in the field of reinforcement learning (RL) that enables agents to explore environments more efficiently. In traditional RL settings, agents learn optimal policies through…

Artificial Intelligence · Computer Science 2025-12-05 Shuyuan Zhang

Effective Reward Specification in Deep Reinforcement Learning

In the last decade, Deep Reinforcement Learning has evolved into a powerful tool for complex sequential decision-making problems. It combines deep learning's proficiency in processing rich input signals with reinforcement learning's…

Machine Learning · Computer Science 2024-12-11 Julien Roy

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon

Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine…

Robotics · Computer Science 2023-09-08 Jensen Gao , Siddharth Reddy , Glen Berseth , Anca D. Dragan , Sergey Levine

Improving interactive reinforcement learning: What makes a good teacher?

Interactive reinforcement learning has become an important apprenticeship approach to speed up convergence in classic reinforcement learning problems. In this regard, a variant of interactive reinforcement learning is policy shaping which…

Artificial Intelligence · Computer Science 2019-04-16 Francisco Cruz , Sven Magg , Yukie Nagai , Stefan Wermter

Informativeness of Reward Functions in Reinforcement Learning

Reward functions are central in specifying the task we want a reinforcement learning agent to perform. Given a task and desired optimal behavior, we study the problem of designing informative reward functions so that the designed rewards…

Machine Learning · Computer Science 2024-02-13 Rati Devidze , Parameswaran Kamalaruban , Adish Singla

Discovering Reinforcement Learning Interfaces with Large Language Models

Reinforcement learning systems rely on environment interfaces that specify observations and reward functions, yet constructing these interfaces for new tasks often requires substantial manual effort. While recent work has automated reward…

Machine Learning · Computer Science 2026-05-06 Akshat Singh Jaswal , Ashish Baghel , Paras Chopra

Dynamic Policy Fusion for User Alignment Without Re-Interaction

Deep reinforcement learning (RL) policies, although optimal in terms of task rewards, may not align with the personal preferences of human users. To ensure this alignment, a naive solution would be to retrain the agent using a reward…

Artificial Intelligence · Computer Science 2025-09-22 Ajsal Shereef Palattuparambil , Thommen George Karimpanal , Santu Rana

A Survey on Reinforcement Learning Methods in Character Animation

Reinforcement Learning is an area of Machine Learning focused on how agents can be trained to make sequential decisions, and achieve a particular goal within an arbitrary environment. While learning, they repeatedly take actions based on…

Graphics · Computer Science 2022-05-26 Ariel Kwiatkowski , Eduardo Alvarado , Vicky Kalogeiton , C. Karen Liu , Julien Pettré , Michiel van de Panne , Marie-Paule Cani

Reinforcement Learning Agent Training with Goals for Real World Tasks

Reinforcement Learning (RL) is a promising approach for solving various control, optimization, and sequential decision making tasks. However, designing reward functions for complex tasks (e.g., with multiple objectives and safety…

Artificial Intelligence · Computer Science 2021-07-23 Xuan Zhao , Marcos Campos

Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Reinforcement learning (RL) agents improve through trial-and-error, but when reward is sparse and the agent cannot discover successful action sequences, learning stagnates. This has been a notable problem in training deep RL agents to…

Artificial Intelligence · Computer Science 2018-02-27 Evan Zheran Liu , Kelvin Guu , Panupong Pasupat , Tianlin Shi , Percy Liang

Scalable agent alignment via reward modeling: a research direction

One obstacle to applying reinforcement learning algorithms to real-world problems is the lack of suitable reward functions. Designing such reward functions is difficult in part because the user only has an implicit understanding of the task…

Machine Learning · Computer Science 2018-11-20 Jan Leike , David Krueger , Tom Everitt , Miljan Martic , Vishal Maini , Shane Legg

MARLUI: Multi-Agent Reinforcement Learning for Adaptive UIs

Adaptive user interfaces (UIs) automatically change an interface to better support users' tasks. Recently, machine learning techniques have enabled the transition to more powerful and complex adaptive UIs. However, a core challenge for…

Human-Computer Interaction · Computer Science 2023-10-30 Thomas Langerak , Sammy Christen , Mert Albaba , Christoph Gebhardt , Otmar Hilliges

Unsupervised Perceptual Rewards for Imitation Learning

Reward function design and exploration time are arguably the biggest obstacles to the deployment of reinforcement learning (RL) agents in the real world. In many real-world tasks, designing a reward function takes considerable hand…

Computer Vision and Pattern Recognition · Computer Science 2017-06-14 Pierre Sermanet , Kelvin Xu , Sergey Levine

Automata Guided Reinforcement Learning With Demonstrations

Tasks with complex temporal structures and long horizons pose a challenge for reinforcement learning agents due to the difficulty in specifying the tasks in terms of reward functions as well as large variances in the learning signals. We…

Artificial Intelligence · Computer Science 2018-09-27 Xiao Li , Yao Ma , Calin Belta

Generating Diverse Programs with Instruction Conditioned Reinforced Adversarial Learning

Advances in Deep Reinforcement Learning have led to agents that perform well across a variety of sensory-motor domains. In this work, we study the setting in which an agent must learn to generate programs for diverse scenes conditioned on a…

Machine Learning · Computer Science 2018-12-04 Aishwarya Agrawal , Mateusz Malinowski , Felix Hill , Ali Eslami , Oriol Vinyals , Tejas Kulkarni