Related papers: Predictive Preference Learning from Human Interven…

OHP-RL: Online Human Preference as Guidance in Reinforcement Learning for Robot Manipulation

While reinforcement learning (RL) enables robots to acquire skills autonomously, its real-world deployment is severely limited by inefficient and unsafe exploration. Human-in-the-loop interventions offer a practical solution, yet existing…

Robotics · Computer Science 2026-05-26 Yunyang Mo , Jian Li , Qiwei Wu , Yihang Kang , Renjing Xu

Policy-labeled Preference Learning: Is Preference Enough for RLHF?

To design rewards that align with human goals, Reinforcement Learning from Human Feedback (RLHF) has emerged as a prominent technique for learning reward functions from human preferences and optimizing policies via reinforcement learning…

Machine Learning · Computer Science 2025-05-14 Taehyun Cho , Seokhun Ju , Seungyub Han , Dohyeong Kim , Kyungjae Lee , Jungwoo Lee

Personalization in Human-Robot Interaction through Preference-based Action Representation Learning

Preference-based reinforcement learning (PbRL) has shown significant promise for personalization in human-robot interaction (HRI) by explicitly integrating human preferences into the robot learning process. However, existing practices often…

Robotics · Computer Science 2025-03-12 Ruiqi Wang , Dezhong Zhao , Dayoon Suh , Ziqin Yuan , Guohua Chen , Byung-Cheol Min

Pareto-Optimal Learning from Preferences with Hidden Context

Ensuring AI models align with human values is essential for their safety and functionality. Reinforcement learning from human feedback (RLHF) leverages human preferences to achieve this alignment. However, when preferences are sourced from…

Machine Learning · Computer Science 2025-02-10 Ryan Bahlous-Boldi , Li Ding , Lee Spector , Scott Niekum

Swap-guided Preference Learning for Personalized Reinforcement Learning from Human Feedback

Reinforcement Learning from Human Feedback (RLHF) is a widely used approach to align large-scale AI systems with human values. However, RLHF typically assumes a single, universal reward, which overlooks diverse preferences and limits…

Machine Learning · Computer Science 2026-03-16 Gihoon Kim , Euntai Kim

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

We study human-in-the-loop reinforcement learning (RL) with trajectory preferences, where instead of receiving a numeric reward at each step, the agent only receives preferences over trajectory pairs from a human overseer. The goal of the…

Machine Learning · Computer Science 2022-05-25 Xiaoyu Chen , Han Zhong , Zhuoran Yang , Zhaoran Wang , Liwei Wang

Learning the Preferences of a Learning Agent

For AI systems to be useful to humans, they must understand and act in accordance with our values and preferences. Since specifying preferences is a hard task, inverse reinforcement learning (IRL) aims to develop methods that allow for…

Artificial Intelligence · Computer Science 2026-05-12 Karim Abdel Sadek , Mark Bedaywi , Rhys Gould , Stuart Russell

Hindsight Preference Learning for Offline Preference-based Reinforcement Learning

Offline preference-based reinforcement learning (RL), which focuses on optimizing policies using human preferences between pairs of trajectory segments selected from an offline dataset, has emerged as a practical avenue for RL applications.…

Machine Learning · Computer Science 2024-07-08 Chen-Xiao Gao , Shengjun Fang , Chenjun Xiao , Yang Yu , Zongzhang Zhang

Contrastive Preference Learning: Learning from Human Feedback without RL

Reinforcement Learning from Human Feedback (RLHF) has emerged as a popular paradigm for aligning models with human intent. Typically RLHF algorithms operate in two phases: first, use human preferences to learn a reward function and second,…

Machine Learning · Computer Science 2024-05-01 Joey Hejna , Rafael Rafailov , Harshit Sikchi , Chelsea Finn , Scott Niekum , W. Bradley Knox , Dorsa Sadigh

Meta Preference Learning for Fast User Adaptation in Human-Supervisory Multi-Robot Deployments

As multi-robot systems (MRS) are widely used in various tasks such as natural disaster response and social security, people enthusiastically expect an MRS to be ubiquitous that a general user without heavy training can easily operate.…

Robotics · Computer Science 2021-03-16 Chao Huang , Wenhao Luo , Rui Liu

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Customizing robotic behaviors to be aligned with diverse human preferences is an underexplored challenge in the field of embodied AI. In this paper, we present Promptable Behaviors, a novel framework that facilitates efficient…

Computer Vision and Pattern Recognition · Computer Science 2023-12-18 Minyoung Hwang , Luca Weihs , Chanwoo Park , Kimin Lee , Aniruddha Kembhavi , Kiana Ehsani

Preference VLM: Leveraging VLMs for Scalable Preference-Based Reinforcement Learning

Preference-based reinforcement learning (RL) offers a promising approach for aligning policies with human intent but is often constrained by the high cost of human feedback. In this work, we introduce PrefVLM, a framework that integrates…

Machine Learning · Computer Science 2025-02-04 Udita Ghosh , Dripta S. Raychaudhuri , Jiachen Li , Konstantinos Karydis , Amit Roy-Chowdhury

Efficiently Generating Expressive Quadruped Behaviors via Language-Guided Preference Learning

Expressive robotic behavior is essential for the widespread acceptance of robots in social environments. Recent advancements in learned legged locomotion controllers have enabled more dynamic and versatile robot behaviors. However,…

Robotics · Computer Science 2025-04-02 Jaden Clark , Joey Hejna , Dorsa Sadigh

Preference Transformer: Modeling Human Preferences using Transformers for RL

Preference-based reinforcement learning (RL) provides a framework to train agents using human preferences between two behaviors. However, preference-based RL has been challenging to scale since it requires a large amount of human feedback…

Machine Learning · Computer Science 2023-03-03 Changyeon Kim , Jongjin Park , Jinwoo Shin , Honglak Lee , Pieter Abbeel , Kimin Lee

PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences

Large foundation models pretrained on raw web-scale data are not readily deployable without additional step of extensive alignment to human preferences. Such alignment is typically done by collecting large amounts of pairwise comparisons…

Machine Learning · Computer Science 2024-06-13 Daiwei Chen , Yi Chen , Aniket Rege , Ramya Korlakai Vinayak

Accounting for Human Learning when Inferring Human Preferences

Inverse reinforcement learning (IRL) is a common technique for inferring human preferences from data. Standard IRL techniques tend to assume that the human demonstrator is stationary, that is that their policy $\pi$ doesn't change over…

Machine Learning · Computer Science 2020-12-02 Harry Giles , Lawrence Chan

Tell me why: Training preferences-based RL with human preferences and step-level explanations

Human-in-the-loop reinforcement learning allows the training of agents through various interfaces, even for non-expert humans. Recently, preference-based methods (PbRL), where the human has to give his preference over two trajectories,…

Artificial Intelligence · Computer Science 2024-08-06 Jakob Karalus

RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning

Reinforcement Learning from Human Feedback (RLHF) has recently surged in popularity, particularly for aligning large language models and other AI systems with human intentions. At its core, RLHF can be viewed as a specialized instance of…

Machine Learning · Computer Science 2025-01-10 Yujie Zhao , Jose Efraim Aguilar Escamill , Weyl Lu , Huazheng Wang

Prioritized Experience-based Reinforcement Learning with Human Guidance for Autonomous Driving

Reinforcement learning (RL) requires skillful definition and remarkable computational efforts to solve optimization and control problems, which could impair its prospect. Introducing human guidance into reinforcement learning is a promising…

Machine Learning · Computer Science 2022-11-30 Jingda Wu , Zhiyu Huang , Wenhui Huang , Chen Lv

Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning

Multi-objective reinforcement learning (MORL) aims to find a set of high-performing and diverse policies that address trade-offs between multiple conflicting objectives. However, in practice, decision makers (DMs) often deploy only one or a…

Neural and Evolutionary Computing · Computer Science 2024-01-05 Ke Li , Han Guo