Robotics · Computer Science
POLAR: Preference Optimization and Learning Algorithms for Robotics
Maegan Tucker, Kejun Li, Yisong Yue, Aaron D. Ames
2022-08-10
Computation and Language · Computer Science
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
Hamish Ivison, Yizhong Wang, Jiacheng Liu, Zeqiu Wu +5
2024-10-10
Computation and Language · Computer Science
Multi-Response Preference Optimization with Augmented Ranking Dataset
Hansle Gwon, Imjin Ahn, Young-Hak Kim, Sanghyun Park +1
2024-12-12
Machine Learning · Computer Science
What Is Preference Optimization Doing, and Why?
Yue Wang, Qizhou Wang, Zizhuo Zhang, Gang Niu +2
2026-05-18
Computation and Language · Computer Science
Cross-Preference Learning for Sentence-Level and Context-Aware Machine Translation
Ying Li, Xinglin Lyu, Junhui Li, Jinlong Yang +4
2026-03-27
Computation and Language · Computer Science
Preference Curriculum: LLMs Should Always Be Pretrained on Their Preferred Data
Xuemiao Zhang, Liangyu Xu, Feiyu Duan, Yongwei Zhou +4
2025-02-18
Computation and Language · Computer Science
PLOT: Enhancing Preference Learning via Optimal Transport
Liang Zhu, Yuelin Bai, Xiankun Ren, Jiaxi Yang +5
2026-04-03
Machine Learning · Computer Science
Multi-Type Preference Learning: Empowering Preference-Based Reinforcement Learning with Equal Preferences
Ziang Liu, Junjie Xu, Xingjiao Wu, Jing Yang +1
2024-10-16
Machine Learning · Computer Science
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
Chen-Xiao Gao, Shengjun Fang, Chenjun Xiao, Yang Yu +1
2024-07-08
Machine Learning · Computer Science
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
Fahim Tajwar, Anikait Singh, Archit Sharma, Rafael Rafailov +5
2024-06-04
Accelerator Physics · Physics
Opportunities in Machine Learning for Particle Accelerators
Auralee Edelen, Christopher Mayes, Daniel Bowring, Daniel Ratner +9
2018-11-09
Artificial Intelligence · Computer Science
ICPL: Few-shot In-context Preference Learning via LLMs
Chao Yu, Qixin Tan, Hong Lu, Jiaxuan Gao +4
2025-04-04
Computation and Language · Computer Science
Towards a Unified View of Preference Learning for Large Language Models: A Survey
Bofei Gao, Feifan Song, Yibo Miao, Zefan Cai +21
2024-11-01
Machine Learning · Computer Science
Pareto-Optimal Learning from Preferences with Hidden Context
Ryan Bahlous-Boldi, Li Ding, Lee Spector, Scott Niekum
2025-02-10
Machine Learning · Computer Science
Debiasing Online Preference Learning via Preference Feature Preservation
Dongyoung Kim, Jinsung Yoon, Jinwoo Shin, Jaehyung Kim
2025-06-16
Computation and Language · Computer Science
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Genta Indra Winata, Hanyang Zhao, Anirban Das, Wenpin Tang +3
2024-11-05
Systems and Control · Electrical Eng. & Systems
Active Learning MPC Objective Functions from Preferences
Hasna El Hasnaouy, Pablo Krupa, Mario Zanon, Alberto Bemporad
2026-05-18