Related papers: Preference-Based Batch and Sequential Teaching

Preference-Based Batch and Sequential Teaching: Towards a Unified View of Models

Algorithmic machine teaching studies the interaction between a teacher and a learner where the teacher selects labeled examples aiming at teaching a target hypothesis. In a quest to lower teaching complexity and to achieve more natural…

Machine Learning · Computer Science 2019-10-25 Farnam Mansouri , Yuxin Chen , Ara Vartanian , Xiaojin Zhu , Adish Singla

Preference-based Teaching

We introduce a new model of teaching named "preference-based teaching" and a corresponding complexity parameter---the preference-based teaching dimension (PBTD)---representing the worst-case number of examples needed to teach any concept in…

Machine Learning · Computer Science 2017-02-09 Ziyuan Gao , Christoph Ries , Hans Ulrich Simon , Sandra Zilles

Adaptive Teaching of Temporal Logic Formulas to Learners with Preferences

Machine teaching is an algorithmic framework for teaching a target hypothesis via a sequence of examples or demonstrations. We investigate machine teaching for temporal logic formulas -- a novel and expressive hypothesis class amenable to…

Artificial Intelligence · Computer Science 2020-01-28 Zhe Xu , Yuxin Chen , Ufuk Topcu

Batch Active Preference-Based Learning of Reward Functions

Data generation and labeling are usually an expensive part of learning for robotics. While active learning methods are commonly used to tackle the former problem, preference-based learning is a concept that attempts to solve the latter by…

Machine Learning · Computer Science 2018-10-11 Erdem Bıyık , Dorsa Sadigh

Dynamic Teaching in Sequential Decision Making Environments

We describe theoretical bounds and a practical algorithm for teaching a model by demonstration in a sequential decision making environment. Unlike previous efforts that have optimized learners that watch a teacher demonstrate a static…

Machine Learning · Computer Science 2012-10-19 Thomas J. Walsh , Sergiu Goschin

A Systematic Examination of Preference Learning through the Lens of Instruction-Following

Preference learning is a widely adopted post-training technique that aligns large language models (LLMs) to human preferences and improves specific downstream task capabilities. In this work we systematically investigate how specific…

Computation and Language · Computer Science 2024-12-23 Joongwon Kim , Anirudh Goyal , Aston Zhang , Bo Xiong , Rui Hou , Melanie Kambadur , Dhruv Mahajan , Hannaneh Hajishirzi , Liang Tan

A preference learning framework for multiple criteria sorting with diverse additive value models and valued assignment examples

We present a preference learning framework for multiple criteria sorting. We consider sorting procedures applying an additive value model with diverse types of marginal value functions (including linear, piecewise-linear, splined, and…

Machine Learning · Computer Science 2019-10-15 Jiapeng Liu , Milosz Kadzinski , Xiuwu Liao , Xiaoxin Mao , Yao Wang

Batch Active Learning of Reward Functions from Human Preferences

Data generation and labeling are often expensive in robot learning. Preference-based learning is a concept that enables reliable labeling by querying users with preference questions. Active querying methods are commonly employed in…

Machine Learning · Computer Science 2024-02-27 Erdem Bıyık , Nima Anari , Dorsa Sadigh

Combining Outcome-Based and Preference-Based Matching: A Constrained Priority Mechanism

We introduce a constrained priority mechanism that combines outcome-based matching from machine-learning with preference-based allocation schemes common in market design. Using real-world data, we illustrate how our mechanism could be…

General Economics · Economics 2020-08-13 Avidit Acharya , Kirk Bansak , Jens Hainmueller

A Framework for Interactive Knowledge-Aided Machine Teaching

Machine Teaching (MT) is an interactive process where humans train a machine learning model by playing the role of a teacher. The process of designing an MT system involves decisions that can impact both efficiency of human teachers and…

Artificial Intelligence · Computer Science 2022-04-25 Karan Taneja , Harshvardhan Sikka , Ashok Goel

The Complexity of Learning Acyclic Conditional Preference Networks

Learning of user preferences, as represented by, for example, Conditional Preference Networks (CP-nets), has become a core issue in AI research. Recent studies investigate learning of CP-nets from randomly chosen examples or from membership…

Artificial Intelligence · Computer Science 2019-02-06 Eisa Alanazi , Malek Mouhoub , Sandra Zilles

A Generalized Acquisition Function for Preference-based Reward Learning

Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task. Previous works have shown that actively synthesizing preference queries to maximize…

Robotics · Computer Science 2024-03-12 Evan Ellis , Gaurav R. Ghosal , Stuart J. Russell , Anca Dragan , Erdem Bıyık

Preference Curriculum: LLMs Should Always Be Pretrained on Their Preferred Data

Large language models (LLMs) generally utilize a consistent data distribution throughout the pretraining process. However, as the model's capability improves, it is intuitive that its data preferences dynamically change, indicating the need…

Computation and Language · Computer Science 2025-02-18 Xuemiao Zhang , Liangyu Xu , Feiyu Duan , Yongwei Zhou , Sirui Wang , Rongxiang Weng , Jingang Wang , Xunliang Cai

Iterative Machine Teaching

In this paper, we consider the problem of machine teaching, the inverse problem of machine learning. Different from traditional machine teaching which views the learners as batch algorithms, we study a new paradigm where the learner uses an…

Machine Learning · Statistics 2017-11-21 Weiyang Liu , Bo Dai , Ahmad Humayun , Charlene Tay , Chen Yu , Linda B. Smith , James M. Rehg , Le Song

Understanding the Role of Adaptivity in Machine Teaching: The Case of Version Space Learners

In real-world applications of education, an effective teacher adaptively chooses the next example to teach based on the learner's current state. However, most existing work in algorithmic machine teaching focuses on the batch setting, where…

Machine Learning · Computer Science 2018-12-11 Yuxin Chen , Adish Singla , Oisin Mac Aodha , Pietro Perona , Yisong Yue

Let the Model Decide its Curriculum for Multitask Learning

Curriculum learning strategies in prior multi-task learning approaches arrange datasets in a difficulty hierarchy either based on human perception or by exhaustively searching the optimal arrangement. However, human perception of difficulty…

Machine Learning · Computer Science 2022-05-30 Neeraj Varshney , Swaroop Mishra , Chitta Baral

Adaptive Helpfulness-Harmlessness Alignment with Preference Vectors

Ensuring that large language models (LLMs) are both helpful and harmless is a critical challenge, as overly strict constraints can lead to excessive refusals, while permissive models risk generating harmful content. Existing approaches,…

Machine Learning · Computer Science 2026-02-05 Ren-Wei Liang , Chin-Ting Hsu , Chan-Hung Yu , Saransh Agrawal , Shih-Cheng Huang , Chieh-Yen Lin , Shang-Tse Chen , Kuan-Hao Huang , Shao-Hua Sun

SYNAPSE: SYmbolic Neural-Aided Preference Synthesis Engine

This paper addresses the problem of preference learning, which aims to align robot behaviors through learning user specific preferences (e.g. "good pull-over location") from visual demonstrations. Despite its similarity to learning factual…

Robotics · Computer Science 2025-01-16 Sadanand Modak , Noah Patton , Isil Dillig , Joydeep Biswas

Machine Teaching of Active Sequential Learners

Machine teaching addresses the problem of finding the best training data that can guide a learning algorithm to a target model with minimal effort. In conventional settings, a teacher provides data that are consistent with the true data…

Machine Learning · Computer Science 2019-11-04 Tomi Peltola , Mustafa Mert Çelikok , Pedram Daee , Samuel Kaski

The Sample Complexity of Teaching-by-Reinforcement on Q-Learning

We study the sample complexity of teaching, termed as "teaching dimension" (TDim) in the literature, for the teaching-by-reinforcement paradigm, where the teacher guides the student through rewards. This is distinct from the…

Machine Learning · Computer Science 2021-03-09 Xuezhou Zhang , Shubham Kumar Bharti , Yuzhe Ma , Adish Singla , Xiaojin Zhu