Related papers: Regularized GLISp for sensor-guided human-in-the-l…

GLISp-r: A preference-based optimization algorithm with convergence guarantees

Preference-based optimization algorithms are iterative procedures that seek the optimal calibration of a decision vector based only on comparisons between couples of different tunings. At each iteration, a human decision-maker expresses a…

Optimization and Control · Mathematics 2023-10-03 Davide Previtali , Mirko Mazzoleni , Antonio Ferramosca , Fabio Previdi

Preference-based MPC calibration

Automating the calibration of the parameters of a control policy by means of global optimization requires quantifying a closed-loop performance function. As this can be impractical in many situations, in this paper we suggest a…

Optimization and Control · Mathematics 2021-05-27 Mengjia Zhu , Alberto Bemporad , Dario Piga

C-GLISp: Preference-Based Global Optimization under Unknown Constraints with Applications to Controller Calibration

Preference-based global optimization algorithms minimize an unknown objective function only based on whether the function is better, worse, or similar for given pairs of candidate optimization vectors. Such optimization problems arise in…

Optimization and Control · Mathematics 2021-12-21 Mengjia Zhu , Dario Piga , Alberto Bemporad

Human-in-the-loop: Real-time Preference Optimization

Optimization with preference feedback is an active research area with many applications in engineering systems where humans play a central role, such as building control and autonomous vehicles. While most existing studies focus on…

Optimization and Control · Mathematics 2026-03-31 Wenbin Wang , Wenjie Xu , Colin N. Jones

Bayesian Preference Elicitation: Human-In-The-Loop Optimization of An Active Prosthesis

Tuning active prostheses for people with amputation is time-consuming and relies on metrics that may not fully reflect user needs. We introduce a human-in-the-loop optimization (HILO) approach that leverages direct user preferences to…

Robotics · Computer Science 2026-02-27 Sophia Taddei , Wouter Koppen , Eligia Alfio , Stefano Nuzzo , Louis Flynn , Maria Alejandra Diaz , Sebastian Rojas Gonzalez , Tom Dhaene , Kevin De Pauw , Ivo Couckuyt , Tom Verstraten

POLAR: Preference Optimization and Learning Algorithms for Robotics

Parameter tuning for robotic systems is a time-consuming and challenging task that often relies on domain expertise of the human operator. Moreover, existing learning methods are not well suited for parameter tuning for many reasons…

Robotics · Computer Science 2022-08-10 Maegan Tucker , Kejun Li , Yisong Yue , Aaron D. Ames

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

We study human-in-the-loop reinforcement learning (RL) with trajectory preferences, where instead of receiving a numeric reward at each step, the agent only receives preferences over trajectory pairs from a human overseer. The goal of the…

Machine Learning · Computer Science 2022-05-25 Xiaoyu Chen , Han Zhong , Zhuoran Yang , Zhaoran Wang , Liwei Wang

Human-in-the-Loop Mixup

Aligning model representations to humans has been found to improve robustness and generalization. However, such methods often focus on standard observational data. Synthetic data is proliferating and powering many advances in machine…

Machine Learning · Computer Science 2023-08-01 Katherine M. Collins , Umang Bhatt , Weiyang Liu , Vihari Piratla , Ilia Sucholutsky , Bradley Love , Adrian Weller

Mixed Likelihood Variational Gaussian Processes

Gaussian processes (GPs) are powerful models for human-in-the-loop experiments due to their flexibility and well-calibrated uncertainty. However, GPs modeling human responses typically ignore auxiliary information, including a priori domain…

Machine Learning · Computer Science 2025-03-07 Kaiwen Wu , Craig Sanders , Benjamin Letham , Phillip Guan

A dynamic Bayesian optimized active recommender system for curiosity-driven Human-in-the-loop automated experiments

Optimization of experimental materials synthesis and characterization through active learning methods has been growing over the last decade, with examples ranging from measurements of diffraction on combinatorial alloys at synchrotrons, to…

Machine Learning · Computer Science 2023-04-06 Arpan Biswas , Yongtao Liu , Nicole Creange , Yu-Chen Liu , Stephen Jesse , Jan-Chi Yang , Sergei V. Kalinin , Maxim A. Ziatdinov , Rama K. Vasudevan

Gray-Box Nonlinear Feedback Optimization

Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the…

Optimization and Control · Mathematics 2026-05-26 Zhiyu He , Saverio Bolognani , Michael Muehlebach , Florian Dörfler

Time-Series-Informed Closed-loop Learning for Sequential Decision Making and Control

Closed-loop performance of sequential decision making algorithms, such as model predictive control, depends strongly on the choice of controller parameters. Bayesian optimization allows learning of parameters from closed-loop experiments,…

Systems and Control · Electrical Eng. & Systems 2025-11-18 Sebastian Hirt , Lukas Theiner , Rolf Findeisen

Continual Human-in-the-Loop Optimization

Optimal input settings vary across users due to differences in motor abilities and personal preferences, which are typically addressed by manual tuning or calibration. Although human-in-the-loop optimization has the potential to identify…

Human-Computer Interaction · Computer Science 2025-03-10 Yi-Chi Liao , Paul Streli , Zhipeng Li , Christoph Gebhardt , Christian Holz

Self-Paced Learning: an Implicit Regularization Perspective

Self-paced learning (SPL) mimics the cognitive mechanism of humans and animals that gradually learns from easy to hard samples. One key issue in SPL is to obtain better weighting strategy that is determined by minimizer function. Existing…

Machine Learning · Computer Science 2016-09-20 Yanbo Fan , Ran He , Jian Liang , Bao-Gang Hu

STRAPPER: Preference-based Reinforcement Learning via Self-training Augmentation and Peer Regularization

Preference-based reinforcement learning (PbRL) promises to learn a complex reward function with binary human preference. However, such human-in-the-loop formulation requires considerable human effort to assign preference labels to segment…

Machine Learning · Computer Science 2023-07-20 Yachen Kang , Li He , Jinxin Liu , Zifeng Zhuang , Donglin Wang

Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning

Multi-objective reinforcement learning (MORL) aims to find a set of high-performing and diverse policies that address trade-offs between multiple conflicting objectives. However, in practice, decision makers (DMs) often deploy only one or a…

Neural and Evolutionary Computing · Computer Science 2024-01-05 Ke Li , Han Guo

Beyond entropic regularization: Debiased Gaussian estimators for discrete optimal transport and general linear programs

This work proposes new estimators for discrete optimal transport plans that enjoy Gaussian limits centered at the true solution. This behavior stands in stark contrast with the performance of existing estimators, including those based on…

Statistics Theory · Mathematics 2025-05-08 Shuyu Liu , Florentina Bunea , Jonathan Niles-Weed

Asymptotic Consistency and Generalization in Hybrid Models of Regularized Selection and Nonlinear Learning

This study explores how different types of supervised models perform in the task of predicting and selecting relevant variables in high-dimensional contexts, especially when the data is very noisy. We analyzed three approaches: regularized…

Other Statistics · Statistics 2025-09-03 Luciano Ribeiro Galvão , Rafael de Andrade Mora

AI-Guided Human-In-the-Loop Inverse Design of High Performance Engineering Structures

Inverse design tools such as Topology Optimization (TO) can achieve new levels of improvement for high-performance engineered structures. However, widespread use is hindered by high computational times and a black-box nature that inhibits…

Machine Learning · Computer Science 2026-01-19 Dat Quoc Ha , Md Ferdous Alam , Markus J. Buehler , Faez Ahmed , Josephine V. Carstensen

The Implicit Bias of Logit Regularization

Logit regularization, the addition of a convex penalty directly in logit space, is widely used in modern classifiers, with label smoothing as a prominent example. While such methods often improve calibration and generalization, their…

Machine Learning · Statistics 2026-02-16 Alon Beck , Yohai Bar Sinai , Noam Levi