English
Related papers

Related papers: Regularized GLISp for sensor-guided human-in-the-l…

200 papers

Preference-based optimization algorithms are iterative procedures that seek the optimal calibration of a decision vector based only on comparisons between couples of different tunings. At each iteration, a human decision-maker expresses a…

Optimization and Control · Mathematics 2023-10-03 Davide Previtali , Mirko Mazzoleni , Antonio Ferramosca , Fabio Previdi

Automating the calibration of the parameters of a control policy by means of global optimization requires quantifying a closed-loop performance function. As this can be impractical in many situations, in this paper we suggest a…

Optimization and Control · Mathematics 2021-05-27 Mengjia Zhu , Alberto Bemporad , Dario Piga

Preference-based global optimization algorithms minimize an unknown objective function only based on whether the function is better, worse, or similar for given pairs of candidate optimization vectors. Such optimization problems arise in…

Optimization and Control · Mathematics 2021-12-21 Mengjia Zhu , Dario Piga , Alberto Bemporad

Optimization with preference feedback is an active research area with many applications in engineering systems where humans play a central role, such as building control and autonomous vehicles. While most existing studies focus on…

Optimization and Control · Mathematics 2026-03-31 Wenbin Wang , Wenjie Xu , Colin N. Jones

Tuning active prostheses for people with amputation is time-consuming and relies on metrics that may not fully reflect user needs. We introduce a human-in-the-loop optimization (HILO) approach that leverages direct user preferences to…

Parameter tuning for robotic systems is a time-consuming and challenging task that often relies on domain expertise of the human operator. Moreover, existing learning methods are not well suited for parameter tuning for many reasons…

Robotics · Computer Science 2022-08-10 Maegan Tucker , Kejun Li , Yisong Yue , Aaron D. Ames

We study human-in-the-loop reinforcement learning (RL) with trajectory preferences, where instead of receiving a numeric reward at each step, the agent only receives preferences over trajectory pairs from a human overseer. The goal of the…

Machine Learning · Computer Science 2022-05-25 Xiaoyu Chen , Han Zhong , Zhuoran Yang , Zhaoran Wang , Liwei Wang

Aligning model representations to humans has been found to improve robustness and generalization. However, such methods often focus on standard observational data. Synthetic data is proliferating and powering many advances in machine…

Machine Learning · Computer Science 2023-08-01 Katherine M. Collins , Umang Bhatt , Weiyang Liu , Vihari Piratla , Ilia Sucholutsky , Bradley Love , Adrian Weller

Gaussian processes (GPs) are powerful models for human-in-the-loop experiments due to their flexibility and well-calibrated uncertainty. However, GPs modeling human responses typically ignore auxiliary information, including a priori domain…

Machine Learning · Computer Science 2025-03-07 Kaiwen Wu , Craig Sanders , Benjamin Letham , Phillip Guan

Optimization of experimental materials synthesis and characterization through active learning methods has been growing over the last decade, with examples ranging from measurements of diffraction on combinatorial alloys at synchrotrons, to…

Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the…

Optimization and Control · Mathematics 2026-05-26 Zhiyu He , Saverio Bolognani , Michael Muehlebach , Florian Dörfler

Closed-loop performance of sequential decision making algorithms, such as model predictive control, depends strongly on the choice of controller parameters. Bayesian optimization allows learning of parameters from closed-loop experiments,…

Systems and Control · Electrical Eng. & Systems 2025-11-18 Sebastian Hirt , Lukas Theiner , Rolf Findeisen

Optimal input settings vary across users due to differences in motor abilities and personal preferences, which are typically addressed by manual tuning or calibration. Although human-in-the-loop optimization has the potential to identify…

Human-Computer Interaction · Computer Science 2025-03-10 Yi-Chi Liao , Paul Streli , Zhipeng Li , Christoph Gebhardt , Christian Holz

Self-paced learning (SPL) mimics the cognitive mechanism of humans and animals that gradually learns from easy to hard samples. One key issue in SPL is to obtain better weighting strategy that is determined by minimizer function. Existing…

Machine Learning · Computer Science 2016-09-20 Yanbo Fan , Ran He , Jian Liang , Bao-Gang Hu

Preference-based reinforcement learning (PbRL) promises to learn a complex reward function with binary human preference. However, such human-in-the-loop formulation requires considerable human effort to assign preference labels to segment…

Machine Learning · Computer Science 2023-07-20 Yachen Kang , Li He , Jinxin Liu , Zifeng Zhuang , Donglin Wang

Multi-objective reinforcement learning (MORL) aims to find a set of high-performing and diverse policies that address trade-offs between multiple conflicting objectives. However, in practice, decision makers (DMs) often deploy only one or a…

Neural and Evolutionary Computing · Computer Science 2024-01-05 Ke Li , Han Guo

This work proposes new estimators for discrete optimal transport plans that enjoy Gaussian limits centered at the true solution. This behavior stands in stark contrast with the performance of existing estimators, including those based on…

Statistics Theory · Mathematics 2025-05-08 Shuyu Liu , Florentina Bunea , Jonathan Niles-Weed

This study explores how different types of supervised models perform in the task of predicting and selecting relevant variables in high-dimensional contexts, especially when the data is very noisy. We analyzed three approaches: regularized…

Other Statistics · Statistics 2025-09-03 Luciano Ribeiro Galvão , Rafael de Andrade Mora

Inverse design tools such as Topology Optimization (TO) can achieve new levels of improvement for high-performance engineered structures. However, widespread use is hindered by high computational times and a black-box nature that inhibits…

Machine Learning · Computer Science 2026-01-19 Dat Quoc Ha , Md Ferdous Alam , Markus J. Buehler , Faez Ahmed , Josephine V. Carstensen

Logit regularization, the addition of a convex penalty directly in logit space, is widely used in modern classifiers, with label smoothing as a prominent example. While such methods often improve calibration and generalization, their…

Machine Learning · Statistics 2026-02-16 Alon Beck , Yohai Bar Sinai , Noam Levi
‹ Prev 1 2 3 10 Next ›