Related papers: Dynamic Parameter Optimization for Highly Transfer…

Improving the Transferability of Adversarial Attacks on Face Recognition with Diverse Parameters Augmentation

Face Recognition (FR) models are vulnerable to adversarial examples that subtly manipulate benign face images, underscoring the urgent need to improve the transferability of adversarial attacks in order to expose the blind spots of these…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Fengfan Zhou , Bangjie Yin , Hefei Ling , Qianyu Zhou , Wenxuan Wang

Transferable Neural Processes for Hyperparameter Optimization

Automated machine learning aims to automate the whole process of machine learning, including model configuration. In this paper, we focus on automated hyperparameter optimization (HPO) based on sequential model-based optimization (SMBO).…

Machine Learning · Computer Science 2019-09-11 Ying Wei , Peilin Zhao , Huaxiu Yao , Junzhou Huang

Dynamic Parameterized Network for CTR Prediction

Learning to capture feature relations effectively and efficiently is essential in click-through rate (CTR) prediction of modern recommendation systems. Most existing CTR prediction methods model such relations either through tedious…

Information Retrieval · Computer Science 2021-11-10 Jian Zhu , Congcong Liu , Pei Wang , Xiwei Zhao , Guangpeng Chen , Junsheng Jin , Changping Peng , Zhangang Lin , Jingping Shao

Improving Hyperparameter Optimization by Planning Ahead

Hyperparameter optimization (HPO) is generally treated as a bi-level optimization problem that involves fitting a (probabilistic) surrogate model to a set of observed hyperparameter responses, e.g. validation loss, and consequently…

Machine Learning · Computer Science 2021-10-18 Hadi S. Jomaa , Jonas Falkner , Lars Schmidt-Thieme

Enhancing Adversarial Attacks via Parameter Adaptive Adversarial Attack

In recent times, the swift evolution of adversarial attacks has captured widespread attention, particularly concerning their transferability and other performance attributes. These techniques are primarily executed at the sample level,…

Machine Learning · Computer Science 2024-08-16 Zhibo Jin , Jiayu Zhang , Zhiyu Zhu , Chenyu Zhang , Jiahao Huang , Jianlong Zhou , Fang Chen

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Since deep neural networks were developed, they have made huge contributions to everyday lives. Machine learning provides more rational advice than humans are capable of in almost every aspect of daily life. However, despite this…

Machine Learning · Computer Science 2020-03-13 Tong Yu , Hong Zhu

TransBO: Hyperparameter Optimization via Two-Phase Transfer Learning

With the extensive applications of machine learning models, automatic hyperparameter optimization (HPO) has become increasingly important. Motivated by the tuning behaviors of human experts, it is intuitive to leverage auxiliary knowledge…

Machine Learning · Computer Science 2022-06-07 Yang Li , Yu Shen , Huaijun Jiang , Wentao Zhang , Zhi Yang , Ce Zhang , Bin Cui

DynamicPO: Dynamic Preference Optimization for Recommendation

In large language model (LLM)-based recommendation systems, direct preference optimization (DPO) effectively aligns recommendations with user preferences, requiring multi-negative objective functions to leverage abundant implicit-feedback…

Information Retrieval · Computer Science 2026-05-04 Xingyu Hu , Kai Zhang , Jiancan Wu , Shuli Wang , Chi Wang , Wenshuai Chen , Yinhua Zhu , Haitao Wang , Xingxing Wang , Xiang Wang

PDPO: Parametric Density Path Optimization

We introduce Parametric Density Path Optimization (PDPO), a novel method for computing action-minimizing paths between probability densities. The core idea is to represent the target probability path as the pushforward of a reference…

Optimization and Control · Mathematics 2025-12-08 Sebastian Gutierrez Hernandez , Peng Chen , Haomin Zhou

Lightweight Robust Direct Preference Optimization

Direct Preference Optimization (DPO) has become a popular method for fine-tuning large language models (LLMs) due to its stability and simplicity. However, it is also known to be sensitive to noise in the data and prone to overfitting.…

Machine Learning · Computer Science 2025-10-28 Cheol Woo Kim , Shresth Verma , Mauricio Tec , Milind Tambe

Practical and sample efficient zero-shot HPO

Zero-shot hyperparameter optimization (HPO) is a simple yet effective use of transfer learning for constructing a small list of hyperparameter (HP) configurations that complement each other. That is to say, for any given dataset, at least…

Machine Learning · Statistics 2020-07-28 Fela Winkelmolen , Nikita Ivkin , H. Furkan Bozkurt , Zohar Karnin

Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective

Direct Preference Optimization (DPO), which derives reward signals directly from pairwise preference data, has shown its effectiveness on aligning Large Language Models (LLMs) with human preferences. Despite its widespread use across…

Computation and Language · Computer Science 2024-04-09 Duanyu Feng , Bowen Qin , Chen Huang , Zheng Zhang , Wenqiang Lei

Direct Multi-Turn Preference Optimization for Language Agents

Adapting Large Language Models (LLMs) for agent tasks is critical in developing language agents. Direct Preference Optimization (DPO) is a promising technique for this adaptation with the alleviation of compounding errors, offering a means…

Computation and Language · Computer Science 2025-02-25 Wentao Shi , Mengqi Yuan , Junkang Wu , Qifan Wang , Fuli Feng

Rethinking DPO: The Role of Rejected Responses in Preference Misalignment

Direct Preference Optimization (DPO) is a simple and efficient framework that has attracted substantial attention. However, it often struggles to meet its primary objectives -- increasing the generation probability of chosen responses while…

Artificial Intelligence · Computer Science 2025-06-17 Jay Hyeon Cho , JunHyeok Oh , Myunsoo Kim , Byung-Jun Lee

DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection

LiDAR-based 3D object detection has seen impressive advances in recent times. However, deploying trained 3D detectors in the real world often yields unsatisfactory performance when the distribution of the test data significantly deviates…

Computer Vision and Pattern Recognition · Computer Science 2024-07-30 Zhuoxiao Chen , Zixin Wang , Yadan Luo , Sen Wang , Zi Huang

Effect of Hyper-Parameter Optimization on the Deep Learning Model Proposed for Distributed Attack Detection in Internet of Things Environment

This paper studies the effect of various hyper-parameters and their selection for the best performance of the deep learning model proposed in [1] for distributed attack detection in the Internet of Things (IoT). The findings show that there…

Machine Learning · Computer Science 2018-06-20 Md Mohaimenuzzaman , Zahraa Said Abdallah , Joarder Kamruzzaman , Bala Srinivasan

Iterative Deepening Hyperband

Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however,…

Machine Learning · Computer Science 2023-02-07 Jasmin Brandt , Marcel Wever , Dimitrios Iliadis , Viktor Bengs , Eyke Hüllermeier

Transfer Attacks Revisited: A Large-Scale Empirical Study in Real Computer Vision Settings

One intriguing property of adversarial attacks is their "transferability" -- an adversarial example crafted with respect to one deep neural network (DNN) model is often found effective against other DNNs as well. Intensive research has been…

Computer Vision and Pattern Recognition · Computer Science 2022-04-11 Yuhao Mao , Chong Fu , Saizhuo Wang , Shouling Ji , Xuhong Zhang , Zhenguang Liu , Jun Zhou , Alex X. Liu , Raheem Beyah , Ting Wang

$\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$

Direct Preference Optimization (DPO) has emerged as a compelling approach for training Large Language Models (LLMs) to adhere to human preferences. However, the performance of DPO is sensitive to the fine-tuning of its trade-off parameter…

Artificial Intelligence · Computer Science 2024-10-15 Junkang Wu , Yuexiang Xie , Zhengyi Yang , Jiancan Wu , Jinyang Gao , Bolin Ding , Xiang Wang , Xiangnan He

A Practical Analysis of Human Alignment with *PO

At the forefront of state-of-the-art human alignment methods are preference optimization methods (*PO). Prior research has often concentrated on identifying the best-performing method, typically involving a grid search over hyperparameters,…

Computation and Language · Computer Science 2025-04-30 Kian Ahrabian , Xihui Lin , Barun Patra , Vishrav Chaudhary , Alon Benhaim , Jay Pujara , Xia Song