Related papers: Using Large Language Models for Hyperparameter Opt…

An investigation on the use of Large Language Models for hyperparameter tuning in Evolutionary Algorithms

Hyperparameter optimization is a crucial problem in Evolutionary Computation. In fact, the values of the hyperparameters directly impact the trajectory taken by the optimization process, and their choice requires extensive reasoning by…

Neural and Evolutionary Computing · Computer Science 2024-08-06 Leonardo Lucio Custode , Fabio Caraffini , Anil Yaman , Giovanni Iacca

Large Language Model Agent for Hyper-Parameter Optimization

Hyperparameter optimization is critical in modern machine learning, requiring expert knowledge, numerous trials, and high computational and human resources. Despite the advancements in Automated Machine Learning (AutoML), challenges in…

Machine Learning · Computer Science 2025-02-27 Siyi Liu , Chen Gao , Yong Li

Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges

Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find…

Machine Learning · Statistics 2021-11-29 Bernd Bischl , Martin Binder , Michel Lang , Tobias Pielok , Jakob Richter , Stefan Coors , Janek Thomas , Theresa Ullmann , Marc Becker , Anne-Laure Boulesteix , Difan Deng , Marius Lindauer

An Empirical Study on Hyperparameter Optimization for Fine-Tuning Pre-trained Language Models

The performance of fine-tuning pre-trained language models largely depends on the hyperparameter configuration. In this paper, we investigate the performance of modern hyperparameter optimization methods (HPO) on fine-tuning pre-trained…

Computation and Language · Computer Science 2021-06-18 Xueqing Liu , Chi Wang

Sequential Large Language Model-Based Hyper-parameter Optimization

This study introduces SLLMBO, an innovative framework leveraging large language models (LLMs) for hyperparameter optimization (HPO), incorporating dynamic search space adaptability, enhanced parameter space exploitation, and a novel…

Machine Learning · Computer Science 2025-01-06 Kanan Mahammadli , Seyda Ertekin

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

In-the-loop Hyper-Parameter Optimization for LLM-Based Automated Design of Heuristics

Large Language Models (LLMs) have shown great potential in automatically generating and optimizing (meta)heuristics, making them valuable tools in heuristic optimization tasks. However, LLMs are generally inefficient when it comes to…

Neural and Evolutionary Computing · Computer Science 2025-05-23 Niki van Stein , Diederick Vermetten , Thomas Bäck

A Meta-Knowledge-Augmented LLM Framework for Hyperparameter Optimization in Time-Series Forecasting

Hyperparameter optimization (HPO) plays a central role in the performance of deep learning models, yet remains computationally expensive and difficult to interpret, particularly for time-series forecasting. While Bayesian Optimization (BO)…

Machine Learning · Computer Science 2026-02-17 Ons Saadallah , Mátyás andó , Tamás Gábor Orosz

Optuna vs Code Llama: Are LLMs a New Paradigm for Hyperparameter Tuning?

Optimal hyperparameter selection is critical for maximizing the performance of neural networks in computer vision, particularly as architectures become more complex. This work explores the use of large language models (LLMs) for…

Machine Learning · Computer Science 2025-09-30 Roman Kochnev , Arash Torabi Goodarzi , Zofia Antonina Bentyn , Dmitry Ignatov , Radu Timofte

Do Large Language Models Understand Performance Optimization?

Large Language Models (LLMs) have emerged as powerful tools for software development tasks such as code completion, translation, and optimization. However, their ability to generate efficient and correct code, particularly in complex…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-19 Bowen Cui , Tejas Ramesh , Oscar Hernandez , Keren Zhou

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Since deep neural networks were developed, they have made huge contributions to everyday lives. Machine learning provides more rational advice than humans are capable of in almost every aspect of daily life. However, despite this…

Machine Learning · Computer Science 2020-03-13 Tong Yu , Hong Zhu

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

Large Language Models (LLMs) have seen great advance in both academia and industry, and their popularity results in numerous open-source frameworks and techniques in accelerating LLM pre-training, fine-tuning, and inference. Training and…

Performance · Computer Science 2023-12-04 Longteng Zhang , Xiang Liu , Zeyu Li , Xinglin Pan , Peijie Dong , Ruibo Fan , Rui Guo , Xin Wang , Qiong Luo , Shaohuai Shi , Xiaowen Chu

Can LLMs Configure Software Tools

In software engineering, the meticulous configuration of software tools is crucial in ensuring optimal performance within intricate systems. However, the complexity inherent in selecting optimal configurations is exacerbated by the…

Software Engineering · Computer Science 2023-12-12 Jai Kannan

Hyperparameter Optimization for Large Language Model Instruction-Tuning

The fine-tuning of Large Language Models (LLMs) has enabled them to recently achieve milestones in natural language processing applications. The emergence of ever larger LLMs has paved the way for more efficient fine-tuning methods. Among…

Computation and Language · Computer Science 2024-02-01 Christophe Tribes , Sacha Benarroch-Lelong , Peng Lu , Ivan Kobyzev

Benchmarking Optimizers for Large Language Model Pretraining

The recent development of Large Language Models (LLMs) has been accompanied by an effervescence of novel ideas and methods to better optimize the loss of deep learning models. Claims from those methods are myriad: from faster convergence to…

Machine Learning · Computer Science 2025-09-03 Andrei Semenov , Matteo Pagliardini , Martin Jaggi

Evaluating Large Language Models in Process Mining: Capabilities, Benchmarks, and Evaluation Strategies

Using Large Language Models (LLMs) for Process Mining (PM) tasks is becoming increasingly essential, and initial approaches yield promising results. However, little attention has been given to developing strategies for evaluating and…

Databases · Computer Science 2024-07-01 Alessandro Berti , Humam Kourani , Hannes Hafke , Chiao-Yun Li , Daniel Schuster

Multi-Objective Hierarchical Optimization with Large Language Models

Despite their widespread adoption in various domains, especially due to their powerful reasoning capabilities, Large Language Models (LLMs) are not the off-the-shelf choice to drive multi-objective optimization yet. Conventional strategies…

Machine Learning · Computer Science 2026-01-21 Andrej Schwanke , Lyubomir Ivanov , David Salinas , Frank Hutter , Arber Zela

When Large Language Model Meets Optimization

Optimization algorithms and large language models (LLMs) enhance decision-making in dynamic environments by integrating artificial intelligence with traditional techniques. LLMs, with extensive domain knowledge, facilitate intelligent…

Neural and Evolutionary Computing · Computer Science 2024-05-17 Sen Huang , Kaixiang Yang , Sheng Qi , Rui Wang

Large Language Models to Enhance Bayesian Optimization

Bayesian optimization (BO) is a powerful approach for optimizing complex and expensive-to-evaluate black-box functions. Its importance is underscored in many applications, notably including hyperparameter tuning, but its efficacy depends on…

Machine Learning · Computer Science 2024-03-11 Tennison Liu , Nicolás Astorga , Nabeel Seedat , Mihaela van der Schaar

Achieving Peak Performance for Large Language Models: A Systematic Review

In recent years, large language models (LLMs) have achieved remarkable success in natural language processing (NLP). LLMs require an extreme amount of parameters to attain high performance. As models grow into the trillion-parameter range,…

Computation and Language · Computer Science 2024-09-10 Zhyar Rzgar K Rostam , Sándor Szénási , Gábor Kertész