English
Related papers

Related papers: Sherpa: Robust Hyperparameter Optimization for Mac…

200 papers

We present an overview of Sherpa, an open source Python project, and discuss its development history, broad design concepts and capabilities. Sherpa contains powerful tools for combining parametric models into complex expressions that can…

Modern learning models are characterized by large hyperparameter spaces and long training times. These properties, coupled with the rise of parallel computing and the growing demand to productionize machine learning workloads, motivate the…

Machine Learning · Computer Science 2020-03-17 Liam Li , Kevin Jamieson , Afshin Rostamizadeh , Ekaterina Gonina , Moritz Hardt , Benjamin Recht , Ameet Talwalkar

Hyper-parameter optimization is crucial for pushing the accuracy of a deep learning model to its limits. A hyper-parameter optimization job, referred to as a study, involves numerous trials of training a model using different training…

Machine Learning · Computer Science 2020-06-23 Ahnjae Shin , Do Yoon Kim , Joo Seong Jeong , Byung-Gon Chun

The ever-increasing quality and complexity of astronomical data underscores the need for new and powerful data analysis applications. This need has led to the development of Sherpa, a modeling and fitting program in the CIAO software…

Astrophysics · Physics 2009-11-07 Peter E. Freeman , Stephen Doe , Aneta Siemiginowska

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

Since deep neural networks were developed, they have made huge contributions to everyday lives. Machine learning provides more rational advice than humans are capable of in almost every aspect of daily life. However, despite this…

Machine Learning · Computer Science 2020-03-13 Tong Yu , Hong Zhu

Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find…

We present a new software, HYPPO, that enables the automatic tuning of hyperparameters of various deep learning (DL) models. Unlike other hyperparameter optimization (HPO) methods, HYPPO uses adaptive surrogate models and directly accounts…

Recently, large language models (LLMs) have achieved widespread application across various fields. Despite their impressive capabilities, LLMs suffer from a lack of structured reasoning ability, particularly for complex tasks requiring…

Artificial Intelligence · Computer Science 2025-09-03 Boqi Chen , Kua Chen , José Antonio Hernández López , Gunter Mussbacher , Dániel Varró , Amir Feizpour

Hyperparameters tuning is a fundamental, yet computationally expensive, step in optimizing machine learning models. Beyond optimization, understanding the relative importance and interaction of hyperparameters is critical to efficient model…

Machine Learning · Computer Science 2025-12-23 Moncef Garouani , Ayah Barhrhouj

Hyperparameter optimization (HPO) is a necessary step to ensure the best possible performance of Machine Learning (ML) algorithms. Several methods have been developed to perform HPO; most of these are focused on optimizing one performance…

Machine Learning · Computer Science 2022-11-16 Alejandro Morales-Hernández , Inneke Van Nieuwenhuyse , Sebastian Rojas Gonzalez

Hyperparameter optimization (HPO) and neural architecture search (NAS) are methods of choice to obtain the best-in-class machine learning models, but in practice they can be costly to run. When models are trained on large datasets, tuning…

Machine Learning · Computer Science 2023-03-09 Ondrej Bohdal , Lukas Balles , Martin Wistuba , Beyza Ermis , Cédric Archambeau , Giovanni Zappella

Tuning machine learning models at scale, especially finding the right hyperparameter values, can be difficult and time-consuming. In addition to the computational effort required, this process also requires some ancillary efforts including…

Machine Learning · Computer Science 2019-11-07 Jiayi Liu , Samarth Tripathi , Unmesh Kurup , Mohak Shah

It is typical for a machine learning system to have numerous hyperparameters that affect its learning rate and prediction quality. Finding a good combination of the hyperparameters is, however, a challenging job. This is mainly because…

Machine Learning · Computer Science 2019-08-08 Dobromir Marinov , Daniel Karapetyan

Most of the machine learning models have associated hyper-parameters along with their parameters. While the algorithm gives the solution for parameters, its utility for model performance is highly dependent on the choice of hyperparameters.…

Machine Learning · Computer Science 2022-01-19 Shashank Shekhar , Adesh Bansode , Asif Salim

Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values determines the effectiveness of systems…

Hyperparameter selection in continual learning scenarios is a challenging and underexplored aspect, especially in practical non-stationary environments. Traditional approaches, such as grid searches with held-out validation data from all…

Machine Learning · Computer Science 2024-06-21 Rudy Semola , Julio Hurtado , Vincenzo Lomonaco , Davide Bacciu

While modern parallel computing systems offer high performance, utilizing these powerful computing resources to the highest possible extent demands advanced knowledge of various hardware architectures and parallel programming models.…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-05-03 Suejb Memeti , Sabri Pllana , Alecio Binotto , Joanna Kolodziej , Ivona Brandic

Hyperparameter tuning is a fundamental aspect of machine learning research. Setting up the infrastructure for systematic optimization of hyperparameters can take a significant amount of time. Here, we present PyHopper, a black-box…

Machine Learning · Computer Science 2022-10-11 Mathias Lechner , Ramin Hasani , Philipp Neubauer , Sophie Neubauer , Daniela Rus

Prior research in resource scheduling for machine learning training workloads has largely focused on minimizing job completion times. Commonly, these model training workloads collectively search over a large number of parameter values that…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-09 Richard Liaw , Romil Bhardwaj , Lisa Dunlap , Yitian Zou , Joseph Gonzalez , Ion Stoica , Alexey Tumanov
‹ Prev 1 2 3 10 Next ›