English
Related papers

Related papers: Hyperparameter Tuning Cookbook: A guide for scikit…

200 papers

The goal of hyperparameter tuning (or hyperparameter optimization) is to optimize the hyperparameters to improve the performance of the machine or deep learning model. spotPython (``Sequential Parameter Optimization Toolbox in Python'') is…

Machine Learning · Computer Science 2023-06-08 Thomas Bartz-Beielstein

Hyper-parameter tuning (HPT) is crucial for many machine learning (ML) algorithms. But due to the large searching space, HPT is usually time-consuming and resource-intensive. Nowadays, many researchers use public cloud resources to train…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-12-08 Yan Li , Bo An , Junming Ma , Donggang Cao , Yasha Wang , Hong Mei

Batch Machine Learning (BML) reaches its limits when dealing with very large amounts of streaming data. This is especially true for available memory, handling drift in data streams, and processing new, unknown data. Online Machine Learning…

Machine Learning · Computer Science 2024-02-20 Thomas Bartz-Beielstein

A surrogate model based hyperparameter tuning approach for deep learning is presented. This article demonstrates how the architecture-level parameters (hyperparameters) of deep learning models that were implemented in Keras/tensorflow can…

Machine Learning · Computer Science 2021-07-07 Thomas Bartz-Beielstein , Frederik Rehbach , Amrita Sen , Martin Zaefferer

Machine learning algorithms such as random forests or xgboost are gaining more importance and are increasingly incorporated into production processes in order to enable comprehensive digitization and, if possible, automation of processes.…

Machine Learning · Computer Science 2021-07-20 Eva Bartz , Martin Zaefferer , Olaf Mersmann , Thomas Bartz-Beielstein

Three-dimensional (3D) point cloud analysis has become central to applications ranging from autonomous driving and robotics to forestry and ecological monitoring. Although numerous deep learning methods have been proposed for point cloud…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Said Ohamouddou , Hanaa El Afia , Abdellatif El Afia , Raddouane Chiheb

Modern LLMs typically require multistage training pipelines to achieve strong downstream performance, with post-training serving as the main interface for adapting open-weight models. We introduce torchtune, a PyTorch-native library…

The ever-growing demand and complexity of machine learning are putting pressure on hyper-parameter tuning systems: while the evaluation cost of models continues to increase, the scalability of state-of-the-arts starts to become a crucial…

Machine Learning · Computer Science 2022-01-19 Yang Li , Yu Shen , Huaijun Jiang , Wentao Zhang , Jixiang Li , Ji Liu , Ce Zhang , Bin Cui

Hyper-parameters of time series models play an important role in time series analysis. Slight differences in hyper-parameters might lead to very different forecast results for a given model, and therefore, selecting good hyper-parameter…

Machine Learning · Computer Science 2021-02-12 Peiyi Zhang , Xiaodong Jiang , Ginger M Holt , Nikolay Pavlovich Laptev , Caner Komurlu , Peng Gao , Yang Yu

Working with any gradient-based machine learning algorithm involves the tedious task of tuning the optimizer's hyperparameters, such as its step size. Recent work has shown how the step size can itself be optimized alongside the model…

Machine Learning · Computer Science 2022-10-18 Kartik Chandra , Audrey Xie , Jonathan Ragan-Kelley , Erik Meijer

TerraTorch is a fine-tuning and benchmarking toolkit for Geospatial Foundation Models built on PyTorch Lightning and tailored for satellite, weather, and climate data. It integrates domain-specific data modules, pre-defined tasks, and a…

Prompt tuning prepends a soft prompt to the input embeddings or hidden states and only optimizes the prompt to adapt pretrained models (PTMs) to downstream tasks. The previous work manually selects prompt layers which are far from optimal…

Computation and Language · Computer Science 2023-11-01 Wei Zhu , Ming Tan

We present a new software, HYPPO, that enables the automatic tuning of hyperparameters of various deep learning (DL) models. Unlike other hyperparameter optimization (HPO) methods, HYPPO uses adaptive surrogate models and directly accounts…

The rapid expansion of spiking neural networks (SNNs) has led to a proliferation of training algorithms that differ widely in biological inspiration, computational structure, and hardware suitability. Despite this progress, the field lacks…

Neural and Evolutionary Computing · Computer Science 2026-05-15 Alessio Caviglia , Filippo Marostica , Roberta Bardini , Alessandro Savino , Stefano Di Carlo

Hyperparameter (HP) tuning in deep learning is an expensive process, prohibitively so for neural networks (NNs) with billions of parameters. We show that, in the recently discovered Maximal Update Parametrization (muP), many optimal HPs…

Neuromorphic systems require user-friendly software to support the design and optimization of experiments. In this work, we address this need by presenting our development of a machine learning-based modeling framework for the BrainScaleS-2…

Neural and Evolutionary Computing · Computer Science 2022-12-26 Philipp Spilger , Elias Arnold , Luca Blessing , Christian Mauch , Christian Pehle , Eric Müller , Johannes Schemmel

Machine learning is a powerful method for modeling in different fields such as education. Its capability to accurately predict students' success makes it an ideal tool for decision-making tasks related to higher education. The accuracy of…

Machine Learning · Computer Science 2021-05-03 Leila Zahedi , Farid Ghareh Mohammadi , Shabnam Rezapour , Matthew W. Ohland , M. Hadi Amini

The `spotoptim` package implements surrogate-model-based optimization of expensive black-box functions in Python. Building on two decades of Sequential Parameter Optimization (SPO) methodology, it provides a Kriging-based optimization loop…

Machine Learning · Computer Science 2026-04-16 Thomas Bartz-Beielstein

DNN learning jobs are common in today's clusters due to the advances in AI driven services such as machine translation and image recognition. The most critical phase of these jobs for model performance and learning cost is the tuning of…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-05 Isabelly Rocha , Nathaniel Morris , Lydia Y. Chen , Pascal Felber , Robert Birke , Valerio Schiavoni

Transfer learning, which allows a source task to affect the inductive bias of the target task, is widely used in computer vision. The typical way of conducting transfer learning with deep neural networks is to fine-tune a model pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Yunhui Guo , Honghui Shi , Abhishek Kumar , Kristen Grauman , Tajana Rosing , Rogerio Feris
‹ Prev 1 2 3 10 Next ›