Related papers: Hyperparameter Tuning Cookbook: A guide for scikit…

PyTorch Hyperparameter Tuning - A Tutorial for spotPython

The goal of hyperparameter tuning (or hyperparameter optimization) is to optimize the hyperparameters to improve the performance of the machine or deep learning model. spotPython (``Sequential Parameter Optimization Toolbox in Python'') is…

Machine Learning · Computer Science 2023-06-08 Thomas Bartz-Beielstein

SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud

Hyper-parameter tuning (HPT) is crucial for many machine learning (ML) algorithms. But due to the large searching space, HPT is usually time-consuming and resource-intensive. Nowadays, many researchers use public cloud resources to train…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-12-08 Yan Li , Bo An , Junming Ma , Donggang Cao , Yasha Wang , Hong Mei

Simplifying Hyperparameter Tuning in Online Machine Learning -- The spotRiverGUI

Batch Machine Learning (BML) reaches its limits when dealing with very large amounts of streaming data. This is especially true for available memory, handling drift in data streams, and processing new, unknown data. Online Machine Learning…

Machine Learning · Computer Science 2024-02-20 Thomas Bartz-Beielstein

Surrogate Model Based Hyperparameter Tuning for Deep Learning with SPOT

A surrogate model based hyperparameter tuning approach for deep learning is presented. This article demonstrates how the architecture-level parameters (hyperparameters) of deep learning models that were implemented in Keras/tensorflow can…

Machine Learning · Computer Science 2021-07-07 Thomas Bartz-Beielstein , Frederik Rehbach , Amrita Sen , Martin Zaefferer

Experimental Investigation and Evaluation of Model-based Hyperparameter Optimization

Machine learning algorithms such as random forests or xgboost are gaining more importance and are increasingly incorporated into production processes in order to enable comprehensive digitization and, if possible, automation of processes.…

Machine Learning · Computer Science 2021-07-20 Eva Bartz , Martin Zaefferer , Olaf Mersmann , Thomas Bartz-Beielstein

LIDARLearn: A Unified Deep Learning Library for 3D Point Cloud Classification, Segmentation, and Self-Supervised Representation Learning

Three-dimensional (3D) point cloud analysis has become central to applications ranging from autonomous driving and robotics to forestry and ecological monitoring. Although numerous deep learning methods have been proposed for point cloud…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Said Ohamouddou , Hanaa El Afia , Abdellatif El Afia , Raddouane Chiheb

torchtune: PyTorch native post-training library

Modern LLMs typically require multistage training pipelines to achieve strong downstream performance, with post-training serving as the main interface for adapting open-weight models. We introduce torchtune, a PyTorch-native library…

Machine Learning · Computer Science 2026-05-21 Mark Obozov , Maxime Griot , Joseph Cummings , Evan Smothers , Felipe Mello , Rafi Ayub , Philip John Bontrager , Salman Mohammadi , Ariel Kwiatkowski , Nathan Azrak , Mircea Mironenco

Hyper-Tune: Towards Efficient Hyper-parameter Tuning at Scale

The ever-growing demand and complexity of machine learning are putting pressure on hyper-parameter tuning systems: while the evaluation cost of models continues to increase, the scalability of state-of-the-arts starts to become a crucial…

Machine Learning · Computer Science 2022-01-19 Yang Li , Yu Shen , Huaijun Jiang , Wentao Zhang , Jixiang Li , Ji Liu , Ce Zhang , Bin Cui

Self-supervised learning for fast and scalable time series hyper-parameter tuning

Hyper-parameters of time series models play an important role in time series analysis. Slight differences in hyper-parameters might lead to very different forecast results for a given model, and therefore, selecting good hyper-parameter…

Machine Learning · Computer Science 2021-02-12 Peiyi Zhang , Xiaodong Jiang , Ginger M Holt , Nikolay Pavlovich Laptev , Caner Komurlu , Peng Gao , Yang Yu

Gradient Descent: The Ultimate Optimizer

Working with any gradient-based machine learning algorithm involves the tedious task of tuning the optimizer's hyperparameters, such as its step size. Recent work has shown how the step size can itself be optimized alongside the model…

Machine Learning · Computer Science 2022-10-18 Kartik Chandra , Audrey Xie , Jonathan Ragan-Kelley , Erik Meijer

TerraTorch: The Geospatial Foundation Models Toolkit

TerraTorch is a fine-tuning and benchmarking toolkit for Geospatial Foundation Models built on PyTorch Lightning and tailored for satellite, weather, and climate data. It integrates domain-specific data modules, pre-defined tasks, and a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-27 Carlos Gomes , Benedikt Blumenstiel , Joao Lucas de Sousa Almeida , Pedro Henrique de Oliveira , Paolo Fraccaro , Francesc Marti Escofet , Daniela Szwarcman , Naomi Simumba , Romeo Kienzler , Bianca Zadrozny

Improving Prompt Tuning with Learned Prompting Layers

Prompt tuning prepends a soft prompt to the input embeddings or hidden states and only optimizes the prompt to adapt pretrained models (PTMs) to downstream tasks. The previous work manually selects prompt layers which are far from optimal…

Computation and Language · Computer Science 2023-11-01 Wei Zhu , Ming Tan

HYPPO: A Surrogate-Based Multi-Level Parallelism Tool for Hyperparameter Optimization

We present a new software, HYPPO, that enables the automatic tuning of hyperparameters of various deep learning (DL) models. Unlike other hyperparameter optimization (HPO) methods, HYPPO uses adaptive surrogate models and directly accounts…

Machine Learning · Computer Science 2022-01-03 Vincent Dumont , Casey Garner , Anuradha Trivedi , Chelsea Jones , Vidya Ganapati , Juliane Mueller , Talita Perciano , Mariam Kiran , Marc Day

NeuroTrain: Surveying Local Learning Rules for Spiking Neural Networks with an Open Benchmarking Framework

The rapid expansion of spiking neural networks (SNNs) has led to a proliferation of training algorithms that differ widely in biological inspiration, computational structure, and hardware suitability. Despite this progress, the field lacks…

Neural and Evolutionary Computing · Computer Science 2026-05-15 Alessio Caviglia , Filippo Marostica , Roberta Bardini , Alessandro Savino , Stefano Di Carlo

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Hyperparameter (HP) tuning in deep learning is an expensive process, prohibitively so for neural networks (NNs) with billions of parameters. We show that, in the recently discovered Maximal Update Parametrization (muP), many optimal HPs…

Machine Learning · Computer Science 2022-03-29 Greg Yang , Edward J. Hu , Igor Babuschkin , Szymon Sidor , Xiaodong Liu , David Farhi , Nick Ryder , Jakub Pachocki , Weizhu Chen , Jianfeng Gao

hxtorch.snn: Machine-learning-inspired Spiking Neural Network Modeling on BrainScaleS-2

Neuromorphic systems require user-friendly software to support the design and optimization of experiments. In this work, we address this need by presenting our development of a machine learning-based modeling framework for the BrainScaleS-2…

Neural and Evolutionary Computing · Computer Science 2022-12-26 Philipp Spilger , Elias Arnold , Luca Blessing , Christian Mauch , Christian Pehle , Eric Müller , Johannes Schemmel

Search Algorithms for Automated Hyper-Parameter Tuning

Machine learning is a powerful method for modeling in different fields such as education. Its capability to accurately predict students' success makes it an ideal tool for decision-making tasks related to higher education. The accuracy of…

Machine Learning · Computer Science 2021-05-03 Leila Zahedi , Farid Ghareh Mohammadi , Shabnam Rezapour , Matthew W. Ohland , M. Hadi Amini

Optimization with SpotOptim

The `spotoptim` package implements surrogate-model-based optimization of expensive black-box functions in Python. Building on two decades of Sequential Parameter Optimization (SPO) methodology, it provides a Kriging-based optimization loop…

Machine Learning · Computer Science 2026-04-16 Thomas Bartz-Beielstein

PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep Learning Clusters

DNN learning jobs are common in today's clusters due to the advances in AI driven services such as machine translation and image recognition. The most critical phase of these jobs for model performance and learning cost is the tuning of…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-05 Isabelly Rocha , Nathaniel Morris , Lydia Y. Chen , Pascal Felber , Robert Birke , Valerio Schiavoni

SpotTune: Transfer Learning through Adaptive Fine-tuning

Transfer learning, which allows a source task to affect the inductive bias of the target task, is widely used in computer vision. The typical way of conducting transfer learning with deep neural networks is to fine-tune a model pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Yunhui Guo , Honghui Shi , Abhishek Kumar , Kristen Grauman , Tajana Rosing , Rogerio Feris