Related papers: Is Hyper-Parameter Optimization Different for Soft…

Simpler Hyperparameter Optimization for Software Analytics: Why, How, When?

How can we make software analytics simpler and faster? One method is to match the complexity of analysis to the intrinsic complexity of the data being explored. For example, hyperparameter optimizers find the control settings for data…

Software Engineering · Computer Science 2021-04-23 Amritanshu Agrawal , Xueqi Yang , Rishabh Agrawal , Rahul Yedida , Xipeng Shen , Tim Menzies

Simpler Hyperparameter Optimization for Software Analytics: Why, How, When?

How to make software analytics simpler and faster? One method is to match the complexity of analysis to the intrinsic complexity of the data being explored. For example, hyperparameter optimizers find the control settings for data miners…

Software Engineering · Computer Science 2021-04-26 Amritanshu Agrawal , Xueqi Yang , Rishabh Agrawal , Xipeng Shen , Tim Menzies

How Low Can You Go? The Data-Light SE Challenge

Much of Software Engineering (SE) research assumes that progress depends on massive datasets and CPU-intensive optimizers. Yet has this assumption been rigorously tested? The counter-evidence presented in this paper suggests otherwise. For…

Software Engineering · Computer Science 2026-03-31 Kishan Kumar Ganguly , Tim Menzies

How to "DODGE" Complex Software Analytics?

Machine learning techniques applied to software engineering tasks can be improved by hyperparameter optimization, i.e., automatic tools that find good settings for a learner's control parameters. We show that such hyperparameter…

Software Engineering · Computer Science 2019-12-03 Amritanshu Agrawal , Wei Fu , Di Chen , Xipeng Shen , Tim Menzies

Is "Better Data" Better than "Better Data Miners"? (On the Benefits of Tuning SMOTE for Defect Prediction)

We report and fix an important systematic error in prior studies that ranked classifiers for software analytics. Those studies did not (a) assess classifiers on multiple criteria and they did not (b) study how variations in the data affect…

Software Engineering · Computer Science 2018-03-16 Amritanshu Agrawal , Tim Menzies

Meta-Semi: A Meta-learning Approach for Semi-supervised Learning

Deep learning based semi-supervised learning (SSL) algorithms have led to promising results in recent years. However, they tend to introduce multiple tunable hyper-parameters, making them less practical in real SSL scenarios where the…

Machine Learning · Computer Science 2024-10-30 Yulin Wang , Jiayi Guo , Shiji Song , Gao Huang

Lessons learned from hyper-parameter tuning for microservice candidate identification

When optimizing software for the cloud, monolithic applications need to be partitioned into many smaller *microservices*. While many tools have been proposed for this task, we warn that the evaluation of those approaches has been…

Software Engineering · Computer Science 2021-08-11 Rahul Yedida , Rahul Krishna , Anup Kalia , Tim Menzies , Jin Xiao , Maja Vukovic

How to Improve AI Tools (by Adding in SE Knowledge): Experiments with the TimeLIME Defect Reduction Tool

AI algorithms are being used with increased frequency in SE research and practice. Such algorithms are usually commissioned and certified using data from outside the SE domain. Can we assume that such algorithms can be used…

Software Engineering · Computer Science 2020-03-17 Kewen Peng , Tim Menzies

Synthetic Simplicity: Unveiling Bias in Medical Data Augmentation

Synthetic data is becoming increasingly integral in data-scarce fields such as medical imaging, serving as a substitute for real data. However, its inherent statistical characteristics can significantly impact downstream tasks, potentially…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Krishan Agyakari Raja Babu , Rachana Sathish , Mrunal Pattanaik , Rahul Venkataramani

Enhancing Machine Learning Model Performance with Hyper Parameter Optimization: A Comparative Study

One of the most critical issues in machine learning is the selection of appropriate hyper parameters for training models. Machine learning models may be able to reach the best training performance and may increase the ability to generalize…

Machine Learning · Computer Science 2023-02-23 Caner Erden , Halil Ibrahim Demir , Abdullah Hulusi Kökçam

Be aware of overfitting by hyperparameter optimization!

Hyperparameter optimization is very frequently employed in machine learning. However, an optimization of a large space of parameters could result in overfitting of models. In recent studies on solubility prediction the authors collected…

Machine Learning · Computer Science 2024-11-26 Igor V. Tetko , Ruud van Deursen , Guillaume Godin

On the Performance of Differential Evolution for Hyperparameter Tuning

Automated hyperparameter tuning aspires to facilitate the application of machine learning for non-experts. In the literature, different optimization approaches are applied for that purpose. This paper investigates the performance of…

Machine Learning · Computer Science 2019-04-16 Mischa Schmidt , Shahd Safarani , Julia Gastinger , Tobias Jacobs , Sebastien Nicolas , Anett Schülke

The Potential Benefits of Filtering Versus Hyper-Parameter Optimization

The quality of an induced model by a learning algorithm is dependent on the quality of the training data and the hyper-parameters supplied to the learning algorithm. Prior work has shown that improving the quality of the training data…

Machine Learning · Statistics 2014-03-14 Michael R. Smith , Tony Martinez , Christophe Giraud-Carrier

Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges

Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find…

Machine Learning · Statistics 2021-11-29 Bernd Bischl , Martin Binder , Michel Lang , Tobias Pielok , Jakob Richter , Stefan Coors , Janek Thomas , Theresa Ullmann , Marc Becker , Anne-Laure Boulesteix , Difan Deng , Marius Lindauer

Differentially Private Bayesian Optimization

Bayesian optimization is a powerful tool for fine-tuning the hyper-parameters of a wide variety of machine learning models. The success of machine learning has led practitioners in diverse real-world settings to learn classifiers for…

Machine Learning · Statistics 2015-02-24 Matt J. Kusner , Jacob R. Gardner , Roman Garnett , Kilian Q. Weinberger

Self-supervised learning for fast and scalable time series hyper-parameter tuning

Hyper-parameters of time series models play an important role in time series analysis. Slight differences in hyper-parameters might lead to very different forecast results for a given model, and therefore, selecting good hyper-parameter…

Machine Learning · Computer Science 2021-02-12 Peiyi Zhang , Xiaodong Jiang , Ginger M Holt , Nikolay Pavlovich Laptev , Caner Komurlu , Peng Gao , Yang Yu

Is One Hyperparameter Optimizer Enough?

Hyperparameter tuning is the black art of automatically finding a good combination of control parameters for a data miner. While widely applied in empirical Software Engineering, there has not been much discussion on which hyperparameter…

Artificial Intelligence · Computer Science 2018-10-04 Huy Tu , Vivek Nair

Effect of Hyper-Parameter Optimization on the Deep Learning Model Proposed for Distributed Attack Detection in Internet of Things Environment

This paper studies the effect of various hyper-parameters and their selection for the best performance of the deep learning model proposed in [1] for distributed attack detection in the Internet of Things (IoT). The findings show that there…

Machine Learning · Computer Science 2018-06-20 Md Mohaimenuzzaman , Zahraa Said Abdallah , Joarder Kamruzzaman , Bala Srinivasan

A Hybrid Deep Learning Architecture for Privacy-Preserving Mobile Analytics

Internet of Things (IoT) devices and applications are being deployed in our homes and workplaces. These devices often rely on continuous data collection to feed machine learning models. However, this approach introduces several privacy and…

Machine Learning · Computer Science 2020-06-08 Seyed Ali Osia , Ali Shahin Shamsabadi , Sina Sajadmanesh , Ali Taheri , Kleomenis Katevas , Hamid R. Rabiee , Nicholas D. Lane , Hamed Haddadi

Utility Assessment of Synthetic Data Generation Methods

Big data analysis poses the dual problem of privacy preservation and utility, i.e., how accurate data analyses remain after transforming original data in order to protect the privacy of the individuals that the data is about - and whether…

Machine Learning · Computer Science 2022-11-29 Md Sakib Nizam Khan , Niklas Reje , Sonja Buchegger