Related papers: Hyperparameter Optimization for AST Differencing

SAT-DIFF: A Tree Diffing Framework Using SAT Solving

Computing differences between tree-structured data is a critical but challenging problem in software analysis. In this paper, we propose a novel tree diffing approach called SatDiff, which reformulates the structural diffing problem into a…

Programming Languages · Computer Science 2024-04-09 Chuqin Geng , Haolin Ye , Yihan Zhang , Brigitte Pientka , Xujie Si

DAT: Dynamic Alpha Tuning for Hybrid Retrieval in Retrieval-Augmented Generation

Hybrid retrieval techniques in Retrieval-Augmented Generation (RAG) systems enhance information retrieval by combining dense and sparse (e.g., BM25-based) retrieval methods. However, existing approaches struggle with adaptability, as fixed…

Information Retrieval · Computer Science 2025-04-01 Hsin-Ling Hsu , Jengnan Tzeng

A Differential Testing Approach for Evaluating Abstract Syntax Tree Mapping Algorithms

Abstract syntax tree (AST) mapping algorithms are widely used to analyze changes in source code. Despite the foundational role of AST mapping algorithms, little effort has been made to evaluate the accuracy of AST mapping algorithms, i.e.,…

Software Engineering · Computer Science 2021-03-02 Yuanrui Fan , Xin Xia , David Lo , Ahmed E. Hassan , Yuan Wang , Shanping Li

On the Performance of Differential Evolution for Hyperparameter Tuning

Automated hyperparameter tuning aspires to facilitate the application of machine learning for non-experts. In the literature, different optimization approaches are applied for that purpose. This paper investigates the performance of…

Machine Learning · Computer Science 2019-04-16 Mischa Schmidt , Shahd Safarani , Julia Gastinger , Tobias Jacobs , Sebastien Nicolas , Anett Schülke

Automated Dynamic Algorithm Configuration

The performance of an algorithm often critically depends on its parameter configuration. While a variety of automated algorithm configuration methods have been proposed to relieve users from the tedious and error-prone task of manually…

Artificial Intelligence · Computer Science 2022-05-30 Steven Adriaensen , André Biedenkapp , Gresa Shala , Noor Awad , Theresa Eimer , Marius Lindauer , Frank Hutter

DiffTune$^+$: Hyperparameter-Free Auto-Tuning using Auto-Differentiation

Controller tuning is a vital step to ensure the controller delivers its designed performance. DiffTune has been proposed as an automatic tuning method that unrolls the dynamical system and controller into a computational graph and uses…

Robotics · Computer Science 2023-05-16 Sheng Cheng , Lin Song , Minkyung Kim , Shenlong Wang , Naira Hovakimyan

A Novel Refactoring and Semantic Aware Abstract Syntax Tree Differencing Tool and a Benchmark for Evaluating the Accuracy of Diff Tools

Software undergoes constant changes to support new requirements, address bugs, enhance performance, and ensure maintainability. Thus, developers spend a great portion of their workday trying to understand and review the code changes of…

Software Engineering · Computer Science 2024-08-26 Pouria Alikhanifard , Nikolaos Tsantalis

AUTOMATA: Gradient Based Data Subset Selection for Compute-Efficient Hyper-parameter Tuning

Deep neural networks have seen great success in recent years; however, training a deep model is often challenging as its performance heavily depends on the hyper-parameters used. In addition, finding the optimal hyper-parameter…

Machine Learning · Computer Science 2022-03-17 Krishnateja Killamsetty , Guttu Sai Abhishek , Aakriti , Alexandre V. Evfimievski , Lucian Popa , Ganesh Ramakrishnan , Rishabh Iyer

AGD: an Auto-switchable Optimizer using Stepwise Gradient Difference for Preconditioning Matrix

Adaptive optimizers, such as Adam, have achieved remarkable success in deep learning. A key component of these optimizers is the so-called preconditioning matrix, providing enhanced gradient information and regulating the step size of each…

Machine Learning · Computer Science 2024-12-10 Yun Yue , Zhiling Ye , Jiadi Jiang , Yongchao Liu , Ke Zhang

GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection

Parameter-Efficient Fine-Tuning (PEFT) has become a key strategy for adapting large language models, with recent advances in sparse tuning reducing overhead by selectively updating key parameters or subsets of data. Existing approaches…

Machine Learning · Computer Science 2026-03-11 Kai Yao , Zhenghan Song , Kaixin Wu , Mingjie Zhong , Danzhao Cheng , Zhaorui Tan , Yixin Ji , Penglei Gao

Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning

Automatic performance tuning (auto-tuning) is widely used to optimize performance-critical applications across many scientific domains by finding the best program variant among many choices. Efficient optimization algorithms are crucial for…

Machine Learning · Computer Science 2025-10-10 Floris-Jan Willemsen , Rob V. van Nieuwpoort , Ben van Werkhoven

Optimizer Benchmarking Needs to Account for Hyperparameter Tuning

The performance of optimizers, particularly in deep learning, depends considerably on their chosen hyperparameter configuration. The efficacy of optimizers is often studied under near-optimal problem-specific hyperparameters, and finding…

Machine Learning · Computer Science 2020-08-18 Prabhu Teja Sivaprasad , Florian Mai , Thijs Vogels , Martin Jaggi , François Fleuret

Automated Configuration of Genetic Algorithms by Tuning for Anytime Performance

Finding the best configuration of algorithms' hyperparameters for a given optimization problem is an important task in evolutionary computation. We compare in this work the results of four different hyperparameter tuning approaches for a…

Neural and Evolutionary Computing · Computer Science 2022-03-18 Furong Ye , Carola Doerr , Hao Wang , Thomas Bäck

Model-based Clustering using Automatic Differentiation: Confronting Misspecification and High-Dimensional Data

We study two practically important cases of model based clustering using Gaussian Mixture Models: (1) when there is misspecification and (2) on high dimensional data, in the light of recent advances in Gradient Descent (GD) based…

Machine Learning · Statistics 2020-07-28 Siva Rajesh Kasa , Vaibhav Rajan

DADA: Differentiable Automatic Data Augmentation

Data augmentation (DA) techniques aim to increase data variability, and thus train deep networks with better generalisation. The pioneering AutoAugment automated the search for optimal DA policies with reinforcement learning. However,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-31 Yonggang Li , Guosheng Hu , Yongtao Wang , Timothy Hospedales , Neil M. Robertson , Yongxin Yang

Modified Adaptive Tree-Structured Parzen Estimator for Hyperparameter Optimization

In this paper, we review hyperparameter optimization methods for machine learning models, with a particular focus on the Adaptive Tree-Structured Parzen Estimator (ATPE) algorithm. We propose several modifications to ATPE and assess their…

Machine Learning · Computer Science 2025-02-04 Szymon Sieradzki , Jacek Mańdziuk

Learning to Adapt SFT Data for Better Reasoning Generalization

Large language models (LLMs) have achieved remarkable progress, with post-training playing a crucial role in enhancing their reasoning capabilities. Among post-training paradigms, supervised fine-tuning (SFT) is widely used: it leverages…

Computation and Language · Computer Science 2026-05-27 Lisong Sun , Li Wang , Chen Zhang , Jinyang Wu , Kui Zhang , Tianhao Peng , Wenjun Wu

Autotuning by Changing Directives and Number of Threads in OpenMP using ppOpen-AT

Recently, computers have diversified architectures. To achieve high numerical calculation software performance, it is necessary to tune the software according to the target computer architecture. However, code optimization for each…

Performance · Computer Science 2023-12-12 Toma Sakurai , Satoshi Ohshima , Takahiro Katagiri , Toru Nagai

A Memory-Efficient Framework for Deformable Transformer with Neural Architecture Search

Deformable Attention Transformers (DAT) have shown remarkable performance in computer vision tasks by adaptively focusing on informative image regions. However, their data-dependent sampling mechanism introduces irregular memory access…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Wendong Mao , Mingfan Zhao , Jianfeng Guan , Qiwei Dong , Zhongfeng Wang

The Role of Adaptive Optimizers for Honest Private Hyperparameter Selection

Hyperparameter optimization is a ubiquitous challenge in machine learning, and the performance of a trained model depends crucially upon their effective selection. While a rich set of tools exist for this purpose, there are currently no…

Machine Learning · Statistics 2021-11-10 Shubhankar Mohapatra , Sajin Sasy , Xi He , Gautam Kamath , Om Thakkar