Related papers: Stacking of Hyperparameter Tuned Models for Taggin…

Enhancing binary classification: A new stacking method via leveraging computational geometry

Stacking, a potent ensemble learning method, leverages a meta-model to harness the strengths of multiple base models, thereby enhancing prediction accuracy. Traditional stacking techniques typically utilize established learning models, such…

Machine Learning · Computer Science 2024-10-31 Wei Wu , Liang Tang , Zhongjie Zhao , Chung-Piaw Teo

A Generalized Stacking for Implementing Ensembles of Gradient Boosting Machines

The gradient boosting machine is one of the powerful tools for solving regression problems. In order to cope with its shortcomings, an approach for constructing ensembles of gradient boosting models is proposed. The main idea behind the…

Machine Learning · Computer Science 2020-10-14 Andrei V. Konstantinov , Lev V. Utkin

Modeling Programs Hierarchically with Stack-Augmented LSTM

Programming language modeling has attracted extensive attention in recent years, and it plays an essential role in program processing fields. Statistical language models, which are initially designed for natural languages, have been…

Software Engineering · Computer Science 2020-02-12 Fang Liu , Lu Zhang , Zhi Jin

Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning

Automatic performance tuning (auto-tuning) is widely used to optimize performance-critical applications across many scientific domains by finding the best program variant among many choices. Efficient optimization algorithms are crucial for…

Machine Learning · Computer Science 2025-10-10 Floris-Jan Willemsen , Rob V. van Nieuwpoort , Ben van Werkhoven

Stacking Neural Network Models for Automatic Short Answer Scoring

Automatic short answer scoring is one of the text classification problems to assess students' answers during exams automatically. Several challenges can arise in making an automatic short answer scoring system, one of which is the quantity…

Computation and Language · Computer Science 2021-03-24 Rian Adam Rajagede , Rochana Prih Hastuti

To tune or not to tune? An Approach for Recommending Important Hyperparameters

Novel technologies in automated machine learning ease the complexity of algorithm selection and hyperparameter optimization. Hyperparameters are important for machine learning models as they significantly influence the performance of…

Machine Learning · Computer Science 2021-08-31 Mohamadjavad Bahmani , Radwa El Shawi , Nshan Potikyan , Sherif Sakr

On Hyper-parameter Tuning for Stochastic Optimization Algorithms

This paper proposes the first-ever algorithmic framework for tuning hyper-parameters of stochastic optimization algorithm based on reinforcement learning. Hyper-parameters impose significant influences on the performance of stochastic…

Machine Learning · Computer Science 2020-03-11 Haotian Zhang , Jianyong Sun , Zongben Xu

Augmenting Parameter-Efficient Pre-trained Language Models with Large Language Models

Training AI models in cybersecurity with help of vast datasets offers significant opportunities to mimic real-world behaviors effectively. However, challenges like data drift and scarcity of labelled data lead to frequent updates of models…

Machine Learning · Computer Science 2026-02-04 Saurabh Anand , Shubham Malaviya , Manish Shukla , Sachin Lodha

Optimizing Ensemble Weights and Hyperparameters of Machine Learning Models for Regression Problems

Aggregating multiple learners through an ensemble of models aim to make better predictions by capturing the underlying distribution of the data more accurately. Different ensembling methods, such as bagging, boosting, and stacking/blending,…

Machine Learning · Statistics 2020-11-03 Mohsen Shahhosseini , Guiping Hu , Hieu Pham

Random Sampling for Distributed Coded Matrix Multiplication

Matrix multiplication is a fundamental building block for large scale computations arising in various applications, including machine learning. There has been significant recent interest in using coding to speed up distributed matrix…

Information Theory · Computer Science 2019-05-17 Wei-Ting Chang , Ravi Tandon

Stacking With Auxiliary Features

Ensembling methods are well known for improving prediction accuracy. However, they are limited in the sense that they cannot discriminate among component models effectively. In this paper, we propose stacking with auxiliary features that…

Computation and Language · Computer Science 2016-05-30 Nazneen Fatema Rajani , Raymond J. Mooney

Predicting Tags For Programming Tasks by Combining Textual And Source Code Data

Competitive programming remains a very popular activity that combines both software engineering and education. In order to prepare and to practice, contestants use extensive archives of problems from past contents available on various…

Software Engineering · Computer Science 2023-01-12 Artyom Lobanov , Egor Bogomolov , Yaroslav Golubev , Mikhail Mirzayanov , Timofey Bryksin

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

Training next-generation code generation models requires high-quality datasets, yet existing datasets face difficulty imbalance, format inconsistency, and data quality problems. We address these challenges through systematic data processing…

Computation and Language · Computer Science 2026-03-10 Zongqian Li , Tengchao Lv , Shaohan Huang , Yixuan Su , Qinzheng Sun , Qiufeng Yin , Ying Xin , Scarlett Li , Lei Cui , Nigel Collier , Furu Wei

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

A Channel Coding Benchmark for Meta-Learning

Meta-learning provides a popular and effective family of methods for data-efficient learning of new tasks. However, several important issues in meta-learning have proven hard to study thus far. For example, performance degrades in…

Machine Learning · Computer Science 2021-12-03 Rui Li , Ondrej Bohdal , Rajesh Mishra , Hyeji Kim , Da Li , Nicholas Lane , Timothy Hospedales

From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers

Instruction tuning -- tuning large language models on instruction-output pairs -- is a promising technique for making models better adapted to the real world. Yet, the key factors driving the model's capability to understand and follow…

Computation and Language · Computer Science 2024-06-03 Dylan Zhang , Justin Wang , Francois Charton

Making a Science of Model Search

Many computer vision algorithms depend on a variety of parameter choices and settings that are typically hand-tuned in the course of evaluating the algorithm. While such parameter tuning is often presented as being incidental to the…

Computer Vision and Pattern Recognition · Computer Science 2012-09-25 J. Bergstra , D. Yamins , D. D. Cox

Tuning support vector machines and boosted trees using optimization algorithms

Statistical learning methods have been growing in popularity in recent years. Many of these procedures have parameters that must be tuned for models to perform well. Research has been extensive in neural networks, but not for many other…

Machine Learning · Statistics 2023-03-15 Jill F. Lundell

AI-Assisted Unit Test Writing and Test-Driven Code Refactoring: A Case Study

Many software systems originate as prototypes or minimum viable products (MVPs), developed with an emphasis on delivery speed and responsiveness to changing requirements rather than long-term code maintainability. While effective for rapid…

Software Engineering · Computer Science 2026-04-06 Ema Smolic , Mario Brcic , Luka Hobor , Mihael Kovac

Approximate Distributed Coded Computing: Polynomial Codes and Randomized Sketching

Coded computing is a distributed paradigm that uses coding theory to introduce \textit{redundancy} and overcome bottlenecks in large-scale systems. In the same vein, randomized numerical linear algebra employs probabilistic methods to…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-19 Neophytos Charalambides , Arya Mazumdar