Related papers: Amazon SageMaker Automatic Model Tuning: Scalable …

Amazon SageMaker Autopilot: a white box AutoML solution at scale

AutoML systems provide a black-box solution to machine learning problems by selecting the right way of processing features, choosing an algorithm and tuning the hyperparameters of the entire pipeline. Although these systems perform well on…

Machine Learning · Computer Science 2020-12-17 Piali Das , Valerio Perrone , Nikita Ivkin , Tanya Bansal , Zohar Karnin , Huibin Shen , Iaroslav Shcherbatyi , Yotam Elor , Wilton Wu , Aida Zolic , Thibaut Lienart , Alex Tang , Amr Ahmed , Jean Baptiste Faddoul , Rodolphe Jenatton , Fela Winkelmolen , Philip Gautier , Leo Dirac , Andre Perunicic , Miroslav Miladinovic , Giovanni Zappella , Cédric Archambeau , Matthias Seeger , Bhaskar Dutt , Laurence Rouesnel

Automatic Gradient Boosting

Automatic machine learning performs predictive modeling with high performing machine learning tools without human interference. This is achieved by making machine learning applications parameter-free, i.e. only a dataset is provided while…

Machine Learning · Statistics 2018-07-16 Janek Thomas , Stefan Coors , Bernd Bischl

AUTOMATA: Gradient Based Data Subset Selection for Compute-Efficient Hyper-parameter Tuning

Deep neural networks have seen great success in recent years; however, training a deep model is often challenging as its performance heavily depends on the hyper-parameters used. In addition, finding the optimal hyper-parameter…

Machine Learning · Computer Science 2022-03-17 Krishnateja Killamsetty , Guttu Sai Abhishek , Aakriti , Alexandre V. Evfimievski , Lucian Popa , Ganesh Ramakrishnan , Rishabh Iyer

Automated Algorithm Design for Auto-Tuning Optimizers

Automatic performance tuning (auto-tuning) is essential for optimizing high-performance applications, where vast and irregular search spaces make manual exploration infeasible. While auto-tuners traditionally rely on classical approaches…

Machine Learning · Computer Science 2026-04-01 Floris-Jan Willemsen , Niki van Stein , Ben van Werkhoven

Amazon SageMaker Model Monitor: A System for Real-Time Insights into Deployed Machine Learning Models

With the increasing adoption of machine learning (ML) models and systems in high-stakes settings across different industries, guaranteeing a model's performance after deployment has become crucial. Monitoring models in production is a…

Machine Learning · Computer Science 2022-08-08 David Nigenda , Zohar Karnin , Muhammad Bilal Zafar , Raghu Ramesha , Alan Tan , Michele Donini , Krishnaram Kenthapadi

Automatic Model Selection for Neural Networks

Neural networks and deep learning are changing the way that artificial intelligence is being done. Efficiently choosing a suitable network architecture and fine-tune its hyper-parameters for a specific dataset is a time-consuming task given…

Machine Learning · Computer Science 2019-05-16 David Laredo , Yulin Qin , Oliver Schütze , Jian-Qiao Sun

Towards Stability of Parameter-free Optimization

Hyperparameter tuning, particularly the selection of an appropriate learning rate in adaptive gradient training methods, remains a challenge. To tackle this challenge, in this paper, we propose a novel parameter-free optimizer,…

Machine Learning · Computer Science 2024-05-28 Yijiang Pang , Shuyang Yu , Bao Hoang , Jiayu Zhou

Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions

Pruning is a promising approach to compress deep learning models in order to deploy them on resource-constrained edge devices. However, many existing pruning solutions are based on unstructured pruning, which yields models that cannot…

Machine Learning · Computer Science 2023-03-16 Kaiqi Zhao , Animesh Jain , Ming Zhao

Delve into the Applicability of Advanced Optimizers for Multi-Task Learning

Multi-Task Learning (MTL) is a foundational machine learning problem that has seen extensive development over the past decade. Recently, various optimization-based MTL approaches have been proposed to learn multiple tasks simultaneously by…

Machine Learning · Computer Science 2026-04-13 Zhipeng Zhou , Linxiao Cao , Pengcheng Wu , Peilin Zhao , Chunyan Miao

To tune or not to tune? An Approach for Recommending Important Hyperparameters

Novel technologies in automated machine learning ease the complexity of algorithm selection and hyperparameter optimization. Hyperparameters are important for machine learning models as they significantly influence the performance of…

Machine Learning · Computer Science 2021-08-31 Mohamadjavad Bahmani , Radwa El Shawi , Nshan Potikyan , Sherif Sakr

Gradient-free Multi-domain Optimization for Autonomous Systems

Autonomous systems are composed of several subsystems such as mechanical, propulsion, perception, planning and control. These are traditionally designed separately which makes performance optimization of the integrated system a significant…

Robotics · Computer Science 2022-03-01 Hongrui Zheng , Johannes Betz , Rahul Mangharam

Techniques for Automated Machine Learning

Automated machine learning (AutoML) aims to find optimal machine learning solutions automatically given a machine learning problem. It could release the burden of data scientists from the multifarious manual tuning process and enable the…

Machine Learning · Computer Science 2019-07-23 Yi-Wei Chen , Qingquan Song , Xia Hu

A Trainable Optimizer

The concept of learning to optimize involves utilizing a trainable optimization strategy rather than relying on manually defined full gradient estimations such as ADAM. We present a framework that jointly trains the full gradient estimator…

Machine Learning · Computer Science 2026-01-30 Ruiqi Wang , Diego Klabjan

PROFIT: A Specialized Optimizer for Deep Fine Tuning

The fine-tuning of pre-trained models has become ubiquitous in generative AI, computer vision, and robotics. Although much attention has been paid to improving the efficiency of fine-tuning model, there has been less scholarship around…

Computer Vision and Pattern Recognition · Computer Science 2025-11-03 Anirudh S Chakravarthy , Shuai Kyle Zheng , Xin Huang , Sachithra Hemachandra , Xiao Zhang , Yuning Chai , Zhao Chen

Optimizing ML Training with Metagradient Descent

A major challenge in training large-scale machine learning models is configuring the training process to maximize model performance, i.e., finding the best training setup from a vast design space. In this work, we unlock a gradient-based…

Machine Learning · Statistics 2025-03-19 Logan Engstrom , Andrew Ilyas , Benjamin Chen , Axel Feldmann , William Moses , Aleksander Madry

ARDA: Automatic Relational Data Augmentation for Machine Learning

Automatic machine learning (\AML) is a family of techniques to automate the process of training predictive models, aiming to both improve performance and make machine learning more accessible. While many recent works have focused on aspects…

Machine Learning · Computer Science 2020-03-24 Nadiia Chepurko , Ryan Marcus , Emanuel Zgraggen , Raul Castro Fernandez , Tim Kraska , David Karger

Search Algorithms for Automated Hyper-Parameter Tuning

Machine learning is a powerful method for modeling in different fields such as education. Its capability to accurately predict students' success makes it an ideal tool for decision-making tasks related to higher education. The accuracy of…

Machine Learning · Computer Science 2021-05-03 Leila Zahedi , Farid Ghareh Mohammadi , Shabnam Rezapour , Matthew W. Ohland , M. Hadi Amini

Active Thinking Model: A Goal-Directed Self-Improving Framework for Real-World Adaptive Intelligence

Real-world artificial intelligence (AI) systems are increasingly required to operate autonomously in dynamic, uncertain, and continuously changing environments. However, most existing AI models rely on predefined objectives, static training…

Artificial Intelligence · Computer Science 2025-11-04 Hong Su

Train Faster, Perform Better: Modular Adaptive Training in Over-Parameterized Models

Despite their prevalence in deep-learning communities, over-parameterized models convey high demands of computational costs for proper training. This work studies the fine-grained, modular-level learning dynamics of over-parameterized…

Machine Learning · Computer Science 2024-05-14 Yubin Shi , Yixuan Chen , Mingzhi Dong , Xiaochen Yang , Dongsheng Li , Yujiang Wang , Robert P. Dick , Qin Lv , Yingying Zhao , Fan Yang , Tun Lu , Ning Gu , Li Shang

AutoAMG($\theta$): An Auto-tuned AMG Method Based on Deep Learning for Strong Threshold

Algebraic Multigrid (AMG) is one of the most used iterative algorithms for solving large sparse linear equations $Ax=b$. In AMG, the coarse grid is a key component that affects the efficiency of the algorithm, the construction of which…

Numerical Analysis · Mathematics 2023-08-17 Haifeng Zou , Xiaowen Xu , Chen-Song Zhang , Zeyao Mo