English
Related papers

Related papers: Amazon SageMaker Automatic Model Tuning: Scalable …

200 papers

AutoML systems provide a black-box solution to machine learning problems by selecting the right way of processing features, choosing an algorithm and tuning the hyperparameters of the entire pipeline. Although these systems perform well on…

Automatic machine learning performs predictive modeling with high performing machine learning tools without human interference. This is achieved by making machine learning applications parameter-free, i.e. only a dataset is provided while…

Machine Learning · Statistics 2018-07-16 Janek Thomas , Stefan Coors , Bernd Bischl

Deep neural networks have seen great success in recent years; however, training a deep model is often challenging as its performance heavily depends on the hyper-parameters used. In addition, finding the optimal hyper-parameter…

Automatic performance tuning (auto-tuning) is essential for optimizing high-performance applications, where vast and irregular search spaces make manual exploration infeasible. While auto-tuners traditionally rely on classical approaches…

Machine Learning · Computer Science 2026-04-01 Floris-Jan Willemsen , Niki van Stein , Ben van Werkhoven

With the increasing adoption of machine learning (ML) models and systems in high-stakes settings across different industries, guaranteeing a model's performance after deployment has become crucial. Monitoring models in production is a…

Machine Learning · Computer Science 2022-08-08 David Nigenda , Zohar Karnin , Muhammad Bilal Zafar , Raghu Ramesha , Alan Tan , Michele Donini , Krishnaram Kenthapadi

Neural networks and deep learning are changing the way that artificial intelligence is being done. Efficiently choosing a suitable network architecture and fine-tune its hyper-parameters for a specific dataset is a time-consuming task given…

Machine Learning · Computer Science 2019-05-16 David Laredo , Yulin Qin , Oliver Schütze , Jian-Qiao Sun

Hyperparameter tuning, particularly the selection of an appropriate learning rate in adaptive gradient training methods, remains a challenge. To tackle this challenge, in this paper, we propose a novel parameter-free optimizer,…

Machine Learning · Computer Science 2024-05-28 Yijiang Pang , Shuyang Yu , Bao Hoang , Jiayu Zhou

Pruning is a promising approach to compress deep learning models in order to deploy them on resource-constrained edge devices. However, many existing pruning solutions are based on unstructured pruning, which yields models that cannot…

Machine Learning · Computer Science 2023-03-16 Kaiqi Zhao , Animesh Jain , Ming Zhao

Multi-Task Learning (MTL) is a foundational machine learning problem that has seen extensive development over the past decade. Recently, various optimization-based MTL approaches have been proposed to learn multiple tasks simultaneously by…

Machine Learning · Computer Science 2026-04-13 Zhipeng Zhou , Linxiao Cao , Pengcheng Wu , Peilin Zhao , Chunyan Miao

Novel technologies in automated machine learning ease the complexity of algorithm selection and hyperparameter optimization. Hyperparameters are important for machine learning models as they significantly influence the performance of…

Machine Learning · Computer Science 2021-08-31 Mohamadjavad Bahmani , Radwa El Shawi , Nshan Potikyan , Sherif Sakr

Autonomous systems are composed of several subsystems such as mechanical, propulsion, perception, planning and control. These are traditionally designed separately which makes performance optimization of the integrated system a significant…

Robotics · Computer Science 2022-03-01 Hongrui Zheng , Johannes Betz , Rahul Mangharam

Automated machine learning (AutoML) aims to find optimal machine learning solutions automatically given a machine learning problem. It could release the burden of data scientists from the multifarious manual tuning process and enable the…

Machine Learning · Computer Science 2019-07-23 Yi-Wei Chen , Qingquan Song , Xia Hu

The concept of learning to optimize involves utilizing a trainable optimization strategy rather than relying on manually defined full gradient estimations such as ADAM. We present a framework that jointly trains the full gradient estimator…

Machine Learning · Computer Science 2026-01-30 Ruiqi Wang , Diego Klabjan

The fine-tuning of pre-trained models has become ubiquitous in generative AI, computer vision, and robotics. Although much attention has been paid to improving the efficiency of fine-tuning model, there has been less scholarship around…

Computer Vision and Pattern Recognition · Computer Science 2025-11-03 Anirudh S Chakravarthy , Shuai Kyle Zheng , Xin Huang , Sachithra Hemachandra , Xiao Zhang , Yuning Chai , Zhao Chen

A major challenge in training large-scale machine learning models is configuring the training process to maximize model performance, i.e., finding the best training setup from a vast design space. In this work, we unlock a gradient-based…

Machine Learning · Statistics 2025-03-19 Logan Engstrom , Andrew Ilyas , Benjamin Chen , Axel Feldmann , William Moses , Aleksander Madry

Automatic machine learning (\AML) is a family of techniques to automate the process of training predictive models, aiming to both improve performance and make machine learning more accessible. While many recent works have focused on aspects…

Machine Learning · Computer Science 2020-03-24 Nadiia Chepurko , Ryan Marcus , Emanuel Zgraggen , Raul Castro Fernandez , Tim Kraska , David Karger

Machine learning is a powerful method for modeling in different fields such as education. Its capability to accurately predict students' success makes it an ideal tool for decision-making tasks related to higher education. The accuracy of…

Machine Learning · Computer Science 2021-05-03 Leila Zahedi , Farid Ghareh Mohammadi , Shabnam Rezapour , Matthew W. Ohland , M. Hadi Amini

Real-world artificial intelligence (AI) systems are increasingly required to operate autonomously in dynamic, uncertain, and continuously changing environments. However, most existing AI models rely on predefined objectives, static training…

Artificial Intelligence · Computer Science 2025-11-04 Hong Su

Despite their prevalence in deep-learning communities, over-parameterized models convey high demands of computational costs for proper training. This work studies the fine-grained, modular-level learning dynamics of over-parameterized…

Algebraic Multigrid (AMG) is one of the most used iterative algorithms for solving large sparse linear equations $Ax=b$. In AMG, the coarse grid is a key component that affects the efficiency of the algorithm, the construction of which…

Numerical Analysis · Mathematics 2023-08-17 Haifeng Zou , Xiaowen Xu , Chen-Song Zhang , Zeyao Mo
‹ Prev 1 2 3 10 Next ›