English
Related papers

Related papers: Model-based Asynchronous Hyperparameter and Neural…

200 papers

While existing work on neural architecture search (NAS) tunes hyperparameters in a separate post-processing step, we demonstrate that architectural choices and other hyperparameter settings interact in a way that can render this separation…

Machine Learning · Computer Science 2018-07-19 Arber Zela , Aaron Klein , Stefan Falkner , Frank Hutter

In the literature on hyper-parameter tuning, a number of recent solutions rely on low-fidelity observations (e.g., training with sub-sampled datasets) in order to efficiently identify promising configurations to be then tested via…

Machine Learning · Computer Science 2022-12-05 Pedro Mendes , Maria Casimiro , Paolo Romano , David Garlan

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through…

Machine Learning · Computer Science 2018-06-20 Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , Ameet Talwalkar

Neural architecture search (NAS) is an attractive approach to automate the design of optimized architectures but is constrained by high computational budget, especially when optimizing for multiple, important conflicting objectives. To…

Machine Learning · Computer Science 2025-09-03 Zhao Wei , Chin Chun Ooi , Yew-Soon Ong

Much recent research has been conducted in the area of Bayesian learning, particularly with regard to the optimization of hyper-parameters via Gaussian process regression. The methodologies rely chiefly on the method of maximizing the…

Machine Learning · Statistics 2014-05-13 James Brofos

A fully Bayesian approach is proposed for ultrahigh-dimensional nonparametric additive models in which the number of additive components may be larger than the sample size, though ideally the true model is believed to include only a small…

Methodology · Statistics 2013-09-24 Zuofeng Shang , Ping Li

Inference of latent feature models in the Bayesian nonparametric setting is generally difficult, especially in high dimensional settings, because it usually requires proposing features from some prior distribution. In special cases, where…

Machine Learning · Statistics 2022-06-14 Michael Minyi Zhang , Sinead A. Williamson , Fernando Perez-Cruz

Methods for neural network hyperparameter optimization and meta-modeling are computationally expensive due to the need to train a large number of model configurations. In this paper, we show that standard frequentist regression models can…

Machine Learning · Computer Science 2017-11-09 Bowen Baker , Otkrist Gupta , Ramesh Raskar , Nikhil Naik

Recent years have witnessed tremendously improved efficiency of Automated Machine Learning (AutoML), especially Automated Deep Learning (AutoDL) systems, but recent work focuses on tabular, image, or NLP tasks. So far, little attention has…

Machine Learning · Computer Science 2022-07-25 Difan Deng , Florian Karl , Frank Hutter , Bernd Bischl , Marius Lindauer

Modern statistical applications involving large data sets have focused attention on statistical methodologies which are both efficient computationally and able to deal with the screening of large numbers of different candidate models. Here…

Methodology · Statistics 2014-02-26 David J. Nott , Minh-Ngoc Tran , Chenlei Leng

Due to the high computational demands executing a rigorous comparison between hyperparameter optimization (HPO) methods is often cumbersome. The goal of this paper is to facilitate a better empirical evaluation of HPO methods by providing…

Machine Learning · Computer Science 2019-05-14 Aaron Klein , Frank Hutter

Modern Neural Architecture Search methods have repeatedly broken state-of-the-art results for several disciplines. The super-network, a central component of many such methods, enables quick estimates of accuracy or loss statistics for any…

Machine Learning · Computer Science 2021-12-16 Kevin Alexander Laube , Andreas Zell

Hardware-Software Co-Design is a highly successful strategy for improving performance of domain-specific computing systems. We argue for the application of the same methodology to deep learning; specifically, we propose to extend neural…

Machine Learning · Computer Science 2020-01-10 Andrew Anderson , Jing Su , Rozenn Dahyot , David Gregg

Neural Architecture Search (NAS) automates the design of high-performing neural networks but typically targets a single predefined task, thereby restricting its real-world applicability. To address this, Meta Neural Architecture Search…

Machine Learning · Computer Science 2025-08-14 Zijun Sun , Yanning Shen

The growing capacity of neural networks has strongly contributed to their success at complex machine learning tasks and the computational demand of such large models has, in turn, stimulated a significant improvement in the hardware…

Machine Learning · Computer Science 2021-04-27 Mohammed Amer , Tomás Maul , Iman Yi Liao

Neural Architecture Search (NAS) has been quite successful in constructing state-of-the-art models on a variety of tasks. Unfortunately, the computational cost can make it difficult to scale. In this paper, we make the first attempt to…

Machine Learning · Computer Science 2019-11-18 Albert Shaw , Wei Wei , Weiyang Liu , Le Song , Bo Dai

Modern deep learning methods are very sensitive to many hyperparameters, and, due to the long training times of state-of-the-art models, vanilla Bayesian hyperparameter optimization is typically computationally infeasible. On the other…

Machine Learning · Computer Science 2018-07-06 Stefan Falkner , Aaron Klein , Frank Hutter

We present two novel hyperparameter optimization strategies for optimization of deep learning models with a modular architecture constructed of multiple subnetworks. As complex networks with multiple subnetworks become more frequently…

Machine Learning · Computer Science 2022-02-25 Alex H. Treacher , Albert Montillo

Several novel frameworks for hyperparameter search have emerged in the last decade, but most rely on strict, often normal, distributional assumptions, limiting search model flexibility. This paper proposes a novel optimization framework…

Machine Learning · Computer Science 2023-11-29 Riccardo Doyle

This paper proposes a neural architecture search space using ResNet as a framework, with search objectives including parameters for convolution, pooling, fully connected layers, and connectivity of the residual network. In addition to…

Neural and Evolutionary Computing · Computer Science 2025-11-03 Shang Wang , Huanrong Tang , Jianquan Ouyang
‹ Prev 1 2 3 10 Next ›