English
Related papers

Related papers: Stabilizing Bi-Level Hyperparameter Optimization u…

200 papers

We tackle highly nonconvex, nonsmooth composite optimization problems whose objectives comprise a Moreau-Yosida regularized term. Classical nonconvex proximal splitting algorithms, such as nonconvex ADMM, suffer from lack of convergence for…

Optimization and Control · Mathematics 2018-02-28 Emanuel Laude , Tao Wu , Daniel Cremers

The task of hyper-parameter optimization (HPO) is burdened with heavy computational costs due to the intractability of optimizing both a model's weights and its hyper-parameters simultaneously. In this work, we introduce a new class of HPO…

Machine Learning · Computer Science 2021-12-14 Mathieu Tuli , Mahdi S. Hosseini , Konstantinos N. Plataniotis

The higher-order guaranteed lower eigenvalue bounds of the Laplacian in the recent work by Carstensen, Ern, and Puttkammer [Numer. Math. 149, 2021] require a parameter $C_{\mathrm{st},1}$ that is found $\textit{not}$ robust as the…

Numerical Analysis · Mathematics 2024-07-03 Carsten Carstensen , Benedikt Gräßle , Ngoc Tien Tran

This work focuses on addressing two major challenges in the context of large-scale nonconvex Bi-Level Optimization (BLO) problems, which are increasingly applied in machine learning due to their ability to model nested structures. These…

Optimization and Control · Mathematics 2024-05-17 Risheng Liu , Zhu Liu , Wei Yao , Shangzhi Zeng , Jin Zhang

Hyperparameter optimization (HPO) is a powerful technique for automating the tuning of machine learning (ML) models. However, in many real-world applications, accuracy is only one of multiple performance criteria that must be considered.…

Machine Learning · Computer Science 2023-05-12 Noor Awad , Ayushi Sharma , Philipp Muller , Janek Thomas , Frank Hutter

We consider a recently proposed class of MCMC methods which uses proximity maps instead of gradients to build proposal mechanisms which can be employed for both differentiable and non-differentiable targets. These methods have been shown to…

Computation · Statistics 2024-06-21 Francesca R. Crucinio , Alain Durmus , Pablo Jiménez , Gareth O. Roberts

In recent years, bilevel approaches have become very popular to efficiently estimate high-dimensional hyperparameters of machine learning models. However, to date, binary parameters are handled by continuous relaxation and rounding…

Machine Learning · Computer Science 2025-03-20 Sara Venturini , Marianna de Santis , Jordan Patracone , Francesco Rinaldi , Saverio Salzo , Martin Schmidt

Bayesian Optimization (BO) is a common solution to search optimal hyperparameters based on sample observations of a machine learning model. Existing BO algorithms could converge slowly even collapse when the potential observation noise…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Lei Cui , Yangguang Li , Xin Lu , Dong An , Fenggang Liu

Multi-fidelity (gray-box) hyperparameter optimization techniques (HPO) have recently emerged as a promising direction for tuning Deep Learning methods. However, existing methods suffer from a sub-optimal allocation of the HPO budget to the…

Machine Learning · Computer Science 2023-06-02 Martin Wistuba , Arlind Kadra , Josif Grabocka

Hyperparameter optimization (HPO) is a necessary step to ensure the best possible performance of Machine Learning (ML) algorithms. Several methods have been developed to perform HPO; most of these are focused on optimizing one performance…

Machine Learning · Computer Science 2022-11-16 Alejandro Morales-Hernández , Inneke Van Nieuwenhuyse , Sebastian Rojas Gonzalez

In multistage decision problems, it is often the case that an initial strategic decision (such as investment) is followed by many operational ones (operating the investment). Such initial strategic decision can be seen as a parameter…

Optimization and Control · Mathematics 2026-03-17 Adrien Le Franc , Pierre Carpentier , Jean-Philippe Chancelier , Michel de Lara

Recent empirical work shows that inconsistent results based on choice of hyperparameter optimization (HPO) configuration are a widespread problem in ML research. When comparing two algorithms J and K searching one subspace can yield the…

Machine Learning · Computer Science 2022-02-18 A. Feder Cooper , Yucheng Lu , Jessica Zosa Forde , Christopher De Sa

The present work aims at the application of finite element discretizations to a class of equilibrium problems involving moving constraints. Therefore, a Moreau--Yosida based regularization technique, controlled by a parameter, is discussed…

Numerical Analysis · Mathematics 2021-10-07 Steven-Marian Stengl

We consider a general optimization problem of minimizing a composite objective functional defined over a class of probability distributions. The objective is composed of two functionals: one is assumed to possess the variational…

Machine Learning · Computer Science 2024-08-20 Dai Hai Nguyen , Tetsuya Sakurai

Recently the authors have studied a state and control constrained optimal control problem with fractional elliptic PDE as constraints. The goal of this paper is to continue that program forward and introduce an algorithm to solve such…

Optimization and Control · Mathematics 2019-12-12 Harbir Antil , Thomas S. Brown , Deepanshu Verma

Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however,…

Machine Learning · Computer Science 2023-02-07 Jasmin Brandt , Marcel Wever , Dimitrios Iliadis , Viktor Bengs , Eyke Hüllermeier

We propose an algorithm to solve optimization problems constrained by partial (ordinary) differential equations under uncertainty, with almost sure constraints on the state variable. To alleviate the computational burden of high-dimensional…

Optimization and Control · Mathematics 2024-07-08 Harbir Antil , Sergey Dolgov , Akwum Onwunta

This paper consider solving a class of nonconvex-strongly-convex distributed stochastic bilevel optimization (DSBO) problems with personalized inner-level objectives. Most existing algorithms require computational loops for hypergradient…

Optimization and Control · Mathematics 2025-04-08 Youcheng Niu , Jinming Xu , Ying Sun , Yan Huang , Li Chai

Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find…

Moreau-Yosida regularization is introduced into the framework of exact DFT. Moreau-Yosida regularization is a lossless operation on lower semicontinuous proper convex functions over separable Hilbert spaces, and when applied to the…

Numerical Analysis · Mathematics 2022-08-11 Simen Kvaal
‹ Prev 1 2 3 10 Next ›