Related papers: Stabilizing Bi-Level Hyperparameter Optimization u…

A Nonconvex Proximal Splitting Algorithm under Moreau-Yosida Regularization

We tackle highly nonconvex, nonsmooth composite optimization problems whose objectives comprise a Moreau-Yosida regularized term. Classical nonconvex proximal splitting algorithms, such as nonconvex ADMM, suffer from lack of convergence for…

Optimization and Control · Mathematics 2018-02-28 Emanuel Laude , Tao Wu , Daniel Cremers

Towards Robust and Automatic Hyper-Parameter Tunning

The task of hyper-parameter optimization (HPO) is burdened with heavy computational costs due to the intractability of optimizing both a model's weights and its hyper-parameters simultaneously. In this work, we introduce a new class of HPO…

Machine Learning · Computer Science 2021-12-14 Mathieu Tuli , Mahdi S. Hosseini , Konstantinos N. Plataniotis

Adaptive hybrid high-order method for guaranteed lower eigenvalue bounds

The higher-order guaranteed lower eigenvalue bounds of the Laplacian in the recent work by Carstensen, Ern, and Puttkammer [Numer. Math. 149, 2021] require a parameter $C_{\mathrm{st},1}$ that is found $\textit{not}$ robust as the…

Numerical Analysis · Mathematics 2024-07-03 Carsten Carstensen , Benedikt Gräßle , Ngoc Tien Tran

Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-loop and Hessian-free Solution Strategy

This work focuses on addressing two major challenges in the context of large-scale nonconvex Bi-Level Optimization (BLO) problems, which are increasingly applied in machine learning due to their ability to model nested structures. These…

Optimization and Control · Mathematics 2024-05-17 Risheng Liu , Zhu Liu , Wei Yao , Shangzhi Zeng , Jin Zhang

MO-DEHB: Evolutionary-based Hyperband for Multi-Objective Optimization

Hyperparameter optimization (HPO) is a powerful technique for automating the tuning of machine learning (ML) models. However, in many real-world applications, accuracy is only one of multiple performance criteria that must be considered.…

Machine Learning · Computer Science 2023-05-12 Noor Awad , Ayushi Sharma , Philipp Muller , Janek Thomas , Frank Hutter

Optimal Scaling Results for Moreau-Yosida Metropolis-adjusted Langevin Algorithms

We consider a recently proposed class of MCMC methods which uses proximity maps instead of gradients to build proposal mechanisms which can be employed for both differentiable and non-differentiable targets. These methods have been shown to…

Computation · Statistics 2024-06-21 Francesca R. Crucinio , Alain Durmus , Pablo Jiménez , Gareth O. Roberts

Relax and penalize: a new bilevel approach to mixed-binary hyperparameter optimization

In recent years, bilevel approaches have become very popular to efficiently estimate high-dimensional hyperparameters of machine learning models. However, to date, binary parameters are handled by continuous relaxation and rounding…

Machine Learning · Computer Science 2025-03-20 Sara Venturini , Marianna de Santis , Jordan Patracone , Francesco Rinaldi , Saverio Salzo , Martin Schmidt

Neighbor Regularized Bayesian Optimization for Hyperparameter Optimization

Bayesian Optimization (BO) is a common solution to search optimal hyperparameters based on sample observations of a machine learning model. Existing BO algorithms could converge slowly even collapse when the potential observation noise…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Lei Cui , Yangguang Li , Xin Lu , Dong An , Fenggang Liu

Supervising the Multi-Fidelity Race of Hyperparameter Configurations

Multi-fidelity (gray-box) hyperparameter optimization techniques (HPO) have recently emerged as a promising direction for tuning Deep Learning methods. However, existing methods suffer from a sub-optimal allocation of the HPO budget to the…

Machine Learning · Computer Science 2023-06-02 Martin Wistuba , Arlind Kadra , Josif Grabocka

A survey on multi-objective hyperparameter optimization algorithms for Machine Learning

Hyperparameter optimization (HPO) is a necessary step to ensure the best possible performance of Machine Learning (ML) algorithms. Several methods have been developed to perform HPO; most of these are focused on optimizing one performance…

Machine Learning · Computer Science 2022-11-16 Alejandro Morales-Hernández , Inneke Van Nieuwenhuyse , Sebastian Rojas Gonzalez

Differentiability and Regularization of Parametric Convex Value Functions in Stochastic Multistage Optimization

In multistage decision problems, it is often the case that an initial strategic decision (such as investment) is followed by many operational ones (operating the investment). Such initial strategic decision can be seen as a parameter…

Optimization and Control · Mathematics 2026-03-17 Adrien Le Franc , Pierre Carpentier , Jean-Philippe Chancelier , Michel de Lara

Hyperparameter Optimization Is Deceiving Us, and How to Stop It

Recent empirical work shows that inconsistent results based on choice of hyperparameter optimization (HPO) configuration are a widespread problem in ML research. When comparing two algorithms J and K searching one subspace can yield the…

Machine Learning · Computer Science 2022-02-18 A. Feder Cooper , Yucheng Lu , Jessica Zosa Forde , Christopher De Sa

Combined Regularization and Discretization of Equilibrium Problems and Primal-Dual Gap Estimators

The present work aims at the application of finite element discretizations to a class of equilibrium problems involving moving constraints. Therefore, a Moreau--Yosida based regularization technique, controlled by a parameter, is discussed…

Numerical Analysis · Mathematics 2021-10-07 Steven-Marian Stengl

Moreau-Yoshida Variational Transport: A General Framework For Solving Regularized Distributional Optimization Problems

We consider a general optimization problem of minimizing a composite objective functional defined over a class of probability distributions. The objective is composed of two functionals: one is assumed to possess the variational…

Machine Learning · Computer Science 2024-08-20 Dai Hai Nguyen , Tetsuya Sakurai

Moreau-Yosida Regularization for Optimal Control of Fractional Elliptic Problems with State and Control Constraints

Recently the authors have studied a state and control constrained optimal control problem with fractional elliptic PDE as constraints. The goal of this paper is to continue that program forward and introduce an algorithm to solve such…

Optimization and Control · Mathematics 2019-12-12 Harbir Antil , Thomas S. Brown , Deepanshu Verma

Iterative Deepening Hyperband

Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however,…

Machine Learning · Computer Science 2023-02-07 Jasmin Brandt , Marcel Wever , Dimitrios Iliadis , Viktor Bengs , Eyke Hüllermeier

Smoothed Moreau-Yosida Tensor Train Approximation of State-constrained Optimization Problems under Uncertainty

We propose an algorithm to solve optimization problems constrained by partial (ordinary) differential equations under uncertainty, with almost sure constraints on the state variable. To alleviate the computational burden of high-dimensional…

Optimization and Control · Mathematics 2024-07-08 Harbir Antil , Sergey Dolgov , Akwum Onwunta

Distributed Stochastic Bilevel Optimization: Improved Complexity and Heterogeneity Analysis

This paper consider solving a class of nonconvex-strongly-convex distributed stochastic bilevel optimization (DSBO) problems with personalized inner-level objectives. Most existing algorithms require computational loops for hypergradient…

Optimization and Control · Mathematics 2025-04-08 Youcheng Niu , Jinming Xu , Ying Sun , Yan Huang , Li Chai

Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges

Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find…

Machine Learning · Statistics 2021-11-29 Bernd Bischl , Martin Binder , Michel Lang , Tobias Pielok , Jakob Richter , Stefan Coors , Janek Thomas , Theresa Ullmann , Marc Becker , Anne-Laure Boulesteix , Difan Deng , Marius Lindauer

Moreau--Yosida regularization in DFT

Moreau-Yosida regularization is introduced into the framework of exact DFT. Moreau-Yosida regularization is a lossless operation on lower semicontinuous proper convex functions over separable Hilbert spaces, and when applied to the…

Numerical Analysis · Mathematics 2022-08-11 Simen Kvaal