Related papers: Parametric Majorization for Data-Driven Energy Min…

Optimization for Supervised Machine Learning: Randomized Algorithms for Data and Parameters

Many key problems in machine learning and data science are routinely modeled as optimization problems and solved via optimization algorithms. With the increase of the volume of data and the size and complexity of the statistical models used…

Optimization and Control · Mathematics 2020-08-28 Filip Hanzely

A Survey on Multi-Objective based Parameter Optimization for Deep Learning

Deep learning models form one of the most powerful machine learning models for the extraction of important features. Most of the designs of deep neural models, i.e., the initialization of parameters, are still manually tuned. Hence,…

Machine Learning · Computer Science 2023-05-18 Mrittika Chakraborty , Wreetbhas Pal , Sanghamitra Bandyopadhyay , Ujjwal Maulik

A Challenge in Reweighting Data with Bilevel Optimization

In many scenarios, one uses a large training set to train a model with the goal of performing well on a smaller testing set with a different distribution. Learning a weight for each data point of the training set is an appealing solution,…

Machine Learning · Statistics 2023-10-27 Anastasia Ivanova , Pierre Ablin

Learning Iterative Reasoning through Energy Minimization

Deep learning has excelled on complex pattern recognition tasks such as image classification and object recognition. However, it struggles with tasks requiring nontrivial reasoning, such as algorithmic computation. Humans are able to solve…

Machine Learning · Computer Science 2022-07-01 Yilun Du , Shuang Li , Joshua B. Tenenbaum , Igor Mordatch

Deep-Energy: Unsupervised Training of Deep Neural Networks

The success of deep learning has been due, in no small part, to the availability of large annotated datasets. Thus, a major bottleneck in current learning pipelines is the time-consuming human annotation of data. In scenarios where such…

Machine Learning · Computer Science 2021-01-29 Alona Golts , Daniel Freedman , Michael Elad

Bilevel Programming for Hyperparameter Optimization and Meta-Learning

We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter optimization and meta-learning. We show that an approximate version of the bilevel problem can be solved by taking into explicit account the…

Machine Learning · Statistics 2018-07-04 Luca Franceschi , Paolo Frasconi , Saverio Salzo , Riccardo Grazzi , Massimilano Pontil

Guaranteed Parameter Estimation for Discrete Energy Minimization

Structural learning, a method to estimate the parameters for discrete energy minimization, has been proven to be effective in solving computer vision problems, especially in 3D scene parsing. As the complexity of the models increases,…

Computer Vision and Pattern Recognition · Computer Science 2017-01-13 Mengtian Li , Daniel Huber

On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization

Some recent works in machine learning and computer vision involve the solution of a bi-level optimization problem. Here the solution of a parameterized lower-level problem binds variables that appear in the objective of an upper-level…

Computer Vision and Pattern Recognition · Computer Science 2016-07-22 Stephen Gould , Basura Fernando , Anoop Cherian , Peter Anderson , Rodrigo Santa Cruz , Edison Guo

Parameterized Machine Learning for High-Energy Physics

We investigate a new structure for machine learning classifiers applied to problems in high-energy physics by expanding the inputs to include not only measured features but also physics parameters. The physics parameters represent a…

High Energy Physics - Experiment · Physics 2016-05-25 Pierre Baldi , Kyle Cranmer , Taylor Faucett , Peter Sadowski , Daniel Whiteson

Large-Scale Linear Energy System Optimization: A Systematic Review on Parallelization Strategies via Decomposition

As renewable energy integration, sector coupling, and spatiotemporal detail increase, energy system optimization models grow in size and complexity, often pushing solvers to their performance limits. This systematic review explores…

Optimization and Control · Mathematics 2025-08-11 Lars Hadidi , Leonard Göke , Maximilian Hoffmann , Mario Klostermeier , Shima Sasanpour , Tim Varelmann , Vassilios Yfantis , Jochen Linßen , Detlef Stolten , Jann M. Weinand

Distributed Training and Optimization Of Neural Networks

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

A Unified Multiscale Framework for Discrete Energy Minimization

Discrete energy minimization is a ubiquitous task in computer vision, yet is NP-hard in most cases. In this work we propose a multiscale framework for coping with the NP-hardness of discrete optimization. Our approach utilizes algebraic…

Computer Vision and Pattern Recognition · Computer Science 2012-04-24 Shai Bagon , Meirav Galun

Functional Bilevel Optimization for Machine Learning

In this paper, we introduce a new functional point of view on bilevel optimization problems for machine learning, where the inner objective is minimized over a function space. These types of problems are most often solved by using methods…

Machine Learning · Statistics 2024-12-10 Ieva Petrulionyte , Julien Mairal , Michael Arbel

Lower-level Duality Based Reformulation and Majorization Minimization Algorithm for Hyperparameter Optimization

Hyperparameter tuning is an important task of machine learning, which can be formulated as a bilevel program (BLP). However, most existing algorithms are not applicable for BLP with non-smooth lower-level problems. To address this, we…

Optimization and Control · Mathematics 2024-03-04 He Chen , Haochen Xu , Rujun Jiang , Anthony Man-Cho So

Efficient Compression of Overparameterized Deep Models through Low-Dimensional Learning Dynamics

Overparameterized models have proven to be powerful tools for solving various machine learning tasks. However, overparameterization often leads to a substantial increase in computational and memory costs, which in turn requires extensive…

Machine Learning · Computer Science 2024-03-13 Soo Min Kwon , Zekai Zhang , Dogyoon Song , Laura Balzano , Qing Qu

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

A Globally Convergent Gradient-based Bilevel Hyperparameter Optimization Method

Hyperparameter optimization in machine learning is often achieved using naive techniques that only lead to an approximate set of hyperparameters. Although techniques such as Bayesian optimization perform an intelligent search on a given…

Machine Learning · Computer Science 2023-06-21 Ankur Sinha , Satender Gunwal , Shivam Kumar

Learning to Solve Constrained Bilevel Control Co-Design Problems

Learning to Optimize (L2O) is a subfield of machine learning (ML) in which ML models are trained to solve parametric optimization problems. The general goal is to learn a fast approximator of solutions to constrained optimization problems,…

Optimization and Control · Mathematics 2025-12-04 James Kotary , Himanshu Sharma , Ethan King , Draguna Vrabie , Ferdinando Fioretto , Jan Drgona

A Gradient-based Bilevel Optimization Approach for Tuning Hyperparameters in Machine Learning

Hyperparameter tuning is an active area of research in machine learning, where the aim is to identify the optimal hyperparameters that provide the best performance on the validation set. Hyperparameter tuning is often achieved using naive…

Machine Learning · Computer Science 2020-07-23 Ankur Sinha , Tanmay Khandait , Raja Mohanty

Open Problems in Applied Deep Learning

This work formulates the machine learning mechanism as a bi-level optimization problem. The inner level optimization loop entails minimizing a properly chosen loss function evaluated on the training data. This is nothing but the…

Machine Learning · Computer Science 2023-01-27 Maziar Raissi