Related papers: Optimizer Amalgamation

Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More

In this paper, we investigate a novel deep-model reusing task. Our goal is to train a lightweight and versatile student model, without human-labelled annotations, that amalgamates the knowledge and masters the expertise of two pretrained…

Computer Vision and Pattern Recognition · Computer Science 2019-04-24 Jingwen Ye , Yixin Ji , Xinchao Wang , Kairi Ou , Dapeng Tao , Mingli Song

Amalgamating Knowledge towards Comprehensive Classification

With the rapid development of deep learning, there have been an unprecedentedly large number of trained deep network models available online. Reusing such trained models can significantly reduce the cost of training the new models from…

Computer Vision and Pattern Recognition · Computer Science 2018-12-04 Chengchao Shen , Xinchao Wang , Jie Song , Li Sun , Mingli Song

Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation

A massive number of well-trained deep networks have been released by developers online. These networks may focus on different tasks and in many cases are optimized for different datasets. In this paper, we study how to exploit such…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Chengchao Shen , Mengqi Xue , Xinchao Wang , Jie Song , Li Sun , Mingli Song

Active learning to optimise time-expensive algorithm selection

Hard optimisation problems such as Boolean Satisfiability typically have long solving times and can usually be solved by many algorithms, although the performance can vary widely in practice. Research has shown that no single algorithm…

Machine Learning · Computer Science 2019-09-10 Riccardo Volpato , Guangyan Song

The Analysis of Optimization Algorithms, A Dissipativity Approach

Optimization problems in engineering and applied mathematics are typically solved in an iterative fashion, by systematically adjusting the variables of interest until an adequate solution is found. The iterative algorithms that govern these…

Optimization and Control · Mathematics 2022-05-31 Laurent Lessard

Investigation into the Training Dynamics of Learned Optimizers

Optimization is an integral part of modern deep learning. Recently, the concept of learned optimizers has emerged as a way to accelerate this optimization process by replacing traditional, hand-crafted algorithms with meta-learned…

Machine Learning · Computer Science 2023-12-13 Jan Sobotka , Petr Šimánek , Daniel Vašata

Mixing ADAM and SGD: a Combined Optimization Method

Optimization methods (optimizers) get special attention for the efficient training of neural networks in the field of deep learning. In literature there are many papers that compare neural models trained with the use of different…

Machine Learning · Computer Science 2020-11-17 Nicola Landro , Ignazio Gallo , Riccardo La Grassa

Optimization Networks for Integrated Machine Learning

Optimization networks are a new methodology for holistically solving interrelated problems that have been developed with combinatorial optimization problems in mind. In this contribution we revisit the core principles of optimization…

Machine Learning · Computer Science 2021-10-04 Michael Kommenda , Johannes Karder , Andreas Beham , Bogdan Burlacu , Gabriel Kronberger , Stefan Wagner , Michael Affenzeller

A Sequential Deep Learning Algorithm for Sampled Mixed-integer Optimisation Problems

Mixed-integer optimisation problems can be computationally challenging. Here, we introduce and analyse two efficient algorithms with a specific sequential design that are aimed at dealing with sampled problems within this class. At each…

Optimization and Control · Mathematics 2023-03-07 Mohammadreza Chamanbaz , Roland Bouffanais

Joint Optimization of Tokenization and Downstream Model

Since traditional tokenizers are isolated from a downstream task and model, they cannot output an appropriate tokenization depending on the task and model, although recent studies imply that the appropriate tokenization improves the…

Computation and Language · Computer Science 2021-05-27 Tatsuya Hiraoka , Sho Takase , Kei Uchiumi , Atsushi Keyaki , Naoaki Okazaki

AutoMixAlign: Adaptive Data Mixing for Multi-Task Preference Optimization in LLMs

When aligning large language models (LLMs), their performance on various tasks (such as being helpful, harmless, and honest) depends heavily on the composition of their training data. However, selecting a data mixture that achieves strong…

Machine Learning · Computer Science 2025-06-03 Nicholas E. Corrado , Julian Katz-Samuels , Adithya Devraj , Hyokun Yun , Chao Zhang , Yi Xu , Yi Pan , Bing Yin , Trishul Chilimbi

Active Learning Meets Optimized Item Selection

Designing recommendation systems with limited or no available training data remains a challenge. To that end, a new combinatorial optimization problem is formulated to generate optimized item selection for experimentation with the goal to…

Information Retrieval · Computer Science 2021-12-07 Bernard Kleynhans , Xin Wang , Serdar Kadıoğlu

Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves

Much as replacing hand-designed features with learned functions has revolutionized how we solve perceptual tasks, we believe learned algorithms will transform how we train models. In this work we focus on general-purpose learned optimizers…

Machine Learning · Computer Science 2020-09-24 Luke Metz , Niru Maheswaranathan , C. Daniel Freeman , Ben Poole , Jascha Sohl-Dickstein

Tutorial on amortized optimization

Optimization is a ubiquitous modeling tool and is often deployed in settings which repeatedly solve similar instances of the same problem. Amortized optimization methods use learning to predict the solutions to problems in these settings,…

Machine Learning · Computer Science 2025-10-07 Brandon Amos

Learning Randomized Algorithms with Transformers

Randomization is a powerful tool that endows algorithms with remarkable properties. For instance, randomized algorithms excel in adversarial settings, often surpassing the worst-case performance of deterministic algorithms with large…

Machine Learning · Computer Science 2024-08-21 Johannes von Oswald , Seijin Kobayashi , Yassir Akram , Angelika Steger

Optimization of the Asymptotic Property of Mutual Learning Involving an Integration Mechanism of Ensemble Learning

We propose an optimization method of mutual learning which converges into the identical state of optimum ensemble learning within the framework of on-line learning, and have analyzed its asymptotic property through the statistical mechanics…

Disordered Systems and Neural Networks · Physics 2015-05-13 Kazuyuki Hara , Takahiro Yamada

Learning with Differentiable Perturbed Optimizers

Machine learning pipelines often rely on optimization procedures to make discrete decisions (e.g., sorting, picking closest neighbors, or shortest paths). Although these discrete decisions are easily computed, they break the…

Machine Learning · Computer Science 2020-06-11 Quentin Berthet , Mathieu Blondel , Olivier Teboul , Marco Cuturi , Jean-Philippe Vert , Francis Bach

A naive aggregation algorithm for improving generalization in a class of learning problems

In this brief paper, we present a naive aggregation algorithm for a typical learning problem with expert advice setting, in which the task of improving generalization, i.e., model validation, is embedded in the learning process as a…

Machine Learning · Computer Science 2024-09-09 Getachew K Befekadu

Narrowing the Focus: Learned Optimizers for Pretrained Models

In modern deep learning, the models are learned by applying gradient updates using an optimizer, which transforms the updates based on various statistics. Optimizers are often hand-designed and tuning their hyperparameters is a big part of…

Machine Learning · Computer Science 2024-10-08 Gus Kristiansen , Mark Sandler , Andrey Zhmoginov , Nolan Miller , Anirudh Goyal , Jihwan Lee , Max Vladymyrov

optimizn: a Python Library for Developing Customized Optimization Algorithms

Combinatorial optimization problems are prevalent across a wide variety of domains. These problems are often nuanced, their optimal solutions might not be efficiently obtainable, and they may require lots of time and compute resources to…

Machine Learning · Computer Science 2025-07-03 Akshay Sathiya , Rohit Pandey