Related papers: First-order Optimization for Superquantile-based S…

Superquantile-based learning: a direct approach using gradient-based optimization

We consider a formulation of supervised learning that endows models with robustness to distributional shifts from training to testing. The formulation hinges upon the superquantile risk measure, also known as the conditional value-at-risk,…

Optimization and Control · Mathematics 2022-01-04 Yassine Laguel , Jérôme Malick , Zaid Harchaoui

Semi-Supervised Ordinal Regression Based on Empirical Risk Minimization

Ordinal regression is aimed at predicting an ordinal class label. In this paper, we consider its semi-supervised formulation, in which we have unlabeled data along with ordinal-labeled data to train an ordinal regressor. There are several…

Machine Learning · Computer Science 2021-06-11 Taira Tsuchiya , Nontawat Charoenphakdee , Issei Sato , Masashi Sugiyama

Minimax Excess Risk of First-Order Methods for Statistical Learning with Data-Dependent Oracles

In this paper, our aim is to analyse the generalization capabilities of first-order methods for statistical learning in multiple, different yet related, scenarios including supervised learning, transfer learning, robust learning and…

Machine Learning · Computer Science 2024-07-02 Kevin Scaman , Mathieu Even , Batiste Le Bars , Laurent Massoulié

Stochastic first-order methods: non-asymptotic and computer-aided analyses via potential functions

We provide a novel computer-assisted technique for systematically analyzing first-order methods for optimization. In contrast with previous works, the approach is particularly suited for handling sublinear convergence rates and stochastic…

Optimization and Control · Mathematics 2021-12-22 Adrien Taylor , Francis Bach

Taylor Learning

Empirical risk minimization stands behind most optimization in supervised machine learning. Under this scheme, labeled data is used to approximate an expected cost (risk), and a learning algorithm updates model-defining parameters in search…

Machine Learning · Statistics 2023-05-25 James Schmidt

RNN-based Online Learning: An Efficient First-Order Optimization Algorithm with a Convergence Guarantee

We investigate online nonlinear regression with continually running recurrent neural network networks (RNNs), i.e., RNN-based online learning. For RNN-based online learning, we introduce an efficient first-order training algorithm that…

Machine Learning · Computer Science 2021-06-01 N. Mert Vural , Selim F. Yilmaz , Fatih Ilhan , Suleyman S. Kozat

Optimization Methods for Supervised Machine Learning: From Linear Models to Deep Learning

The goal of this tutorial is to introduce key models, algorithms, and open questions related to the use of optimization methods for solving problems arising in machine learning. It is written with an INFORMS audience in mind, specifically…

Machine Learning · Statistics 2017-07-03 Frank E. Curtis , Katya Scheinberg

Optimization with First-Order Surrogate Functions

In this paper, we study optimization methods consisting of iteratively minimizing surrogates of an objective function. By proposing several algorithmic variants and simple convergence analyses, we make two main contributions. First, we…

Machine Learning · Statistics 2013-05-15 Julien Mairal

Near-optimal and Efficient First-Order Algorithm for Multi-Task Learning with Shared Linear Representation

Multi-task learning (MTL) has emerged as a pivotal paradigm in machine learning by leveraging shared structures across multiple related tasks. Despite its empirical success, the development of likelihood-based efficiently solvable…

Machine Learning · Computer Science 2026-05-07 Shihong Ding , Fangyu Du , Cong Fang

A Comparison of First-order Algorithms for Machine Learning

Using an optimization algorithm to solve a machine learning problem is one of mainstreams in the field of science. In this work, we demonstrate a comprehensive comparison of some state-of-the-art first-order optimization algorithms for…

Machine Learning · Computer Science 2014-04-29 Yu Wei , Pock Thomas

Optimization for Supervised Machine Learning: Randomized Algorithms for Data and Parameters

Many key problems in machine learning and data science are routinely modeled as optimization problems and solved via optimization algorithms. With the increase of the volume of data and the size and complexity of the statistical models used…

Optimization and Control · Mathematics 2020-08-28 Filip Hanzely

Second-Order Stochastic Optimization for Machine Learning in Linear Time

First-order stochastic methods are the state-of-the-art in large-scale machine learning optimization owing to efficient per-iteration complexity. Second-order methods, while able to provide faster convergence, have been much less explored…

Machine Learning · Statistics 2017-12-01 Naman Agarwal , Brian Bullins , Elad Hazan

Optimizing the Efficiency of First-Order Methods for Decreasing the Gradient of Smooth Convex Functions

This paper optimizes the step coefficients of first-order methods for smooth convex minimization in terms of the worst-case convergence bound (i.e., efficiency) of the decrease in the gradient norm. This work is based on the performance…

Optimization and Control · Mathematics 2020-10-28 Donghwan Kim , Jeffrey A. Fessler

Superquantiles at Work: Machine Learning Applications and Efficient Subgradient Computation

R. Tyrell Rockafellar and collaborators introduced, in a series of works, new regression modeling methods based on the notion of superquantile (or conditional value-at-risk). These methods have been influential in economics, finance,…

Optimization and Control · Mathematics 2022-01-04 Yassine Laguel , Krishna Pillutla , Jérôme Malick , Zaid Harchaoui

A New First-Order Meta-Learning Algorithm with Convergence Guarantees

Learning new tasks by drawing on prior experience gathered from other (related) tasks is a core property of any intelligent system. Gradient-based meta-learning, especially MAML and its variants, has emerged as a viable solution to…

Machine Learning · Computer Science 2024-09-06 El Mahdi Chayti , Martin Jaggi

From inexact optimization to learning via gradient concentration

Optimization in machine learning typically deals with the minimization of empirical objectives defined by training data. However, the ultimate goal of learning is to minimize the error on future data (test error), for which the training…

Machine Learning · Statistics 2021-11-08 Bernhard Stankewitz , Nicole Mücke , Lorenzo Rosasco

First-Order Methods for Convex Optimization

First-order methods for solving convex optimization problems have been at the forefront of mathematical optimization in the last 20 years. The rapid development of this important class of algorithms is motivated by the success stories…

Optimization and Control · Mathematics 2021-01-07 Pavel Dvurechensky , Mathias Staudigl , Shimrit Shtern

Fourier Transform Approach to Machine Learning I: Fourier Regression

We propose a supervised learning algorithm for machine learning applications. Contrary to the model developing in the classical methods, which treat training, validation, and test as separate steps, in the presented approach, there is a…

Machine Learning · Computer Science 2019-09-24 Soheil Mehrabkhani

Algorithms and Theory for Supervised Gradual Domain Adaptation

The phenomenon of data distribution evolving over time has been observed in a range of applications, calling the needs of adaptive learning algorithms. We thus study the problem of supervised gradual domain adaptation, where labeled data…

Machine Learning · Computer Science 2022-11-15 Jing Dong , Shiji Zhou , Baoxiang Wang , Han Zhao

On First-Order Meta-Learning Algorithms

This paper considers meta-learning problems, where there is a distribution of tasks, and we would like to obtain an agent that performs well (i.e., learns quickly) when presented with a previously unseen task sampled from this distribution.…

Machine Learning · Computer Science 2018-10-23 Alex Nichol , Joshua Achiam , John Schulman