Related papers: Superquantile-based learning: a direct approach us…

Superquantiles at Work: Machine Learning Applications and Efficient Subgradient Computation

R. Tyrell Rockafellar and collaborators introduced, in a series of works, new regression modeling methods based on the notion of superquantile (or conditional value-at-risk). These methods have been influential in economics, finance,…

Optimization and Control · Mathematics 2022-01-04 Yassine Laguel , Krishna Pillutla , Jérôme Malick , Zaid Harchaoui

First-order Optimization for Superquantile-based Supervised Learning

Classical supervised learning via empirical risk (or negative log-likelihood) minimization hinges upon the assumption that the testing distribution coincides with the training distribution. This assumption can be challenged in modern…

Optimization and Control · Mathematics 2020-10-02 Yassine Laguel , Jérôme Malick , Zaid Harchaoui

Robust supervised learning with coordinate gradient descent

This paper considers the problem of supervised learning with linear methods when both features and labels can be corrupted, either in the form of heavy tailed data and/or corrupted rows. We introduce a combination of coordinate gradient…

Machine Learning · Statistics 2022-02-01 Stéphane Gaïffas , Ibrahim Merad

Bundled Gradients through Contact via Randomized Smoothing

The empirical success of derivative-free methods in reinforcement learning for planning through contact seems at odds with the perceived fragility of classical gradient-based optimization methods in these domains. What is causing this gap,…

Robotics · Computer Science 2022-01-25 H. J. Terry Suh , Tao Pang , Russ Tedrake

An Adaptive Gradient Method with Energy and Momentum

We introduce a novel algorithm for gradient-based optimization of stochastic objective functions. The method may be seen as a variant of SGD with momentum equipped with an adaptive learning rate automatically adjusted by an 'energy'…

Optimization and Control · Mathematics 2022-03-24 Hailiang Liu , Xuping Tian

Reinforcement Learning by Value Gradients

The concept of the value-gradient is introduced and developed in the context of reinforcement learning. It is shown that by learning the value-gradients exploration or stochastic behaviour is no longer needed to find locally optimal…

Neural and Evolutionary Computing · Computer Science 2008-03-26 Michael Fairbank

Gradients as Features for Deep Representation Learning

We address the challenging problem of deep representation learning--the efficient adaption of a pre-trained deep network to different tasks. Specifically, we propose to explore gradient-based features. These features are gradients of the…

Machine Learning · Computer Science 2020-04-14 Fangzhou Mu , Yingyu Liang , Yin Li

Quantum Semi-Supervised Kernel Learning

Quantum computing leverages quantum effects to build algorithms that are faster then their classical variants. In machine learning, for a given model architecture, the speed of training the model is typically determined by the size of the…

Machine Learning · Computer Science 2022-04-25 Seyran Saeedi , Aliakbar Panahi , Tom Arodz

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

We consider the problem of sequentially making decisions that are rewarded by "successes" and "failures" which can be predicted through an unknown relationship that depends on a partially controllable vector of attributes for each instance.…

Machine Learning · Statistics 2017-09-18 Yingfei Wang , Chu Wang , Warren Powell

Approximate Stochastic Subgradient Estimation Training for Support Vector Machines

Subgradient algorithms for training support vector machines have been quite successful for solving large-scale and online learning problems. However, they have been restricted to linear kernels and strongly convex formulations. This paper…

Machine Learning · Computer Science 2011-11-04 Sangkyun Lee , Stephen J. Wright

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Discriminative, Generative and Self-Supervised Approaches for Target-Agnostic Learning

Supervised learning, characterized by both discriminative and generative learning, seeks to predict the values of single (or sometimes multiple) predefined target attributes based on a predefined set of predictor attributes. For…

Machine Learning · Computer Science 2020-11-13 Yuan Jin , Wray Buntine , Francois Petitjean , Geoffrey I. Webb

Data-Driven Qubit Characterization and Optimal Control using Deep Learning

Quantum computing requires the optimization of control pulses to achieve high-fidelity quantum gates. We propose a machine learning-based protocol to address the challenges of evaluating gradients and modeling complex system dynamics. By…

Quantum Physics · Physics 2026-01-27 Paul Surrey , Julian D. Teske , Tobias Hangleiter , Hendrik Bluhm , Pascal Cerfontaine

Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies

Gradient-based methods have been widely used for system design and optimization in diverse application domains. Recently, there has been a renewed interest in studying theoretical properties of these methods in the context of control and…

Optimization and Control · Mathematics 2022-10-11 Bin Hu , Kaiqing Zhang , Na Li , Mehran Mesbahi , Maryam Fazel , Tamer Başar

Soft Gradient Boosting with Learnable Feature Transforms for Sequential Regression

We propose a soft gradient boosting framework for sequential regression that embeds a learnable linear feature transform within the boosting procedure. At each boosting iteration, we train a soft decision tree and learn a linear input…

Machine Learning · Computer Science 2025-09-17 Huseyin Karaca , Suleyman Serdar Kozat

Robust Stochastic Optimization via Gradient Quantile Clipping

We introduce a clipping strategy for Stochastic Gradient Descent (SGD) which uses quantiles of the gradient norm as clipping thresholds. We prove that this new strategy provides a robust and efficient optimization algorithm for smooth…

Machine Learning · Statistics 2024-10-15 Ibrahim Merad , Stéphane Gaïffas

Towards Poisoning of Deep Learning Algorithms with Back-gradient Optimization

A number of online services nowadays rely upon machine learning to extract valuable information from data collected in the wild. This exposes learning algorithms to the threat of data poisoning, i.e., a coordinate attack in which a fraction…

Machine Learning · Computer Science 2017-08-30 Luis Muñoz-González , Battista Biggio , Ambra Demontis , Andrea Paudice , Vasin Wongrassamee , Emil C. Lupu , Fabio Roli

Using Deep Q-Learning to Control Optimization Hyperparameters

We present a novel definition of the reinforcement learning state, actions and reward function that allows a deep Q-network (DQN) to learn to control an optimization hyperparameter. Using Q-learning with experience replay, we train two DQNs…

Optimization and Control · Mathematics 2016-06-21 Samantha Hansen

A Stochastic Gradient Method with an Exponential Convergence Rate for Finite Training Sets

We propose a new stochastic gradient method for optimizing the sum of a finite set of smooth functions, where the sum is strongly convex. While standard stochastic gradient methods converge at sublinear rates for this problem, the proposed…

Optimization and Control · Mathematics 2013-03-12 Nicolas Le Roux , Mark Schmidt , Francis Bach

Proximal boosting: aggregating weak learners to minimize non-differentiable losses

Gradient boosting is a prediction method that iteratively combines weak learners to produce a complex and accurate model. From an optimization point of view, the learning procedure of gradient boosting mimics a gradient descent on a…

Machine Learning · Computer Science 2022-11-30 Erwan Fouillen , Claire Boyer , Maxime Sangnier