Related papers: Decision-Focused Learning with Directional Gradien…

Robust Losses for Decision-Focused Learning

Optimization models used to make discrete decisions often contain uncertain parameters that are context-dependent and estimated through prediction. To account for the quality of the decision made based on the prediction, decision-focused…

Machine Learning · Computer Science 2024-07-30 Noah Schutte , Krzysztof Postek , Neil Yorke-Smith

Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization

Stochastic Gradient Descent (SGD) has played a central role in machine learning. However, it requires a carefully hand-picked stepsize for fast convergence, which is notoriously tedious and time-consuming to tune. Over the last several…

Machine Learning · Computer Science 2019-06-10 Zhenxun Zhuang , Ashok Cutkosky , Francesco Orabona

A Dual Perspective on Decision-Focused Learning: Scalable Training via Dual-Guided Surrogates

Many real-world decisions are made under uncertainty by solving optimization problems using predicted quantities. This predict-then-optimize paradigm has motivated decision-focused learning, which trains models with awareness of how the…

Machine Learning · Computer Science 2025-11-10 Paula Rodriguez-Diaz , Kirk Bansak Elisabeth Paulson

Zero Grads: Learning Local Surrogate Losses for Non-Differentiable Graphics

Gradient-based optimization is now ubiquitous across graphics, but unfortunately can not be applied to problems with undefined or zero gradients. To circumvent this issue, the loss function can be manually replaced by a ``surrogate'' that…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Michael Fischer , Tobias Ritschel

Minimizing Surrogate Losses for Decision-Focused Learning using Differentiable Optimization

Decision-focused learning (DFL) trains a machine learning (ML) model to predict parameters of an optimization problem, to directly minimize decision regret, i.e., maximize decision quality. Gradient-based DFL requires computing the…

Machine Learning · Computer Science 2025-08-26 Jayanta Mandi , Ali İrfan Mahmutoğulları , Senne Berden , Tias Guns

Prediction Loss Guided Decision-Focused Learning

Decision-making under uncertainty is often considered in two stages: predicting the unknown parameters, and then optimizing decisions based on predictions. While traditional prediction-focused learning (PFL) treats these two stages…

Machine Learning · Computer Science 2025-09-11 Haeun Jeon , Hyunglip Bae , Chanyeong Kim , Yongjae Lee , Woo Chang Kim

Projected Gradient Descent for Constrained Decision-Dependent Optimization

This paper considers the decision-dependent optimization problem, where the data distributions react in response to decisions affecting both the objective function and linear constraints. We propose a new method termed repeated projected…

Optimization and Control · Mathematics 2025-08-13 Zifan Wang , Changxin Liu , Thomas Parisini , Michael M. Zavlanos , Karl H. Johansson

On the Outsized Importance of Learning Rates in Local Update Methods

We study a family of algorithms, which we refer to as local update methods, that generalize many federated learning and meta-learning algorithms. We prove that for quadratic objectives, local update methods perform stochastic gradient…

Machine Learning · Computer Science 2020-07-03 Zachary Charles , Jakub Konečný

Learning Differentiable Surrogate Losses for Structured Prediction

Structured prediction involves learning to predict complex structures rather than simple scalar values. The main challenge arises from the non-Euclidean nature of the output space, which generally requires relaxing the problem formulation.…

Machine Learning · Statistics 2024-11-19 Junjie Yang , Matthieu Labeau , Florence d'Alché-Buc

Learning Surrogate Losses

The minimization of loss functions is the heart and soul of Machine Learning. In this paper, we propose an off-the-shelf optimization approach that can minimize virtually any non-differentiable and non-decomposable loss function (e.g.…

Machine Learning · Computer Science 2019-05-27 Josif Grabocka , Randolf Scholz , Lars Schmidt-Thieme

Elucidating the theoretical underpinnings of surrogate gradient learning in spiking neural networks

Training spiking neural networks to approximate universal functions is essential for studying information processing in the brain and for neuromorphic computing. Yet the binary nature of spikes poses a challenge for direct gradient-based…

Neural and Evolutionary Computing · Computer Science 2024-11-19 Julia Gygax , Friedemann Zenke

Gradient Perturbation: Learning to Perturb Gradients for Adaptive Training

Deep neural network training involves both forward propagation (from features through logits to loss) and backward propagation (from loss through gradients to parameter updates). While perturbations along the forward chain, including…

Machine Learning · Computer Science 2026-05-29 Hua Li

Goal-oriented learning of stochastic dynamical systems using error bounds on path-space observables

The governing equations of stochastic dynamical systems often become cost-prohibitive for numerical simulation at large scales. Surrogate models of the governing equations, learned from data of the high-fidelity system, are routinely used…

Methodology · Statistics 2026-03-24 Joanna Zou , Han Cheng Lie , Youssef Marzouk

On Structured Prediction Theory with Calibrated Convex Surrogate Losses

We provide novel theoretical insights on structured prediction in the context of efficient convex surrogate loss minimization with consistency guarantees. For any task loss, we construct a convex surrogate that can be optimized via…

Machine Learning · Computer Science 2018-01-30 Anton Osokin , Francis Bach , Simon Lacoste-Julien

Guided evolutionary strategies: Augmenting random search with surrogate gradients

Many applications in machine learning require optimizing a function whose true gradient is unknown, but where surrogate gradient information (directions that may be correlated with, but not necessarily identical to, the true gradient) is…

Neural and Evolutionary Computing · Computer Science 2019-06-12 Niru Maheswaranathan , Luke Metz , George Tucker , Dami Choi , Jascha Sohl-Dickstein

Offline Model-Based Optimization via Policy-Guided Gradient Search

Offline optimization is an emerging problem in many experimental engineering domains including protein, drug or aircraft design, where online experimentation to collect evaluation data is too expensive or dangerous. To avoid that, one has…

Machine Learning · Computer Science 2024-05-10 Yassine Chemingui , Aryan Deshwal , Trong Nghia Hoang , Janardhan Rao Doppa

Decision-Focused Learning without Differentiable Optimization: Learning Locally Optimized Decision Losses

Decision-Focused Learning (DFL) is a paradigm for tailoring a predictive model to a downstream optimization task that uses its predictions in order to perform better on that specific task. The main technical challenge associated with DFL is…

Machine Learning · Computer Science 2022-11-10 Sanket Shah , Kai Wang , Bryan Wilder , Andrew Perrault , Milind Tambe

Laplacian Smoothing Gradient Descent

We propose a class of very simple modifications of gradient descent and stochastic gradient descent. We show that when applied to a large variety of machine learning problems, ranging from logistic regression to deep neural nets, the…

Machine Learning · Computer Science 2019-04-30 Stanley Osher , Bao Wang , Penghang Yin , Xiyang Luo , Farzin Barekat , Minh Pham , Alex Lin

Improving Gradient Estimation in Evolutionary Strategies With Past Descent Directions

Evolutionary Strategies (ES) are known to be an effective black-box optimization technique for deep neural networks when the true gradients cannot be computed, such as in Reinforcement Learning. We continue a recent line of research that…

Neural and Evolutionary Computing · Computer Science 2019-10-14 Florian Meier , Asier Mujika , Marcelo Matheus Gauy , Angelika Steger

Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information

Recent works in learning-integrated optimization have shown promise in settings where the optimization problem is only partially observed or where general-purpose optimizers perform poorly without expert tuning. By learning an optimizer…

Machine Learning · Computer Science 2023-11-06 Arman Zharmagambetov , Brandon Amos , Aaron Ferber , Taoan Huang , Bistra Dilkina , Yuandong Tian