English
Related papers

Related papers: Decision-Focused Learning with Directional Gradien…

200 papers

Optimization models used to make discrete decisions often contain uncertain parameters that are context-dependent and estimated through prediction. To account for the quality of the decision made based on the prediction, decision-focused…

Machine Learning · Computer Science 2024-07-30 Noah Schutte , Krzysztof Postek , Neil Yorke-Smith

Stochastic Gradient Descent (SGD) has played a central role in machine learning. However, it requires a carefully hand-picked stepsize for fast convergence, which is notoriously tedious and time-consuming to tune. Over the last several…

Machine Learning · Computer Science 2019-06-10 Zhenxun Zhuang , Ashok Cutkosky , Francesco Orabona

Many real-world decisions are made under uncertainty by solving optimization problems using predicted quantities. This predict-then-optimize paradigm has motivated decision-focused learning, which trains models with awareness of how the…

Machine Learning · Computer Science 2025-11-10 Paula Rodriguez-Diaz , Kirk Bansak Elisabeth Paulson

Gradient-based optimization is now ubiquitous across graphics, but unfortunately can not be applied to problems with undefined or zero gradients. To circumvent this issue, the loss function can be manually replaced by a ``surrogate'' that…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Michael Fischer , Tobias Ritschel

Decision-focused learning (DFL) trains a machine learning (ML) model to predict parameters of an optimization problem, to directly minimize decision regret, i.e., maximize decision quality. Gradient-based DFL requires computing the…

Machine Learning · Computer Science 2025-08-26 Jayanta Mandi , Ali İrfan Mahmutoğulları , Senne Berden , Tias Guns

Decision-making under uncertainty is often considered in two stages: predicting the unknown parameters, and then optimizing decisions based on predictions. While traditional prediction-focused learning (PFL) treats these two stages…

Machine Learning · Computer Science 2025-09-11 Haeun Jeon , Hyunglip Bae , Chanyeong Kim , Yongjae Lee , Woo Chang Kim

This paper considers the decision-dependent optimization problem, where the data distributions react in response to decisions affecting both the objective function and linear constraints. We propose a new method termed repeated projected…

Optimization and Control · Mathematics 2025-08-13 Zifan Wang , Changxin Liu , Thomas Parisini , Michael M. Zavlanos , Karl H. Johansson

We study a family of algorithms, which we refer to as local update methods, that generalize many federated learning and meta-learning algorithms. We prove that for quadratic objectives, local update methods perform stochastic gradient…

Machine Learning · Computer Science 2020-07-03 Zachary Charles , Jakub Konečný

Structured prediction involves learning to predict complex structures rather than simple scalar values. The main challenge arises from the non-Euclidean nature of the output space, which generally requires relaxing the problem formulation.…

Machine Learning · Statistics 2024-11-19 Junjie Yang , Matthieu Labeau , Florence d'Alché-Buc

The minimization of loss functions is the heart and soul of Machine Learning. In this paper, we propose an off-the-shelf optimization approach that can minimize virtually any non-differentiable and non-decomposable loss function (e.g.…

Machine Learning · Computer Science 2019-05-27 Josif Grabocka , Randolf Scholz , Lars Schmidt-Thieme

Training spiking neural networks to approximate universal functions is essential for studying information processing in the brain and for neuromorphic computing. Yet the binary nature of spikes poses a challenge for direct gradient-based…

Neural and Evolutionary Computing · Computer Science 2024-11-19 Julia Gygax , Friedemann Zenke

Deep neural network training involves both forward propagation (from features through logits to loss) and backward propagation (from loss through gradients to parameter updates). While perturbations along the forward chain, including…

Machine Learning · Computer Science 2026-05-29 Hua Li

The governing equations of stochastic dynamical systems often become cost-prohibitive for numerical simulation at large scales. Surrogate models of the governing equations, learned from data of the high-fidelity system, are routinely used…

Methodology · Statistics 2026-03-24 Joanna Zou , Han Cheng Lie , Youssef Marzouk

We provide novel theoretical insights on structured prediction in the context of efficient convex surrogate loss minimization with consistency guarantees. For any task loss, we construct a convex surrogate that can be optimized via…

Machine Learning · Computer Science 2018-01-30 Anton Osokin , Francis Bach , Simon Lacoste-Julien

Many applications in machine learning require optimizing a function whose true gradient is unknown, but where surrogate gradient information (directions that may be correlated with, but not necessarily identical to, the true gradient) is…

Neural and Evolutionary Computing · Computer Science 2019-06-12 Niru Maheswaranathan , Luke Metz , George Tucker , Dami Choi , Jascha Sohl-Dickstein

Offline optimization is an emerging problem in many experimental engineering domains including protein, drug or aircraft design, where online experimentation to collect evaluation data is too expensive or dangerous. To avoid that, one has…

Machine Learning · Computer Science 2024-05-10 Yassine Chemingui , Aryan Deshwal , Trong Nghia Hoang , Janardhan Rao Doppa

Decision-Focused Learning (DFL) is a paradigm for tailoring a predictive model to a downstream optimization task that uses its predictions in order to perform better on that specific task. The main technical challenge associated with DFL is…

Machine Learning · Computer Science 2022-11-10 Sanket Shah , Kai Wang , Bryan Wilder , Andrew Perrault , Milind Tambe

We propose a class of very simple modifications of gradient descent and stochastic gradient descent. We show that when applied to a large variety of machine learning problems, ranging from logistic regression to deep neural nets, the…

Machine Learning · Computer Science 2019-04-30 Stanley Osher , Bao Wang , Penghang Yin , Xiyang Luo , Farzin Barekat , Minh Pham , Alex Lin

Evolutionary Strategies (ES) are known to be an effective black-box optimization technique for deep neural networks when the true gradients cannot be computed, such as in Reinforcement Learning. We continue a recent line of research that…

Neural and Evolutionary Computing · Computer Science 2019-10-14 Florian Meier , Asier Mujika , Marcelo Matheus Gauy , Angelika Steger

Recent works in learning-integrated optimization have shown promise in settings where the optimization problem is only partially observed or where general-purpose optimizers perform poorly without expert tuning. By learning an optimizer…

Machine Learning · Computer Science 2023-11-06 Arman Zharmagambetov , Brandon Amos , Aaron Ferber , Taoan Huang , Bistra Dilkina , Yuandong Tian
‹ Prev 1 2 3 10 Next ›