Related papers: Conservative set valued fields, automatic differen…

The structure of conservative gradient fields

The classical Clarke subdifferential alone is inadequate for understanding automatic differentiation in nonsmooth contexts. Instead, we can sometimes rely on enlarged generalized gradients called "conservative fields", defined through the…

Optimization and Control · Mathematics 2021-01-05 Adrian Lewis , Tonghua Tian

On the complexity of nonsmooth automatic differentiation

Using the notion of conservative gradient, we provide a simple model to estimate the computational costs of the backward and forward modes of algorithmic differentiation for a wide class of nonsmooth programs. The overhead complexity of the…

Numerical Analysis · Mathematics 2023-02-07 Jérôme Bolte , Ryan Boustany , Edouard Pauwels , Béatrice Pesquet-Popescu

Automatic differentiation of nonsmooth iterative algorithms

Differentiation along algorithms, i.e., piggyback propagation of derivatives, is now routinely used to differentiate iterative solvers in differentiable programming. Asymptotics is well understood for many smooth problems but the…

Optimization and Control · Mathematics 2022-06-02 Jérôme Bolte , Edouard Pauwels , Samuel Vaiter

Path differentiability of ODE flows

We consider flows of ordinary differential equations (ODEs) driven by path differentiable vector fields. Path differentiable functions constitute a proper subclass of Lipschitz functions which admit conservative gradients, a notion of…

Machine Learning · Computer Science 2022-01-12 Swann Marx , Edouard Pauwels

Nonsmooth Implicit Differentiation for Machine Learning and Optimization

In view of training increasingly complex learning architectures, we establish a nonsmooth implicit function theorem with an operational calculus. Our result applies to most practical problems (i.e., definable problems) provided that a…

Machine Learning · Computer Science 2022-04-06 Jérôme Bolte , Tam Le , Edouard Pauwels , Antonio Silveti-Falls

A mathematical model for automatic differentiation in machine learning

Automatic differentiation, as implemented today, does not have a simple mathematical model adapted to the needs of modern machine learning. In this work we articulate the relationships between differentiation of programs as implemented in…

Machine Learning · Computer Science 2020-10-30 Jerome Bolte , Edouard Pauwels

Computation of Generalized Derivatives for Abs-Smooth Functions by Backward Mode Algorithmic Differentiation and Implications to Deep Learning

Algorithmic differentiation (AD) tools allow to obtain gradient information of a continuously differentiable objective function in a computationally cheap way using the so-called backward mode. It is common practice to use the same tools…

Optimization and Control · Mathematics 2024-12-02 Lukas Baumgärtner , Franz Bethke

A Review of automatic differentiation and its efficient implementation

Derivatives play a critical role in computational statistics, examples being Bayesian inference using Hamiltonian Monte Carlo sampling and the training of neural networks. Automatic differentiation is a powerful tool to automate the…

Mathematical Software · Computer Science 2019-03-27 Charles C. Margossian

Subgradient sampling for nonsmooth nonconvex minimization

Risk minimization for nonsmooth nonconvex problems naturally leads to first-order sampling or, by an abuse of terminology, to stochastic subgradient descent. We establish the convergence of this method in the path-differentiable case and…

Optimization and Control · Mathematics 2024-07-24 Jérôme Bolte , Tam Le , Edouard Pauwels

A novel discrete variational derivative method using "average-difference methods"

We consider structure-preserving methods for conservative systems, which rigorously replicate the conservation property yielding better numerical solutions. There, corresponding to the skew-symmetry of the differential operator, that of…

Numerical Analysis · Mathematics 2016-07-19 Daisuke Furihata , Shun Sato , Takayasu Matsuo

Derivative-free discrete gradient methods

Discrete gradient methods are a class of numerical integrators producing solutions with exact preservation of first integrals of ordinary differential equations. In this paper, we apply order theory combined with the symmetrized Itoh--Abe…

Numerical Analysis · Mathematics 2026-01-13 Håkon Noren Myhr , Sølve Eidnes

Understanding Automatic Differentiation Pitfalls

Automatic differentiation, also known as backpropagation, AD, autodiff, or algorithmic differentiation, is a popular technique for computing derivatives of computer programs accurately and efficiently. Sometimes, however, the derivatives…

Numerical Analysis · Mathematics 2023-05-15 Jan Hückelheim , Harshitha Menon , William Moses , Bruce Christianson , Paul Hovland , Laurent Hascoët

First-order variational analysis of non-amenable composite functions

This paper is devoted to studying the first-order variational analysis of non-convex and non-differentiable functions that may not be subdifferentially regular. To achieve this goal, we entirely rely on two concepts of directional…

Optimization and Control · Mathematics 2022-04-22 Ashkan Mohammadi

Drifting Fields are not Conservative

Drifting models have recently gained attention for generating high-quality samples in a single forward pass. During training, they learn a push-forward map by following a vector-valued field, the drift field. We ask whether this procedure…

Machine Learning · Computer Science 2026-05-11 Leonard T. Franz , Sebastian Hoffmann , Tim Weiland , Bernhard Schölkopf , Georg Martius

Storchastic: A Framework for General Stochastic Automatic Differentiation

Modelers use automatic differentiation (AD) of computation graphs to implement complex Deep Learning models without defining gradient computations. Stochastic AD extends AD to stochastic computation graphs with sampling steps, which arise…

Machine Learning · Statistics 2021-10-27 Emile van Krieken , Jakub M. Tomczak , Annette ten Teije

Set-valued differentiation as an operator

We introduce real vector spaces composed of set-valued maps on an open set. They are also complete metric spaces, lattices, commutative rings. The set of differentiable functions is a dense subset of these spaces and the classical gradient…

Optimization and Control · Mathematics 2007-05-23 Serguei Samborski

On Correctness of Automatic Differentiation for Non-Differentiable Functions

Differentiation lies at the core of many machine-learning algorithms, and is well-supported by popular autodiff systems, such as TensorFlow and PyTorch. Originally, these systems have been developed to compute derivatives of differentiable…

Machine Learning · Computer Science 2020-10-27 Wonyeol Lee , Hangyeol Yu , Xavier Rival , Hongseok Yang

Iterated Vector Fields and Conservatism, with Applications to Federated Learning

We study whether iterated vector fields (vector fields composed with themselves) are conservative. We give explicit examples of vector fields for which this self-composition preserves conservatism. Notably, this includes gradient vector…

Optimization and Control · Mathematics 2021-11-16 Zachary Charles , Keith Rush

Gradients without Backpropagation

Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning. Backpropagation, or reverse-mode differentiation, is a special case within the general family of automatic…

Machine Learning · Computer Science 2022-02-18 Atılım Güneş Baydin , Barak A. Pearlmutter , Don Syme , Frank Wood , Philip Torr

Gradients on Sets

For a locally Lipschitz continuous function $f:X\to\mathbb{R}$ the generalized gradient $\partial f(x)$ of Clarke is used to develop some (set-valued) gradient on a set $A\subset X$. Existence, uniqueness and some approximation are…

Optimization and Control · Mathematics 2018-03-19 Jan Mankau , Friedemann Schuricht