Related papers: Learning in Integer Latent Variable Models with Ne…

Randomized Automatic Differentiation

The successes of deep learning, variational inference, and many other fields have been aided by specialized implementations of reverse-mode automatic differentiation (AD) to compute gradients of mega-dimensional objectives. The AD…

Machine Learning · Computer Science 2021-03-16 Deniz Oktay , Nick McGreivy , Joshua Aduol , Alex Beatson , Ryan P. Adams

A Brief Introduction to Automatic Differentiation for Machine Learning

Machine learning and neural network models in particular have been improving the state of the art performance on many artificial intelligence related tasks. Neural network models are typically implemented using frameworks that perform…

Machine Learning · Computer Science 2021-10-18 Davan Harrison

Learning Hidden Dynamics using Intelligent Automatic Differentiation

Many engineering problems involve learning hidden dynamics from indirect observations, where the physical processes are described by systems of partial differential equations (PDE). Gradient-based optimization methods are considered…

Numerical Analysis · Mathematics 2019-12-17 Kailai Xu , Dongzhuo Li , Eric Darve , Jerry M. Harris

Demystifying Differentiable Programming: Shift/Reset the Penultimate Backpropagator

Deep learning has seen tremendous success over the past decade in computer vision, machine translation, and gameplay. This success rests in crucial ways on gradient-descent optimization and the ability to learn parameters of a neural…

Machine Learning · Computer Science 2019-08-30 Fei Wang , Daniel Zheng , James Decker , Xilun Wu , Grégory M. Essertel , Tiark Rompf

Accelerated Inference for Partially Observed Markov Processes using Automatic Differentiation

Automatic differentiation (AD) has driven recent advances in machine learning, including deep neural networks and Hamiltonian Markov Chain Monte Carlo methods. Partially observed nonlinear stochastic dynamical systems have proved resistant…

Methodology · Statistics 2024-07-04 Kevin Tan , Giles Hooker , Edward L. Ionides

Automatic Differentiation is Essential in Training Neural Networks for Solving Differential Equations

Neural network-based approaches have recently shown significant promise in solving partial differential equations (PDEs) in science and engineering, especially in scenarios featuring complex domains or incorporation of empirical data. One…

Machine Learning · Computer Science 2025-03-19 Chuqi Chen , Yahong Yang , Yang Xiang , Wenrui Hao

Automating Steady and Unsteady Adjoints: Efficiently Utilizing Implicit and Algorithmic Differentiation

Algorithmic differentiation (AD) has become increasingly capable and straightforward to use. However, AD is inefficient when applied directly to solvers, a feature of most engineering analyses. We can leverage implicit differentiation to…

Optimization and Control · Mathematics 2023-06-28 Andrew Ning , Taylor McDonnell

Automatic Relevance Determination For Deep Generative Models

A recurring problem when building probabilistic latent variable models is regularization and model selection, for instance, the choice of the dimensionality of the latent space. In the context of belief networks with latent variables, this…

Machine Learning · Statistics 2015-08-27 Theofanis Karaletsos , Gunnar Rätsch

Peering Beyond the Gradient Veil with Distributed Auto Differentiation

Although distributed machine learning has opened up many new and exciting research frontiers, fragmentation of models and data across different machines, nodes, and sites still results in considerable communication overhead, impeding…

Machine Learning · Computer Science 2022-02-04 Bradley T. Baker , Aashis Khanal , Vince D. Calhoun , Barak Pearlmutter , Sergey M. Plis

Differentiable Agent-Based Simulation for Gradient-Guided Simulation-Based Optimization

Simulation-based optimization using agent-based models is typically carried out under the assumption that the gradient describing the sensitivity of the simulation output to the input cannot be evaluated directly. To still apply…

Machine Learning · Computer Science 2021-03-24 Philipp Andelfinger

Nested Learning: The Illusion of Deep Learning Architectures

Despite the recent progresses, particularly in developing Language Models, there are fundamental challenges and unanswered questions about how such models can continually learn/memorize, self-improve, and find effective solutions. In this…

Machine Learning · Computer Science 2026-01-01 Ali Behrouz , Meisam Razaviyayn , Peilin Zhong , Vahab Mirrokni

RangeAD: Fast On-Model Anomaly Detection

In practice, machine learning methods commonly require anomaly detection (AD) to filter inputs or detect distributional shifts. Typically, this is implemented by running a separate AD model alongside the primary model. However, this…

Machine Learning · Computer Science 2026-03-19 Luca Hinkamp , Simon Klüttermann , Emmanuel Müller

AD for an Array Language with Nested Parallelism

We present a technique for applying (forward and) reverse-mode automatic differentiation (AD) on a non-recursive second-order functional array language that supports nested parallelism and is primarily aimed at efficient GPU execution. The…

Programming Languages · Computer Science 2022-02-22 Robert Schenck , Ola Rønning , Troels Henriksen , Cosmin E. Oancea

The simple essence of automatic differentiation

Automatic differentiation (AD) in reverse mode (RAD) is a central component of deep learning and other uses of large-scale optimization. Commonly used RAD algorithms such as backpropagation, however, are complex and stateful, hindering deep…

Programming Languages · Computer Science 2018-10-03 Conal Elliott

Stable and efficient differentiation of tensor network algorithms

Gradient based optimization methods are the established state-of-the-art paradigm to study strongly entangled quantum systems in two dimensions with Projected Entangled Pair States. However, the key ingredient, the gradient itself, has…

Quantum Physics · Physics 2025-04-15 Anna Francuz , Norbert Schuch , Bram Vanhecke

Tricks from Deep Learning

The deep learning community has devised a diverse set of methods to make gradient optimization, using large datasets, of large and highly complex models with deeply cascaded nonlinearities, practical. Taken as a whole, these methods…

Machine Learning · Computer Science 2016-11-14 Atılım Güneş Baydin , Barak A. Pearlmutter , Jeffrey Mark Siskind

Fixed-Point Automatic Differentiation of Forward--Backward Splitting Algorithms for Partly Smooth Functions

A large class of non-smooth practical optimization problems can be written as minimization of a sum of smooth and partly smooth functions. We examine such structured problems which also depend on a parameter vector and study the problem of…

Optimization and Control · Mathematics 2024-10-28 Sheheryar Mehmood , Peter Ochs

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training

An activation function is an element-wise mathematical function and plays a crucial role in deep neural networks (DNN). Many novel and sophisticated activation functions have been proposed to improve the DNN accuracy but also consume…

Machine Learning · Computer Science 2022-09-23 Cong Guo , Yuxian Qiu , Jingwen Leng , Chen Zhang , Ying Cao , Quanlu Zhang , Yunxin Liu , Fan Yang , Minyi Guo

You Only Linearize Once: Tangents Transpose to Gradients

Automatic differentiation (AD) is conventionally understood as a family of distinct algorithms, rooted in two "modes" -- forward and reverse -- which are typically presented (and implemented) separately. Can there be only one? Following up…

Programming Languages · Computer Science 2022-12-07 Alexey Radul , Adam Paszke , Roy Frostig , Matthew Johnson , Dougal Maclaurin

Efficient and Sound Differentiable Programming in a Functional Array-Processing Language

Automatic differentiation (AD) is a technique for computing the derivative of a function represented by a program. This technique is considered as the de-facto standard for computing the differentiation in many machine learning and…

Programming Languages · Computer Science 2022-12-21 Amir Shaikhha , Mathieu Huot , Shabnam Ghasemirad , Andrew Fitzgibbon , Simon Peyton Jones , Dimitrios Vytiniotis