Related papers: Optimizing Automatic Differentiation with Deep Rei…

Relative gradient optimization of the Jacobian term in unsupervised deep learning

Learning expressive probabilistic models correctly describing the data is a ubiquitous problem in machine learning. A popular approach for solving it is mapping the observations into a representation space with a simple joint distribution,…

Machine Learning · Statistics 2020-10-28 Luigi Gresele , Giancarlo Fissore , Adrián Javaloy , Bernhard Schölkopf , Aapo Hyvärinen

The Curse of Unrolling: Rate of Differentiating Through Optimization

Computing the Jacobian of the solution of an optimization problem is a central problem in machine learning, with applications in hyperparameter optimization, meta-learning, optimization as a layer, and dataset distillation, to name a few.…

Optimization and Control · Mathematics 2023-08-28 Damien Scieur , Quentin Bertrand , Gauthier Gidel , Fabian Pedregosa

Jacobian Descent for Multi-Objective Optimization

Many optimization problems require balancing multiple conflicting objectives. As gradient descent is limited to single-objective optimization, we introduce its direct generalization: Jacobian descent (JD). This algorithm iteratively updates…

Machine Learning · Computer Science 2025-02-04 Pierre Quinton , Valérian Rey

Computing Derivatives for PETSc Adjoint Solvers using Algorithmic Differentiation

Most nonlinear partial differential equation (PDE) solvers require the Jacobian matrix associated to the differential operator. In PETSc, this is typically achieved by either an analytic derivation or numerical approximation method such as…

Mathematical Software · Computer Science 2019-09-09 J. G. Wallwork , P. Hovland , H. Zhang , O. Marin

One-step differentiation of iterative algorithms

In appropriate frameworks, automatic differentiation is transparent to the user at the cost of being a significant computational burden when the number of operations is large. For iterative algorithms, implicit differentiation alleviates…

Optimization and Control · Mathematics 2023-05-24 Jérôme Bolte , Edouard Pauwels , Samuel Vaiter

Continuous-time reinforcement learning for optimal switching over multiple regimes

This paper studies the continuous-time reinforcement learning (RL) for optimal switching problems across multiple regimes. We consider a type of exploratory formulation under entropy regularization where the agent randomizes both the timing…

Optimization and Control · Mathematics 2025-12-23 Yijie Huang , Mengge Li , Xiang Yu , Zhou Zhou

Breaking the Computational Barrier: Provably Efficient Actor-Critic for Low-Rank MDPs

Reinforcement learning (RL) is a fundamental framework for sequential decision-making, in which an agent learns an optimal policy through interactions with an unknown environment. In settings with function approximation, many existing RL…

Machine Learning · Computer Science 2026-05-05 Ruiquan Huang , Donghao Li , Yingbin Liang , Jing Yang

Exploratory Combinatorial Optimization with Reinforcement Learning

Many real-world problems can be reduced to combinatorial optimization on a graph, where the subset or ordering of vertices that maximize some objective function must be found. With such tasks often NP-hard and analytically intractable,…

Machine Learning · Computer Science 2021-03-22 Thomas D. Barrett , William R. Clements , Jakob N. Foerster , A. I. Lvovsky

Improvements to Jacobian Arithmetic in Global Function Fields

We present two improvements to arithmetic in the Jacobian of global function fields based on the approach of Hess. The first reduces the number of expensive reduction steps by optimizing for typical inputs rather than worst-case behavior,…

Number Theory · Mathematics 2026-05-18 Vincent Macri , Michael Jacobson , Renate Scheidler

Jacobian Exploratory Dual-Phase Reinforcement Learning for Dynamic Endoluminal Navigation of Deformable Continuum Robots

Deformable continuum robots (DCRs) present unique planning challenges due to nonlinear deformation mechanics and partial state observability, violating the Markov assumptions of conventional reinforcement learning (RL) methods. While…

Robotics · Computer Science 2025-09-03 Yu Tian , Chi Kit Ng , Hongliang Ren

A Review of automatic differentiation and its efficient implementation

Derivatives play a critical role in computational statistics, examples being Bayesian inference using Hamiltonian Monte Carlo sampling and the training of neural networks. Automatic differentiation is a powerful tool to automate the…

Mathematical Software · Computer Science 2019-03-27 Charles C. Margossian

Achieving Sample and Computational Efficient Reinforcement Learning by Action Space Reduction via Grouping

Reinforcement learning often needs to deal with the exponential growth of states and actions when exploring optimal control in high-dimensional spaces (often known as the curse of dimensionality). In this work, we address this issue by…

Machine Learning · Computer Science 2023-06-23 Yining Li , Peizhong Ju , Ness Shroff

Automatic differentiation and the optimization of differential equation models in biology

A computational revolution unleashed the power of artificial neural networks. At the heart of that revolution is automatic differentiation, which calculates the derivative of a performance measure relative to a large number of parameters.…

Quantitative Methods · Quantitative Biology 2023-12-27 Steven A. Frank

Fractional-order Jacobian Matrix Differentiation and Its Application in Artificial Neural Networks

Fractional-order differentiation has many characteristics different from integer-order differentiation. These characteristics can be applied to the optimization algorithms of artificial neural networks to obtain better results. However, due…

Machine Learning · Computer Science 2025-06-10 Xiaojun zhou , Chunna Zhao , Yaqun Huang , Chengli Zhou , Junjie Ye , Kemeng Xiang

Deep Reinforcement Learning for Exact Combinatorial Optimization: Learning to Branch

Branch-and-bound is a systematic enumerative method for combinatorial optimization, where the performance highly relies on the variable selection strategy. State-of-the-art handcrafted heuristic strategies suffer from relatively slow…

Machine Learning · Computer Science 2022-06-15 Tianyu Zhang , Amin Banitalebi-Dehkordi , Yong Zhang

Bridging Optimal Transport and Jacobian Regularization by Optimal Trajectory for Enhanced Adversarial Defense

Deep neural networks, particularly in vision tasks, are notably susceptible to adversarial perturbations. To overcome this challenge, developing a robust classifier is crucial. In light of the recent advancements in the robustness of…

Computer Vision and Pattern Recognition · Computer Science 2024-02-14 Binh M. Le , Shahroz Tariq , Simon S. Woo

Differentiable Spline Approximations

The paradigm of differentiable programming has significantly enhanced the scope of machine learning via the judicious use of gradient-based optimization. However, standard differentiable programming methods (such as autodiff) typically…

Machine Learning · Computer Science 2021-10-05 Minsu Cho , Aditya Balu , Ameya Joshi , Anjana Deva Prasad , Biswajit Khara , Soumik Sarkar , Baskar Ganapathysubramanian , Adarsh Krishnamurthy , Chinmay Hegde

Understanding the Curse of Unrolling

Algorithm unrolling is ubiquitous in machine learning, particularly in hyperparameter optimization and meta-learning, where Jacobians of solution mappings are computed by differentiating through iterative algorithms. Although unrolling is…

Machine Learning · Computer Science 2026-02-24 Sheheryar Mehmood , Florian Knoll , Peter Ochs

Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution

Optimal execution is a sequential decision-making problem for cost-saving in algorithmic trading. Studies have found that reinforcement learning (RL) can help decide the order-splitting sizes. However, a problem remains unsolved: how to…

Trading and Market Microstructure · Quantitative Finance 2022-07-25 Feiyang Pan , Tongzhe Zhang , Ling Luo , Jia He , Shuoling Liu

Jacobian-free Efficient Pseudo-Likelihood (EPL) Algorithm

This study proposes a simple procedure to compute Efficient Pseudo Likelihood (EPL) estimator proposed by Dearing and Blevins (2024) for estimating dynamic discrete games, without computing Jacobians of equilibrium constraints. EPL…

Econometrics · Economics 2024-10-29 Takeshi Fukasawa