Related papers: Efficient optimization of ODE neuron models using …

Gradient Descent for Spiking Neural Networks

Much of studies on neural computation are based on network models of static neurons that produce analog output, despite the fact that information processing in the brain is predominantly carried out by dynamic neurons that produce discrete…

Neurons and Cognition · Quantitative Biology 2017-06-21 Dongsung Huh , Terrence J. Sejnowski

Gradient-based optimization of spintronic devices

The optimization of physical parameters serves various purposes, such as system identification and efficiency in developing devices. Spin-torque oscillators have been applied to neuromorphic computing experimentally and theoretically, but…

Mesoscale and Nanoscale Physics · Physics 2024-09-17 Yusuke Imai , Shuhong Liu , Nozomi Akashi , Kohei Nakajima

Multi-scale Neural ODEs for 3D Medical Image Registration

Image registration plays an important role in medical image analysis. Conventional optimization based methods provide an accurate estimation due to the iterative process at the cost of expensive computation. Deep learning methods such as…

Computer Vision and Pattern Recognition · Computer Science 2021-06-21 Junshen Xu , Eric Z. Chen , Xiao Chen , Terrence Chen , Shanhui Sun

Gradient Descent based Optimization Algorithms for Deep Learning Models Training

In this paper, we aim at providing an introduction to the gradient descent based optimization algorithms for learning deep neural network models. Deep learning models involving multiple nonlinear projection layers are very challenging to…

Machine Learning · Computer Science 2019-03-12 Jiawei Zhang

A memory-efficient neural ODE framework based on high-level adjoint differentiation

Neural ordinary differential equations (neural ODEs) have emerged as a novel network architecture that bridges dynamical systems and deep learning. However, the gradient obtained with the continuous adjoint method in the vanilla neural ODE…

Machine Learning · Computer Science 2023-06-12 Hong Zhang , Wenjun Zhao

On Tuning Neural ODE for Stability, Consistency and Faster Convergence

Neural-ODE parameterize a differential equation using continuous depth neural network and solve it using numerical ODE-integrator. These models offer a constant memory cost compared to models with discrete sequence of hidden layers in which…

Machine Learning · Computer Science 2025-03-27 Sheikh Waqas Akhtar

Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models

Differential equations in general and neural ODEs in particular are an essential technique in continuous-time system identification. While many deterministic learning algorithms have been designed based on numerical integration via the…

Machine Learning · Computer Science 2021-10-18 Lenart Treven , Philippe Wenk , Florian Dörfler , Andreas Krause

Exploring the Optimized Value of Each Hyperparameter in Various Gradient Descent Algorithms

In the recent years, various gradient descent algorithms including the methods of gradient descent, gradient descent with momentum, adaptive gradient (AdaGrad), root-mean-square propagation (RMSProp) and adaptive moment estimation (Adam)…

Machine Learning · Computer Science 2024-09-19 Abel C. H. Chen

Identification of Dynamical Systems using Symbolic Regression

We describe a method for the identification of models for dynamical systems from observational data. The method is based on the concept of symbolic regression and uses genetic programming to evolve a system of ordinary differential…

Machine Learning · Computer Science 2021-07-14 Gabriel Kronberger , Lukas Kammerer , Michael Kommenda

Rethinking Optimization with Differentiable Simulation from a Global Perspective

Differentiable simulation is a promising toolkit for fast gradient-based policy optimization and system identification. However, existing approaches to differentiable simulation have largely tackled scenarios where obtaining smooth…

Machine Learning · Statistics 2022-07-04 Rika Antonova , Jingyun Yang , Krishna Murthy Jatavallabhula , Jeannette Bohg

Improving the Efficiency of Gradient Descent Algorithms Applied to Optimization Problems with Dynamical Constraints

We introduce two block coordinate descent algorithms for solving optimization problems with ordinary differential equations (ODEs) as dynamical constraints. The algorithms do not need to implement direct or adjoint sensitivity analysis…

Machine Learning · Computer Science 2022-08-30 Ion Matei , Maksym Zhenirovskyy , Johan de Kleer , John Maxwell

A Comprehensive Study on Optimization Strategies for Gradient Descent In Deep Learning

One of the most important parts of Artificial Neural Networks is minimizing the loss functions which tells us how good or bad our model is. To minimize these losses we need to tune the weights and biases. Also to calculate the minimum value…

Machine Learning · Computer Science 2021-01-08 Kaustubh Yadav

Variational Neural Networks: Every Layer and Neuron Can Be Unique

The choice of activation function can significantly influence the performance of neural networks. The lack of guiding principles for the selection of activation function is lamentable. We try to address this issue by introducing our…

Machine Learning · Computer Science 2018-10-16 Yiwei Li , Enzhi Li

Optimizing ML Training with Metagradient Descent

A major challenge in training large-scale machine learning models is configuring the training process to maximize model performance, i.e., finding the best training setup from a vast design space. In this work, we unlock a gradient-based…

Machine Learning · Statistics 2025-03-19 Logan Engstrom , Andrew Ilyas , Benjamin Chen , Axel Feldmann , William Moses , Aleksander Madry

ANODE: Unconditionally Accurate Memory-Efficient Gradients for Neural ODEs

Residual neural networks can be viewed as the forward Euler discretization of an Ordinary Differential Equation (ODE) with a unit time step. This has recently motivated researchers to explore other discretization approaches and train ODE…

Machine Learning · Computer Science 2019-07-02 Amir Gholami , Kurt Keutzer , George Biros

Continuous vs. Discrete Optimization of Deep Neural Networks

Existing analyses of optimization in deep learning are either continuous, focusing on (variants of) gradient flow, or discrete, directly treating (variants of) gradient descent. Gradient flow is amenable to theoretical analysis, but is…

Machine Learning · Computer Science 2021-12-30 Omer Elkabetz , Nadav Cohen

Efficient Search for Customized Activation Functions with Gradient Descent

Different activation functions work best for different deep learning models. To exploit this, we leverage recent advancements in gradient-based search techniques for neural architectures to efficiently identify high-performing activation…

Machine Learning · Computer Science 2024-08-14 Lukas Strack , Mahmoud Safari , Frank Hutter

Efficient Differentiable Simulation of Articulated Bodies

We present a method for efficient differentiable simulation of articulated bodies. This enables integration of articulated body dynamics into deep learning frameworks, and gradient-based optimization of neural networks that operate on…

Machine Learning · Computer Science 2021-09-17 Yi-Ling Qiao , Junbang Liang , Vladlen Koltun , Ming C. Lin

Towards Guided Descent: Optimization Algorithms for Training Neural Networks At Scale

Neural network optimization remains one of the most consequential yet poorly understood challenges in modern AI research, where improvements in training algorithms can lead to enhanced feature learning in foundation models,…

Machine Learning · Computer Science 2025-12-23 Ansh Nagwekar

Accelerating PDE-Constrained Optimization by the Derivative of Neural Operators

PDE-Constrained Optimization (PDECO) problems can be accelerated significantly by employing gradient-based methods with surrogate models like neural operators compared to traditional numerical solvers. However, this approach faces two key…

Machine Learning · Computer Science 2025-06-17 Ze Cheng , Zhuoyu Li , Xiaoqiang Wang , Jianing Huang , Zhizhou Zhang , Zhongkai Hao , Hang Su