Related papers: Speed Limits for Deep Learning

Training Energy-Efficient Deep Spiking Neural Networks with Time-to-First-Spike Coding

The tremendous energy consumption of deep neural networks (DNNs) has become a serious problem in deep learning. Spiking neural networks (SNNs), which mimic the operations in the human brain, have been studied as prominent energy-efficient…

Neural and Evolutionary Computing · Computer Science 2021-06-07 Seongsik Park , Sungroh Yoon

T2FSNN: Deep Spiking Neural Networks with Time-to-first-spike Coding

Spiking neural networks (SNNs) have gained considerable interest due to their energy-efficient characteristics, yet lack of a scalable training algorithm has restricted their applicability in practical machine learning problems. The deep…

Neural and Evolutionary Computing · Computer Science 2020-03-27 Seongsik Park , Seijoon Kim , Byunggook Na , Sungroh Yoon

When do Convolutional Neural Networks Stop Learning?

Convolutional Neural Networks (CNNs) have demonstrated outstanding performance in computer vision tasks such as image classification, detection, segmentation, and medical image analysis. In general, an arbitrary number of epochs is used to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Sahan Ahmad , Gabriel Trahan , Aminul Islam

Efficiently Training Time-to-First-Spike Spiking Neural Networks from Scratch

Spiking Neural Networks (SNNs), with their event-driven and biologically inspired operation, are well-suited for energy-efficient neuromorphic hardware. Neural coding, critical to SNNs, determines how information is represented via spikes.…

Neural and Evolutionary Computing · Computer Science 2025-03-11 Kaiwei Che , Wei Fang , Zhengyu Ma , Yifan Huang , Peng Xue , Li Yuan , Timothée Masquelier , Yonghong Tian

Efficiently Training Low-Curvature Neural Networks

The highly non-linear nature of deep neural networks causes them to be susceptible to adversarial examples and have unstable gradients which hinders interpretability. However, existing methods to solve these issues, such as adversarial…

Machine Learning · Computer Science 2023-01-11 Suraj Srinivas , Kyle Matoba , Himabindu Lakkaraju , Francois Fleuret

To Spike or Not to Spike, that is the Question

Neuromorphic computing has recently gained momentum with the emergence of various neuromorphic processors. As the field advances, there is an increasing focus on developing training methods that can effectively leverage the unique…

Emerging Technologies · Computer Science 2025-04-15 Sanaz Mahmoodi Takaghaj , Jack Sampson

Fast Recurrent Fully Convolutional Networks for Direct Perception in Autonomous Driving

Deep convolutional neural networks (CNNs) have been shown to perform extremely well at a variety of tasks including subtasks of autonomous driving such as image segmentation and object classification. However, networks designed for these…

Computer Vision and Pattern Recognition · Computer Science 2017-11-21 Yiqi Hou , Sascha Hornauer , Karl Zipser

Fast Training of Deep Neural Networks Robust to Adversarial Perturbations

Deep neural networks are capable of training fast and generalizing well within many domains. Despite their promising performance, deep networks have shown sensitivities to perturbations of their inputs (e.g., adversarial examples) and their…

Machine Learning · Computer Science 2020-07-09 Justin Goodwin , Olivia Brown , Victoria Helus

NL-CNN: A Resources-Constrained Deep Learning Model based on Nonlinear Convolution

A novel convolution neural network model, abbreviated NL-CNN is proposed, where nonlinear convolution is emulated in a cascade of convolution + nonlinearity layers. The code for its implementation and some trained models are made publicly…

Machine Learning · Computer Science 2021-02-03 Radu Dogaru , Ioana Dogaru

Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing

Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on…

Neural and Evolutionary Computing · Computer Science 2016-10-13 Steven K. Esser , Paul A. Merolla , John V. Arthur , Andrew S. Cassidy , Rathinakumar Appuswamy , Alexander Andreopoulos , David J. Berg , Jeffrey L. McKinstry , Timothy Melano , Davis R. Barch , Carmelo di Nolfo , Pallab Datta , Arnon Amir , Brian Taba , Myron D. Flickner , Dharmendra S. Modha

Designing Interpretable Approximations to Deep Reinforcement Learning

In an ever expanding set of research and application areas, deep neural networks (DNNs) set the bar for algorithm performance. However, depending upon additional constraints such as processing power and execution time limits, or…

Machine Learning · Computer Science 2021-06-22 Nathan Dahlin , Krishna Chaitanya Kalagarla , Nikhil Naik , Rahul Jain , Pierluigi Nuzzo

TCT: Convexifying Federated Learning using Bootstrapped Neural Tangent Kernels

State-of-the-art federated learning methods can perform far worse than their centralized counterparts when clients have dissimilar data distributions. For neural networks, even when centralized SGD easily finds a solution that is…

Machine Learning · Computer Science 2022-10-06 Yaodong Yu , Alexander Wei , Sai Praneeth Karimireddy , Yi Ma , Michael I. Jordan

Physical learning beyond the quasistatic limit

Physical networks, such as biological neural networks, can learn desired functions without a central processor, using local learning rules in space and time to learn in a fully distributed manner. Learning approaches such as equilibrium…

Disordered Systems and Neural Networks · Physics 2022-05-17 Menachem Stern , Sam Dillavou , Marc Z. Miskin , Douglas J. Durian , Andrea J. Liu

Flatter, faster: scaling momentum for optimal speedup of SGD

Commonly used optimization algorithms often show a trade-off between good generalization and fast training times. For instance, stochastic gradient descent (SGD) tends to have good generalization; however, adaptive gradient methods have…

Machine Learning · Computer Science 2023-06-14 Aditya Cowsik , Tankut Can , Paolo Glorioso

Training NTK to Generalize with KARE

The performance of the data-dependent neural tangent kernel (NTK; Jacot et al. (2018)) associated with a trained deep neural network (DNN) often matches or exceeds that of the full network. This implies that DNN training via gradient…

Machine Learning · Computer Science 2025-05-22 Johannes Schwab , Bryan Kelly , Semyon Malamud , Teng Andrea Xu

Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training

Deep neural networks have achieved great success in many data processing applications. However, the high computational complexity and storage cost makes deep learning hard to be used on resource-constrained devices, and it is not…

Machine Learning · Computer Science 2023-03-27 Xinwei Ou , Zhangxin Chen , Ce Zhu , Yipeng Liu

Neural (Tangent Kernel) Collapse

This work bridges two important concepts: the Neural Tangent Kernel (NTK), which captures the evolution of deep neural networks (DNNs) during training, and the Neural Collapse (NC) phenomenon, which refers to the emergence of symmetry and…

Machine Learning · Computer Science 2023-11-07 Mariia Seleznova , Dana Weitzner , Raja Giryes , Gitta Kutyniok , Hung-Hsu Chou

Optimal Convergence Rates for Neural Operators

We introduce the neural tangent kernel (NTK) regime for two-layer neural operators and analyze their generalization properties. For early-stopped gradient descent (GD), we derive fast convergence rates that are known to be minimax optimal…

Machine Learning · Statistics 2024-12-24 Mike Nguyen , Nicole Mücke

On the rates of convergence for learning with convolutional neural networks

We study approximation and learning capacities of convolutional neural networks (CNNs) with one-side zero-padding and multiple channels. Our first result proves a new approximation bound for CNNs with certain constraint on the weights. Our…

Machine Learning · Computer Science 2025-07-29 Yunfei Yang , Han Feng , Ding-Xuan Zhou

Finite-Width Neural Tangent Kernels from Feynman Diagrams

Neural tangent kernels (NTKs) are a powerful tool for analyzing deep, non-linear neural networks. In the infinite-width limit, NTKs can easily be computed for most common architectures, yielding full analytic control over the training…

Machine Learning · Computer Science 2026-02-16 Max Guillen , Philipp Misof , Jan E. Gerken