Related papers: Stochastic Adaptive Activation Function

Learning Activation Functions: A new paradigm for understanding Neural Networks

The scope of research in the domain of activation functions remains limited and centered around improving the ease of optimization or generalization quality of neural networks (NNs). However, to develop a deeper understanding of deep…

Machine Learning · Computer Science 2020-12-10 Mohit Goyal , Rajan Goyal , Brejesh Lall

Understanding Activation Patterns in Artificial Neural Networks by Exploring Stochastic Processes

To gain a deeper understanding of the behavior and learning dynamics of (deep) artificial neural networks, it is valuable to employ mathematical abstractions and models. These tools provide a simplified perspective on network performance…

Machine Learning · Computer Science 2023-08-03 Stephan Johann Lehmler , Muhammad Saif-ur-Rehman , Tobias Glasmachers , Ioannis Iossifidis

A Non-monotonic Smooth Activation Function

Activation functions are crucial in deep learning models since they introduce non-linearity into the networks, allowing them to learn from errors and make adjustments, which is essential for learning complex patterns. The essential purpose…

Machine Learning · Computer Science 2023-10-17 Koushik Biswas , Meghana Karri , Ulaş Bağcı

APTx: better activation function than MISH, SWISH, and ReLU's variants used in deep learning

Activation Functions introduce non-linearity in the deep neural networks. This nonlinearity helps the neural networks learn faster and efficiently from the dataset. In deep learning, many activation functions are developed and used based on…

Machine Learning · Computer Science 2025-09-29 Ravin Kumar

A survey on recently proposed activation functions for Deep Learning

Artificial neural networks (ANN), typically referred to as neural networks, are a class of Machine Learning algorithms and have achieved widespread success, having been inspired by the biological structure of the human brain. Neural…

Machine Learning · Computer Science 2022-04-08 Murilo Gustineli

Adaptively Customizing Activation Functions for Various Layers

To enhance the nonlinearity of neural networks and increase their mapping abilities between the inputs and response variables, activation functions play a crucial role to model more complex relationships and patterns in the data. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-20 Haigen Hu , Aizhu Liu , Qiu Guan , Xiaoxin Li , Shengyong Chen , Qianwei Zhou

Activate or Not: Learning Customized Activation

We present a simple, effective, and general activation function we term ACON which learns to activate the neurons or not. Interestingly, we find Swish, the recent popular NAS-searched activation, can be interpreted as a smooth approximation…

Computer Vision and Pattern Recognition · Computer Science 2021-04-19 Ningning Ma , Xiangyu Zhang , Ming Liu , Jian Sun

The Quest for the Golden Activation Function

Deep Neural Networks have been shown to be beneficial for a variety of tasks, in particular allowing for end-to-end learning and reducing the requirement for manual design decisions. However, still many parameters have to be chosen in…

Neural and Evolutionary Computing · Computer Science 2018-08-03 Mina Basirat , Peter M. Roth

Learning Activation Functions to Improve Deep Neural Networks

Artificial neural networks typically have a fixed, non-linear activation function at each neuron. We have designed a novel form of piecewise linear activation function that is learned independently for each neuron using gradient descent.…

Neural and Evolutionary Computing · Computer Science 2015-04-22 Forest Agostinelli , Matthew Hoffman , Peter Sadowski , Pierre Baldi

Neural Networks with Smooth Adaptive Activation Functions for Regression

In Neural Networks (NN), Adaptive Activation Functions (AAF) have parameters that control the shapes of activation functions. These parameters are trained along with other parameters in the NN. AAFs have improved performance of Neural…

Computer Vision and Pattern Recognition · Computer Science 2016-08-24 Le Hou , Dimitris Samaras , Tahsin M. Kurc , Yi Gao , Joel H. Saltz

Hard ASH: Sparsity and the right optimizer make a continual learner

In class incremental learning, neural networks typically suffer from catastrophic forgetting. We show that an MLP featuring a sparse activation function and an adaptive learning rate optimizer can compete with established regularization…

Machine Learning · Computer Science 2024-04-30 Santtu Keskinen

Competing nonlinearities, criticality, and order-to-chaos transition in deep networks

Deep neural networks owe their expressive power to nonlinear activation functions. The effective field theory of signal propagation at initialization reveals a few distinct universality classes of activations that exhibit different depth…

Disordered Systems and Neural Networks · Physics 2026-05-08 Omri Lesser , Debanjan Chowdhury

ASU-CNN: An Efficient Deep Architecture for Image Classification and Feature Visualizations

Activation functions play a decisive role in determining the capacity of Deep Neural Networks as they enable neural networks to capture inherent nonlinearities present in data fed to them. The prior research on activation functions…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Jamshaid Ul Rahman , Faiza Makhdoom , Dianchen Lu

E-swish: Adjusting Activations to Different Network Depths

Activation functions have a notorious impact on neural networks on both training and testing the models against the desired problem. Currently, the most used activation function is the Rectified Linear Unit (ReLU). This paper introduces a…

Computer Vision and Pattern Recognition · Computer Science 2018-01-23 Eric Alcaide

A Theoretical View on Sparsely Activated Networks

Deep and wide neural networks successfully fit very complex functions today, but dense models are starting to be prohibitively expensive for inference. To mitigate this, one promising direction is networks that activate a sparse subgraph of…

Machine Learning · Computer Science 2022-08-10 Cenk Baykal , Nishanth Dikkala , Rina Panigrahy , Cyrus Rashtchian , Xin Wang

Is it Time to Swish? Comparing Deep Learning Activation Functions Across NLP tasks

Activation functions play a crucial role in neural networks because they are the nonlinearities which have been attributed to the success story of deep learning. One of the currently most popular activation functions is ReLU, but several…

Computation and Language · Computer Science 2019-01-10 Steffen Eger , Paul Youssef , Iryna Gurevych

Switchable Activation Networks

Deep neural networks, and more recently large-scale generative models such as large language models (LLMs) and large vision-action models (LVAs), achieve remarkable performance across diverse domains, yet their prohibitive computational…

Machine Learning · Computer Science 2026-03-10 Laha Ale , Ning Zhang , Scott A. King , Pingzhi Fan

Evolution of Activation Functions: An Empirical Investigation

The hyper-parameters of a neural network are traditionally designed through a time consuming process of trial and error that requires substantial expert knowledge. Neural Architecture Search (NAS) algorithms aim to take the human out of the…

Neural and Evolutionary Computing · Computer Science 2021-06-01 Andrew Nader , Danielle Azar

Exploring the Relationship: Transformative Adaptive Activation Functions in Comparison to Other Activation Functions

Neural networks are the state-of-the-art approach for many tasks and the activation function is one of the main building blocks that allow such performance. Recently, a novel transformative adaptive activation function (TAAF) allowing for…

Machine Learning · Computer Science 2024-02-15 Vladimír Kunc

Deep Network Approximation: Beyond ReLU to Diverse Activation Functions

This paper explores the expressive power of deep neural networks for a diverse range of activation functions. An activation function set $\mathscr{A}$ is defined to encompass the majority of commonly used activation functions, such as…

Machine Learning · Computer Science 2024-02-28 Shijun Zhang , Jianfeng Lu , Hongkai Zhao