English
Related papers

Related papers: Stochastic Adaptive Activation Function

200 papers

The scope of research in the domain of activation functions remains limited and centered around improving the ease of optimization or generalization quality of neural networks (NNs). However, to develop a deeper understanding of deep…

Machine Learning · Computer Science 2020-12-10 Mohit Goyal , Rajan Goyal , Brejesh Lall

To gain a deeper understanding of the behavior and learning dynamics of (deep) artificial neural networks, it is valuable to employ mathematical abstractions and models. These tools provide a simplified perspective on network performance…

Machine Learning · Computer Science 2023-08-03 Stephan Johann Lehmler , Muhammad Saif-ur-Rehman , Tobias Glasmachers , Ioannis Iossifidis

Activation functions are crucial in deep learning models since they introduce non-linearity into the networks, allowing them to learn from errors and make adjustments, which is essential for learning complex patterns. The essential purpose…

Machine Learning · Computer Science 2023-10-17 Koushik Biswas , Meghana Karri , Ulaş Bağcı

Activation Functions introduce non-linearity in the deep neural networks. This nonlinearity helps the neural networks learn faster and efficiently from the dataset. In deep learning, many activation functions are developed and used based on…

Machine Learning · Computer Science 2025-09-29 Ravin Kumar

Artificial neural networks (ANN), typically referred to as neural networks, are a class of Machine Learning algorithms and have achieved widespread success, having been inspired by the biological structure of the human brain. Neural…

Machine Learning · Computer Science 2022-04-08 Murilo Gustineli

To enhance the nonlinearity of neural networks and increase their mapping abilities between the inputs and response variables, activation functions play a crucial role to model more complex relationships and patterns in the data. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-20 Haigen Hu , Aizhu Liu , Qiu Guan , Xiaoxin Li , Shengyong Chen , Qianwei Zhou

We present a simple, effective, and general activation function we term ACON which learns to activate the neurons or not. Interestingly, we find Swish, the recent popular NAS-searched activation, can be interpreted as a smooth approximation…

Computer Vision and Pattern Recognition · Computer Science 2021-04-19 Ningning Ma , Xiangyu Zhang , Ming Liu , Jian Sun

Deep Neural Networks have been shown to be beneficial for a variety of tasks, in particular allowing for end-to-end learning and reducing the requirement for manual design decisions. However, still many parameters have to be chosen in…

Neural and Evolutionary Computing · Computer Science 2018-08-03 Mina Basirat , Peter M. Roth

Artificial neural networks typically have a fixed, non-linear activation function at each neuron. We have designed a novel form of piecewise linear activation function that is learned independently for each neuron using gradient descent.…

Neural and Evolutionary Computing · Computer Science 2015-04-22 Forest Agostinelli , Matthew Hoffman , Peter Sadowski , Pierre Baldi

In Neural Networks (NN), Adaptive Activation Functions (AAF) have parameters that control the shapes of activation functions. These parameters are trained along with other parameters in the NN. AAFs have improved performance of Neural…

Computer Vision and Pattern Recognition · Computer Science 2016-08-24 Le Hou , Dimitris Samaras , Tahsin M. Kurc , Yi Gao , Joel H. Saltz

In class incremental learning, neural networks typically suffer from catastrophic forgetting. We show that an MLP featuring a sparse activation function and an adaptive learning rate optimizer can compete with established regularization…

Machine Learning · Computer Science 2024-04-30 Santtu Keskinen

Deep neural networks owe their expressive power to nonlinear activation functions. The effective field theory of signal propagation at initialization reveals a few distinct universality classes of activations that exhibit different depth…

Disordered Systems and Neural Networks · Physics 2026-05-08 Omri Lesser , Debanjan Chowdhury

Activation functions play a decisive role in determining the capacity of Deep Neural Networks as they enable neural networks to capture inherent nonlinearities present in data fed to them. The prior research on activation functions…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Jamshaid Ul Rahman , Faiza Makhdoom , Dianchen Lu

Activation functions have a notorious impact on neural networks on both training and testing the models against the desired problem. Currently, the most used activation function is the Rectified Linear Unit (ReLU). This paper introduces a…

Computer Vision and Pattern Recognition · Computer Science 2018-01-23 Eric Alcaide

Deep and wide neural networks successfully fit very complex functions today, but dense models are starting to be prohibitively expensive for inference. To mitigate this, one promising direction is networks that activate a sparse subgraph of…

Machine Learning · Computer Science 2022-08-10 Cenk Baykal , Nishanth Dikkala , Rina Panigrahy , Cyrus Rashtchian , Xin Wang

Activation functions play a crucial role in neural networks because they are the nonlinearities which have been attributed to the success story of deep learning. One of the currently most popular activation functions is ReLU, but several…

Computation and Language · Computer Science 2019-01-10 Steffen Eger , Paul Youssef , Iryna Gurevych

Deep neural networks, and more recently large-scale generative models such as large language models (LLMs) and large vision-action models (LVAs), achieve remarkable performance across diverse domains, yet their prohibitive computational…

Machine Learning · Computer Science 2026-03-10 Laha Ale , Ning Zhang , Scott A. King , Pingzhi Fan

The hyper-parameters of a neural network are traditionally designed through a time consuming process of trial and error that requires substantial expert knowledge. Neural Architecture Search (NAS) algorithms aim to take the human out of the…

Neural and Evolutionary Computing · Computer Science 2021-06-01 Andrew Nader , Danielle Azar

Neural networks are the state-of-the-art approach for many tasks and the activation function is one of the main building blocks that allow such performance. Recently, a novel transformative adaptive activation function (TAAF) allowing for…

Machine Learning · Computer Science 2024-02-15 Vladimír Kunc

This paper explores the expressive power of deep neural networks for a diverse range of activation functions. An activation function set $\mathscr{A}$ is defined to encompass the majority of commonly used activation functions, such as…

Machine Learning · Computer Science 2024-02-28 Shijun Zhang , Jianfeng Lu , Hongkai Zhao
‹ Prev 1 2 3 10 Next ›