Related papers: Cooperative Initialization based Deep Neural Netwo…

Discovering Parametric Activation Functions

Recent studies have shown that the choice of activation function can significantly affect the performance of deep learning networks. However, the benefits of novel activation functions have been inconsistent and task dependent, and…

Machine Learning · Computer Science 2022-01-25 Garrett Bingham , Risto Miikkulainen

Improved weight initialization for deep and narrow feedforward neural network

Appropriate weight initialization settings, along with the ReLU activation function, have become cornerstones of modern deep learning, enabling the training and deployment of highly effective and efficient neural network models across…

Machine Learning · Computer Science 2024-04-02 Hyunwoo Lee , Yunho Kim , Seung Yeop Yang , Hayoung Choi

Neural networks with trainable matrix activation functions

The training process of neural networks usually optimize weights and bias parameters of linear transformations, while nonlinear activation functions are pre-specified and fixed. This work develops a systematic approach to constructing…

Machine Learning · Computer Science 2024-10-29 Zhengqi Liu , Shuhao Cao , Yuwen Li , Ludmil Zikatanov

Learning Combinations of Activation Functions

In the last decade, an active area of research has been devoted to design novel activation functions that are able to help deep neural networks to converge, obtaining better performance. The training procedure of these architectures usually…

Machine Learning · Computer Science 2019-04-26 Franco Manessi , Alessandro Rozza

A Methodology for Automatic Selection of Activation Functions to Design Hybrid Deep Neural Networks

Activation functions influence behavior and performance of DNNs. Nonlinear activation functions, like Rectified Linear Units (ReLU), Exponential Linear Units (ELU) and Scaled Exponential Linear Units (SELU), outperform the linear…

Neural and Evolutionary Computing · Computer Science 2019-02-05 Alberto Marchisio , Muhammad Abdullah Hanif , Semeen Rehman , Maurizio Martina , Muhammad Shafique

Effects of the Nonlinearity in Activation Functions on the Performance of Deep Learning Models

The nonlinearity of activation functions used in deep learning models are crucial for the success of predictive models. There are several commonly used simple nonlinear functions, including Rectified Linear Unit (ReLU) and Leaky-ReLU…

Machine Learning · Computer Science 2020-10-16 Nalinda Kulathunga , Nishath Rajiv Ranasinghe , Daniel Vrinceanu , Zackary Kinsman , Lei Huang , Yunjiao Wang

DSReLU: A Novel Dynamic Slope Function for Superior Model Training

This study introduces a novel activation function, characterized by a dynamic slope that adjusts throughout the training process, aimed at enhancing adaptability and performance in deep neural networks for computer vision tasks. The…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Archisman Chakraborti , Bidyut B Chaudhuri

Evolutionary Optimization of Deep Learning Activation Functions

The choice of activation function can have a large effect on the performance of a neural network. While there have been some attempts to hand-engineer novel activation functions, the Rectified Linear Unit (ReLU) remains the most…

Machine Learning · Computer Science 2020-04-14 Garrett Bingham , William Macke , Risto Miikkulainen

Competition-based Adaptive ReLU for Deep Neural Networks

Activation functions introduce nonlinearity into deep neural networks. Most popular activation functions allow positive values to pass through while blocking or suppressing negative values. From the idea that positive values and negative…

Neural and Evolutionary Computing · Computer Science 2024-07-30 Junjia Chen , Zhibin Pan

Deeper Learning with CoLU Activation

In neural networks, non-linearity is introduced by activation functions. One commonly used activation function is Rectified Linear Unit (ReLU). ReLU has been a popular choice as an activation but has flaws. State-of-the-art functions like…

Machine Learning · Computer Science 2021-12-23 Advait Vagerwal

Adaptively Customizing Activation Functions for Various Layers

To enhance the nonlinearity of neural networks and increase their mapping abilities between the inputs and response variables, activation functions play a crucial role to model more complex relationships and patterns in the data. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-20 Haigen Hu , Aizhu Liu , Qiu Guan , Xiaoxin Li , Shengyong Chen , Qianwei Zhou

Rotate the ReLU to implicitly sparsify deep networks

In the era of Deep Neural Network based solutions for a variety of real-life tasks, having a compact and energy-efficient deployable model has become fairly important. Most of the existing deep architectures use Rectifier Linear Unit (ReLU)…

Machine Learning · Computer Science 2022-06-02 Nancy Nayak , Sheetal Kalyani

Natural-Logarithm-Rectified Activation Function in Convolutional Neural Networks

Activation functions play a key role in providing remarkable performance in deep neural networks, and the rectified linear unit (ReLU) is one of the most widely used activation functions. Various new activation functions and improvements on…

Machine Learning · Computer Science 2019-08-27 Yang Liu , Jianpeng Zhang , Chao Gao , Jinghua Qu , Lixin Ji

Studying Various Activation Functions and Non-IID Data for Machine Learning Model Robustness

Adversarial training is an effective method to improve the machine learning (ML) model robustness. Most existing studies typically consider the Rectified linear unit (ReLU) activation function and centralized training environments. In this…

Machine Learning · Computer Science 2025-12-05 Long Dang , Thushari Hapuarachchi , Kaiqi Xiong , Jing Lin

Activation function design for deep networks: linearity and effective initialisation

The activation function deployed in a deep neural network has great influence on the performance of the network at initialisation, which in turn has implications for training. In this paper we study how to avoid two problems at…

Machine Learning · Computer Science 2021-05-18 Michael Murray , Vinayak Abrol , Jared Tanner

Improving Deep Neural Network with Multiple Parametric Exponential Linear Units

Activation function is crucial to the recent successes of deep neural networks. In this paper, we first propose a new activation function, Multiple Parametric Exponential Linear Units (MPELU), aiming to generalize and unify the rectified…

Computer Vision and Pattern Recognition · Computer Science 2017-01-18 Yang Li , Chunxiao Fan , Yong Li , Qiong Wu , Yue Ming

On weight initialization in deep neural networks

A proper initialization of the weights in a neural network is critical to its convergence. Current insights into weight initialization come primarily from linear activation functions. In this paper, I develop a theory for weight…

Machine Learning · Computer Science 2017-05-04 Siddharth Krishna Kumar

GELU Activation Function in Deep Learning: A Comprehensive Mathematical Analysis and Performance

Selecting the most suitable activation function is a critical factor in the effectiveness of deep learning models, as it influences their learning capacity, stability, and computational efficiency. In recent years, the Gaussian Error Linear…

Machine Learning · Computer Science 2023-08-02 Minhyeok Lee

Ensemble of Convolutional Neural Networks Trained with Different Activation Functions

Activation functions play a vital role in the training of Convolutional Neural Networks. For this reason, to develop efficient and performing functions is a crucial problem in the deep learning community. Key to these approaches is to…

Computer Vision and Pattern Recognition · Computer Science 2020-09-23 Gianluca Maguolo , Loris Nanni , Stefano Ghidoni

An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network. These activation functions comprise of Rectified Linear Units (ReLU), Exponential Linear Unit (ELU),…

Machine Learning · Computer Science 2021-04-07 Anh Nguyen , Khoa Pham , Dat Ngo , Thanh Ngo , Lam Pham