English
Related papers

Related papers: Cooperative Initialization based Deep Neural Netwo…

200 papers

Recent studies have shown that the choice of activation function can significantly affect the performance of deep learning networks. However, the benefits of novel activation functions have been inconsistent and task dependent, and…

Machine Learning · Computer Science 2022-01-25 Garrett Bingham , Risto Miikkulainen

Appropriate weight initialization settings, along with the ReLU activation function, have become cornerstones of modern deep learning, enabling the training and deployment of highly effective and efficient neural network models across…

Machine Learning · Computer Science 2024-04-02 Hyunwoo Lee , Yunho Kim , Seung Yeop Yang , Hayoung Choi

The training process of neural networks usually optimize weights and bias parameters of linear transformations, while nonlinear activation functions are pre-specified and fixed. This work develops a systematic approach to constructing…

Machine Learning · Computer Science 2024-10-29 Zhengqi Liu , Shuhao Cao , Yuwen Li , Ludmil Zikatanov

In the last decade, an active area of research has been devoted to design novel activation functions that are able to help deep neural networks to converge, obtaining better performance. The training procedure of these architectures usually…

Machine Learning · Computer Science 2019-04-26 Franco Manessi , Alessandro Rozza

Activation functions influence behavior and performance of DNNs. Nonlinear activation functions, like Rectified Linear Units (ReLU), Exponential Linear Units (ELU) and Scaled Exponential Linear Units (SELU), outperform the linear…

Neural and Evolutionary Computing · Computer Science 2019-02-05 Alberto Marchisio , Muhammad Abdullah Hanif , Semeen Rehman , Maurizio Martina , Muhammad Shafique

The nonlinearity of activation functions used in deep learning models are crucial for the success of predictive models. There are several commonly used simple nonlinear functions, including Rectified Linear Unit (ReLU) and Leaky-ReLU…

Machine Learning · Computer Science 2020-10-16 Nalinda Kulathunga , Nishath Rajiv Ranasinghe , Daniel Vrinceanu , Zackary Kinsman , Lei Huang , Yunjiao Wang

This study introduces a novel activation function, characterized by a dynamic slope that adjusts throughout the training process, aimed at enhancing adaptability and performance in deep neural networks for computer vision tasks. The…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Archisman Chakraborti , Bidyut B Chaudhuri

The choice of activation function can have a large effect on the performance of a neural network. While there have been some attempts to hand-engineer novel activation functions, the Rectified Linear Unit (ReLU) remains the most…

Machine Learning · Computer Science 2020-04-14 Garrett Bingham , William Macke , Risto Miikkulainen

Activation functions introduce nonlinearity into deep neural networks. Most popular activation functions allow positive values to pass through while blocking or suppressing negative values. From the idea that positive values and negative…

Neural and Evolutionary Computing · Computer Science 2024-07-30 Junjia Chen , Zhibin Pan

In neural networks, non-linearity is introduced by activation functions. One commonly used activation function is Rectified Linear Unit (ReLU). ReLU has been a popular choice as an activation but has flaws. State-of-the-art functions like…

Machine Learning · Computer Science 2021-12-23 Advait Vagerwal

To enhance the nonlinearity of neural networks and increase their mapping abilities between the inputs and response variables, activation functions play a crucial role to model more complex relationships and patterns in the data. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-20 Haigen Hu , Aizhu Liu , Qiu Guan , Xiaoxin Li , Shengyong Chen , Qianwei Zhou

In the era of Deep Neural Network based solutions for a variety of real-life tasks, having a compact and energy-efficient deployable model has become fairly important. Most of the existing deep architectures use Rectifier Linear Unit (ReLU)…

Machine Learning · Computer Science 2022-06-02 Nancy Nayak , Sheetal Kalyani

Activation functions play a key role in providing remarkable performance in deep neural networks, and the rectified linear unit (ReLU) is one of the most widely used activation functions. Various new activation functions and improvements on…

Machine Learning · Computer Science 2019-08-27 Yang Liu , Jianpeng Zhang , Chao Gao , Jinghua Qu , Lixin Ji

Adversarial training is an effective method to improve the machine learning (ML) model robustness. Most existing studies typically consider the Rectified linear unit (ReLU) activation function and centralized training environments. In this…

Machine Learning · Computer Science 2025-12-05 Long Dang , Thushari Hapuarachchi , Kaiqi Xiong , Jing Lin

The activation function deployed in a deep neural network has great influence on the performance of the network at initialisation, which in turn has implications for training. In this paper we study how to avoid two problems at…

Machine Learning · Computer Science 2021-05-18 Michael Murray , Vinayak Abrol , Jared Tanner

Activation function is crucial to the recent successes of deep neural networks. In this paper, we first propose a new activation function, Multiple Parametric Exponential Linear Units (MPELU), aiming to generalize and unify the rectified…

Computer Vision and Pattern Recognition · Computer Science 2017-01-18 Yang Li , Chunxiao Fan , Yong Li , Qiong Wu , Yue Ming

A proper initialization of the weights in a neural network is critical to its convergence. Current insights into weight initialization come primarily from linear activation functions. In this paper, I develop a theory for weight…

Machine Learning · Computer Science 2017-05-04 Siddharth Krishna Kumar

Selecting the most suitable activation function is a critical factor in the effectiveness of deep learning models, as it influences their learning capacity, stability, and computational efficiency. In recent years, the Gaussian Error Linear…

Machine Learning · Computer Science 2023-08-02 Minhyeok Lee

Activation functions play a vital role in the training of Convolutional Neural Networks. For this reason, to develop efficient and performing functions is a crucial problem in the deep learning community. Key to these approaches is to…

Computer Vision and Pattern Recognition · Computer Science 2020-09-23 Gianluca Maguolo , Loris Nanni , Stefano Ghidoni

This paper provides an analysis of state-of-the-art activation functions with respect to supervised classification of deep neural network. These activation functions comprise of Rectified Linear Units (ReLU), Exponential Linear Unit (ELU),…

Machine Learning · Computer Science 2021-04-07 Anh Nguyen , Khoa Pham , Dat Ngo , Thanh Ngo , Lam Pham
‹ Prev 1 2 3 10 Next ›