English
Related papers

Related papers: Improving Neural Network Learning Through Dual Var…

200 papers

The choice of learning rate (LR) functions and policies has evolved from a simple fixed LR to the decaying LR and the cyclic LR, aiming to improve the accuracy and reduce the training time of Deep Neural Networks (DNNs). This paper presents…

Machine Learning · Computer Science 2022-10-25 Yanzhao Wu , Ling Liu

The increasing complexity of deep learning architectures is resulting in training time requiring weeks or even months. This slow training is due in part to vanishing gradients, in which the gradients used by back-propagation are extremely…

Computer Vision and Pattern Recognition · Computer Science 2015-10-16 Bharat Singh , Soham De , Yangmuzi Zhang , Thomas Goldstein , Gavin Taylor

Motivated by the observation that humans can learn patterns from two given images at one time, we propose a dual pattern learning network architecture in this paper. Unlike conventional networks, the proposed architecture has two input…

Computer Vision and Pattern Recognition · Computer Science 2018-06-12 Haimin Zhang , Min Xu

Training neural networks can be challenging, especially as the complexity of the problem increases. Despite using wider or deeper networks, training them can be a tedious process, especially if a wrong choice of the hyperparameter is made.…

Computational Engineering, Finance, and Science · Computer Science 2025-07-30 D. Veerababu , Ashwin A. Raikar , Prasanta K. Ghosh

Deep neural networks (DNN) have achieved remarkable success in various fields, including computer vision and natural language processing. However, training an effective DNN model still poses challenges. This paper aims to propose a method…

Machine Learning · Computer Science 2024-07-03 Hejie Ying , Mengmeng Song , Yaohong Tang , Shungen Xiao , Zimin Xiao

Artificial Intelligence algorithms have been steadily increasing in popularity and usage. Deep Learning, allows neural networks to be trained using huge datasets and also removes the need for human extracted features, as it automates the…

Neural and Evolutionary Computing · Computer Science 2020-05-11 Vasco Lopes , Paulo Fazendeiro

We propose multirate training of neural networks: partitioning neural network parameters into "fast" and "slow" parts which are trained on different time scales, where slow parts are updated less frequently. By choosing appropriate…

Machine Learning · Computer Science 2022-11-02 Tiffany Vlaar , Benedict Leimkuhler

Deep learning is a subset of a broader family of machine learning methods based on learning data representations. These models are inspired by human biological nervous systems, even if there are various differences pertaining to the…

Neural and Evolutionary Computing · Computer Science 2019-05-22 Adriano Baldeschi , Raffaella Margutti , Adam Miller

Deep reinforcement learning (DRL) has achieved significant breakthroughs in various tasks. However, most DRL algorithms suffer a problem of generalizing the learned policy which makes the learning performance largely affected even by minor…

Machine Learning · Computer Science 2019-07-11 Zhengyao Jiang , Shan Luo

Recurrent neural networks (RNNs) are widely used as a memory model for sequence-related problems. Many variants of RNN have been proposed to solve the gradient problems of training RNNs and process long sequences. Although some classical…

Neural and Evolutionary Computing · Computer Science 2020-05-29 Chenpeng Zhang , Shuai Li , Mao Ye , Ce Zhu , Xue Li

Differential learning rate (DLR), a technique that applies different learning rates to different model parameters, has been widely used in deep learning and achieved empirical success via its various forms. For example, parameter-efficient…

Machine Learning · Computer Science 2025-05-20 Shiyun Xu , Zhiqi Bu , Yiliang Zhang , Ian Barnett

This paper proposes a set of new error criteria and learning approaches, Adaptive Normalized Risk-Averting Training (ANRAT), to attack the non-convex optimization problem in training deep neural networks (DNNs). Theoretically, we…

Machine Learning · Computer Science 2016-06-10 Zhiguang Wang , Tim Oates , James Lo

The online learning of deep neural networks is an interesting problem of machine learning because, for example, major IT companies want to manage the information of the massive data uploaded on the web daily, and this technology can…

Machine Learning · Computer Science 2015-06-16 Sang-Woo Lee , Min-Oh Heo , Jiwon Kim , Jeonghee Kim , Byoung-Tak Zhang

Deep reinforcement learning (DRL) agents are trained through trial-and-error interactions with the environment. This leads to a long training time for dense neural networks to achieve good performance. Hence, prohibitive computation and…

Machine Learning · Computer Science 2022-05-09 Ghada Sokar , Elena Mocanu , Decebal Constantin Mocanu , Mykola Pechenizkiy , Peter Stone

Deep neural networks (DNNs) are often trained on the premise that the complete training data set is provided ahead of time. However, in real-world scenarios, data often arrive in chunks over time. This leads to important considerations…

Machine Learning · Computer Science 2023-03-21 Vijaya Raghavan T. Ramkumar , Elahe Arani , Bahram Zonooz

Learning Rate (LR) is an important hyper-parameter to tune for effective training of deep neural networks (DNNs). Even for the baseline of a constant learning rate, it is non-trivial to choose a good constant value for training a DNN.…

Machine Learning · Computer Science 2019-10-29 Yanzhao Wu , Ling Liu , Juhyun Bae , Ka-Ho Chow , Arun Iyengar , Calton Pu , Wenqi Wei , Lei Yu , Qi Zhang

In the context of classification problems, Deep Learning (DL) approaches represent state of art. Many DL approaches are based on variations of standard multi-layer feed-forward neural networks. These are also referred to as deep networks.…

Machine Learning · Computer Science 2023-11-21 Andrea Apicella , Francesco Isgrò , Roberto Prevete

We scrutinize the structural and operational aspects of deep learning models, particularly focusing on the nuances of learnable parameters (weight) statistics, distribution, node interaction, and visualization. By establishing correlations…

Machine Learning · Computer Science 2024-08-22 Ziwei Zheng , Huizhi Liang , Vaclav Snasel , Vito Latora , Panos Pardalos , Giuseppe Nicosia , Varun Ojha

Small neural networks with a constrained number of trainable parameters, can be suitable resource-efficient candidates for many simple tasks, where now excessively large models are used. However, such models face several problems during the…

Machine Learning · Computer Science 2021-09-21 Alexander Kovalenko , Pavel Kordík , Magda Friedjungová

The success of deep learning in the computer vision and natural language processing communities can be attributed to training of very deep neural networks with millions or billions of parameters which can then be trained with massive…

Machine Learning · Computer Science 2021-02-17 Kei Ota , Devesh K. Jha , Asako Kanezaki
‹ Prev 1 2 3 10 Next ›