English
Related papers

Related papers: Principled Deep Neural Network Training through Li…

200 papers

In the past decade, deep learning became the prevalent methodology for predictive modeling thanks to the remarkable accuracy of deep neural networks in tasks such as computer vision and natural language processing. Meanwhile, the structure…

Optimization and Control · Mathematics 2025-09-16 Joey Huchette , Gonzalo Muñoz , Thiago Serra , Calvin Tsay

The holy grail of deep learning is to come up with an automatic method to design optimal architectures for different applications. In other words, how can we effectively dimension and organize neurons along the network layers based on the…

Optimization and Control · Mathematics 2018-06-19 Thiago Serra , Christian Tjandraatmadja , Srikumar Ramalingam

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Deep learning (DL) has transformed applications in a variety of domains, including computer vision, natural language processing, and tabular data analysis. The search for improved DL model accuracy has led practitioners to explore…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-10 Kabir Nagrecha

Neural network models and deep models are one of the leading and state of the art models in machine learning. Most successful deep neural models are the ones with many layers which highly increases their number of parameters. Training such…

Machine Learning · Computer Science 2018-07-17 Soufiane Belharbi

The increasing complexity of modern deep neural network models and the expanding sizes of datasets necessitate the development of optimized and scalable training methods. In this white paper, we addressed the challenge of efficiently…

Machine Learning · Computer Science 2024-04-29 Raphael Ruschel , A. S. M. Iftekhar , B. S. Manjunath , Suya You

Deep learning has shown promising results in many machine learning applications. The hierarchical feature representation built by deep networks enable compact and precise encoding of the data. A kernel analysis of the trained deep networks…

Machine Learning · Computer Science 2017-03-22 Mandar Kulkarni , Shirish Karande

Deep learning, a branch of artificial intelligence, is a data-driven method that uses multiple layers of interconnected units or neurons to learn intricate patterns and representations directly from raw input data. Empowered by this…

Machine Learning · Computer Science 2025-07-28 Mohd Halim Mohd Noor , Ayokunle Olalekan Ige

Deep learning, in general, focuses on training a neural network from large labeled datasets. Yet, in many cases there is value in training a network just from the input at hand. This is particularly relevant in many signal and image…

Machine Learning · Computer Science 2024-04-09 Tom Tirer , Raja Giryes , Se Young Chun , Yonina C. Eldar

Deep neural networks are widely used for classification. These deep models often suffer from a lack of interpretability -- they are particularly difficult to understand because of their non-linear nature. As a result, neural networks are…

Artificial Intelligence · Computer Science 2017-11-22 Oscar Li , Hao Liu , Chaofan Chen , Cynthia Rudin

Given a neural network, training data, and a threshold, it was known that it is NP-hard to find weights for the neural network such that the total error is below the threshold. We determine the algorithmic complexity of this fundamental…

Computational Complexity · Computer Science 2021-11-22 Mikkel Abrahamsen , Linda Kleist , Tillmann Miltzow

The field of deep learning has witnessed a remarkable shift towards extremely compute- and memory-intensive neural networks. These newer larger models have enabled researchers to advance state-of-the-art tools across a variety of fields.…

Machine Learning · Computer Science 2022-07-04 Daniel Nichols , Siddharth Singh , Shu-Huai Lin , Abhinav Bhatele

We consider deep neural networks, in which the output of each node is a quadratic function of its inputs. Similar to other deep architectures, these networks can compactly represent any function on a finite training set. The main goal of…

Machine Learning · Computer Science 2014-02-21 Roi Livni , Shai Shalev-Shwartz , Ohad Shamir

Machine Learning algorithms have had a profound impact on the field of computer science over the past few decades. These algorithms performance is greatly influenced by the representations that are derived from the data in the learning…

Deep neural network has recently shown very promising applications in different research directions and attracted the industry attention as well. Although the idea was introduced in the past but just recently the main limitation of using…

Signal Processing · Electrical Eng. & Systems 2019-04-16 Amin Abbasloo , Alan Salari

Recent success in training deep neural networks have prompted active investigation into the features learned on their intermediate layers. Such research is difficult because it requires making sense of non-linear computations performed by…

Machine Learning · Computer Science 2016-03-01 Yixuan Li , Jason Yosinski , Jeff Clune , Hod Lipson , John Hopcroft

It is widely believed that learning good representations is one of the main reasons for the success of deep neural networks. Although highly intuitive, there is a lack of theory and systematic approach quantitatively characterizing what…

Machine Learning · Computer Science 2018-11-30 Liwei Wang , Lunjia Hu , Jiayuan Gu , Yue Wu , Zhiqiang Hu , Kun He , John Hopcroft

Despite the remarkable success of deep learning in pattern recognition, deep network models face the problem of training a large number of parameters. In this paper, we propose and evaluate a novel multi-path wavelet neural network…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 D. D. N. De Silva , H. W. M. K. Vithanage , K. S. D. Fernando , I. T. S. Piyatilake

Deep learning relies on a very specific kind of neural networks: those superposing several neural layers. In the last few years, deep learning achieved major breakthroughs in many tasks such as image analysis, speech recognition, natural…

Artificial Intelligence · Computer Science 2018-02-01 Lê Nguyên Hoang , Rachid Guerraoui

Deep neural networks can struggle to learn continually in the face of non-stationarity. This phenomenon is known as loss of plasticity. In this paper, we identify underlying principles that lead to plastic algorithms. In particular, we…

Machine Learning · Computer Science 2024-10-29 Alex Lewandowski , Dale Schuurmans , Marlos C. Machado
‹ Prev 1 2 3 10 Next ›