English
Related papers

Related papers: Predicting Parameters in Deep Learning

200 papers

We show experimentally that the accuracy of a trained neural network can be predicted surprisingly well by looking only at its weights, without evaluating it on input data. We motivate this task and introduce a formal setting for it. Even…

Machine Learning · Statistics 2021-04-12 Thomas Unterthiner , Daniel Keysers , Sylvain Gelly , Olivier Bousquet , Ilya Tolstikhin

Training deep neural networks results in strong learned representations that show good generalization capabilities. In most cases, training involves iterative modification of all weights inside the network via back-propagation. In Extreme…

Machine Learning · Computer Science 2018-02-06 Amir Rosenfeld , John K. Tsotsos

Deep Neural Networks are highly over-parameterized and the size of the neural networks can be reduced significantly after training without any decrease in performance. One can clearly see this phenomenon in a wide range of architectures…

Machine Learning · Computer Science 2018-06-19 Utku Evci

The customizable nature of deep learning models have allowed them to be successful predictors in various disciplines. These models are often trained with respect to thousands or millions of instances for complicated problems, but the…

Machine Learning · Computer Science 2019-12-24 Drimik Roy Chowdhury , Muhammad Firmansyah Kasim

Deep neural networks with millions of parameters are at the heart of many state of the art machine learning models today. However, recent works have shown that models with much smaller number of parameters can also perform just as well. In…

Machine Learning · Computer Science 2016-08-03 Suraj Srinivas , R. Venkatesh Babu

We propose to use deep learning to estimate parameters in statistical models when standard likelihood estimation methods are computationally infeasible. We show how to estimate parameters from max-stable processes, where inference is…

Methodology · Statistics 2021-08-02 Amanda Lenzi , Julie Bessac , Johann Rudi , Michael L. Stein

Recent research has shown the existence of significant redundancy in large Transformer models. One can prune the redundant parameters without significantly sacrificing the generalization performance. However, we question whether the…

Computation and Language · Computer Science 2022-02-15 Chen Liang , Haoming Jiang , Simiao Zuo , Pengcheng He , Xiaodong Liu , Jianfeng Gao , Weizhu Chen , Tuo Zhao

It has been observed \citep{zhang2016understanding} that deep neural networks can memorize: they achieve 100\% accuracy on training data. Recent theoretical results explained such behavior in highly overparametrized regimes, where the…

Machine Learning · Computer Science 2019-09-27 Rong Ge , Runzhe Wang , Haoyu Zhao

One of the distinguishing characteristics of modern deep learning systems is that they typically employ neural network architectures that utilize enormous numbers of parameters, often in the millions and sometimes even in the billions.…

Machine Learning · Statistics 2021-11-15 Ben Adlam , Jake Levinson , Jeffrey Pennington

Not all neural network architectures are created equal, some perform much better than others for certain tasks. But how important are the weight parameters of a neural network compared to its architecture? In this work, we question to what…

Machine Learning · Computer Science 2019-09-06 Adam Gaier , David Ha

We present a deep transformation model for probabilistic regression. Deep learning is known for outstandingly accurate predictions on complex data but in regression tasks, it is predominantly used to just predict a single number. This…

Machine Learning · Statistics 2020-04-02 Beate Sick , Torsten Hothorn , Oliver Dürr

One of the arguments to explain the success of deep learning is the powerful approximation capacity of deep neural networks. Such capacity is generally accompanied by the explosive growth of the number of parameters, which, in turn, leads…

Machine Learning · Computer Science 2022-09-15 Zuowei Shen , Haizhao Yang , Shijun Zhang

We propose to impose symmetry in neural network parameters to improve parameter usage and make use of dedicated convolution and matrix multiplication routines. Due to significant reduction in the number of parameters as a result of the…

Machine Learning · Computer Science 2019-01-11 Xu Shell Hu , Sergey Zagoruyko , Nikos Komodakis

Deep learning has been successful in automating the design of features in machine learning pipelines. However, the algorithms optimizing neural network parameters remain largely hand-designed and computationally inefficient. We study if we…

Machine Learning · Computer Science 2021-10-26 Boris Knyazev , Michal Drozdzal , Graham W. Taylor , Adriana Romero-Soriano

This paper investigates the learnability of the nonlinearity property of Boolean functions using neural networks. We train encoder style deep neural networks to learn to predict the nonlinearity of Boolean functions from examples of…

Machine Learning · Computer Science 2025-02-04 Sriram Ranga , Nandish Chattopadhyay , Anupam Chattopadhyay

We scrutinize the structural and operational aspects of deep learning models, particularly focusing on the nuances of learnable parameters (weight) statistics, distribution, node interaction, and visualization. By establishing correlations…

Machine Learning · Computer Science 2024-08-22 Ziwei Zheng , Huizhi Liang , Vaclav Snasel , Vito Latora , Panos Pardalos , Giuseppe Nicosia , Varun Ojha

Machine learning models have achieved human-level performance on various tasks. This success comes at a high cost of computation and storage overhead, which makes machine learning algorithms difficult to deploy on edge devices. Typically,…

Machine Learning · Computer Science 2021-11-12 Giuseppina Carannante , Dimah Dera , Ghulam Rasool , Nidhal C. Bouaynaya

In this work, we investigate the fundamental trade-off regarding accuracy and parameter efficiency in the parameterization of neural network weights using predictor networks. We present a surprising finding that, when recovering the…

Machine Learning · Computer Science 2024-07-02 Hongjun Choi , Jayaraman J. Thiagarajan , Ruben Glatt , Shusen Liu

Deep Neural nets (NNs) with millions of parameters are at the heart of many state-of-the-art computer vision systems today. However, recent works have shown that much smaller models can achieve similar levels of performance. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2015-07-23 Suraj Srinivas , R. Venkatesh Babu

Our goal is to provide a review of deep learning methods which provide insight into structured high-dimensional data. Rather than using shallow additive architectures common to most statistical models, deep learning uses layers of…

Machine Learning · Statistics 2023-10-11 Nick Polson , Vadim Sokolov
‹ Prev 1 2 3 10 Next ›