English
Related papers

Related papers: Understanding Generalization in Deep Learning via …

200 papers

Deep nets generalize well despite having more parameters than the number of training samples. Recent works try to give an explanation using PAC-Bayes and Margin-based analyses, but do not as yet result in sample complexity bounds better…

Machine Learning · Computer Science 2018-11-28 Sanjeev Arora , Rong Ge , Behnam Neyshabur , Yi Zhang

While there has been progress in developing non-vacuous generalization bounds for deep neural networks, these bounds tend to be uninformative about why deep learning works. In this paper, we develop a compression approach based on…

Machine Learning · Computer Science 2022-11-28 Sanae Lotfi , Marc Finzi , Sanyam Kapoor , Andres Potapczynski , Micah Goldblum , Andrew Gordon Wilson

Explaining the surprising generalization performance of deep neural networks is an active and important line of research in theoretical machine learning. Influential work by Arora et al. (ICML'18) showed that, noise stability properties of…

Machine Learning · Computer Science 2021-07-02 Allan Grønlund , Mikael Høgsgaard , Lior Kamma , Kasper Green Larsen

Tensorizing a neural network involves reshaping some or all of its dense weight matrices into higher-order tensors and approximating them using low-rank tensor network decompositions. This technique has shown promise as a model compression…

Machine Learning · Computer Science 2025-05-27 Safa Hamreras , Sukhbinder Singh , Román Orús

We propose tensorial neural networks (TNNs), a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones. The problem of parameter learning is challenging, as it corresponds…

Machine Learning · Statistics 2018-12-11 Jiahao Su , Jingling Li , Bobby Bhattacharjee , Furong Huang

Modern neural networks are highly overparameterized, with capacity to substantially overfit to training data. Nevertheless, these networks often generalize well in practice. It has also been observed that trained networks can often be…

Machine Learning · Statistics 2019-02-26 Wenda Zhou , Victor Veitch , Morgane Austern , Ryan P. Adams , Peter Orbanz

We present an efficient coresets-based neural network compression algorithm that sparsifies the parameters of a trained fully-connected neural network in a manner that provably approximates the network's output. Our approach is based on an…

Machine Learning · Computer Science 2019-05-21 Cenk Baykal , Lucas Liebenwein , Igor Gilitschenski , Dan Feldman , Daniela Rus

Deep Neural Networks can generalize despite being significantly overparametrized. Recent research has tried to examine this phenomenon from various view points and to provide bounds on the generalization error or measures predictive of the…

Machine Learning · Computer Science 2020-12-07 Parth Natekar , Manik Sharma

We present a new framework to measure the intrinsic properties of (deep) neural networks. While we focus on convolutional networks, our framework can be extrapolated to any network architecture. In particular, we evaluate two network…

Machine Learning · Computer Science 2022-05-10 Alberto Badias , Ashis Banerjee

Both PAC-Bayesian and Sample Compress learning frameworks are instrumental for deriving tight (non-vacuous) generalization bounds for neural networks. We leverage these results in a meta-learning scheme, relying on a hypernetwork that…

Machine Learning · Computer Science 2025-06-06 Benjamin Leblanc , Mathieu Bazinet , Nathaniel D'Amours , Alexandre Drouin , Pascal Germain

Tensor networks have found a wide use in a variety of applications in physics and computer science, recently leading to both theoretical insights as well as practical algorithms in machine learning. In this work we explore the connection…

Quantum Physics · Physics 2019-12-04 Ivan Glasser , Nicola Pancotti , J. Ignacio Cirac

The primary objective of learning methods is generalization. Classic uniform generalization bounds, which rely on VC-dimension or Rademacher complexity, fail to explain the significant attribute that over-parameterized models in deep…

Machine Learning · Computer Science 2025-03-07 Lijia Yu , Yibo Miao , Yifan Zhu , Xiao-Shan Gao , Lijun Zhang

In recent years, great progress has been made in a variety of application domains thanks to the development of increasingly deeper neural networks. Unfortunately, the huge number of units of these networks makes them expensive both…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Jose M. Alvarez , Mathieu Salzmann

Various iterative reconstruction algorithms for inverse problems can be unfolded as neural networks. Empirically, this approach has often led to improved results, but theoretical guarantees are still scarce. While some progress on…

Statistics Theory · Mathematics 2021-08-16 Arash Behboodi , Holger Rauhut , Ekkehard Schnoor

Tensor regression networks achieve high compression rate of neural networks while having slight impact on performances. They do so by imposing low tensor rank structure on the weight matrices of fully connected layers. In recent years,…

Machine Learning · Computer Science 2018-11-30 Xingwei Cao , Guillaume Rabusseau

One of the biggest issues in deep learning theory is the generalization ability of networks with huge model size. The classical learning theory suggests that overparameterized models cause overfitting. However, practically used large deep…

Machine Learning · Computer Science 2020-06-23 Taiji Suzuki , Hiroshi Abe , Tomoaki Nishimura

The ability of overparameterized deep networks to generalize well has been linked to the fact that stochastic gradient descent (SGD) finds solutions that lie in flat, wide minima in the training loss -- minima where the output of the…

Machine Learning · Computer Science 2019-06-03 Vaishnavh Nagarajan , J. Zico Kolter

In the context of neural network models, overparametrization refers to the phenomena whereby these models appear to generalize well on the unseen data, even though the number of parameters significantly exceeds the sample sizes, and the…

Machine Learning · Statistics 2020-03-25 Matt Emschwiller , David Gamarnik , Eren C. Kızıldağ , Ilias Zadik

Understanding the generalization behavior of deep neural networks remains a fundamental challenge in modern statistical learning theory. Among existing approaches, PAC-Bayesian norm-based bounds have demonstrated particular promise due to…

Machine Learning · Statistics 2026-01-14 Xinping Yi , Gaojie Jin , Xiaowei Huang , Shi Jin

Along with the rapid development of deep learning in practice, the theoretical explanations for its success become urgent. Generalization and expressivity are two widely used measurements to quantify theoretical behaviors of deep learning.…

Machine Learning · Computer Science 2018-03-26 Shao-Bo Lin
‹ Prev 1 2 3 10 Next ›