English
Related papers

Related papers: Statistical guarantees for sparse deep learning

200 papers

Sparse deep learning has become a popular technique for improving the performance of deep neural networks in areas such as uncertainty quantification, variable selection, and large-scale network compression. However, most existing research…

Machine Learning · Statistics 2023-10-06 Mingxuan Zhang , Yan Sun , Faming Liang

Sparse deep learning aims to address the challenge of huge storage consumption by deep neural networks, and to recover the sparse structure of target functions. Although tremendous empirical successes have been achieved, most sparse deep…

Machine Learning · Statistics 2020-11-17 Jincheng Bai , Qifan Song , Guang Cheng

Deep neural networks have emerged as powerful tools for learning operators defined over infinite-dimensional function spaces. However, existing theories frequently encounter difficulties related to dimensionality and limited…

Machine Learning · Computer Science 2026-05-12 Jianfei Li , Shuo Huang , Han Feng , Ding-Xuan Zhou , Gitta Kutyniok

While deep learning has demonstrated impressive progress, it remains a daunting challenge to learn from hard samples as these samples are usually noisy and intricate. These hard samples play a crucial role in the optimal performance of deep…

Computer Vision and Pattern Recognition · Computer Science 2024-12-30 Qiao Xiao , Boqian Wu , Lu Yin , Christopher Neil Gadzinski , Tianjin Huang , Mykola Pechenizkiy , Decebal Constantin Mocanu

Deep learning has powered recent successes of artificial intelligence (AI). However, the deep neural network, as the basic model of deep learning, has suffered from issues such as local traps and miscalibration. In this paper, we provide a…

Machine Learning · Statistics 2021-12-03 Yan Sun , Wenjun Xiong , Faming Liang

Since statistical guarantees for neural networks are usually restricted to global optima of intricate objective functions, it is unclear whether these theories explain the performances of actual outputs of neural network pipelines. The goal…

Machine Learning · Computer Science 2025-12-08 Mahsa Taheri , Fang Xie , Johannes Lederer

Sparsity has become popular in machine learning, because it can save computational resources, facilitate interpretations, and prevent overfitting. In this paper, we discuss sparsity in the framework of neural networks. In particular, we…

Machine Learning · Computer Science 2020-06-30 Mohamed Hebiri , Johannes Lederer

Deep learning has been the engine powering many successes of data science. However, the deep neural network (DNN), as the basic model of deep learning, is often excessively over-parameterized, causing many difficulties in training,…

Machine Learning · Statistics 2021-03-09 Yan Sun , Qifan Song , Faming Liang

In this thesis we discuss machine learning methods performing automated variable selection for learning sparse predictive models. There are multiple reasons for promoting sparsity in the predictive models. By relying on a limited set of…

Machine Learning · Computer Science 2019-03-27 Magda Gregorova

Recent work on deep neural network pruning has shown there exist sparse subnetworks that achieve equal or improved accuracy, training time, and loss using fewer network parameters when compared to their dense counterparts. Orthogonal to…

Machine Learning · Computer Science 2019-12-06 Justin Cosentino , Federico Zaiter , Dan Pei , Jun Zhu

Sparse connectivity is a hallmark of the brain and a desired property of artificial neural networks. It promotes energy efficiency, simplifies training, and enhances the robustness of network function. Thus, a detailed understanding of how…

Disordered Systems and Neural Networks · Physics 2024-09-10 Mirza M. Junaid Baig , Armen Stepanyants

Deep learning has arguably achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks…

Machine Learning · Statistics 2019-04-16 Jianqing Fan , Cong Ma , Yiqiao Zhong

Sparse regression on a library of candidate features has developed as the prime method to discover the partial differential equation underlying a spatio-temporal data-set. These features consist of higher order derivatives, limiting model…

Machine Learning · Computer Science 2021-05-05 Gert-Jan Both , Gijs Vermarien , Remy Kusters

Variational inference is becoming more and more popular for approximating intractable posterior distributions in Bayesian statistics and machine learning. Meanwhile, a few recent works have provided theoretical justification and new…

Statistics Theory · Mathematics 2019-09-09 Badr-Eddine Chérief-Abdellatif

Sparse neural networks are highly desirable in deep learning in reducing its complexity. The goal of this paper is to study how choices of regularization parameters influence the sparsity level of learned neural networks. We first derive…

Machine Learning · Computer Science 2024-08-07 Lixin Shen , Rui Wang , Yuesheng Xu , Mingsong Yan

Deep artificial neural networks achieve surprising generalization abilities that remain poorly understood. In this paper, we present a new approach to analyzing generalization for deep feed-forward ReLU networks that takes advantage of the…

Machine Learning · Computer Science 2023-07-06 Ramchandran Muthukumar , Jeremias Sulam

Balancing predictive power and interpretability has long been a challenging research area, particularly in powerful yet complex models like neural networks, where nonlinearity obstructs direct interpretation. This paper introduces a novel…

Machine Learning · Computer Science 2025-02-20 Antoine Ledent , Peng Liu

Coded recurrent neural networks with three levels of sparsity are introduced. The first level is related to the size of messages, much smaller than the number of available neurons. The second one is provided by a particular coding rule,…

Machine Learning · Computer Science 2011-02-22 Vincent Gripon , Claude Berrou

Even though probabilistic treatments of neural networks have a long history, they have not found widespread use in practice. Sampling approaches are often too slow already for simple networks. The size of the inputs and the depth of typical…

Computer Vision and Pattern Recognition · Computer Science 2018-05-30 Jochen Gast , Stefan Roth

The computational demands of modern Deep Neural Networks (DNNs) are immense and constantly growing. While training costs usually capture public attention, inference demands are also contributing in significant computational, energy and…

‹ Prev 1 2 3 10 Next ›