English
Related papers

Related papers: Singular Value Perturbation and Deep Network Optim…

200 papers

Deep neural networks have been one of the dominant machine learning approaches in recent years. Several new network structures are proposed and have better performance than the traditional feedforward neural network structure.…

Computer Vision and Pattern Recognition · Computer Science 2018-10-04 Huan Li , Yibo Yang , Dongmin Chen , Zhouchen Lin

Recent results in the literature indicate that a residual network (ResNet) composed of a single residual block outperforms linear predictors, in the sense that all local minima in its optimization landscape are at least as good as the best…

Machine Learning · Computer Science 2019-10-30 Chulhee Yun , Suvrit Sra , Ali Jadbabaie

Advanced deep neural networks (DNNs), designed by either human or AutoML algorithms, are growing increasingly complex. Diverse operations are connected by complicated connectivity patterns, e.g., various types of skip connections. Those…

Machine Learning · Computer Science 2022-10-13 Wuyang Chen , Wei Huang , Xinyu Gong , Boris Hanin , Zhangyang Wang

Skip connections made the training of very deep networks possible and have become an indispensable component in a variety of neural architectures. A completely satisfactory explanation for their success remains elusive. Here, we present a…

Neural and Evolutionary Computing · Computer Science 2018-03-06 A. Emin Orhan , Xaq Pitkow

A residual network (or ResNet) is a standard deep neural net architecture, with state-of-the-art performance across numerous applications. The main premise of ResNets is that they allow the training of each layer to focus on fitting just…

Machine Learning · Computer Science 2018-09-28 Ohad Shamir

The trend towards increasingly deep neural networks has been driven by a general observation that increasing depth increases the performance of a network. Recently, however, evidence has been amassing that simply increasing depth may not be…

Computer Vision and Pattern Recognition · Computer Science 2016-12-01 Zifeng Wu , Chunhua Shen , Anton van den Hengel

There are many surprising and perhaps counter-intuitive properties of optimization of deep neural networks. We propose and experimentally verify a unified phenomenological model of the loss landscape that incorporates many of them. High…

Machine Learning · Computer Science 2019-06-12 Stanislav Fort , Stanislaw Jastrzebski

Deep neural networks are highly expressive machine learning models with the ability to interpolate arbitrary datasets. Deep nets are typically optimized via first-order methods and the optimization process crucially depends on the…

Machine Learning · Statistics 2019-11-12 Talha Cihad Gulcu

Deep learning demonstrated major abilities in solving many kinds of different real-world problems in computer vision literature. However, they are still strained by simple reasoning tasks that humans consider easy to solve. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2021-01-25 Nicola Messina , Giuseppe Amato , Fabio Carrara , Claudio Gennaro , Fabrizio Falchi

In deep learning, dense layer connectivity has become a key design principle in deep neural networks (DNNs), enabling efficient information flow and strong performance across a range of applications. In this work, we model densely connected…

Machine Learning · Computer Science 2025-10-03 Jinshu Huang , Haibin Su , Xue-Cheng Tai , Chunlin Wu

While deep learning is successful in a number of applications, it is not yet well understood theoretically. A satisfactory theoretical characterization of deep learning however, is beginning to emerge. It covers the following questions: 1)…

Machine Learning · Computer Science 2019-08-27 Tomaso Poggio , Andrzej Banburski , Qianli Liao

We introduce a general theoretical framework, designed for the study of gradient optimisation of deep neural networks, that encompasses ubiquitous architecture choices including batch normalisation, weight normalisation and skip…

Machine Learning · Computer Science 2023-12-05 Lachlan Ewen MacDonald , Jack Valmadre , Hemanth Saratchandran , Simon Lucey

Augmenting neural networks with skip connections, as introduced in the so-called ResNet architecture, surprised the community by enabling the training of networks of more than 1,000 layers with significant performance gains. This paper…

Computer Vision and Pattern Recognition · Computer Science 2020-04-24 Alireza Zaeemzadeh , Nazanin Rahnavard , Mubarak Shah

Deep neural networks have a good success record and are thus viewed as the best architecture choice for complex applications. Their main shortcoming has been, for a long time, the vanishing gradient which prevented the numerical…

Machine Learning · Computer Science 2024-05-02 Bernhard Bermeitinger , Tomas Hrycej , Siegfried Handschuh

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

Deep convolutional neural networks, assisted by architectural design strategies, make extensive use of data augmentation techniques and layers with a high number of feature maps to embed object transformations. That is highly inefficient…

Computer Vision and Pattern Recognition · Computer Science 2021-12-21 Vittorio Mazzia , Francesco Salvetti , Marcello Chiaberge

We propose to impose symmetry in neural network parameters to improve parameter usage and make use of dedicated convolution and matrix multiplication routines. Due to significant reduction in the number of parameters as a result of the…

Machine Learning · Computer Science 2019-01-11 Xu Shell Hu , Sergey Zagoruyko , Nikos Komodakis

Various powerful deep neural network architectures have made great contribution to the exciting successes of deep learning in the past two decades. Among them, deep Residual Networks (ResNets) are of particular importance because they…

Machine Learning · Computer Science 2022-05-16 Wentao Huang , Haizhang Zhang

Convolution Neural Networks, known as ConvNets exceptionally perform well in many complex machine learning tasks. The architecture of ConvNets demands the huge and rich amount of data and involves with a vast number of parameters that leads…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Pushparaja Murugan , Shanmugasundaram Durairaj

Deep convolutional neural networks (DCNNs) have shown remarkable performance in image classification tasks in recent years. Generally, deep neural network architectures are stacks consisting of a large number of convolutional layers, and…

Computer Vision and Pattern Recognition · Computer Science 2017-09-07 Dongyoon Han , Jiwhan Kim , Junmo Kim
‹ Prev 1 2 3 10 Next ›