English
Related papers

Related papers: Optimization Algorithm Inspired Deep Neural Networ…

200 papers

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

The effectiveness of shortcut/skip-connection has been widely verified, which inspires massive explorations on neural architecture design. This work attempts to find an effective way to design new network architectures. It is discovered…

Machine Learning · Computer Science 2021-08-20 Yilin Liao , Hao Wang , Zhaoran Liu , Haozhe Li , Xinggao Liu

A residual network (or ResNet) is a standard deep neural net architecture, with state-of-the-art performance across numerous applications. The main premise of ResNets is that they allow the training of each layer to focus on fitting just…

Machine Learning · Computer Science 2018-09-28 Ohad Shamir

We develop new theoretical results on matrix perturbation to shed light on the impact of architecture on the performance of a deep network. In particular, we explain analytically what deep learning practitioners have long observed…

Machine Learning · Computer Science 2022-12-06 Rudolf H. Riedi , Randall Balestriero , Richard G. Baraniuk

Neural networks are trained by choosing an architecture and training the parameters. The choice of architecture is often by trial and error or with Neural Architecture Search (NAS) methods. While NAS provides some automation, it often…

Machine Learning · Computer Science 2024-10-14 Zitong Huang , Mansooreh Montazerin , Ajitesh Srivastava

Deep neural network has been ensured as a key technology in the field of many challenging and vigorously researched computer vision tasks. Furthermore, classical ResNet is thought to be a state-of-the-art convolutional neural network (CNN)…

Computer Vision and Pattern Recognition · Computer Science 2024-09-02 Prathibha Varghese , G. Arockia Selva Saroja

Neural network optimization remains one of the most consequential yet poorly understood challenges in modern AI research, where improvements in training algorithms can lead to enhanced feature learning in foundation models,…

Machine Learning · Computer Science 2025-12-23 Ansh Nagwekar

This work attempts to interpret modern deep (convolutional) networks from the principles of rate reduction and (shift) invariant classification. We show that the basic iterative gradient ascent scheme for optimizing the rate reduction of…

Machine Learning · Computer Science 2020-10-30 Kwan Ho Ryan Chan , Yaodong Yu , Chong You , Haozhi Qi , John Wright , Yi Ma

Depth separation results propose a possible theoretical explanation for the benefits of deep neural networks over shallower architectures, establishing that the former possess superior approximation capabilities. However, there are no known…

Machine Learning · Computer Science 2023-02-03 Itay Safran , Jason D. Lee

Deep neural networks demonstrate to have a high performance on image classification tasks while being more difficult to train. Due to the complexity and vanishing gradient problem, it normally takes a lot of time and more computational…

Computer Vision and Pattern Recognition · Computer Science 2018-05-02 Mohammad Sadegh Ebrahimi , Hossein Karkeh Abadi

Optimizing a neural network's performance is a tedious and time taking process, this iterative process does not have any defined solution which can work for all the problems. Optimization can be roughly categorized into - Architecture and…

Machine Learning · Computer Science 2019-12-16 Siddhartha Dhar Choudhury , Shashank Pandey , Kunal Mehrotra

Augmenting neural networks with skip connections, as introduced in the so-called ResNet architecture, surprised the community by enabling the training of networks of more than 1,000 layers with significant performance gains. This paper…

Computer Vision and Pattern Recognition · Computer Science 2020-04-24 Alireza Zaeemzadeh , Nazanin Rahnavard , Mubarak Shah

In this effort, we propose a new deep architecture utilizing residual blocks inspired by implicit discretization schemes. As opposed to the standard feed-forward networks, the outputs of the proposed implicit residual blocks are defined as…

Machine Learning · Computer Science 2021-02-23 Viktor Reshniak , Clayton Webster

Various deep neural network architectures (DNNs) maintain massive vital records in computer vision. While drawing attention worldwide, the design of the overall structure lacks general guidance. Based on the relationship between DNN design…

Computer Vision and Pattern Recognition · Computer Science 2021-07-29 Zhengbo Luo , Zitang Sun , Weilian Zhou , Zizhang Wu , Sei-ichiro Kamata

Designing a high-efficiency and high-quality expressive network architecture has always been the most important research topic in the field of deep learning. Most of today's network design strategies focus on how to integrate features…

Computer Vision and Pattern Recognition · Computer Science 2022-11-10 Chien-Yao Wang , Hong-Yuan Mark Liao , I-Hau Yeh

Major winning Convolutional Neural Networks (CNNs), such as VGGNet, ResNet, DenseNet, \etc, include tens to hundreds of millions of parameters, which impose considerable computation and memory overheads. This limits their practical usage in…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Seyyed Hossein Hasanpour , Mohammad Rouhani , Mohsen Fayyaz , Mohammad Sabokrou , Ehsan Adeli

Techniques for automatically designing deep neural network architectures such as reinforcement learning based approaches have recently shown promising results. However, their success is based on vast computational resources (e.g. hundreds…

Machine Learning · Computer Science 2017-11-22 Han Cai , Tianyao Chen , Weinan Zhang , Yong Yu , Jun Wang

Deep learning models have been widely used during the last decade due to their outstanding learning and abstraction capacities. However, one of the main challenges any scientist has to face using deep learning models is to establish the…

Machine Learning · Computer Science 2025-04-22 Luis Balderas , Miguel Lastra , José M. Benítez

The trend towards increasingly deep neural networks has been driven by a general observation that increasing depth increases the performance of a network. Recently, however, evidence has been amassing that simply increasing depth may not be…

Computer Vision and Pattern Recognition · Computer Science 2016-12-01 Zifeng Wu , Chunhua Shen , Anton van den Hengel

Past few years have witnessed exponential growth of interest in deep learning methodologies with rapidly improving accuracies and reduced computational complexity. In particular, architectures using Convolutional Neural Networks (CNNs) have…

Computer Vision and Pattern Recognition · Computer Science 2018-05-11 Sai Samarth R Phaye , Apoorva Sikka , Abhinav Dhall , Deepti Bathula
‹ Prev 1 2 3 10 Next ›