Related papers: Hyper-Connections

Frac-Connections: Fractional Extension of Hyper-Connections

Residual connections are central to modern deep learning architectures, enabling the training of very deep networks by mitigating gradient vanishing. Hyper-Connections recently generalized residual connections by introducing multiple…

Machine Learning · Computer Science 2025-03-19 Defa Zhu , Hongzhi Huang , Jundong Zhou , Zihao Huang , Yutao Zeng , Banggu Wu , Qiyang Min , Xun Zhou

Exploiting Nontrivial Connectivity for Automatic Speech Recognition

Nontrivial connectivity has allowed the training of very deep networks by addressing the problem of vanishing gradients and offering a more efficient method of reusing parameters. In this paper we make a comparison between residual…

Sound · Computer Science 2017-11-29 Marius Paraschiv , Lasse Borgholt , Tycho Max Sylvester Tax , Marco Singh , Lars Maaløe

Gradient Residual Connections

Existing work has linked properties of a function's gradient to the difficulty of function approximation. Motivated by these insights, we study how gradient information can be leveraged to improve neural network's ability to approximate…

Machine Learning · Computer Science 2026-02-11 Yangchen Pan , Qizhen Ying , Philip Torr , Bo Liu

ResidualDroppath: Enhancing Feature Reuse over Residual Connections

Residual connections are one of the most important components in neural network architectures for mitigating the vanishing gradient problem and facilitating the training of much deeper networks. One possible explanation for how residual…

Machine Learning · Computer Science 2024-11-15 Sejik Park

3D CNN with Localized Residual Connections for Hyperspectral Image Classification

In this paper we propose a novel 3D CNN network with localized residual connections for hyperspectral image classification. Our work chalks a comparative study with the existing methods employed for abstracting deeper features and propose a…

Computer Vision and Pattern Recognition · Computer Science 2019-12-09 Shivangi Dwivedi , Murari Mandal , Shekhar Yadav , Santosh Kumar Vipparthi

ResNets Are Deeper Than You Think

Residual connections remain ubiquitous in modern neural network architectures nearly a decade after their introduction. Their widespread adoption is often credited to their dramatically improved trainability: residual networks train faster,…

Machine Learning · Computer Science 2025-06-18 Christian H. X. Ali Mehmeti-Göpel , Michael Wand

IamNN: Iterative and Adaptive Mobile Neural Network for Efficient Image Classification

Deep residual networks (ResNets) made a recent breakthrough in deep learning. The core idea of ResNets is to have shortcut connections between layers that allow the network to be much deeper while still being easy to optimize avoiding…

Computer Vision and Pattern Recognition · Computer Science 2018-04-30 Sam Leroux , Pavlo Molchanov , Pieter Simoens , Bart Dhoedt , Thomas Breuel , Jan Kautz

Study of Residual Networks for Image Recognition

Deep neural networks demonstrate to have a high performance on image classification tasks while being more difficult to train. Due to the complexity and vanishing gradient problem, it normally takes a lot of time and more computational…

Computer Vision and Pattern Recognition · Computer Science 2018-05-02 Mohammad Sadegh Ebrahimi , Hossein Karkeh Abadi

Residual Connections Encourage Iterative Inference

Residual networks (Resnets) have become a prominent architecture in deep learning. However, a comprehensive understanding of Resnets is still a topic of ongoing research. A recent view argues that Resnets perform iterative refinement of…

Computer Vision and Pattern Recognition · Computer Science 2018-03-09 Stanisław Jastrzębski , Devansh Arpit , Nicolas Ballas , Vikas Verma , Tong Che , Yoshua Bengio

A Layer-Wise Information Reinforcement Approach to Improve Learning in Deep Belief Networks

With the advent of deep learning, the number of works proposing new methods or improving existent ones has grown exponentially in the last years. In this scenario, "very deep" models were emerging, once they were expected to extract more…

Artificial Intelligence · Computer Science 2021-01-19 Mateus Roder , Leandro A. Passos , Luiz Carlos Felix Ribeiro , Clayton Pereira , João Paulo Papa

Fading memory as inductive bias in residual recurrent networks

Residual connections have been proposed as an architecture-based inductive bias to mitigate the problem of exploding and vanishing gradients and increased task performance in both feed-forward and recurrent networks (RNNs) when trained with…

Machine Learning · Computer Science 2024-01-04 Igor Dubinin , Felix Effenberger

Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network

In recent years, deep learning methods have been successfully applied to single-image super-resolution tasks. Despite their great performances, deep learning methods cannot be easily applied to real-world applications due to the requirement…

Computer Vision and Pattern Recognition · Computer Science 2018-10-08 Namhyuk Ahn , Byungkon Kang , Kyung-Ah Sohn

Algebraic Representations for Faster Predictions in Convolutional Neural Networks

Convolutional neural networks (CNNs) are a popular choice of model for tasks in computer vision. When CNNs are made with many layers, resulting in a deep neural network, skip connections may be added to create an easier gradient…

Computer Vision and Pattern Recognition · Computer Science 2024-08-16 Johnny Joyce , Jan Verschelde

Residual Connections Harm Generative Representation Learning

We show that introducing a weighting factor to reduce the influence of identity shortcuts in residual networks significantly enhances semantic feature learning in generative representation learning frameworks, such as masked autoencoders…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Xiao Zhang , Ruoxi Jiang , William Gao , Rebecca Willett , Michael Maire

HyperNetworks

This work explores hypernetworks: an approach of using a one network, also known as a hypernetwork, to generate the weights for another network. Hypernetworks provide an abstraction that is similar to what is found in nature: the…

Machine Learning · Computer Science 2016-12-02 David Ha , Andrew Dai , Quoc V. Le

ResiDual: Transformer with Dual Residual Connections

Transformer networks have become the preferred architecture for many tasks due to their state-of-the-art performance. However, the optimal way to implement residual connections in Transformer, which are essential for effective training, is…

Computation and Language · Computer Science 2023-05-01 Shufang Xie , Huishuai Zhang , Junliang Guo , Xu Tan , Jiang Bian , Hany Hassan Awadalla , Arul Menezes , Tao Qin , Rui Yan

ResNet: Enabling Deep Convolutional Neural Networks through Residual Learning

Convolutional Neural Networks (CNNs) has revolutionized computer vision, but training very deep networks has been challenging due to the vanishing gradient problem. This paper explores Residual Networks (ResNet), introduced by He et al.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Xingyu Liu , Kun Ming Goh

Hierarchical Residual Attention Network for Single Image Super-Resolution

Convolutional neural networks are the most successful models in single image super-resolution. Deeper networks, residual connections, and attention mechanisms have further improved their performance. However, these strategies often improve…

Image and Video Processing · Electrical Eng. & Systems 2020-12-09 Parichehr Behjati , Pau Rodriguez , Armin Mehri , Isabelle Hupont , Carles Fernández Tena , Jordi Gonzalez

Residual Enhanced Multi-Hypergraph Neural Network

Hypergraphs are a generalized data structure of graphs to model higher-order correlations among entities, which have been successfully adopted into various research domains. Meanwhile, HyperGraph Neural Network (HGNN) is currently the…

Computer Vision and Pattern Recognition · Computer Science 2021-05-04 Jing Huang , Xiaolin Huang , Jie Yang

SIRe-Networks: Convolutional Neural Networks Architectural Extension for Information Preservation via Skip/Residual Connections and Interlaced Auto-Encoders

Improving existing neural network architectures can involve several design choices such as manipulating the loss functions, employing a diverse learning strategy, exploiting gradient evolution at training time, optimizing the network…

Computer Vision and Pattern Recognition · Computer Science 2022-10-27 Danilo Avola , Luigi Cinque , Alessio Fagioli , Gian Luca Foresti