Related papers: DyRep: Bootstrapping Training with Dynamic Re-para…

Efficient Re-parameterization Operations Search for Easy-to-Deploy Network Based on Directional Evolutionary Strategy

Structural re-parameterization (Rep) methods has achieved significant performance improvement on traditional convolutional network. Most current Rep methods rely on prior knowledge to select the reparameterization operations. However, the…

Artificial Intelligence · Computer Science 2022-07-05 Xinyi Yu , Xiaowei Wang , Jintao Rong , Mingyang Zhang , Linlin Ou

RepSR: Training Efficient VGG-style Super-Resolution Networks with Structural Re-Parameterization and Batch Normalization

This paper explores training efficient VGG-style super-resolution (SR) networks with the structural re-parameterization technique. The general pipeline of re-parameterization is to train networks with multi-branch topology first, and then…

Computer Vision and Pattern Recognition · Computer Science 2022-05-12 Xintao Wang , Chao Dong , Ying Shan

DL-Reg: A Deep Learning Regularization Technique using Linear Regression

Regularization plays a vital role in the context of deep learning by preventing deep neural networks from the danger of overfitting. This paper proposes a novel deep learning regularization method named as DL-Reg, which carefully reduces…

Machine Learning · Computer Science 2020-11-05 Maryam Dialameh , Ali Hamzeh , Hossein Rahmani

ASR: Attention-alike Structural Re-parameterization

The structural re-parameterization (SRP) technique is a novel deep learning technique that achieves interconversion between different network architectures through equivalent parameter transformations. This technique enables the mitigation…

Computer Vision and Pattern Recognition · Computer Science 2024-08-08 Shanshan Zhong , Zhongzhan Huang , Wushao Wen , Jinghui Qin , Liang Lin

Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training

In this paper, we introduce a new perspective on training deep neural networks capable of state-of-the-art performance without the need for the expensive over-parameterization by proposing the concept of In-Time Over-Parameterization (ITOP)…

Machine Learning · Computer Science 2021-06-16 Shiwei Liu , Lu Yin , Decebal Constantin Mocanu , Mykola Pechenizkiy

Online Convolutional Re-parameterization

Structural re-parameterization has drawn increasing attention in various computer vision tasks. It aims at improving the performance of deep models without introducing any inference-time cost. Though efficient during inference, such models…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Mu Hu , Junyi Feng , Jiashen Hua , Baisheng Lai , Jianqiang Huang , Xiaojin Gong , Xiansheng Hua

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning

We identify an implicit under-parameterization phenomenon in value-based deep RL methods that use bootstrapping: when value functions, approximated using deep neural networks, are trained with gradient descent using iterated regression onto…

Machine Learning · Computer Science 2021-10-26 Aviral Kumar , Rishabh Agarwal , Dibya Ghosh , Sergey Levine

CoopSubNet: Cooperating Subnetwork for Data-Driven Regularization of Deep Networks under Limited Training Budgets

Deep networks are an integral part of the current machine learning paradigm. Their inherent ability to learn complex functional mappings between data and various target variables, while discovering hidden, task-driven features, makes them a…

Computer Vision and Pattern Recognition · Computer Science 2019-06-14 Riddhish Bhalodia , Shireen Elhabian , Ladislav Kavan , Ross Whitaker

Dyn-Adapter: Towards Disentangled Representation for Efficient Visual Recognition

Parameter-efficient transfer learning (PETL) is a promising task, aiming to adapt the large-scale pre-trained model to downstream tasks with a relatively modest cost. However, current PETL methods struggle in compressing computational…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Yurong Zhang , Honghao Chen , Xinyu Zhang , Xiangxiang Chu , Li Song

RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Existing neural networks are memory-consuming and computationally intensive, making deploying them challenging in resource-constrained environments. However, there are various methods to improve their efficiency. Two such methods are…

Machine Learning · Computer Science 2023-11-10 Anastasiia Prutianova , Alexey Zaytsev , Chung-Kuei Lee , Fengyu Sun , Ivan Koryakovskiy

Parameter Efficient Training of Deep Convolutional Neural Networks by Dynamic Sparse Reparameterization

Modern deep neural networks are typically highly overparameterized. Pruning techniques are able to remove a significant fraction of network parameters with little loss in accuracy. Recently, techniques based on dynamic reallocation of…

Machine Learning · Computer Science 2019-05-14 Hesham Mostafa , Xin Wang

ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting

We propose ResRep, a novel method for lossless channel pruning (a.k.a. filter pruning), which slims down a CNN by reducing the width (number of output channels) of convolutional layers. Inspired by the neurobiology research about the…

Machine Learning · Computer Science 2021-08-17 Xiaohan Ding , Tianxiang Hao , Jianchao Tan , Ji Liu , Jungong Han , Yuchen Guo , Guiguang Ding

Representation Learning over Dynamic Graphs

How can we effectively encode evolving information over dynamic graphs into low-dimensional representations? In this paper, we propose DyRep, an inductive deep representation learning framework that learns a set of functions to efficiently…

Machine Learning · Computer Science 2018-03-20 Rakshit Trivedi , Mehrdad Farajtabar , Prasenjeet Biswal , Hongyuan Zha

Rapid Structural Pruning of Neural Networks with Set-based Task-Adaptive Meta-Pruning

As deep neural networks are growing in size and being increasingly deployed to more resource-limited devices, there has been a recent surge of interest in network pruning methods, which aim to remove less important weights or activations of…

Machine Learning · Computer Science 2020-06-23 Minyoung Song , Jaehong Yoon , Eunho Yang , Sung Ju Hwang

Self-supervised Graph Representation Learning via Bootstrapping

Graph neural networks~(GNNs) apply deep learning techniques to graph-structured data and have achieved promising performance in graph representation learning. However, existing GNNs rely heavily on enough labels or well-designed negative…

Machine Learning · Computer Science 2020-11-13 Feihu Che , Guohua Yang , Dawei Zhang , Jianhua Tao , Pengpeng Shao , Tong Liu

Replacement Learning: Training Neural Networks with Fewer Parameters

End-to-end training with full-depth backpropagation remains the dominant paradigm for optimizing deep neural networks, but its efficiency deteriorates as models grow deeper. Since every block must be executed and differentiated under a…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Yuming Zhang , Peizhe Wang , Tianyang Han , Hengyu Shi , Junhao Su , Dongzhi Guan , Jiabin Liu , Jiaji Wang

CS-Rep: Making Speaker Verification Networks Embracing Re-parameterization

Automatic speaker verification (ASV) systems, which determine whether two speeches are from the same speaker, mainly focus on verification accuracy while ignoring inference speed. However, in real applications, both inference speed and…

Sound · Computer Science 2022-04-05 Ruiteng Zhang , Jianguo Wei , Wenhuan Lu , Lin Zhang , Yantao Ji , Junhai Xu , Xugang Lu

A Simple Non-i.i.d. Sampling Approach for Efficient Training and Better Generalization

While training on samples drawn from independent and identical distribution has been a de facto paradigm for optimizing image classification networks, humans learn new concepts in an easy-to-hard manner and on the selected examples…

Computer Vision and Pattern Recognition · Computer Science 2020-10-16 Bowen Cheng , Yunchao Wei , Jiahui Yu , Shiyu Chang , Jinjun Xiong , Wen-Mei Hwu , Thomas S. Huang , Humphrey Shi

Dynamic Sparse Training of Diagonally Sparse Networks

Recent advances in Dynamic Sparse Training (DST) have pushed the frontier of sparse neural network training in structured and unstructured contexts, matching dense-model performance while drastically reducing parameter counts to facilitate…

Machine Learning · Computer Science 2025-06-16 Abhishek Tyagi , Arjun Iyer , William H Renninger , Christopher Kanan , Yuhao Zhu

RealRep: Generalized SDR-to-HDR Conversion via Attribute-Disentangled Representation Learning

High-Dynamic-Range Wide-Color-Gamut (HDR-WCG) technology is becoming increasingly widespread, driving a growing need for converting Standard Dynamic Range (SDR) content to HDR. Existing methods primarily rely on fixed tone mapping…

Computer Vision and Pattern Recognition · Computer Science 2026-05-18 Li Xu , Siqi Wang , Kepeng Xu , Gang He , Lin Zhang , Weiran Wang , Yu-Wing Tai