Related papers: Online Convolutional Re-parameterization

RePr: Improved Training of Convolutional Filters

A well-trained Convolutional Neural Network can easily be pruned without significant loss of performance. This is because of unnecessary overlap in the features captured by the network's filters. Innovations in network architecture such as…

Computer Vision and Pattern Recognition · Computer Science 2019-02-26 Aaditya Prakash , James Storer , Dinei Florencio , Cha Zhang

Deep Clustered Convolutional Kernels

Deep neural networks have recently achieved state of the art performance thanks to new training algorithms for rapid parameter estimation and new regularization methods to reduce overfitting. However, in practice the network architecture…

Machine Learning · Computer Science 2016-03-04 Minyoung Kim , Luca Rigazio

Boosting Pruned Networks with Linear Over-parameterization

Structured pruning compresses neural networks by reducing channels (filters) for fast inference and low footprint at run-time. To restore accuracy after pruning, fine-tuning is usually applied to pruned networks. However, too few remaining…

Computer Vision and Pattern Recognition · Computer Science 2024-01-01 Yu Qian , Jian Cao , Xiaoshuang Li , Jie Zhang , Hufei Li , Jue Chen

Efficient Re-parameterization Operations Search for Easy-to-Deploy Network Based on Directional Evolutionary Strategy

Structural re-parameterization (Rep) methods has achieved significant performance improvement on traditional convolutional network. Most current Rep methods rely on prior knowledge to select the reparameterization operations. However, the…

Artificial Intelligence · Computer Science 2022-07-05 Xinyi Yu , Xiaowei Wang , Jintao Rong , Mingyang Zhang , Linlin Ou

Boosting Neural Video Representation via Online Structural Reparameterization

Neural Video Representation~(NVR) is a promising paradigm for video compression, showing great potential in improving video storage and transmission efficiency. While recent advances have made efforts in architectural refinements to improve…

Image and Video Processing · Electrical Eng. & Systems 2025-11-17 Ziyi Li , Qingyu Mao , Shuai Liu , Qilei Li , Fanyang Meng , Yongsheng Liang

Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

We present techniques for speeding up the test-time evaluation of large convolutional networks, designed for object recognition tasks. These models deliver impressive accuracy but each image evaluation requires millions of floating point…

Computer Vision and Pattern Recognition · Computer Science 2024-03-15 Remi Denton , Wojciech Zaremba , Joan Bruna , Yann LeCun , Rob Fergus

Efficient Hyperparameter Search for Non-Stationary Model Training

Online learning is the cornerstone of applications like recommendation and advertising systems, where models continuously adapt to shifting data distributions. Model training for such systems is remarkably expensive, a cost that multiplies…

Machine Learning · Computer Science 2025-12-02 Berivan Isik , Matthew Fahrbach , Dima Kuzmin , Nicolas Mayoraz , Emil Praun , Steffen Rendle , Raghavendra Vasudeva

Online hyperparameter optimization by real-time recurrent learning

Conventional hyperparameter optimization methods are computationally intensive and hard to generalize to scenarios that require dynamically adapting hyperparameters, such as life-long learning. Here, we propose an online hyperparameter…

Machine Learning · Computer Science 2021-04-09 Daniel Jiwoong Im , Cristina Savin , Kyunghyun Cho

COLORA: Efficient Fine-Tuning for Convolutional Models with a Study Case on Optical Coherence Tomography Image Classification

We introduce CoLoRA (Convolutional Low-Rank Adaptation), a parameter-efficient fine-tuning method for convolutional neural networks (CNNs). CoLoRA extends LoRA to convolutional layers by decomposing kernel updates into lightweight depthwise…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Mariano Rivera , Angello Hoyos

Low-Cost Parameterizations of Deep Convolutional Neural Networks

Convolutional Neural Networks (CNNs) filter the input data using a series of spatial convolution operators with compactly supported stencils and point-wise nonlinearities. Commonly, the convolution operators couple features from all…

Numerical Analysis · Computer Science 2018-10-04 Eran Treister , Lars Ruthotto , Michal Sharoni , Sapir Zafrani , Eldad Haber

Orthogonal Convolutional Neural Networks

Deep convolutional neural networks are hindered by training instability and feature redundancy towards further performance improvement. A promising solution is to impose orthogonality on convolutional filters. We develop an efficient…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Jiayun Wang , Yubei Chen , Rudrasis Chakraborty , Stella X. Yu

Learning Efficient Convolutional Networks through Irregular Convolutional Kernels

As deep neural networks are increasingly used in applications suited for low-power devices, a fundamental dilemma becomes apparent: the trend is to grow models to absorb increasing data that gives rise to memory intensive; however low-power…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Weiyu Guo , Jiabin Ma , Liang Wang , Yongzhen Huang

DyRep: Bootstrapping Training with Dynamic Re-parameterization

Structural re-parameterization (Rep) methods achieve noticeable improvements on simple VGG-style networks. Despite the prevalence, current Rep methods simply re-parameterize all operations into an augmented network, including those that…

Computer Vision and Pattern Recognition · Computer Science 2022-03-25 Tao Huang , Shan You , Bohan Zhang , Yuxuan Du , Fei Wang , Chen Qian , Chang Xu

What is the Best Feature Learning Procedure in Hierarchical Recognition Architectures?

(This paper was written in November 2011 and never published. It is posted on arXiv.org in its original form in June 2016). Many recent object recognition systems have proposed using a two phase training procedure to learn sparse…

Computer Vision and Pattern Recognition · Computer Science 2016-06-07 Kevin Jarrett , Koray Kvukcuoglu , Karol Gregor , Yann LeCun

ResNet Structure Simplification with the Convolutional Kernel Redundancy Measure

Deep learning, especially convolutional neural networks, has triggered accelerated advancements in computer vision, bringing changes into our daily practice. Furthermore, the standardized deep learning modules (also known as backbone…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Hongzhi Zhu , Robert Rohling , Septimiu Salcudean

Rethinking the Inception Architecture for Computer Vision

Convolutional networks are at the core of most state-of-the-art computer vision solutions for a wide variety of tasks. Since 2014 very deep convolutional networks started to become mainstream, yielding substantial gains in various…

Computer Vision and Pattern Recognition · Computer Science 2015-12-14 Christian Szegedy , Vincent Vanhoucke , Sergey Ioffe , Jonathon Shlens , Zbigniew Wojna

Optimizing Convolutional Neural Network Architecture

Convolutional Neural Networks (CNN) are widely used to face challenging tasks like speech recognition, natural language processing or computer vision. As CNN architectures get larger and more complex, their computational requirements…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Luis Balderas , Miguel Lastra , José M. Benítez

An Energy-Efficient Edge Computing Paradigm for Convolution-based Image Upsampling

A novel energy-efficient edge computing paradigm is proposed for real-time deep learning-based image upsampling applications. State-of-the-art deep learning solutions for image upsampling are currently trained using either resize or…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das

PruneTrain: Fast Neural Network Training by Dynamic Sparse Model Reconfiguration

State-of-the-art convolutional neural networks (CNNs) used in vision applications have large models with numerous weights. Training these models is very compute- and memory-resource intensive. Much research has been done on pruning or…

Machine Learning · Computer Science 2019-12-10 Sangkug Lym , Esha Choukse , Siavash Zangeneh , Wei Wen , Sujay Sanghavi , Mattan Erez

A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks

Acceleration of convolutional neural network has received increasing attention during the past several years. Among various acceleration techniques, filter pruning has its inherent merit by effectively reducing the number of convolution…

Computer Vision and Pattern Recognition · Computer Science 2019-06-19 Dong Wang , Lei Zhou , Xiao Bai , Jun Zhou