Related papers: Architecture Compression

A Novel Architecture Slimming Method for Network Pruning and Knowledge Distillation

Network pruning and knowledge distillation are two widely-known model compression methods that efficiently reduce computation cost and model size. A common problem in both pruning and distillation is to determine compressed architecture,…

Computer Vision and Pattern Recognition · Computer Science 2022-02-23 Dongqi Wang , Shengyu Zhang , Zhipeng Di , Xin Lin , Weihua Zhou , Fei Wu

Neural Architecture Optimization

Automatic neural architecture design has shown its potential in discovering powerful neural network architectures. Existing methods, no matter based on reinforcement learning or evolutionary algorithms (EA), conduct architecture search in a…

Machine Learning · Computer Science 2019-09-05 Renqian Luo , Fei Tian , Tao Qin , Enhong Chen , Tie-Yan Liu

A Low Effort Approach to Structured CNN Design Using PCA

Deep learning models hold state of the art performance in many fields, yet their design is still based on heuristics or grid search methods that often result in overparametrized networks. This work proposes a method to analyze a trained…

Computer Vision and Pattern Recognition · Computer Science 2020-01-13 Isha Garg , Priyadarshini Panda , Kaushik Roy

Simultaneous Weight and Architecture Optimization for Neural Networks

Neural networks are trained by choosing an architecture and training the parameters. The choice of architecture is often by trial and error or with Neural Architecture Search (NAS) methods. While NAS provides some automation, it often…

Machine Learning · Computer Science 2024-10-14 Zitong Huang , Mansooreh Montazerin , Ajitesh Srivastava

Data-Driven Compression of Convolutional Neural Networks

Deploying trained convolutional neural networks (CNNs) to mobile devices is a challenging task because of the simultaneous requirements of the deployed model to be fast, lightweight and accurate. Designing and training a CNN architecture…

Machine Learning · Computer Science 2019-12-02 Ramit Pahwa , Manoj Ghuhan Arivazhagan , Ankur Garg , Siddarth Krishnamoorthy , Rohit Saxena , Sunav Choudhary

Learnable Embedding Space for Efficient Neural Architecture Compression

We propose a method to incrementally learn an embedding space over the domain of network architectures, to enable the careful selection of architectures for evaluation during compressed architecture search. Given a teacher network, we…

Computer Vision and Pattern Recognition · Computer Science 2019-04-26 Shengcao Cao , Xiaofang Wang , Kris M. Kitani

Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression

This paper investigates deep neural network (DNN) compression from the perspective of compactly representing and storing trained parameters. We explore the previously overlooked opportunity of cross-layer architecture-agnostic…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Yuezhou Sun , Wenlong Zhao , Lijun Zhang , Xiao Liu , Hui Guan , Matei Zaharia

Efficient Architecture Search by Network Transformation

Techniques for automatically designing deep neural network architectures such as reinforcement learning based approaches have recently shown promising results. However, their success is based on vast computational resources (e.g. hundreds…

Machine Learning · Computer Science 2017-11-22 Han Cai , Tianyao Chen , Weinan Zhang , Yong Yu , Jun Wang

DECORE: Deep Compression with Reinforcement Learning

Deep learning has become an increasingly popular and powerful methodology for modern pattern recognition systems. However, many deep neural networks have millions or billions of parameters, making them untenable for real-world applications…

Machine Learning · Computer Science 2022-02-14 Manoj Alwani , Yang Wang , Vashisht Madhavan

Efficient training for compact compression models via sequential distillation

Deep learning models for image compression often face practical limitations in hardware-constrained applications. Although these models achieve high-quality reconstructions, they are typically complex, heavyweight, and require substantial…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Caroline Mazini Rodrigues , Nicolas Keriven , Thomas Maugey

Model Preserving Compression for Neural Networks

After training complex deep learning models, a common task is to compress the model to reduce compute and storage demands. When compressing, it is desirable to preserve the original model's per-example decisions (e.g., to go beyond top-1…

Machine Learning · Computer Science 2022-10-18 Jerry Chee , Megan Renz , Anil Damle , Christopher De Sa

Image Complexity Guided Network Compression for Biomedical Image Segmentation

Compression is a standard procedure for making convolutional neural networks (CNNs) adhere to some specific computing resource constraints. However, searching for a compressed architecture typically involves a series of time-consuming…

Image and Video Processing · Electrical Eng. & Systems 2021-07-08 Suraj Mishra , Danny Z. Chen , X. Sharon Hu

End-to-end Compression Towards Machine Vision: Network Architecture Design and Optimization

The visual signal compression is a long-standing problem. Fueled by the recent advances of deep learning, exciting progress has been made. Despite better compression performance, existing end-to-end compression algorithms are still designed…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Shurun Wang , Zhao Wang , Shiqi Wang , Yan Ye

Architected Materials for Mechanical Compression: Design via Simulation, Deep Learning, and Experimentation

Architected materials can achieve enhanced properties compared to their plain counterparts. Specific architecting serves as a powerful design lever to achieve targeted behavior without changing the base material. Thus, the connection…

Materials Science · Physics 2023-02-14 Andrew J. Lew , Kai Jin , Markus J. Buehler

Compression of Deep Neural Networks on the Fly

Thanks to their state-of-the-art performance, deep neural networks are increasingly used for object recognition. To achieve these results, they use millions of parameters to be trained. However, when targeting embedded applications the size…

Machine Learning · Computer Science 2016-03-21 Guillaume Soulié , Vincent Gripon , Maëlys Robert

DARC: Differentiable ARchitecture Compression

In many learning situations, resources at inference time are significantly more constrained than resources at training time. This paper studies a general paradigm, called Differentiable ARchitecture Compression (DARC), that combines model…

Machine Learning · Computer Science 2019-05-21 Shashank Singh , Ashish Khetan , Zohar Karnin

Computer Vision Model Compression Techniques for Embedded Systems: A Survey

Deep neural networks have consistently represented the state of the art in most computer vision problems. In these scenarios, larger and more complex models have demonstrated superior performance to smaller architectures, especially when…

Computer Vision and Pattern Recognition · Computer Science 2024-08-16 Alexandre Lopes , Fernando Pereira dos Santos , Diulhio de Oliveira , Mauricio Schiezaro , Helio Pedrini

Content Adaptive Optimization for Neural Image Compression

The field of neural image compression has witnessed exciting progress as recently proposed architectures already surpass the established transform coding based approaches. While, so far, research has mainly focused on architecture and model…

Computer Vision and Pattern Recognition · Computer Science 2019-06-06 Joaquim Campos , Simon Meierhans , Abdelaziz Djelouah , Christopher Schroers

A Structurally Regularized CNN Architecture via Adaptive Subband Decomposition

We propose a generalized convolutional neural network (CNN) architecture that first decomposes the input signal into subbands by an adaptive filter bank structure, and then uses convolutional layers to extract features from each subband…

Image and Video Processing · Electrical Eng. & Systems 2023-06-30 Pavel Sinha , Ioannis Psaromiligkos , Zeljko Zilic

Channel Compression: Rethinking Information Redundancy among Channels in CNN Architecture

Model compression and acceleration are attracting increasing attentions due to the demand for embedded devices and mobile applications. Research on efficient convolutional neural networks (CNNs) aims at removing feature redundancy by…

Machine Learning · Computer Science 2020-08-21 Jinhua Liang , Tao Zhang , Guoqing Feng