Related papers: Dynamic Shuffle: An Efficient Channel Mixture Meth…

ShuffleBlock: Shuffle to Regularize Deep Convolutional Neural Networks

Deep neural networks have enormous representational power which leads them to overfit on most datasets. Thus, regularizing them is important in order to reduce overfitting and enhance their generalization capabilities. Recently, channel…

Computer Vision and Pattern Recognition · Computer Science 2021-06-18 Sudhakar Kumawat , Gagan Kanojia , Shanmuganathan Raman

AutoShuffleNet: Learning Permutation Matrices via an Exact Lipschitz Continuous Penalty in Deep Convolutional Neural Networks

ShuffleNet is a state-of-the-art light weight convolutional neural network architecture. Its basic operations include group, channel-wise convolution and channel shuffling. However, channel shuffling is manually designed empirically.…

Machine Learning · Computer Science 2019-01-28 Jiancheng Lyu , Shuai Zhang , Yingyong Qi , Jack Xin

Revisiting Dynamic Convolution via Matrix Decomposition

Recent research in dynamic convolution shows substantial performance boost for efficient CNNs, due to the adaptive aggregation of K static convolution kernels. It has two limitations: (a) it increases the number of convolutional weights by…

Computer Vision and Pattern Recognition · Computer Science 2021-03-17 Yunsheng Li , Yinpeng Chen , Xiyang Dai , Mengchen Liu , Dongdong Chen , Ye Yu , Lu Yuan , Zicheng Liu , Mei Chen , Nuno Vasconcelos

PatchShuffle Regularization

This paper focuses on regularizing the training of the convolutional neural network (CNN). We propose a new regularization approach named ``PatchShuffle`` that can be adopted in any classification-oriented CNN models. It is easy to…

Computer Vision and Pattern Recognition · Computer Science 2017-07-25 Guoliang Kang , Xuanyi Dong , Liang Zheng , Yi Yang

ShuffleMix: Improving Representations via Channel-Wise Shuffle of Interpolated Hidden States

Mixup style data augmentation algorithms have been widely adopted in various tasks as implicit network regularization on representation learning to improve model generalization, which can be achieved by a linear interpolation of labeled…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Kangjun Liu , Ke Chen , Lihua Guo , Yaowei Wang , Kui Jia

Dynamic Sampling for Deep Metric Learning

Deep metric learning maps visually similar images onto nearby locations and visually dissimilar images apart from each other in an embedding manifold. The learning process is mainly based on the supplied image negative and positive training…

Computer Vision and Pattern Recognition · Computer Science 2020-09-14 Chang-Hui Liang , Wan-Lei Zhao , Run-Qing Chen

Matrix Shuffle-Exchange Networks for Hard 2D Tasks

Convolutional neural networks have become the main tools for processing two-dimensional data. They work well for images, yet convolutions have a limited receptive field that prevents its applications to more complex 2D tasks. We propose a…

Machine Learning · Computer Science 2020-10-06 Emīls Ozoliņš , Kārlis Freivalds , Agris Šostaks

Structured Sparsification with Joint Optimization of Group Convolution and Channel Shuffle

Recent advances in convolutional neural networks(CNNs) usually come with the expense of excessive computational overhead and memory footprint. Network compression aims to alleviate this issue by training compact models with comparable…

Computer Vision and Pattern Recognition · Computer Science 2021-05-17 Xin-Yu Zhang , Kai Zhao , Taihong Xiao , Ming-Ming Cheng , Ming-Hsuan Yang

Dynamic Neural Network Channel Execution for Efficient Training

Existing methods for reducing the computational burden of neural networks at run-time, such as parameter pruning or dynamic computational path selection, focus solely on improving computational efficiency during inference. On the other…

Machine Learning · Computer Science 2019-05-17 Simeon E. Spasov , Pietro Lio

Learning to Shuffle: Block Reshuffling and Reversal Schemes for Stochastic Optimization

Shuffling strategies for stochastic gradient descent (SGD), including incremental gradient, shuffle-once, and random reshuffling, are supported by rigorous convergence analyses for arbitrary within-epoch permutations. In particular, random…

Machine Learning · Computer Science 2026-04-02 Lam M. Nguyen , Dzung T. Phan , Jayant Kalagnanam

AugShuffleNet: Communicate More, Compute Less

As a remarkable compact model, ShuffleNetV2 offers a good example to design efficient ConvNets but its limit is rarely noticed. In this paper, we rethink the design pattern of ShuffleNetV2 and find that the channel-wise redundancy problem…

Computer Vision and Pattern Recognition · Computer Science 2022-08-23 Longqing Ye

Dynamic Group Convolution for Accelerating Convolutional Neural Networks

Replacing normal convolutions with group convolutions can significantly increase the computational efficiency of modern deep convolutional networks, which has been widely adopted in compact network architecture designs. However, existing…

Computer Vision and Pattern Recognition · Computer Science 2020-07-13 Zhuo Su , Linpu Fang , Wenxiong Kang , Dewen Hu , Matti Pietikäinen , Li Liu

Channel Compression: Rethinking Information Redundancy among Channels in CNN Architecture

Model compression and acceleration are attracting increasing attentions due to the demand for embedded devices and mobile applications. Research on efficient convolutional neural networks (CNNs) aims at removing feature redundancy by…

Machine Learning · Computer Science 2020-08-21 Jinhua Liang , Tao Zhang , Guoqing Feng

MergeShuffle: A Very Fast, Parallel Random Permutation Algorithm

This article introduces an algorithm, MergeShuffle, which is an extremely efficient algorithm to generate random permutations (or to randomly permute an existing array). It is easy to implement, runs in $n\log_2 n + O(1)$ time, is in-place,…

Data Structures and Algorithms · Computer Science 2015-08-14 Axel Bacher , Olivier Bodini , Alexandros Hollender , Jérémie Lumbroso

Efficient Dynamic Structured Sparse Training with Learned Shuffles

Structured sparsity accelerates training and inference on modern GPUs, yet it still trails unstructured dynamic sparse training (DST) in accuracy. The shortfall stems from a loss of expressivity: whereas a dense layer can realize every…

Machine Learning · Computer Science 2025-10-17 Abhishek Tyagi , Arjun Iyer , Liam Young , William H Renninger , Christopher Kanan , Yuhao Zhu

TransformMix: Learning Transformation and Mixing Strategies from Data

Data augmentation improves the generalization power of deep learning models by synthesizing more training samples. Sample-mixing is a popular data augmentation approach that creates additional data by combining existing samples. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Tsz-Him Cheung , Dit-Yan Yeung

Efficient neural topology optimization via active learning for enhancing turbulent mass transfer in fluid channels

The design of fluid channel structures of reactors or separators of chemical processes is key to enhancing the mass transfer processes inside the devices. However, the systematic design of channel topological structures is difficult for…

Fluid Dynamics · Physics 2025-03-07 Chenhui Kou , Yuhui Yin , Min Zhu , Shengkun Jia , Yiqing Luo , Xigang Yuana , Lu Lu

Group and Shuffle: Efficient Structured Orthogonal Parametrization

The increasing size of neural networks has led to a growing demand for methods of efficient fine-tuning. Recently, an orthogonal fine-tuning paradigm was introduced that uses orthogonal matrices for adapting the weights of a pretrained…

Machine Learning · Computer Science 2024-06-17 Mikhail Gorbunov , Nikolay Yudin , Vera Soboleva , Aibek Alanov , Alexey Naumov , Maxim Rakhuba

Evolving Image Compositions for Feature Representation Learning

Convolutional neural networks for visual recognition require large amounts of training samples and usually benefit from data augmentation. This paper proposes PatchMix, a data augmentation method that creates new samples by composing…

Computer Vision and Pattern Recognition · Computer Science 2022-04-04 Paola Cascante-Bonilla , Arshdeep Sekhon , Yanjun Qi , Vicente Ordonez

HENet:A Highly Efficient Convolutional Neural Networks Optimized for Accuracy, Speed and Storage

In order to enhance the real-time performance of convolutional neural networks(CNNs), more and more researchers are focusing on improving the efficiency of CNN. Based on the analysis of some CNN architectures, such as ResNet, DenseNet,…

Computer Vision and Pattern Recognition · Computer Science 2018-03-16 Qiuyu Zhu , Ruixin Zhang