English
Related papers

Related papers: Improved Techniques for Training Adaptive Deep Net…

200 papers

Quantizing deep networks with adaptive bit-widths is a promising technique for efficient inference across many devices and resource constraints. In contrast to static methods that repeat the quantization process and train different models…

Computer Vision and Pattern Recognition · Computer Science 2021-09-20 Ximeng Sun , Rameswar Panda , Chun-Fu Chen , Naigang Wang , Bowen Pan , Kailash Gopalakrishnan , Aude Oliva , Rogerio Feris , Kate Saenko

Predictable adaptation of network depths can be an effective way to control inference latency and meet the resource condition of various devices. However, previous adaptive depth networks do not provide general principles and a formal…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Woochul Kang , Hyungseop Lee

We present an approach to adaptively utilize deep neural networks in order to reduce the evaluation time on new examples without loss of accuracy. Rather than attempting to redesign or approximate existing networks, we propose two schemes…

Machine Learning · Computer Science 2017-09-20 Tolga Bolukbasi , Joseph Wang , Ofer Dekel , Venkatesh Saligrama

Deep learning models offer superior performance compared to other machine learning techniques for a variety of tasks and domains, but pose their own challenges. In particular, deep learning models require larger training times as the depth…

Machine Learning · Computer Science 2023-05-31 Sunitha Basodi , Krishna Pusuluri , Xueli Xiao , Yi Pan

We propose self-adaptive training -- a unified training algorithm that dynamically calibrates and enhances training processes by model predictions without incurring an extra computational cost -- to advance both supervised and…

Machine Learning · Computer Science 2022-10-17 Lang Huang , Chao Zhang , Hongyang Zhang

Dynamic DNN optimization techniques such as layer-skipping offer increased adaptability and efficiency gains but can lead to i) a larger memory footprint as in decision gates, ii) increased training complexity (e.g., with non-differentiable…

Machine Learning · Computer Science 2025-05-26 Guilherme Korol , Antonio Carlos Schneider Beck , Jeronimo Castrillon

Recent advancements in artificial intelligence, particularly deep neural networks, have pushed the boundaries of what is achievable in complex tasks. Traditional methods for training neural networks in classification problems often rely on…

Machine Learning · Computer Science 2024-09-10 Jaouad Dabounou

Federated Learning offers a way to train deep neural networks in a distributed fashion. While this addresses limitations related to distributed data, it incurs a communication overhead as the model parameters or gradients need to be…

Machine Learning · Computer Science 2023-05-26 Morten From Elvebakken , Alexandros Iosifidis , Lukas Esterle

Training deep networks that generalize to a wide range of variations in test data is essential to building accurate and robust image classifiers. One standard strategy is to apply data augmentation to synthetically enlarge the training set.…

Computer Vision and Pattern Recognition · Computer Science 2020-06-29 Yunhan Zhao , Ye Tian , Charless Fowlkes , Wei Shen , Alan Yuille

Handling varying computational resources is a critical issue in modern AI applications. Adaptive deep networks, featuring the dynamic employment of multiple classifier heads among different layers, have been proposed to address…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Xu Zhang , Zhipeng Xie , Haiyang Yu , Qitong Wang , Peng Wang , Wei Wang

A novel approach is suggested for improving the accuracy of fault detection in distribution networks. This technique combines adaptive probability learning and waveform decomposition to optimize the similarity of features. Its objective is…

Signal Processing · Electrical Eng. & Systems 2023-10-03 Xinliang Ma , Weihua Liu , Bingying Jin

Intermediate features at different layers of a deep neural network are known to be discriminative for visual patterns of different complexities. However, most existing works ignore such cross-layer heterogeneities when classifying samples…

Computer Vision and Pattern Recognition · Computer Science 2016-07-20 Xiaojie Jin , Yunpeng Chen , Jian Dong , Jiashi Feng , Shuicheng Yan

Many algorithms and applications involve repeatedly solving variations of the same inference problem; for example we may want to introduce new evidence to the model or perform updates to conditional dependencies. The goal of adaptive…

Data Structures and Algorithms · Computer Science 2012-06-18 Umut A. Acar , Alexander T. Ihler , Ramgopal Mettu , Ozgur Sumer

While depth tends to improve network performances, it also makes gradient-based training more difficult since deeper networks tend to be more non-linear. The recently proposed knowledge distillation approach is aimed at obtaining small and…

Machine Learning · Computer Science 2015-03-30 Adriana Romero , Nicolas Ballas , Samira Ebrahimi Kahou , Antoine Chassang , Carlo Gatta , Yoshua Bengio

We propose a new technique that boosts the convergence of training generative adversarial networks. Generally, the rate of training deep models reduces severely after multiple iterations. A key reason for this phenomenon is that a deep…

Machine Learning · Statistics 2018-06-15 Atsushi Nitanda , Taiji Suzuki

As deep learning models and datasets rapidly scale up, network training is extremely time-consuming and resource-costly. Instead of training on the entire dataset, learning with a small synthetic dataset becomes an efficient solution.…

Machine Learning · Computer Science 2022-08-02 Zixuan Jiang , Jiaqi Gu , Mingjie Liu , David Z. Pan

In the last decade, motivated by the success of Deep Learning, the scientific community proposed several approaches to make the learning procedure of Neural Networks more effective. When focussing on the way in which the training data are…

Machine Learning · Computer Science 2021-06-22 Simone Marullo , Matteo Tiezzi , Marco Gori , Stefano Melacci

Deep energy-based models are powerful, but pose challenges for learning and inference (Belanger and McCallum, 2016). Tu and Gimpel (2018) developed an efficient framework for energy-based models by training "inference networks" to…

Computation and Language · Computer Science 2020-10-13 Lifu Tu , Richard Yuanzhe Pang , Kevin Gimpel

Model compression becomes a recent trend due to the requirement of deploying neural networks on embedded and mobile devices. Hence, both accuracy and efficiency are of critical importance. To explore a balance between them, a knowledge…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Zhiyuan Wu , Hong Qi , Yu Jiang , Minghao Zhao , Chupeng Cui , Zongmin Yang , Xinhui Xue

The increasing complexity of deep learning architectures is resulting in training time requiring weeks or even months. This slow training is due in part to vanishing gradients, in which the gradients used by back-propagation are extremely…

Computer Vision and Pattern Recognition · Computer Science 2015-10-16 Bharat Singh , Soham De , Yangmuzi Zhang , Thomas Goldstein , Gavin Taylor
‹ Prev 1 2 3 10 Next ›