Related papers: EfficientTrain: Exploring Generalized Curriculum L…

EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training

The superior performance of modern visual backbones usually comes with a costly training procedure. We contribute to this issue by generalizing the idea of curriculum learning beyond its original formulation, i.e., training models using…

Computer Vision and Pattern Recognition · Computer Science 2024-05-15 Yulin Wang , Yang Yue , Rui Lu , Yizeng Han , Shiji Song , Gao Huang

On The Power of Curriculum Learning in Training Deep Networks

Training neural networks is traditionally done by providing a sequence of random mini-batches sampled uniformly from the entire training data. In this work, we analyze the effect of curriculum learning, which involves the non-uniform…

Machine Learning · Computer Science 2020-12-03 Guy Hacohen , Daphna Weinshall

FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed

Large-scale vision foundation models such as DINOv2 boast impressive performances by leveraging massive architectures and training datasets. But numerous scenarios require practitioners to reproduce those pre-training solutions, such as on…

Computer Vision and Pattern Recognition · Computer Science 2026-01-30 Jiaqi Zhang , Juntuo Wang , Zhixin Sun , John Zou , Randall Balestriero

Curriculum Learning for Graph Neural Networks: A Multiview Competence-based Approach

A curriculum is a planned sequence of learning materials and an effective one can make learning efficient and effective for both humans and machines. Recent studies developed effective data-driven curriculum learning approaches for training…

Machine Learning · Computer Science 2023-07-19 Nidhi Vakil , Hadi Amiri

Dynamic Neural Network Channel Execution for Efficient Training

Existing methods for reducing the computational burden of neural networks at run-time, such as parameter pruning or dynamic computational path selection, focus solely on improving computational efficiency during inference. On the other…

Machine Learning · Computer Science 2019-05-17 Simeon E. Spasov , Pietro Lio

Efficient Conditional Pre-training for Transfer Learning

Almost all the state-of-the-art neural networks for computer vision tasks are trained by (1) pre-training on a large-scale dataset and (2) finetuning on the target dataset. This strategy helps reduce dependence on the target dataset and…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Shuvam Chakraborty , Burak Uzkent , Kumar Ayush , Kumar Tanmay , Evan Sheehan , Stefano Ermon

Learning Efficient Convolutional Networks through Network Slimming

The deployment of deep convolutional neural networks (CNNs) in many real world applications is largely hindered by their high computational cost. In this paper, we propose a novel learning scheme for CNNs to simultaneously 1) reduce the…

Computer Vision and Pattern Recognition · Computer Science 2017-08-23 Zhuang Liu , Jianguo Li , Zhiqiang Shen , Gao Huang , Shoumeng Yan , Changshui Zhang

Computation-efficient Deep Learning for Computer Vision: A Survey

Over the past decade, deep learning models have exhibited considerable advancements, reaching or even exceeding human-level performance in a range of visual perception tasks. This remarkable progress has sparked interest in applying deep…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Yulin Wang , Yizeng Han , Chaofei Wang , Shiji Song , Qi Tian , Gao Huang

PruneTrain: Fast Neural Network Training by Dynamic Sparse Model Reconfiguration

State-of-the-art convolutional neural networks (CNNs) used in vision applications have large models with numerous weights. Training these models is very compute- and memory-resource intensive. Much research has been done on pruning or…

Machine Learning · Computer Science 2019-12-10 Sangkug Lym , Esha Choukse , Siavash Zangeneh , Wei Wen , Sujay Sanghavi , Mattan Erez

Spatial Transformer Networks for Curriculum Learning

Curriculum learning is a bio-inspired training technique that is widely adopted to machine learning for improved optimization and better training of neural networks regarding the convergence rate or obtained accuracy. The main concept in…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Fatemeh Azimi , Jean-Francois Jacques Nicolas Nies , Sebastian Palacio , Federico Raue , Jörn Hees , Andreas Dengel

Curriculum Learning by Transfer Learning: Theory and Experiments with Deep Networks

We provide theoretical investigation of curriculum learning in the context of stochastic gradient descent when optimizing the convex linear regression loss. We prove that the rate of convergence of an ideal curriculum learning method is…

Machine Learning · Computer Science 2023-12-29 Daphna Weinshall , Gad Cohen , Dan Amir

An Analytical Theory of Curriculum Learning in Teacher-Student Networks

In humans and animals, curriculum learning -- presenting data in a curated order - is critical to rapid learning and effective pedagogy. Yet in machine learning, curricula are not widely used and empirically often yield only moderate…

Machine Learning · Computer Science 2022-12-07 Luca Saglietti , Stefano Sarao Mannelli , Andrew Saxe

Efficiently Robustify Pre-trained Models

A recent trend in deep learning algorithms has been towards training large scale models, having high parameter count and trained on big dataset. However, robustness of such large scale models towards real-world settings is still a…

Computer Vision and Pattern Recognition · Computer Science 2023-09-15 Nishant Jain , Harkirat Behl , Yogesh Singh Rawat , Vibhav Vineet

Training Efficiency and Robustness in Deep Learning

Deep Learning has revolutionized machine learning and artificial intelligence, achieving superhuman performance in several standard benchmarks. It is well-known that deep learning models are inefficient to train; they learn by processing…

Machine Learning · Computer Science 2021-12-03 Fartash Faghri

Deep Learning Models on CPUs: A Methodology for Efficient Training

GPUs have been favored for training deep learning models due to their highly parallelized architecture. As a result, most studies on training optimization focus on GPUs. There is often a trade-off, however, between cost and efficiency when…

Machine Learning · Computer Science 2023-06-21 Quchen Fu , Ramesh Chukka , Keith Achorn , Thomas Atta-fosu , Deepak R. Canchi , Zhongwei Teng , Jules White , Douglas C. Schmidt

ClickTrain: Efficient and Accurate End-to-End Deep Learning Training via Fine-Grained Architecture-Preserving Pruning

Convolutional neural networks (CNNs) are becoming increasingly deeper, wider, and non-linear because of the growing demand on prediction accuracy and analysis quality. The wide and deep CNNs, however, require a large amount of computing…

Computer Vision and Pattern Recognition · Computer Science 2021-05-04 Chengming Zhang , Geng Yuan , Wei Niu , Jiannan Tian , Sian Jin , Donglin Zhuang , Zhe Jiang , Yanzhi Wang , Bin Ren , Shuaiwen Leon Song , Dingwen Tao

EfficientNetV2: Smaller Models and Faster Training

This paper introduces EfficientNetV2, a new family of convolutional networks that have faster training speed and better parameter efficiency than previous models. To develop this family of models, we use a combination of training-aware…

Computer Vision and Pattern Recognition · Computer Science 2021-06-25 Mingxing Tan , Quoc V. Le

Continual Learning with Pretrained Backbones by Tuning in the Input Space

The intrinsic difficulty in adapting deep learning models to non-stationary environments limits the applicability of neural networks to real-world tasks. This issue is critical in practical supervised learning settings, such as the ones in…

Machine Learning · Computer Science 2023-06-09 Simone Marullo , Matteo Tiezzi , Marco Gori , Stefano Melacci , Tinne Tuytelaars

Making EfficientNet More Efficient: Exploring Batch-Independent Normalization, Group Convolutions and Reduced Resolution Training

Much recent research has been dedicated to improving the efficiency of training and inference for image classification. This effort has commonly focused on explicitly improving theoretical efficiency, often measured as ImageNet validation…

Machine Learning · Computer Science 2021-08-27 Dominic Masters , Antoine Labatie , Zach Eaton-Rosen , Carlo Luschi

Curriculum-Based Strategies for Efficient Cross-Domain Action Recognition

Despite significant progress in human action recognition, generalizing to diverse viewpoints remains a challenge. Most existing datasets are captured from ground-level perspectives, and models trained on them often struggle to transfer to…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Emily Kim , Allen Wu , Jessica Hodgins