Related papers: Using Small Proxy Datasets to Accelerate Hyperpara…

The Power of Proxy Data and Proxy Networks for Hyper-Parameter Optimization in Medical Image Segmentation

Deep learning models for medical image segmentation are primarily data-driven. Models trained with more data lead to improved performance and generalizability. However, training is a computationally expensive process because multiple…

Image and Video Processing · Electrical Eng. & Systems 2021-07-13 Vishwesh Nath , Dong Yang , Ali Hatamizadeh , Anas A. Abidin , Andriy Myronenko , Holger Roth , Daguang Xu

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Data teams at frontier AI companies routinely train small proxy models to make critical decisions about pretraining data recipes for full-scale training runs. However, the community has a limited understanding of whether and when…

Machine Learning · Computer Science 2026-04-14 Jiachen T. Wang , Tong Wu , Kaifeng Lyu , James Zou , Dawn Song , Ruoxi Jia , Prateek Mittal

Dataset Pruning: Reducing Training Data by Examining Generalization Influence

The great success of deep learning heavily relies on increasingly larger training data, which comes at a price of huge computational and infrastructural costs. This poses crucial questions that, do all training data contribute to model's…

Machine Learning · Computer Science 2023-02-28 Shuo Yang , Zeke Xie , Hanyu Peng , Min Xu , Mingming Sun , Ping Li

Selection via Proxy: Efficient Data Selection for Deep Learning

Data selection methods, such as active learning and core-set selection, are useful tools for machine learning on large datasets. However, they can be prohibitively expensive to apply in deep learning because they depend on feature…

Machine Learning · Computer Science 2020-10-28 Cody Coleman , Christopher Yeh , Stephen Mussmann , Baharan Mirzasoleiman , Peter Bailis , Percy Liang , Jure Leskovec , Matei Zaharia

Data Proxy Generation for Fast and Efficient Neural Architecture Search

Due to the recent advances on Neural Architecture Search (NAS), it gains popularity in designing best networks for specific tasks. Although it shows promising results on many benchmarks and competitions, NAS still suffers from its demanding…

Machine Learning · Computer Science 2019-11-22 Minje Park

Efficient Neural Network Training via Subset Pretraining

In training neural networks, it is common practice to use partial gradients computed over batches, mostly very small subsets of the training set. This approach is motivated by the argument that such a partial gradient is close to the true…

Machine Learning · Computer Science 2024-11-25 Jan Spörer , Bernhard Bermeitinger , Tomas Hrycej , Niklas Limacher , Siegfried Handschuh

Less is More: Proxy Datasets in NAS approaches

Neural Architecture Search (NAS) defines the design of Neural Networks as a search problem. Unfortunately, NAS is computationally intensive because of various possibilities depending on the number of elements in the design and the possible…

Machine Learning · Computer Science 2023-11-20 Brian Moser , Federico Raue , Jörn Hees , Andreas Dengel

Zero-Cost Proxies for Lightweight NAS

Neural Architecture Search (NAS) is quickly becoming the standard methodology to design neural network models. However, NAS is typically compute-intensive because multiple models need to be evaluated before choosing the best one. To reduce…

Machine Learning · Computer Science 2021-03-22 Mohamed S. Abdelfattah , Abhinav Mehrotra , Łukasz Dudziak , Nicholas D. Lane

Efficient Conditional Pre-training for Transfer Learning

Almost all the state-of-the-art neural networks for computer vision tasks are trained by (1) pre-training on a large-scale dataset and (2) finetuning on the target dataset. This strategy helps reduce dependence on the target dataset and…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Shuvam Chakraborty , Burak Uzkent , Kumar Ayush , Kumar Tanmay , Evan Sheehan , Stefano Ermon

Approximate Selection with Guarantees using Proxies

Due to the falling costs of data acquisition and storage, researchers and industry analysts often want to find all instances of rare events in large datasets. For instance, scientists can cheaply capture thousands of hours of video, but are…

Databases · Computer Science 2022-01-05 Daniel Kang , Edward Gan , Peter Bailis , Tatsunori Hashimoto , Matei Zaharia

These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining

Transfer learning is widely used to adapt large pretrained models to new tasks with only a small amount of new data. However, a challenge persists -- the features from the original task often do not fully cover what is needed for unseen…

Machine Learning · Computer Science 2026-02-10 Xingyu Alice Yang , Jianyu Zhang , Léon Bottou

A Proposal to Study "Is High Quality Data All We Need?"

Even though deep neural models have achieved superhuman performance on many popular benchmarks, they have failed to generalize to OOD or adversarial datasets. Conventional approaches aimed at increasing robustness include developing…

Machine Learning · Computer Science 2022-03-15 Swaroop Mishra , Anjana Arunkumar

Proxy Network for Few Shot Learning

The use of a few examples for each class to train a predictive model that can be generalized to novel classes is a crucial and valuable research direction in artificial intelligence. This work addresses this problem by proposing a few-shot…

Machine Learning · Computer Science 2020-09-10 Bin Xiao , Chien-Liang Liu , Wen-Hoar Hsaio

Subset Sampling For Progressive Neural Network Learning

Progressive Neural Network Learning is a class of algorithms that incrementally construct the network's topology and optimize its parameters based on the training data. While this approach exempts the users from the manual task of designing…

Machine Learning · Computer Science 2020-05-26 Dat Thanh Tran , Moncef Gabbouj , Alexandros Iosifidis

A Data-Centric Approach for Training Deep Neural Networks with Less Data

While the availability of large datasets is perceived to be a key requirement for training deep neural networks, it is possible to train such models with relatively little data. However, compensating for the absence of large datasets…

Artificial Intelligence · Computer Science 2021-11-02 Mohammad Motamedi , Nikolay Sakharnykh , Tim Kaldewey

Progressive Data Dropout: An Embarrassingly Simple Approach to Faster Training

The success of the machine learning field has reliably depended on training on large datasets. While effective, this trend comes at an extraordinary cost. This is due to two deeply intertwined factors: the size of models and the size of…

Computer Vision and Pattern Recognition · Computer Science 2025-10-27 Shriram M Sathiyanarayanan , Xinyue Hao , Shihao Hou , Yang Lu , Laura Sevilla-Lara , Anurag Arnab , Shreyank N Gowda

Smoothness and Structure Learning by Proxy

As data sets grow in size, the ability of learning methods to find structure in them is increasingly hampered by the time needed to search the large spaces of possibilities and generate a score for each that takes all of the observed data…

Machine Learning · Computer Science 2012-07-03 Benjamin Yackley , Terran Lane

Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype Prediction

Current machine learning has made great progress on computer vision and many other fields attributed to the large amount of high-quality training samples, while it does not work very well on genomic data analysis, since they are notoriously…

Machine Learning · Computer Science 2020-09-04 Ziyi Yang , Jun Shu , Yong Liang , Deyu Meng , Zongben Xu

A Linear Combination-based Method to Construct Proxy Benchmarks for Big Data Workloads

During early stages of CPU design, benchmarks can only run on simulators to evaluate CPU performance. However, most big data benchmarks are too huge at code size scale, which causes them to be unable to finish running on simulators at an…

Performance · Computer Science 2023-09-20 Yikang Yang , Lei Wang , Jianfeng Zhan

Feature Alignment: Rethinking Efficient Active Learning via Proxy in the Context of Pre-trained Models

Fine-tuning the pre-trained model with active learning holds promise for reducing annotation costs. However, this combination introduces significant computational costs, particularly with the growing scale of pre-trained models. Recent…

Machine Learning · Computer Science 2024-11-19 Ziting Wen , Oscar Pizarro , Stefan Williams