English
Related papers

Related papers: Data Pruning via Moving-one-Sample-out

200 papers

Model pruning is a performance optimization technique for large language models like R1 or o3-mini. However, existing pruning methods often lead to significant performance degradation or require extensive retraining and fine-tuning. This…

Computation and Language · Computer Science 2025-05-21 Wei Jiang , Anying Fu , Youling Zhang

Training advanced machine learning models demands massive datasets, resulting in prohibitive computational costs. To address this challenge, data pruning techniques identify and remove redundant training samples while preserving model…

Machine Learning · Computer Science 2025-06-23 Sebastian Schmidt , Prasanga Dhungel , Christoffer Löffler , Björn Nieth , Stephan Günnemann , Leo Schwinn

In the era of exceptionally data-hungry models, careful selection of the training data is essential to mitigate the extensive costs of deep learning. Data pruning offers a solution by removing redundant or uninformative samples from the…

Machine Learning · Computer Science 2025-02-11 Artem Vysogorets , Kartik Ahuja , Julia Kempe

This paper improves upon existing data pruning methods for image classification by introducing a novel pruning metric and pruning procedure based on importance sampling. The proposed pruning metric explicitly accounts for data separability,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Steven Grosz , Rui Zhao , Rajeev Ranjan , Hongcheng Wang , Manoj Aggarwal , Gerard Medioni , Anil Jain

The great success of deep learning heavily relies on increasingly larger training data, which comes at a price of huge computational and infrastructural costs. This poses crucial questions that, do all training data contribute to model's…

Machine Learning · Computer Science 2023-02-28 Shuo Yang , Zeke Xie , Hanyu Peng , Min Xu , Mingming Sun , Ping Li

Machine learning optimization often depends on stochastic gradient descent, where the precision of gradient estimation is vital for model performance. Gradients are calculated from mini-batches formed by uniformly selecting data samples…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

Pruning is a core technique for compressing neural networks to improve computational efficiency. This process is typically approached in two ways: one-shot pruning, which involves a single pass of training and pruning, and iterative…

Machine Learning · Computer Science 2025-08-20 Mikołaj Janusz , Tomasz Wojnar , Yawei Li , Luca Benini , Kamil Adamczewski

Previous studies have demonstrated that not each sample in a dataset is of equal importance during training. Data pruning aims to remove less important or informative samples while still achieving comparable results as training on the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-16 Zi Yang , Haojin Yang , Soumajit Majumder , Jorge Cardoso , Guillermo Gallego

In this paper, we present InfoMax, a novel data pruning method, also known as coreset selection, designed to maximize the information content of selected samples while minimizing redundancy. By doing so, InfoMax enhances the overall…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Haoru Tan , Sitong Wu , Wei Huang , Shizhen Zhao , Xiaojuan Qi

Uncertainty sampling, a popular active learning algorithm, is used to reduce the amount of data required to learn a classifier, but it has been observed in practice to converge to different parameters depending on the initialization and…

Machine Learning · Computer Science 2018-12-06 Stephen Mussmann , Percy Liang

Pruning neural networks at initialization would enable us to find sparse models that retain the accuracy of the original network while consuming fewer computational resources for training and inference. However, current methods are…

We introduce a pruning algorithm that provably sparsifies the parameters of a trained model in a way that approximately preserves the model's predictive accuracy. Our algorithm uses a small batch of input points to construct a data-informed…

Machine Learning · Computer Science 2021-03-16 Cenk Baykal , Lucas Liebenwein , Igor Gilitschenski , Dan Feldman , Daniela Rus

Compute-efficient training of language models has become an important issue. We consider data pruning for data-efficient training of LLMs. In this work, we consider a data pruning method based on information entropy. We propose that the…

Artificial Intelligence · Computer Science 2024-12-13 Minsang Kim , Seungjun Baek

Deep learning has been able to outperform humans in terms of classification accuracy in many tasks. However, to achieve robustness to adversarial perturbations, the best methodologies require to perform adversarial training on a much larger…

Machine Learning · Computer Science 2024-05-13 Javier Maroto , Pascal Frossard

In image Super-Resolution (SR), relying on large datasets for training is a double-edged sword. While offering rich training material, they also demand substantial computational and storage resources. In this work, we analyze dataset…

Image and Video Processing · Electrical Eng. & Systems 2024-06-11 Brian B. Moser , Federico Raue , Andreas Dengel

Large-scale pre-trained models have been remarkably successful in resolving downstream tasks. Nonetheless, deploying these models on low-capability devices still requires an effective approach, such as model pruning. However, pruning the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Haiyan Zhao , Guodong Long

Dataset pruning is the process of removing sub-optimal tuples from a dataset to improve the learning of a machine learning model. In this paper, we compared the performance of different algorithms, first on an unpruned dataset and then on…

Machine Learning · Computer Science 2019-01-31 Arun Thundyill Saseendran , Lovish Setia , Viren Chhabria , Debrup Chakraborty , Aneek Barman Roy

Sparsely activated Mixture-of-Experts (SMoE) has shown promise in scaling up the learning capacity of neural networks. However, vanilla SMoEs have issues such as expert redundancy and heavy memory requirements, making them inefficient and…

Machine Learning · Computer Science 2025-04-11 Ajay Jaiswal , Jianyu Wang , Yixiao Li , Pingzhi Li , Tianlong Chen , Zhangyang Wang , Chong Wang , Ruoming Pang , Xianzhi Du

The sparsely gated mixture of experts (MoE) architecture sends different inputs to different subnetworks, i.e., experts, through trainable routers. MoE reduces the training computation significantly for large models, but its deployment can…

Due to increasing privacy regulations and regulatory compliance, Machine Unlearning (MU) has become essential. The goal of unlearning is to remove information related to a specific class from a model. Traditional approaches achieve exact…

Machine Learning · Computer Science 2024-11-20 Atharv Mittal
‹ Prev 1 2 3 10 Next ›