Related papers: Data Pruning via Moving-one-Sample-out

Improved Methods for Model Pruning and Knowledge Distillation

Model pruning is a performance optimization technique for large language models like R1 or o3-mini. However, existing pruning methods often lead to significant performance degradation or require extensive retraining and fine-tuning. This…

Computation and Language · Computer Science 2025-05-21 Wei Jiang , Anying Fu , Youling Zhang

Effective Data Pruning through Score Extrapolation

Training advanced machine learning models demands massive datasets, resulting in prohibitive computational costs. To address this challenge, data pruning techniques identify and remove redundant training samples while preserving model…

Machine Learning · Computer Science 2025-06-23 Sebastian Schmidt , Prasanga Dhungel , Christoffer Löffler , Björn Nieth , Stephan Günnemann , Leo Schwinn

DRoP: Distributionally Robust Data Pruning

In the era of exceptionally data-hungry models, careful selection of the training data is essential to mitigate the extensive costs of deep learning. Data pruning offers a solution by removing redundant or uninformative samples from the…

Machine Learning · Computer Science 2025-02-11 Artem Vysogorets , Kartik Ahuja , Julia Kempe

Data Pruning via Separability, Integrity, and Model Uncertainty-Aware Importance Sampling

This paper improves upon existing data pruning methods for image classification by introducing a novel pruning metric and pruning procedure based on importance sampling. The proposed pruning metric explicitly accounts for data separability,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Steven Grosz , Rui Zhao , Rajeev Ranjan , Hongcheng Wang , Manoj Aggarwal , Gerard Medioni , Anil Jain

Dataset Pruning: Reducing Training Data by Examining Generalization Influence

The great success of deep learning heavily relies on increasingly larger training data, which comes at a price of huge computational and infrastructural costs. This poses crucial questions that, do all training data contribute to model's…

Machine Learning · Computer Science 2023-02-28 Shuo Yang , Zeke Xie , Hanyu Peng , Min Xu , Mingming Sun , Ping Li

Online Importance Sampling for Stochastic Gradient Optimization

Machine learning optimization often depends on stochastic gradient descent, where the precision of gradient estimation is vital for model performance. Gradients are calculated from mini-batches formed by uniformly selecting data samples…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression

Pruning is a core technique for compressing neural networks to improve computational efficiency. This process is typically approached in two ways: one-shot pruning, which involves a single pass of training and pruning, and iterative…

Machine Learning · Computer Science 2025-08-20 Mikołaj Janusz , Tomasz Wojnar , Yawei Li , Luca Benini , Kamil Adamczewski

Data Pruning Can Do More: A Comprehensive Data Pruning Approach for Object Re-identification

Previous studies have demonstrated that not each sample in a dataset is of equal importance during training. Data pruning aims to remove less important or informative samples while still achieving comparable results as training on the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-16 Zi Yang , Haojin Yang , Soumajit Majumder , Jorge Cardoso , Guillermo Gallego

Data Pruning by Information Maximization

In this paper, we present InfoMax, a novel data pruning method, also known as coreset selection, designed to maximize the information content of selected samples while minimizing redundancy. By doing so, InfoMax enhances the overall…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Haoru Tan , Sitong Wu , Wei Huang , Shizhen Zhao , Xiaojuan Qi

Uncertainty Sampling is Preconditioned Stochastic Gradient Descent on Zero-One Loss

Uncertainty sampling, a popular active learning algorithm, is used to reduce the amount of data required to learn a classifier, but it has been observed in practice to converge to different parameters depending on the initialization and…

Machine Learning · Computer Science 2018-12-06 Stephen Mussmann , Percy Liang

Prospect Pruning: Finding Trainable Weights at Initialization using Meta-Gradients

Pruning neural networks at initialization would enable us to find sparse models that retain the accuracy of the original network while consuming fewer computational resources for training and inference. However, current methods are…

Machine Learning · Computer Science 2022-04-06 Milad Alizadeh , Shyam A. Tailor , Luisa M Zintgraf , Joost van Amersfoort , Sebastian Farquhar , Nicholas Donald Lane , Yarin Gal

SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks

We introduce a pruning algorithm that provably sparsifies the parameters of a trained model in a way that approximately preserves the model's predictive accuracy. Our algorithm uses a small batch of input points to construct a data-informed…

Machine Learning · Computer Science 2021-03-16 Cenk Baykal , Lucas Liebenwein , Igor Gilitschenski , Dan Feldman , Daniela Rus

Measuring Sample Importance in Data Pruning for Language Models based on Information Entropy

Compute-efficient training of language models has become an important issue. We consider data pruning for data-efficient training of LLMs. In this work, we consider a data pruning method based on information entropy. We propose that the…

Artificial Intelligence · Computer Science 2024-12-13 Minsang Kim , Seungjun Baek

PUMA: margin-based data pruning

Deep learning has been able to outperform humans in terms of classification accuracy in many tasks. However, to achieve robustness to adversarial perturbations, the best methodologies require to perform adversarial training on a much larger…

Machine Learning · Computer Science 2024-05-13 Javier Maroto , Pascal Frossard

A Study in Dataset Pruning for Image Super-Resolution

In image Super-Resolution (SR), relying on large datasets for training is a double-edged sword. While offering rich training material, they also demand substantial computational and storage resources. In this work, we analyze dataset…

Image and Video Processing · Electrical Eng. & Systems 2024-06-11 Brian B. Moser , Federico Raue , Andreas Dengel

One-Shot Pruning for Fast-adapting Pre-trained Models on Devices

Large-scale pre-trained models have been remarkably successful in resolving downstream tasks. Nonetheless, deploying these models on low-capability devices still requires an effective approach, such as model pruning. However, pruning the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Haiyan Zhao , Guodong Long

Impact of Data Pruning on Machine Learning Algorithm Performance

Dataset pruning is the process of removing sub-optimal tuples from a dataset to improve the learning of a machine learning model. In this paper, we compared the performance of different algorithms, first on an unpruned dataset and then on…

Machine Learning · Computer Science 2019-01-31 Arun Thundyill Saseendran , Lovish Setia , Viren Chhabria , Debrup Chakraborty , Aneek Barman Roy

Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations

Sparsely activated Mixture-of-Experts (SMoE) has shown promise in scaling up the learning capacity of neural networks. However, vanilla SMoEs have issues such as expert redundancy and heavy memory requirements, making them inefficient and…

Machine Learning · Computer Science 2025-04-11 Ajay Jaiswal , Jianyu Wang , Yixiao Li , Pingzhi Li , Tianlong Chen , Zhangyang Wang , Chong Wang , Ruoming Pang , Xianzhi Du

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

The sparsely gated mixture of experts (MoE) architecture sends different inputs to different subnetworks, i.e., experts, through trainable routers. MoE reduces the training computation significantly for large models, but its deployment can…

Machine Learning · Computer Science 2024-05-31 Mohammed Nowaz Rabbani Chowdhury , Meng Wang , Kaoutar El Maghraoui , Naigang Wang , Pin-Yu Chen , Christopher Carothers

LoRA Unlearns More and Retains More (Student Abstract)

Due to increasing privacy regulations and regulatory compliance, Machine Unlearning (MU) has become essential. The goal of unlearning is to remove information related to a specific class from a model. Traditional approaches achieve exact…

Machine Learning · Computer Science 2024-11-20 Atharv Mittal