Related papers: Optimizing Data Usage via Differentiable Rewards

Reinforced Data Sampling for Model Diversification

With the rising number of machine learning competitions, the world has witnessed an exciting race for the best algorithms. However, the involved data selection process may fundamentally suffer from evidence ambiguity and concept drift…

Machine Learning · Computer Science 2020-06-15 Hoang D. Nguyen , Xuan-Son Vu , Quoc-Tuan Truong , Duc-Trong Le

Data Agent: Learning to Select Data via End-to-End Dynamic Optimization

Dynamic Data selection aims to accelerate training by prioritizing informative samples during online training. However, existing methods typically rely on task-specific handcrafted metrics or static/snapshot-based criteria to estimate…

Machine Learning · Computer Science 2026-05-14 Suorong Yang , Fangjian Su , Hai Gan , Ziqi Ye , Jie Li , Baile Xu , Furao Shen , Soujanya Poria

ProDS: Preference-oriented Data Selection for Instruction Tuning

Instruction data selection aims to identify a high-quality subset from the training set that matches or exceeds the performance of the full dataset on target tasks. Existing methods focus on the instruction-to-response mapping, but neglect…

Machine Learning · Computer Science 2025-05-20 Wenya Guo , Zhengkun Zhang , Xumeng Liu , Ying Zhang , Ziyu Lu , Haoze Zhu , Xubo Liu , Ruxue Yan

Training With Data Dependent Dynamic Learning Rates

Recently many first and second order variants of SGD have been proposed to facilitate training of Deep Neural Networks (DNNs). A common limitation of these works stem from the fact that they use the same learning rate across all instances…

Machine Learning · Computer Science 2021-05-31 Shreyas Saxena , Nidhi Vyas , Dennis DeCoste

Learning What Data to Learn

Machine learning is essentially the sciences of playing with data. An adaptive data selection strategy, enabling to dynamically choose different data at various training stages, can reach a more effective model in a more efficient way. In…

Machine Learning · Computer Science 2017-03-01 Yang Fan , Fei Tian , Tao Qin , Jiang Bian , Tie-Yan Liu

Data Valuation using Reinforcement Learning

Quantifying the value of data is a fundamental problem in machine learning. Data valuation has multiple important use cases: (1) building insights about the learning task, (2) domain adaptation, (3) corrupted sample discovery, and (4)…

Machine Learning · Computer Science 2019-09-27 Jinsung Yoon , Sercan O. Arik , Tomas Pfister

Optimization Methods for Interpretable Differentiable Decision Trees in Reinforcement Learning

Decision trees are ubiquitous in machine learning for their ease of use and interpretability. Yet, these models are not typically employed in reinforcement learning as they cannot be updated online via stochastic gradient descent. We…

Machine Learning · Computer Science 2020-06-29 Andrew Silva , Taylor Killian , Ivan Dario Jimenez Rodriguez , Sung-Hyun Son , Matthew Gombolay

Learning to Maximize Mutual Information for Dynamic Feature Selection

Feature selection helps reduce data acquisition costs in ML, but the standard approach is to train models with static feature subsets. Here, we consider the dynamic feature selection (DFS) problem where a model sequentially queries features…

Machine Learning · Computer Science 2023-06-09 Ian Covert , Wei Qiu , Mingyu Lu , Nayoon Kim , Nathan White , Su-In Lee

Reinforcement Learning with a Focus on Adjusting Policies to Reach Targets

The objective of a reinforcement learning agent is to discover better actions through exploration. However, typical exploration techniques aim to maximize rewards, often incurring high costs in both exploration and learning processes. We…

Machine Learning · Computer Science 2024-12-24 Akane Tsuboya , Yu Kono , Tatsuji Takahashi

ADDS: Adaptive Differentiable Sampling for Robust Multi-Party Learning

Distributed multi-party learning provides an effective approach for training a joint model with scattered data under legal and practical constraints. However, due to the quagmire of a skewed distribution of data labels across participants…

Machine Learning · Computer Science 2021-11-01 Maoguo Gong , Yuan Gao , Yue Wu , A. K. Qin

Direct Differentiable Augmentation Search

Data augmentation has been an indispensable tool to improve the performance of deep neural networks, however the augmentation can hardly transfer among different tasks and datasets. Consequently, a recent trend is to adopt AutoML technique…

Computer Vision and Pattern Recognition · Computer Science 2021-10-13 Aoming Liu , Zehao Huang , Zhiwu Huang , Naiyan Wang

Learning Data Manipulation for Augmentation and Weighting

Manipulating data, such as weighting data examples or augmenting with new instances, has been increasingly used to improve model training. Previous work has studied various rule- or learning-based approaches designed for specific types of…

Machine Learning · Computer Science 2019-10-29 Zhiting Hu , Bowen Tan , Ruslan Salakhutdinov , Tom Mitchell , Eric P. Xing

Deep Reinforcement Learning for Resource Allocation in Business Processes

Assigning resources in business processes execution is a repetitive task that can be effectively automated. However, different automation methods may give varying results that may not be optimal. Proper resource allocation is crucial as it…

Machine Learning · Computer Science 2021-04-02 Kamil Żbikowski , Michał Ostapowicz , Piotr Gawrysiak

Friendly Training: Neural Networks Can Adapt Data To Make Learning Easier

In the last decade, motivated by the success of Deep Learning, the scientific community proposed several approaches to make the learning procedure of Neural Networks more effective. When focussing on the way in which the training data are…

Machine Learning · Computer Science 2021-06-22 Simone Marullo , Matteo Tiezzi , Marco Gori , Stefano Melacci

LLM Data Selection and Utilization via Dynamic Bi-level Optimization

While large-scale training data is fundamental for developing capable large language models (LLMs), strategically selecting high-quality data has emerged as a critical approach to enhance training efficiency and reduce computational costs.…

Machine Learning · Computer Science 2025-07-23 Yang Yu , Kai Han , Hang Zhou , Yehui Tang , Kaiqi Huang , Yunhe Wang , Dacheng Tao

Influence Guided Sampling for Domain Adaptation of Text Retrievers

General-purpose open-domain dense retrieval systems are usually trained with a large, eclectic mix of corpora and search tasks. How should these diverse corpora and tasks be sampled for training? Conventional approaches sample them…

Information Retrieval · Computer Science 2026-01-30 Meet Doshi , Vishwajeet Kumar , Yulong Li , Jaydeep Sen

Accelerating Deep Learning with Dynamic Data Pruning

Deep learning's success has been attributed to the training of large, overparameterized models on massive amounts of data. As this trend continues, model training has become prohibitively costly, requiring access to powerful computing…

Machine Learning · Computer Science 2021-11-25 Ravi S Raju , Kyle Daruwalla , Mikko Lipasti

Dynamic Memory for Interpretable Sequential Optimisation

Real-world applications of reinforcement learning for recommendation and experimentation faces a practical challenge: the relative reward of different bandit arms can evolve over the lifetime of the learning agent. To deal with these…

Machine Learning · Computer Science 2022-06-29 Srivas Chennu , Andrew Maher , Jamie Martin , Subash Prabanantham

Adaptive Data Dropout: Towards Self-Regulated Learning in Deep Neural Networks

Deep neural networks are typically trained by uniformly sampling large datasets across epochs, despite evidence that not all samples contribute equally throughout learning. Recent work shows that progressively reducing the amount of…

Machine Learning · Computer Science 2026-04-15 Amar Gahir , Varshil Patel , Shreyank N Gowda

Finding High-Value Training Data Subset through Differentiable Convex Programming

Finding valuable training data points for deep neural networks has been a core research challenge with many applications. In recent years, various techniques for calculating the "value" of individual training datapoints have been proposed…

Machine Learning · Computer Science 2021-04-29 Soumi Das , Arshdeep Singh , Saptarshi Chatterjee , Suparna Bhattacharya , Sourangshu Bhattacharya