English
Related papers

Related papers: Is Data Valuation Learnable and Interpretable?

200 papers

The proliferation of large models has intensified the need for efficient data valuation methods to quantify the contribution of individual data providers. Traditional approaches, such as game-theory-based Shapley value and…

Artificial Intelligence · Computer Science 2025-09-24 Le Ma , Shirao Yang , Zihao Wang , Yinggui Wang , Lei Wang , Tao Wei , Kejun Zhang

"How much is my data worth?" is an increasingly common question posed by organizations and individuals alike. An answer to this question could allow, for instance, fairly distributing profits among multiple data contributors and determining…

Machine Learning · Computer Science 2023-03-07 Ruoxi Jia , David Dao , Boxin Wang , Frances Ann Hubis , Nick Hynes , Nezihe Merve Gurel , Bo Li , Ce Zhang , Dawn Song , Costas Spanos

As data becomes the fuel driving technological and economic growth, a fundamental challenge is how to quantify the value of data in algorithmic predictions and decisions. For example, in healthcare and consumer markets, it has been…

Machine Learning · Statistics 2019-06-11 Amirata Ghorbani , James Zou

Data valuation, or the valuation of individual datum contributions, has seen growing interest in machine learning due to its demonstrable efficacy for tasks such as noisy label detection. In particular, due to the desirable axiomatic…

Machine Learning · Computer Science 2022-11-15 Stephanie Schoch , Haifeng Xu , Yangfeng Ji

Data valuation has become an increasingly significant discipline in data science due to the economic value of data. In the context of machine learning (ML), data valuation methods aim to equitably measure the contribution of each data point…

Machine Learning · Computer Science 2023-06-13 Xiang Li , Haocheng Xia , Jinfei Liu

Data is a critical asset for training large language models (LLMs), alongside compute resources and skilled workers. While some training data is publicly available, substantial investment is required to generate proprietary datasets, such…

Machine Learning · Computer Science 2026-01-27 Mélissa Tamine , Otmane Sakhi , Benjamin Heymann

Data valuation has found various applications in machine learning, such as data filtering, efficient learning and incentives for data sharing. The most popular current approach to data valuation is the Shapley value. While popular for its…

Machine Learning · Computer Science 2023-11-10 Lauren Watson , Zeno Kujawa , Rayna Andreeva , Hao-Tsung Yang , Tariq Elahi , Rik Sarkar

Shapley value is a classic notion from game theory, historically used to quantify the contributions of individuals within groups, and more recently applied to assign values to data points when training machine learning models. Despite its…

Machine Learning · Computer Science 2020-02-28 Amirata Ghorbani , Michael P. Kim , James Zou

Data valuation seeks to answer the important question, "How much is this data worth?" Existing data valuation methods have largely focused on discriminative models, primarily examining data value through the lens of its utility in training.…

Machine Learning · Computer Science 2024-10-17 Mohamad Rida Rammal , Ruida Zhou , Suhas Diggavi

Data valuation has garnered increasing attention in recent years, given the critical role of high-quality data in various applications. Among diverse data valuation approaches, Shapley value-based methods are predominant due to their strong…

Machine Learning · Computer Science 2025-11-27 Xiaoling Zhou , Ou Wu , Michael K. Ng , Hao Jiang

Distributional data Shapley value (DShapley) has recently been proposed as a principled framework to quantify the contribution of individual datum in machine learning. DShapley develops the foundational game theory concept of Shapley values…

Machine Learning · Statistics 2021-02-19 Yongchan Kwon , Manuel A. Rivas , James Zou

With the adoption of machine learning-based solutions in routine clinical practice, the need for reliable interpretability tools has become pressing. Shapley values provide local explanations. The method gained popularity in recent years.…

Methodology · Statistics 2023-06-27 Lucile Ter-Minassian , Sahra Ghalebikesabi , Karla Diaz-Ordaz , Chris Holmes

Shapley values are great analytical tools in game theory to measure the importance of a player in a game. Due to their axiomatic and desirable properties such as efficiency, they have become popular for feature importance analysis in data…

Machine Learning · Computer Science 2020-10-26 Ramin Okhrati , Aldo Lipani

The presence of artificial intelligence (AI) in our society is increasing, which brings with it the need to understand the behavior of AI mechanisms, including machine learning predictive algorithms fed with tabular data, text or images,…

Machine Learning · Statistics 2025-06-06 Pedro Delicado , Cristian Pachón-García

We study instancewise feature importance scoring as a method for model interpretation. Any such method yields, for each predicted instance, a vector of importance scores associated with the feature vector. Methods based on the Shapley score…

Machine Learning · Computer Science 2018-08-09 Jianbo Chen , Le Song , Martin J. Wainwright , Michael I. Jordan

Data valuation is an essential task in a data marketplace. It aims at fairly compensating data owners for their contribution. There is increasing recognition in the machine learning community that the Shapley value -- a foundational…

Cryptography and Security · Computer Science 2023-02-20 Zhihua Tian , Jian Liu , Jingyu Li , Xinle Cao , Ruoxi Jia , Jun Kong , Mengdi Liu , Kui Ren

Deep neural networks have demonstrated remarkable performance across various domains, yet their decision-making processes remain opaque. Although many explanation methods are dedicated to bringing the obscurity of DNNs to light, they…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Kanglong Fan , Yunqiao Yang , Chen Ma

Data valuation -- quantifying the contribution of individual data sources to certain predictive behaviors of a model -- is of great importance to enhancing the transparency of machine learning and designing incentive systems for data…

Machine Learning · Computer Science 2023-07-28 Zhihong Liu , Hoang Anh Just , Xiangyu Chang , Xi Chen , Ruoxi Jia

Explainable AI (XAI) is widely used to analyze AI systems' decision-making, such as providing counterfactual explanations for recourse. When unexpected explanations occur, users may want to understand the training data properties shaping…

Machine Learning · Computer Science 2025-03-26 André Artelt , Barbara Hammer

The Shapley value provides a principled framework for fairly distributing rewards among participants according to their individual contributions. While prior work has applied this concept to data valuation in machine learning, existing…

Computer Science and Game Theory · Computer Science 2026-01-22 Zhuofan Jia , Jian Pei
‹ Prev 1 2 3 10 Next ›