English
Related papers

Related papers: EcoVal: An Efficient Data Valuation Framework for …

200 papers

As data becomes the fuel driving technological and economic growth, a fundamental challenge is how to quantify the value of data in algorithmic predictions and decisions. For example, in healthcare and consumer markets, it has been…

Machine Learning · Statistics 2019-06-11 Amirata Ghorbani , James Zou

"How much is my data worth?" is an increasingly common question posed by organizations and individuals alike. An answer to this question could allow, for instance, fairly distributing profits among multiple data contributors and determining…

Machine Learning · Computer Science 2023-03-07 Ruoxi Jia , David Dao , Boxin Wang , Frances Ann Hubis , Nick Hynes , Nezihe Merve Gurel , Bo Li , Ce Zhang , Dawn Song , Costas Spanos

Data valuation has found various applications in machine learning, such as data filtering, efficient learning and incentives for data sharing. The most popular current approach to data valuation is the Shapley value. While popular for its…

Machine Learning · Computer Science 2023-11-10 Lauren Watson , Zeno Kujawa , Rayna Andreeva , Hao-Tsung Yang , Tariq Elahi , Rik Sarkar

The proliferation of large models has intensified the need for efficient data valuation methods to quantify the contribution of individual data providers. Traditional approaches, such as game-theory-based Shapley value and…

Artificial Intelligence · Computer Science 2025-09-24 Le Ma , Shirao Yang , Zihao Wang , Yinggui Wang , Lei Wang , Tao Wei , Kejun Zhang

Assessing the importance of individual training samples is a key challenge in machine learning. Traditional approaches retrain models with and without specific samples, which is computationally expensive and ignores dependencies between…

Machine Learning · Computer Science 2024-12-18 Tim Wibiral , Mohamed Karim Belaid , Maximilian Rabus , Ansgar Scherp

The value and copyright of training data are crucial in the artificial intelligence industry. Service platforms should protect data providers' legitimate rights and fairly reward them for their contributions. Shapley value, a potent tool…

Machine Learning · Computer Science 2025-11-21 Haifeng Sun , Yu Xiong , Runze Wu , Xinyu Cai , Changjie Fan , Lan Zhang , Xiang-Yang Li

Shapley value is a classic notion from game theory, historically used to quantify the contributions of individuals within groups, and more recently applied to assign values to data points when training machine learning models. Despite its…

Machine Learning · Computer Science 2020-02-28 Amirata Ghorbani , Michael P. Kim , James Zou

Data valuation is an essential task in a data marketplace. It aims at fairly compensating data owners for their contribution. There is increasing recognition in the machine learning community that the Shapley value -- a foundational…

Cryptography and Security · Computer Science 2023-02-20 Zhihua Tian , Jian Liu , Jingyu Li , Xinle Cao , Ruoxi Jia , Jun Kong , Mengdi Liu , Kui Ren

Data Shapley has recently been proposed as a principled framework to quantify the contribution of individual datum in machine learning. It can effectively identify helpful or harmful data points for a learning algorithm. In this paper, we…

Machine Learning · Computer Science 2022-01-20 Yongchan Kwon , James Zou

Assessing the quality and impact of individual data points is critical for improving model performance and mitigating undesirable biases within the training dataset. Several data valuation algorithms have been proposed to quantify data…

Machine Learning · Computer Science 2023-10-16 Kevin Fu Jiang , Weixin Liang , James Zou , Yongchan Kwon

We consider the dataset valuation problem, that is, the problem of quantifying the incremental gain, to some relevant pre-defined utility of a machine learning task, of aggregating an individual dataset to others. The Shapley value is a…

Artificial Intelligence · Computer Science 2025-02-25 Felipe Garrido-Lucero , Benjamin Heymann , Maxime Vono , Patrick Loiseau , Vianney Perchet

As data emerges as a vital driver of technological and economic advancements, a key challenge is accurately quantifying its value in algorithmic decision-making. The Shapley value, a well-established concept from cooperative game theory,…

Computer Science and Game Theory · Computer Science 2025-11-20 Xi Zheng , Xiangyu Chang , Ruoxi Jia , Yong Tan

Distributional data Shapley value (DShapley) has recently been proposed as a principled framework to quantify the contribution of individual datum in machine learning. DShapley develops the foundational game theory concept of Shapley values…

Machine Learning · Statistics 2021-02-19 Yongchan Kwon , Manuel A. Rivas , James Zou

Data valuation has garnered increasing attention in recent years, given the critical role of high-quality data in various applications. Among diverse data valuation approaches, Shapley value-based methods are predominant due to their strong…

Machine Learning · Computer Science 2025-11-27 Xiaoling Zhou , Ou Wu , Michael K. Ng , Hao Jiang

Measuring the value of individual samples is critical for many data-driven tasks, e.g., the training of a deep learning model. Recent literature witnesses the substantial efforts in developing data valuation methods. The primary data…

Machine Learning · Computer Science 2024-06-06 Ou Wu , Weiyao Zhu , Mengyang Li

Understanding the decision-making process of machine learning models is crucial for ensuring trustworthy machine learning. Data Shapley, a landmark study on data valuation, advances this understanding by assessing the contribution of each…

Computer Science and Game Theory · Computer Science 2025-01-23 Huaiguang Cai

Data valuation is increasingly used in machine learning (ML) to decide the fair compensation for data owners and identify valuable or harmful data for improving ML models. Cooperative game theory-based data valuation, such as Data Shapley,…

Machine Learning · Computer Science 2025-07-09 Kieu Thao Nguyen Pham , Rachael Hwee Ling Sim , Quoc Phong Nguyen , See Kiong Ng , Bryan Kian Hsiang Low

Data valuation, or the valuation of individual datum contributions, has seen growing interest in machine learning due to its demonstrable efficacy for tasks such as noisy label detection. In particular, due to the desirable axiomatic…

Machine Learning · Computer Science 2022-11-15 Stephanie Schoch , Haifeng Xu , Yangfeng Ji

Data valuation -- quantifying the contribution of individual data sources to certain predictive behaviors of a model -- is of great importance to enhancing the transparency of machine learning and designing incentive systems for data…

Machine Learning · Computer Science 2023-07-28 Zhihong Liu , Hoang Anh Just , Xiangyu Chang , Xi Chen , Ruoxi Jia

Data valuation has become an increasingly significant discipline in data science due to the economic value of data. In the context of machine learning (ML), data valuation methods aim to equitably measure the contribution of each data point…

Machine Learning · Computer Science 2023-06-13 Xiang Li , Haocheng Xia , Jinfei Liu
‹ Prev 1 2 3 10 Next ›