Related papers: Anonymizing Machine Learning Models

Human-Centered Interactive Anonymization for Privacy-Preserving Machine Learning: A Case for Human-Guided k-Anonymity

Privacy-preserving machine learning (ML) seeks to balance data utility and privacy, especially as regulations like the GDPR mandate the anonymization of personal data for ML applications. Conventional anonymization approaches often reduce…

Cryptography and Security · Computer Science 2025-07-08 Sri Harsha Gajavalli

$k$-Anonymity in Practice: How Generalisation and Suppression Affect Machine Learning Classifiers

The protection of private information is a crucial issue in data-driven research and business contexts. Typically, techniques like anonymisation or (selective) deletion are introduced in order to allow data sharing, e. g. in the case of…

Machine Learning · Computer Science 2022-06-23 Djordje Slijepčević , Maximilian Henzl , Lukas Daniel Klausner , Tobias Dam , Peter Kieseberg , Matthias Zeppelzauer

Multi-Objective Optimization-Based Anonymization of Structured Data for Machine Learning Application

Organizations are collecting vast amounts of data, but they often lack the capabilities needed to fully extract insights. As a result, they increasingly share data with external experts, such as analysts or researchers, to gain value from…

Machine Learning · Computer Science 2025-05-16 Yusi Wei , Hande Y. Benson , Joseph K. Agor , Muge Capan

Comparison of machine learning models applied on anonymized data with different techniques

Anonymization techniques based on obfuscating the quasi-identifiers by means of value generalization hierarchies are widely used to achieve preset levels of privacy. To prevent different types of attacks against database privacy it is…

Machine Learning · Computer Science 2023-05-15 Judith Sáinz-Pardo Díaz , Álvaro López García

Anonymizing Data for Privacy-Preserving Federated Learning

Federated learning enables training a global machine learning model from data distributed across multiple sites, without having to move the data. This is particularly relevant in healthcare applications, where data is rife with personal,…

Cryptography and Security · Computer Science 2020-02-24 Olivia Choudhury , Aris Gkoulalas-Divanis , Theodoros Salonidis , Issa Sylla , Yoonyoung Park , Grace Hsu , Amar Das

AI-Driven Anonymization: Protecting Personal Data Privacy While Leveraging Machine Learning

The development of artificial intelligence has significantly transformed people's lives. However, it has also posed a significant threat to privacy and security, with numerous instances of personal information being exposed online and…

Cryptography and Security · Computer Science 2024-02-28 Le Yang , Miao Tian , Duan Xin , Qishuo Cheng , Jiajian Zheng

Learning Anonymized Representations with Adversarial Neural Networks

Statistical methods protecting sensitive information or the identity of the data owner have become critical to ensure privacy of individuals as well as of organizations. This paper investigates anonymization methods based on representation…

Machine Learning · Statistics 2018-02-27 Clément Feutry , Pablo Piantanida , Yoshua Bengio , Pierre Duhamel

Learning from Anonymized and Incomplete Tabular Data

User-driven privacy allows individuals to control whether and at what granularity their data is shared, leading to datasets that mix original, generalized, and missing values within the same records and attributes. While such…

Machine Learning · Computer Science 2026-02-03 Lucas Lange , Adrian Böttinger , Victor Christen , Anushka Vidanage , Peter Christen , Erhard Rahm

Data Minimization for GDPR Compliance in Machine Learning Models

The EU General Data Protection Regulation (GDPR) mandates the principle of data minimization, which requires that only data necessary to fulfill a certain purpose be collected. However, it can often be difficult to determine the minimal…

Machine Learning · Computer Science 2022-02-02 Abigail Goldsteen , Gilad Ezov , Ron Shmelkin , Micha Moffie , Ariel Farkash

Energy cost and machine learning accuracy impact of k-anonymisation and synthetic data techniques

To address increasing societal concerns regarding privacy and climate, the EU adopted the General Data Protection Regulation (GDPR) and committed to the Green Deal. Considerable research studied the energy efficiency of software and the…

Machine Learning · Computer Science 2023-10-31 Pepijn de Reus , Ana Oprescu , Koen van Elsen

Machine Learning with Membership Privacy using Adversarial Regularization

Machine learning models leak information about the datasets on which they are trained. An adversary can build an algorithm to trace the individual members of a model's training dataset. As a fundamental inference attack, he aims to…

Machine Learning · Statistics 2018-07-17 Milad Nasr , Reza Shokri , Amir Houmansadr

Rethinking Anonymity Claims in Synthetic Data Generation: A Model-Centric Privacy Attack Perspective

Training generative machine learning models to produce synthetic tabular data has become a popular approach for enhancing privacy in data sharing. As this typically involves processing sensitive personal information, releasing either the…

Cryptography and Security · Computer Science 2026-02-02 Georgi Ganev , Emiliano De Cristofaro

Can the Utility of Anonymized Data be used for Privacy Breaches?

Group based anonymization is the most widely studied approach for privacy preserving data publishing. This includes k-anonymity, l-diversity, and t-closeness, to name a few. The goal of this paper is to raise a fundamental issue on the…

Databases · Computer Science 2009-05-13 Raymond Chi-Wing Wong , Ada Wai-Chee Fu , Ke Wang , Yabo Xu , Philip S. Yu

Generalization Techniques Empirically Outperform Differential Privacy against Membership Inference

Differentially private training algorithms provide protection against one of the most popular attacks in machine learning: the membership inference attack. However, these privacy algorithms incur a loss of the model's classification…

Cryptography and Security · Computer Science 2021-10-13 Jiaxiang Liu , Simon Oya , Florian Kerschbaum

Approximate, Adapt, Anonymize (3A): a Framework for Privacy Preserving Training Data Release for Machine Learning

The availability of large amounts of informative data is crucial for successful machine learning. However, in domains with sensitive information, the release of high-utility data which protects the privacy of individuals has proven…

Machine Learning · Computer Science 2023-07-06 Tamas Madl , Weijie Xu , Olivia Choudhury , Matthew Howard

Augmenting Anonymized Data with AI: Exploring the Feasibility and Limitations of Large Language Models in Data Enrichment

Large Language Models (LLMs) have demonstrated advanced capabilities in both text generation and comprehension, and their application to data archives might facilitate the privatization of sensitive information about the data subjects. In…

Cryptography and Security · Computer Science 2025-04-08 Stefano Cirillo , Domenico Desiato , Giuseppe Polese , Monica Maria Lucia Sebillo , Giandomenico Solimando

A Review on Machine Unlearning

Recently, an increasing number of laws have governed the useability of users' privacy. For example, Article 17 of the General Data Protection Regulation (GDPR), the right to be forgotten, requires machine learning applications to remove a…

Machine Learning · Computer Science 2024-11-19 Haibo Zhang , Toru Nakamura , Takamasa Isohara , Kouichi Sakurai

Privacy-preserving Machine Learning through Data Obfuscation

As machine learning becomes a practice and commodity, numerous cloud-based services and frameworks are provided to help customers develop and deploy machine learning applications. While it is prevalent to outsource model training and…

Cryptography and Security · Computer Science 2018-07-16 Tianwei Zhang , Zecheng He , Ruby B. Lee

Towards more accurate and useful data anonymity vulnerability measures

The purpose of anonymizing structured data is to protect the privacy of individuals in the data while retaining the statistical properties of the data. There is a large body of work that examines anonymization vulnerabilities. Focusing on…

Cryptography and Security · Computer Science 2024-03-12 Paul Francis , David Wagner

When Machine Unlearning Jeopardizes Privacy

The right to be forgotten states that a data owner has the right to erase their data from an entity storing it. In the context of machine learning (ML), the right to be forgotten requires an ML model owner to remove the data owner's data…

Cryptography and Security · Computer Science 2021-09-15 Min Chen , Zhikun Zhang , Tianhao Wang , Michael Backes , Mathias Humbert , Yang Zhang