English
Related papers

Related papers: Differentially Private Data Release over Multiple …

200 papers

A common goal of privacy research is to release synthetic data that satisfies a formal privacy guarantee and can be used by an analyst in place of the original data. To achieve reasonable accuracy, a synthetic data set must be tuned to…

Databases · Computer Science 2015-03-20 Chao Li , Gerome Miklau

In this paper we demonstrate that, ignoring computational constraints, it is possible to privately release synthetic databases that are useful for large classes of queries -- much larger in size than the database itself. Specifically, we…

Data Structures and Algorithms · Computer Science 2011-09-13 Avrim Blum , Katrina Ligett , Aaron Roth

We study the problem of differentially private synthetic data generation for hierarchical datasets in which individual data points are grouped together (e.g., people within households). In particular, to measure the similarity between the…

Machine Learning · Computer Science 2022-06-14 Terrance Liu , Zhiwei Steven Wu

We propose, implement, and evaluate a new algorithm for releasing answers to very large numbers of statistical queries like $k$-way marginals, subject to differential privacy. Our algorithm makes adaptive use of a continuous relaxation of…

Machine Learning · Computer Science 2021-06-24 Sergul Aydore , William Brown , Michael Kearns , Krishnaram Kenthapadi , Luca Melis , Aaron Roth , Ankit Siva

This work considers computationally efficient privacy-preserving data release. We study the task of analyzing a database containing sensitive information about individual participants. Given a set of statistical queries on the data, we want…

Computational Complexity · Computer Science 2011-07-14 Moritz Hardt , Guy N. Rothblum , Rocco A. Servedio

Motivated by privacy concerns in long-term longitudinal studies in medical and social science research, we study the problem of continually releasing differentially private synthetic data from longitudinal data collections. We introduce a…

Data Structures and Algorithms · Computer Science 2024-05-28 Mark Bun , Marco Gaboardi , Marcel Neunhoeffer , Wanrong Zhang

Existing differentially private (DP) synthetic data generation mechanisms typically assume a single-source table. In practice, data is often distributed across multiple tables with relationships across tables. In this paper, we introduce…

Machine Learning · Computer Science 2025-01-22 Kaveh Alimohammadi , Hao Wang , Ojas Gulati , Akash Srivastava , Navid Azizan

Privately generating synthetic data from a table is an important brick of a privacy-first world. We propose and investigate a simple approach of treating each row in a table as a sentence and training a language model with differential…

Machine Learning · Computer Science 2023-06-09 Alexandre Sablayrolles , Yue Wang , Brian Karrer

Differential privacy (DP) provides formal guarantees that the output of a database query does not reveal too much information about any individual present in the database. While many differentially private algorithms have been proposed in…

Cryptography and Security · Computer Science 2019-11-27 Royce J Wilson , Celia Yuxin Zhang , William Lam , Damien Desfontaines , Daniel Simmons-Marengo , Bryant Gipson

Differential privacy allows quantifying privacy loss resulting from accessing sensitive personal data. Repeated accesses to underlying data incur increasing loss. Releasing data as privacy-preserving synthetic data would avoid this…

Machine Learning · Statistics 2021-06-10 Joonas Jälkö , Eemil Lagerspetz , Jari Haukka , Sasu Tarkoma , Antti Honkela , Samuel Kaski

We consider the problem of differentially private query release through a synthetic database approach. Departing from the existing approaches that require the query set to be specified in advance, we advocate to devise query-set independent…

Cryptography and Security · Computer Science 2014-12-02 Weina Wang , Lei Ying , Junshan Zhang

The problem of privately releasing data is to provide a version of a dataset without revealing sensitive information about the individuals who contribute to the data. The model of differential privacy allows such private release while…

Databases · Computer Science 2011-03-07 Graham Cormode , Magda Procopiuc , Divesh Srivastava , Thanh T. L. Tran

While differentially private synthetic data generation has been explored extensively in the literature, how to update this data in the future if the underlying private data changes is much less understood. We propose an algorithmic…

Cryptography and Security · Computer Science 2024-09-04 Girish Kumar , Thomas Strohmer , Roman Vershynin

Data sharing is a prerequisite for collaborative innovation, enabling organizations to leverage diverse datasets for deeper insights. In real-world applications like FinTech and Smart Manufacturing, transactional data, often in tabular…

Cryptography and Security · Computer Science 2024-11-07 Mengmeng Yang , Chi-Hung Chi , Kwok-Yan Lam , Jie Feng , Taolin Guo , Wei Ni

Many data analysis operations can be expressed as a GROUP BY query on an unbounded set of partitions, followed by a per-partition aggregation. To make such a query differentially private, adding noise to each aggregation is not enough: we…

Cryptography and Security · Computer Science 2021-11-01 Damien Desfontaines , James Voss , Bryant Gipson , Chinmoy Mandayam

Differential privacy (DP) is increasingly used to protect the release of hierarchical, tabular population data, such as census data. A common approach for implementing DP in this setting is to release noisy responses to a predefined set of…

Cryptography and Security · Computer Science 2024-04-03 Aadyaa Maddi , Swadhin Routray , Alexander Goldberg , Giulia Fanti

Concern about how to aggregate sensitive user data without compromising individual privacy is a major barrier to greater availability of data. The model of differential privacy has emerged as an accepted model to release sensitive…

Databases · Computer Science 2017-10-03 Graham Cormode , Tejas Kulkarni , Divesh Srivastava

The need to analyze sensitive data, such as medical records or financial data, has created a critical research challenge in recent years. In this paper, we adopt the framework of differential privacy, and explore mechanisms for generating…

Cryptography and Security · Computer Science 2024-05-09 Nikolija Bojkovic , Po-Ling Loh

We present a practical, differentially private algorithm for answering a large number of queries on high dimensional datasets. Like all algorithms for this task, ours necessarily has worst-case complexity exponential in the dimension of the…

Data Structures and Algorithms · Computer Science 2018-03-16 Marco Gaboardi , Emilio Jesús Gallego Arias , Justin Hsu , Aaron Roth , Zhiwei Steven Wu

Existing studies on differential privacy mainly consider aggregation on data sets where each entry corresponds to a particular participant to be protected. In many situations, a user may pose a relational algebra query on a sensitive…

Databases · Computer Science 2013-04-18 Shixi Chen , Shuigeng Zhou
‹ Prev 1 2 3 10 Next ›