English
Related papers

Related papers: An Algorithm for Streaming Differentially Private …

200 papers

While differentially private synthetic data generation has been explored extensively in the literature, how to update this data in the future if the underlying private data changes is much less understood. We propose an algorithmic…

Cryptography and Security · Computer Science 2024-09-04 Girish Kumar , Thomas Strohmer , Roman Vershynin

Differential privacy is a rigorous definition for privacy that guarantees that any analysis performed on a sensitive dataset leaks no information about the individuals whose data are contained therein. In this work, we develop new…

Cryptography and Security · Computer Science 2021-11-18 Vassilis Digalakis , George N. Karystinos , Minos N. Garofalakis

Differential privacy is the state-of-the-art definition for privacy, guaranteeing that any analysis performed on a sensitive dataset leaks no information about the individuals whose data are contained therein. In this thesis, we develop…

Machine Learning · Computer Science 2023-11-29 Vassilis Digalakis

The need to analyze sensitive data, such as medical records or financial data, has created a critical research challenge in recent years. In this paper, we adopt the framework of differential privacy, and explore mechanisms for generating…

Cryptography and Security · Computer Science 2024-05-09 Nikolija Bojkovic , Po-Ling Loh

Techniques to deliver privacy-preserving synthetic datasets take a sensitive dataset as input and produce a similar dataset as output while maintaining differential privacy. These approaches have the potential to improve data sharing and…

Databases · Computer Science 2018-08-24 Luke Rodriguez , Bill Howe

Artificial intelligence and data access are already mainstream. One of the main challenges when designing an artificial intelligence or disclosing content from a database is preserving the privacy of individuals who participate in the…

Cryptography and Security · Computer Science 2023-12-13 Clément Pierquin , Bastien Zimmermann , Matthieu Boussard

Estimating the quantiles of a large dataset is a fundamental problem in both the streaming algorithms literature and the differential privacy literature. However, all existing private mechanisms for distribution-independent quantile…

Data Structures and Algorithms · Computer Science 2022-01-11 Daniel Alabi , Omri Ben-Eliezer , Anamay Chaturvedi

Differentially private data generation techniques have become a promising solution to the data privacy challenge -- it enables sharing of data while complying with rigorous privacy guarantees, which is essential for scientific progress in…

Cryptography and Security · Computer Science 2022-11-09 Dingfan Chen , Raouf Kerkouche , Mario Fritz

Creation of a synthetic dataset that faithfully represents the data distribution and simultaneously preserves privacy is a major research challenge. Many space partitioning based approaches have emerged in recent years for answering…

Cryptography and Security · Computer Science 2023-06-26 Eleonora Kreačić , Navid Nouri , Vamsi K. Potluru , Tucker Balch , Manuela Veloso

The literature on data sanitization aims to design algorithms that take an input dataset and produce a privacy-preserving version of it, that captures some of its statistical properties. In this note we study this question from a streaming…

Data Structures and Algorithms · Computer Science 2021-11-30 Haim Kaplan , Uri Stemmer

Privacy-preserving estimation of counts of items in streaming data finds applications in several real-world scenarios including word auto-correction and traffic management applications. Recent works of RAPPOR and Apple's count-mean sketch…

Data Structures and Algorithms · Computer Science 2022-12-01 Dinusha Vatsalan , Raghav Bhaskar , Mohamed Ali Kaafar

The streaming model of computation is a popular approach for working with large-scale data. In this setting, there is a stream of items and the goal is to compute the desired quantities (usually data statistics) while making a single pass…

Data Structures and Algorithms · Computer Science 2023-01-16 Alessandro Epasto , Jieming Mao , Andres Munoz Medina , Vahab Mirrokni , Sergei Vassilvitskii , Peilin Zhong

Private synthetic data sharing is preferred as it keeps the distribution and nuances of original data compared to summary statistics. The state-of-the-art methods adopt a select-measure-generate paradigm, but measuring large domain…

Cryptography and Security · Computer Science 2023-10-11 Meifan Zhang , Dihang Deng , Lihua Yin

Existing differentially private (DP) synthetic data generation mechanisms typically assume a single-source table. In practice, data is often distributed across multiple tables with relationships across tables. In this paper, we introduce…

Machine Learning · Computer Science 2025-01-22 Kaveh Alimohammadi , Hao Wang , Ojas Gulati , Akash Srivastava , Navid Azizan

While power systems research relies on the availability of real-world network datasets, data owners (e.g., system operators) are hesitant to share data due to security and privacy risks. To control these risks, we develop privacy-preserving…

Cryptography and Security · Computer Science 2023-03-21 Vladimir Dvorkin , Audun Botterud

In a world where artificial intelligence and data science become omnipresent, data sharing is increasingly locking horns with data-privacy concerns. Differential privacy has emerged as a rigorous framework for protecting individual privacy…

Cryptography and Security · Computer Science 2022-06-06 March Boedihardjo , Thomas Strohmer , Roman Vershynin

Differential privacy is a mathematical concept that provides an information-theoretic security guarantee. While differential privacy has emerged as a de facto standard for guaranteeing privacy in data sharing, the known mechanisms to…

Cryptography and Security · Computer Science 2024-03-26 March Boedihardjo , Thomas Strohmer , Roman Vershynin

Motivated by privacy concerns in long-term longitudinal studies in medical and social science research, we study the problem of continually releasing differentially private synthetic data from longitudinal data collections. We introduce a…

Data Structures and Algorithms · Computer Science 2024-05-28 Mark Bun , Marco Gaboardi , Marcel Neunhoeffer , Wanrong Zhang

Online learning has been in the spotlight from the machine learning society for a long time. To handle massive data in Big Data era, one single learner could never efficiently finish this heavy task. Hence, in this paper, we propose a novel…

Machine Learning · Computer Science 2015-06-24 Chencheng Li , Pan Zhou

Differential privacy allows quantifying privacy loss resulting from accessing sensitive personal data. Repeated accesses to underlying data incur increasing loss. Releasing data as privacy-preserving synthetic data would avoid this…

Machine Learning · Statistics 2021-06-10 Joonas Jälkö , Eemil Lagerspetz , Jari Haukka , Sasu Tarkoma , Antti Honkela , Samuel Kaski
‹ Prev 1 2 3 10 Next ›