English
Related papers

Related papers: Differentially Private Weighted Sampling

200 papers

Many machine learning applications are based on data collected from people, such as their tastes and behaviour as well as biological traits and genetic data. Regardless of how important the application might be, one has to make sure…

Machine Learning · Statistics 2017-04-11 Joonas Jälkö , Onur Dikmen , Antti Honkela

Increasing interest in privacy-preserving machine learning has led to new and evolved approaches for generating private synthetic data from undisclosed real data. However, mechanisms of privacy preservation can significantly reduce the…

Machine Learning · Statistics 2022-05-23 Sahra Ghalebikesabi , Harrison Wilde , Jack Jewson , Arnaud Doucet , Sebastian Vollmer , Chris Holmes

In a world where artificial intelligence and data science become omnipresent, data sharing is increasingly locking horns with data-privacy concerns. Differential privacy has emerged as a rigorous framework for protecting individual privacy…

Cryptography and Security · Computer Science 2022-06-06 March Boedihardjo , Thomas Strohmer , Roman Vershynin

For scalable machine learning on large data sets, subsampling a representative subset is a common approach for efficient model training. This is often achieved through importance sampling, whereby informative data points are sampled more…

Cryptography and Security · Computer Science 2025-03-31 Dominik Fay , Sebastian Mair , Jens Sjölund

Differential privacy comes equipped with multiple analytical tools for the design of private data analyses. One important tool is the so-called "privacy amplification by subsampling" principle, which ensures that a differentially private…

Machine Learning · Computer Science 2018-11-26 Borja Balle , Gilles Barthe , Marco Gaboardi

In order to remain competitive, Internet companies collect and analyse user data for the purpose of improving user experiences. Frequency estimation is a widely used statistical tool which could potentially conflict with the relevant…

Cryptography and Security · Computer Science 2021-04-14 Mengmeng Yang , Ivan Tjuawinata , Kwok-Yan Lam , Tianqing Zhu , Jun Zhao

In general, it is challenging to release differentially private versions of survey-weighted statistics with low error for acceptable privacy loss. This is because weighted statistics from complex sample survey data can be more sensitive to…

Cryptography and Security · Computer Science 2024-11-08 Jeremy Seeman , Yajuan Si , Jerome P Reiter

As data-privacy requirements are becoming increasingly stringent and statistical models based on sensitive data are being deployed and used more routinely, protecting data-privacy becomes pivotal. Partial Least Squares (PLS) regression is…

Machine Learning · Statistics 2024-12-13 Ramin Nikzad-Langerodi , Mohit Kumar , Du Nguyen Duy , Mahtab Alghasi

We consider the privacy amplification properties of a sampling scheme in which a user's data is used in $k$ steps chosen randomly and uniformly from a sequence (or set) of $t$ steps. This sampling scheme has been recently applied in the…

Machine Learning · Computer Science 2026-02-20 Vitaly Feldman , Moshe Shenfeld

With the recent bloom of data, there is a huge surge in threats against individuals' private information. Various techniques for optimizing privacy-preserving data analysis are at the focus of research in the recent years. In this paper, we…

Cryptography and Security · Computer Science 2022-11-11 Sayan Biswas , Graham Cormode , Carsten Maple

The ability to quantify information transmission is crucial for the analysis and design of natural and engineered systems. The information transmission rate is the fundamental measure for systems with time-varying signals, yet computing it…

Biological Physics · Physics 2025-09-26 Manuel Reinhardt , Gašper Tkačik , Pieter Rein ten Wolde

Differential privacy is becoming a gold standard for privacy research; it offers a guaranteed bound on loss of privacy due to release of query results, even under worst-case assumptions. The theory of differential privacy is an active…

The sliding window model of computation captures scenarios in which data are continually arriving in the form of a stream, and only the most recent $w$ items are used for analysis. In this setting, an algorithm needs to accurately track…

Cryptography and Security · Computer Science 2024-06-13 Yiping Wang , Yanhao Wang , Cen Chen

Differential privacy guarantees allow the results of a statistical analysis involving sensitive data to be released without compromising the privacy of any individual taking part. Achieving such guarantees generally requires the injection…

Machine Learning · Statistics 2023-10-31 Jack Jewson , Sahra Ghalebikesabi , Chris Holmes

Traditional statistical methods for confidentiality protection of statistical databases do not scale well to deal with GWAS (genome-wide association studies) databases especially in terms of guarantees regarding protection from linkage to…

Methodology · Statistics 2012-05-04 Caroline Uhler , Aleksandra B. Slavkovic , Stephen E. Fienberg

Differential privacy is the leading mathematical framework for privacy protection, providing a probabilistic guarantee that safeguards individuals' private information when publishing statistics from a dataset. This guarantee is achieved by…

Methodology · Statistics 2025-08-19 Yuki Ohnishi , Jordan Awan

Differential privacy (DP) considers a scenario, where an adversary has almost complete information about the entries of a database This worst-case assumption is likely to overestimate the privacy thread for an individual in real life.…

Cryptography and Security · Computer Science 2025-04-16 Dennis Breutigam , Rüdiger Reischuk

Differentially private data generation techniques have become a promising solution to the data privacy challenge -- it enables sharing of data while complying with rigorous privacy guarantees, which is essential for scientific progress in…

Cryptography and Security · Computer Science 2022-11-09 Dingfan Chen , Raouf Kerkouche , Mario Fritz

This study presents Weighted Sampled Split Learning (WSSL), an innovative framework tailored to bolster privacy, robustness, and fairness in distributed machine learning systems. Unlike traditional approaches, WSSL disperses the learning…

Machine Learning · Computer Science 2023-10-31 Manish Osti , Aashray Thakuri , Basheer Qolomany , Aos Mulahuwaish

Confidence intervals are a fundamental tool for quantifying the uncertainty of parameters of interest. With the increase of data privacy awareness, developing a private version of confidence intervals has gained growing attention from both…

Methodology · Statistics 2024-04-12 Shurong Lin , Mark Bun , Marco Gaboardi , Eric D. Kolaczyk , Adam Smith
‹ Prev 1 2 3 10 Next ›