Related papers: Differentially Private Kernel Density Estimation

Efficiently Computing Similarities to Private Datasets

Many methods in differentially private model training rely on computing the similarity between a query point (such as public or synthetic data) and private data. We abstract out this common subroutine and study the following fundamental…

Cryptography and Security · Computer Science 2024-03-15 Arturs Backurs , Zinan Lin , Sepideh Mahabadi , Sandeep Silwal , Jakub Tarnawski

Fast Private Kernel Density Estimation via Locality Sensitive Quantization

We study efficient mechanisms for differentially private kernel density estimation (DP-KDE). Prior work for the Gaussian kernel described algorithms that run in time exponential in the number of dimensions $d$. This paper breaks the…

Data Structures and Algorithms · Computer Science 2023-07-06 Tal Wagner , Yonatan Naamad , Nina Mishra

On Differentially Private Counting on Trees

We study the problem of performing counting queries at different levels in hierarchical structures while preserving individuals' privacy. Motivated by applications, we propose a new error measure for this problem by considering a…

Data Structures and Algorithms · Computer Science 2023-04-28 Badih Ghazi , Pritish Kamath , Ravi Kumar , Pasin Manurangsi , Kewen Wu

A One-Pass Private Sketch for Most Machine Learning Tasks

Differential privacy (DP) is a compelling privacy definition that explains the privacy-utility tradeoff via formal, provable guarantees. Inspired by recent progress toward general-purpose data release algorithms, we propose a private…

Data Structures and Algorithms · Computer Science 2020-06-17 Benjamin Coleman , Anshumali Shrivastava

Numerical Composition of Differential Privacy

We give a fast algorithm to optimally compose privacy guarantees of differentially private (DP) algorithms to arbitrary accuracy. Our method is based on the notion of privacy loss random variables to quantify the privacy loss of DP…

Data Structures and Algorithms · Computer Science 2021-10-27 Sivakanth Gopi , Yin Tat Lee , Lukas Wutschitz

Differentially Private Synthetic Data Using KD-Trees

Creation of a synthetic dataset that faithfully represents the data distribution and simultaneously preserves privacy is a major research challenge. Many space partitioning based approaches have emerged in recent years for answering…

Cryptography and Security · Computer Science 2023-06-26 Eleonora Kreačić , Navid Nouri , Vamsi K. Potluru , Tucker Balch , Manuela Veloso

Auditing Differential Privacy in High Dimensions with the Kernel Quantum R\'enyi Divergence

Differential privacy (DP) is the de facto standard for private data release and private machine learning. Auditing black-box DP algorithms and mechanisms to certify whether they satisfy a certain DP guarantee is challenging, especially in…

Machine Learning · Computer Science 2022-05-30 Carles Domingo-Enrich , Youssef Mroueh

k-Means SubClustering: A Differentially Private Algorithm with Improved Clustering Quality

In today's data-driven world, the sensitivity of information has been a significant concern. With this data and additional information on the person's background, one can easily infer an individual's private data. Many differentially…

Machine Learning · Computer Science 2023-01-10 Devvrat Joshi , Janvi Thakkar

In the kernel density estimation (KDE) problem, we are given a set $X$ of data points in $\mathbb{R}^d$, a kernel function $k: \mathbb{R}^d \times \mathbb{R}^d \rightarrow \mathbb{R}$, and a query point $\mathbf{q} \in \mathbb{R}^d$, and…

Data Structures and Algorithms · Computer Science 2025-07-03 Steinar Laenen , Peter Macgregor , He Sun

Double-Private Distributed Estimation Algorithm Using Differential Privacy and a Key-Like Proportionate Matrix with Its Performance Analysis

In this brief, we present an enhanced privacy-preserving distributed estimation algorithm, referred to as the ``Double-Private Algorithm," which combines the principles of both differential privacy (DP) and cryptography. The proposed…

Signal Processing · Electrical Eng. & Systems 2024-03-19 Mehdi Korki , Fatemehsadat Hosseiniamin , Hadi Zayyani , Mehdi Bekrani

Differentially Private Optimization on Large Model at Small Cost

Differentially private (DP) optimization is the standard paradigm to learn large neural networks that are accurate and privacy-preserving. The computational cost for DP deep learning, however, is notoriously heavy due to the per-sample…

Machine Learning · Computer Science 2023-09-20 Zhiqi Bu , Yu-Xiang Wang , Sheng Zha , George Karypis

Differentially Private Random Feature Model

Designing privacy-preserving machine learning algorithms has received great attention in recent years, especially in the setting when the data contains sensitive information. Differential privacy (DP) is a widely used mechanism for data…

Machine Learning · Computer Science 2025-09-11 Chunyang Liao , Deanna Needell , Hayden Schaeffer , Alexander Xue

Provable Differentially Private Computation of the Cross-Attention Mechanism

Cross-attention has emerged as a cornerstone module in modern artificial intelligence, underpinning critical applications such as retrieval-augmented generation (RAG), system prompting, and guided stable diffusion. However, this is a rising…

Machine Learning · Computer Science 2026-01-26 Yekun Ke , Yingyu Liang , Zhenmei Shi , Zhao Song , Jiahao Zhang

Optimal partition selection with R\'enyi differential privacy

A common problem in private data analysis is the partition selection problem, where each user holds a set of partitions (e.g. keys in a GROUP BY operation) from a possibly unbounded set. The challenge here is in maximizing the set of…

Cryptography and Security · Computer Science 2026-03-12 Charlie Harrison , Pasin Manurangsi

Distributed Differentially Private Computation of Functions with Correlated Noise

Many applications of machine learning, such as human health research, involve processing private or sensitive information. Privacy concerns may impose significant hurdles to collaboration in scenarios where there are multiple sites holding…

Machine Learning · Computer Science 2021-02-24 Hafiz Imtiaz , Jafar Mohammadi , Anand D. Sarwate

FastLloyd: Federated, Accurate, Secure, and Tunable $k$-Means Clustering with Differential Privacy

We study the problem of privacy-preserving $k$-means clustering in the horizontally federated setting. Existing federated approaches using secure computation suffer from substantial overheads and do not offer output privacy. At the same…

Cryptography and Security · Computer Science 2025-06-12 Abdulrahman Diaa , Thomas Humphries , Florian Kerschbaum

A Neural Database for Differentially Private Spatial Range Queries

Mobile apps and location-based services generate large amounts of location data that can benefit research on traffic optimization, context-aware notifications and public health (e.g., spread of contagious diseases). To preserve individual…

Databases · Computer Science 2021-08-04 Sepanta Zeighami , Ritesh Ahuja , Gabriel Ghinita , Cyrus Shahabi

Differentially Private Estimation and Inference in High-Dimensional Regression with FDR Control

This paper proposes new methodologies for conducting practical differentially private (DP) estimation and inference in high-dimensional linear regression. We first introduce a DP Bayesian Information Criterion (DP-BIC) for selecting the…

Methodology · Statistics 2026-04-13 Zhanrui Cai , Sai Li , Xintao Xia , Linjun Zhang

Differential Privacy in Kernelized Contextual Bandits via Random Projections

We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space. We study this problem under an additional constraint of Differential…

Machine Learning · Statistics 2025-07-21 Nikola Pavlovic , Sudeep Salgia , Qing Zhao

Fast Kernel Density Estimation with Density Matrices and Random Fourier Features

Kernel density estimation (KDE) is one of the most widely used nonparametric density estimation methods. The fact that it is a memory-based method, i.e., it uses the entire training data set for prediction, makes it unsuitable for most…

Machine Learning · Computer Science 2022-08-08 Joseph A. Gallego , Juan F. Osorio , Fabio A. González