Related papers: Distribution-Agnostic Database De-Anonymization Un…

Distribution-Agnostic Database De-Anonymization Under Obfuscation And Synchronization Errors

Database de-anonymization typically involves matching an anonymized database with correlated publicly available data. Existing research focuses either on practical aspects without requiring knowledge of the data distribution yet provides…

Information Theory · Computer Science 2024-04-03 Serhat Bakirtas , Elza Erkip

Seeded Database Matching Under Noisy Column Repetitions

The re-identification or de-anonymization of users from anonymized data through matching with publicly-available correlated user data has raised privacy concerns, leading to the complementary measure of obfuscation in addition to…

Information Theory · Computer Science 2022-09-16 Serhat Bakirtas , Elza Erkip

Database Matching Under Column Deletions

De-anonymizing user identities by matching various forms of user data available on the internet raises privacy concerns. A fundamental understanding of the privacy leakage in such scenarios requires a careful study of conditions under which…

Information Theory · Computer Science 2021-05-21 Serhat Bakirtas , Elza Erkip

Database Matching Under Noisy Synchronization Errors

The re-identification or de-anonymization of users from anonymized data through matching with publicly available correlated user data has raised privacy concerns, leading to the complementary measure of obfuscation in addition to…

Information Theory · Computer Science 2023-10-26 Serhat Bakirtas , Elza Erkip

Database Matching Under Adversarial Column Deletions

The de-anonymization of users from anonymized microdata through matching or aligning with publicly-available correlated databases has been of scientific interest recently. While most of the rigorous analyses of database matching have…

Information Theory · Computer Science 2023-09-06 Serhat Bakirtas , Elza Erkip

Blind De-anonymization Attacks using Social Networks

It is important to study the risks of publishing privacy-sensitive data. Even if sensitive identities (e.g., name, social security number) were removed and advanced data perturbation techniques were applied, several de-anonymization attacks…

Social and Information Networks · Computer Science 2018-01-18 Wei-Han Lee , Changchang Liu , Shouling Ji , Prateek Mittal , Ruby Lee

De-anonymizing scale-free social networks by percolation graph matching

We address the problem of social network de-anonymization when relationships between people are described by scale-free graphs. In particular, we propose a rigorous, asymptotic mathematical analysis of the network de-anonymization problem…

Social and Information Networks · Computer Science 2014-11-27 Carla Chiasserini , Michele Garetto , Emilio Leonardi

A New Algorithm for Distributed Nonparametric Sequential Detection

We consider nonparametric sequential hypothesis testing problem when the distribution under the null hypothesis is fully known but the alternate hypothesis corresponds to some other unknown distribution with some loose constraints. We…

Information Theory · Computer Science 2013-11-15 Shouvik Ganguly , K Sahasranand , Vinod Sharma

A Concentration of Measure Approach to Database De-anonymization

In this paper, matching of correlated high-dimensional databases is investigated. A stochastic database model is considered where the correlation among the database entries is governed by an arbitrary joint distribution. Concentration of…

Databases · Computer Science 2019-05-06 Farhad Shirani , Siddharth Garg , Elza Erkip

Impact of Clustering on the Performance of Network De-anonymization

Recently, graph matching algorithms have been successfully applied to the problem of network de-anonymization, in which nodes (users) participating to more than one social network are identified only by means of the structure of their links…

Social and Information Networks · Computer Science 2015-08-11 C. F Chiasserini , M. Garetto , E. Leonardi

Anonymization with Worst-Case Distribution-Based Background Knowledge

Background knowledge is an important factor in privacy preserving data publishing. Distribution-based background knowledge is one of the well studied background knowledge. However, to the best of our knowledge, there is no existing work…

Databases · Computer Science 2009-09-08 Raymond Chi-Wing Wong , Ada Wai-Chee Fu , Ke Wang , Yabo Xu , Jian Pei , Philip S. Yu

Distributed Binary Detection with Lossy Data Compression

Consider the problem where a statistician in a two-node system receives rate-limited information from a transmitter about marginal observations of a memoryless process generated from two possible distributions. Using its own observations,…

Information Theory · Computer Science 2017-03-02 Gil Katz , Pablo Piantanida , Mérouane Debbah

On the Simultaneous Preservation of Privacy and Community Structure in Anonymized Networks

We consider the problem of performing community detection on a network, while maintaining privacy, assuming that the adversary has access to an auxiliary correlated network. We ask the question "Does there exist a regime where the network…

Machine Learning · Computer Science 2016-03-29 Daniel Cullina , Kushagra Singhal , Negar Kiyavash , Prateek Mittal

An Automated Social Graph De-anonymization Technique

We present a generic and automated approach to re-identifying nodes in anonymized social networks which enables novel anonymization techniques to be quickly evaluated. It uses machine learning (decision forests) to matching pairs of nodes…

Cryptography and Security · Computer Science 2014-08-08 Kumar Sharad , George Danezis

Adaptive Image Denoising by Targeted Databases

We propose a data-dependent denoising procedure to restore noisy images. Different from existing denoising algorithms which search for patches from either the noisy image or a generic database, the new algorithm finds patches from a…

Computer Vision and Pattern Recognition · Computer Science 2015-06-22 Enming Luo , Stanley H. Chan , Truong Q. Nguyen

The Sufficiency Principle for Decentralized Data Reduction

This paper develops the sufficiency principle suitable for data reduction in decentralized inference systems. Both parallel and tandem networks are studied and we focus on the cases where observations at decentralized nodes are…

Information Theory · Computer Science 2012-07-16 Ge Xu , Biao Chen

Privacy-Preserving Distributed Optimisation using Stochastic PDMM

Privacy-preserving distributed processing has received considerable attention recently. The main purpose of these algorithms is to solve certain signal processing tasks over a network in a decentralised fashion without revealing…

Signal Processing · Electrical Eng. & Systems 2023-12-14 Sebastian O. Jordan , Qiongxiu Li , Richard Heusdens

Distributed Machine Learning with Sparse Heterogeneous Data

Motivated by distributed machine learning settings such as Federated Learning, we consider the problem of fitting a statistical model across a distributed collection of heterogeneous data sets whose similarity structure is encoded by a…

Statistics Theory · Mathematics 2021-11-30 Dominic Richards , Sahand N. Negahban , Patrick Rebeschini

A Multi-Objective Degree-Based Network Anonymization Approach

Enormous amounts of data collected from social networks or other online platforms are being published for the sake of statistics, marketing, and research, among other objectives. The consequent privacy and data security concerns have…

Cryptography and Security · Computer Science 2021-12-24 Ola N. Halawi , Faisal N. Abu-Khzam

Finding the Sweet Spot for Data Anonymization: A Mechanism Design Perspective

Data sharing between different organizations is an essential process in today's connected world. However, recently there were many concerns about data sharing as sharing sensitive information can jeopardize users' privacy. To preserve the…

Computer Science and Game Theory · Computer Science 2021-02-01 Abdelrahman Eldosouky , Tapadhir Das , Anuraag Kotra , Shamik Sengupta