English
Related papers

Related papers: The 2020 Census Disclosure Avoidance System TopDow…

200 papers

The 2020 Decennial Census will be released with a new disclosure avoidance system in place, putting differential privacy in the spotlight for a wide range of data users. We consider several key applications of Census data in redistricting,…

Computers and Society · Computer Science 2022-03-11 Aloni Cohen , Moon Duchin , JN Matthews , Bhushan Suwal

In "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Abowd et al. (2022) describe the concepts and methods used by the Disclosure Avoidance System (DAS) to produce formally private output in support of the 2020 Census…

Cryptography and Security · Computer Science 2025-08-12 Ryan Cumings-Menon , Robert Ashmead , Daniel Kifer , Philip Leclerc , Matthew Spence , Pavel Zhuravlev , John M. Abowd

The US Census Bureau Disclosure Avoidance System (DAS) balances confidentiality and utility requirements for the decennial US Census (Abowd et al., 2022). The DAS was used in the 2020 Census to produce demographic datasets critically used…

Machine Learning · Computer Science 2026-03-12 Badih Ghazi , Pritish Kamath , Ravi Kumar , Pasin Manurangsi , Adam Sealfon

To protect the confidentiality of the 2020 Census, the U.S. Census Bureau adopted a statistical disclosure limitation framework based on the principles of differential privacy. A key component was the TopDown Algorithm, which applied…

Methodology · Statistics 2025-03-26 Robert Ashmead , Michael B. Hawes , Mary Pritts , Pavel Zhuravlev , Sallie Ann Keller

This paper extends $\texttt{InfTDA}$, a mechanism proposed in (Boninsegna, Silvestri, PETS 2025) for mobility datasets with origin and destination trips, in a general setting. The algorithm presented in this paper works for any dataset of…

Data Structures and Algorithms · Computer Science 2025-05-09 Fabrizio Boninsegna

Protecting an individual's privacy when releasing their data is inherently an exercise in relativity, regardless of how privacy is qualified or quantified. This is because we can only limit the gain in information about an individual…

Cryptography and Security · Computer Science 2026-02-26 James Bailie , Ruobin Gong , Xiao-Li Meng

This article describes the disclosure avoidance algorithm that the U.S. Census Bureau used to protect the Detailed Demographic and Housing Characteristics File A (Detailed DHC-A) of the 2020 Census. The tabulations contain statistics…

In 2018, the US Census Bureau designed a new data reconstruction and re-identification attack and tested it against their 2010 data release. The specific attack executed by the Bureau allows an attacker to infer the race and ethnicity of…

Cryptography and Security · Computer Science 2022-09-27 Paul Francis

The United States Census Bureau faces a difficult trade-off between the accuracy of Census statistics and the protection of individual information. We conduct the first independent evaluation of bias and noise induced by the Bureau's two…

Computers and Society · Computer Science 2024-05-03 Christopher T. Kenny , Cory McCartan , Shiro Kuriwaki , Tyler Simko , Kosuke Imai

This article describes the disclosure avoidance algorithm that the U.S. Census Bureau used to protect the 2020 Census Supplemental Demographic and Housing Characteristics File (S-DHC). The tabulations contain statistics of counts of U.S.…

Differential privacy (DP) is increasingly used to protect the release of hierarchical, tabular population data, such as census data. A common approach for implementing DP in this setting is to release noisy responses to a predefined set of…

Cryptography and Security · Computer Science 2024-04-03 Aadyaa Maddi , Swadhin Routray , Alexander Goldberg , Giulia Fanti

To meet its dual burdens of providing useful statistics and ensuring privacy of individual respondents, the US Census Bureau has for decades introduced some form of "noise" into published statistics. Initially, they used a method known as…

Computers and Society · Computer Science 2025-02-11 Maria Ballesteros , Cynthia Dwork , Gary King , Conlan Olson , Manish Raghavan

The US Census Bureau plans to protect the privacy of 2020 Census respondents through its Disclosure Avoidance System (DAS), which attempts to achieve differential privacy guarantees by adding noise to the Census microdata. By applying…

Applications · Statistics 2021-11-12 Christopher T. Kenny , Shiro Kuriwaki , Cory McCartan , Evan Rosenman , Tyler Simko , Kosuke Imai

This article describes SafeTab-H, a disclosure avoidance algorithm applied to the release of the U.S. Census Bureau's Detailed Demographic and Housing Characteristics File B (Detailed DHC-B) as part of the 2020 Census. The tabulations…

This article describes a proposed differentially private (DP) algorithms that the US Census Bureau is considering to release the Detailed Demographic and Housing Characteristics (DHC) Race & Ethnicity tabulations as part of the 2020 Census.…

Cryptography and Security · Computer Science 2021-07-23 Sam Haney , William Sexton , Ashwin Machanavajjhala , Michael Hay , Gerome Miklau

In early 2021, the US Census Bureau will begin releasing statistical tables based on the decennial census conducted in 2020. Because of significant changes in the data landscape, the Census Bureau is changing its approach to disclosure…

Computers and Society · Computer Science 2019-07-09 danah boyd

Disclosure avoidance (DA) systems are used to safeguard the confidentiality of data while allowing it to be analyzed and disseminated for analytic purposes. These methods, e.g., cell suppression, swapping, and k-anonymity, are commonly…

Cryptography and Security · Computer Science 2023-01-31 Keyu Zhu , Ferdinando Fioretto , Pascal Van Hentenryck , Saswat Das , Christine Task

Data from the Decennial Census is published only after applying a disclosure avoidance system (DAS). Data users were shaken by the adoption of differential privacy in the 2020 DAS, a radical departure from past methods. The goal of this…

Computers and Society · Computer Science 2026-02-23 Christian Cianfarani , Aloni Cohen

Latent Dirichlet Allocation (LDA) is a popular topic modeling technique for discovery of hidden semantic architecture of text datasets, and plays a fundamental role in many machine learning applications. However, like many other machine…

Machine Learning · Computer Science 2019-07-02 Fangyuan Zhao , Xuebin Ren , Shusen Yang , Xinyu Yang

In 2017, the United States Census Bureau announced that because of high disclosure risk in the methodology (data swapping) used to produce tabular data for the 2010 census, a different protection mechanism based on differential privacy…

Databases · Computer Science 2024-07-24 Krish Muralidhar , Steven Ruggles
‹ Prev 1 2 3 10 Next ›