Related papers: Reconstructing training data from document underst…

Reconstructing Training Data with Informed Adversaries

Given access to a machine learning model, can an adversary reconstruct the model's training data? This work studies this question from the lens of a powerful informed adversary who knows all the training data points except one. By…

Cryptography and Security · Computer Science 2022-04-26 Borja Balle , Giovanni Cherubin , Jamie Hayes

Model Inversion Robustness: Can Transfer Learning Help?

Model Inversion (MI) attacks aim to reconstruct private training data by abusing access to machine learning models. Contemporary MI attacks have achieved impressive attack performance, posing serious threats to privacy. Meanwhile, all…

Machine Learning · Computer Science 2024-05-10 Sy-Tuyen Ho , Koh Jun Hao , Keshigeyan Chandrasegaran , Ngoc-Bao Nguyen , Ngai-Man Cheung

Improving Information Extraction on Business Documents with Specific Pre-Training Tasks

Transformer-based Language Models are widely used in Natural Language Processing related tasks. Thanks to their pre-training, they have been successfully adapted to Information Extraction in business documents. However, most pre-training…

Computation and Language · Computer Science 2023-09-12 Thibault Douzon , Stefan Duffner , Christophe Garcia , Jérémy Espinas

Cutting Through Privacy: A Hyperplane-Based Data Reconstruction Attack in Federated Learning

Federated Learning (FL) enables collaborative training of machine learning models across distributed clients without sharing raw data, ostensibly preserving data privacy. Nevertheless, recent studies have revealed critical vulnerabilities…

Machine Learning · Computer Science 2025-09-08 Francesco Diana , André Nusser , Chuan Xu , Giovanni Neglia

Model Inversion Attack via Dynamic Memory Learning

Model Inversion (MI) attacks aim to recover the private training data from the target model, which has raised security concerns about the deployment of DNNs in practice. Recent advances in generative adversarial models have rendered them…

Computer Vision and Pattern Recognition · Computer Science 2023-09-04 Gege Qi , YueFeng Chen , Xiaofeng Mao , Binyuan Hui , Xiaodan Li , Rong Zhang , Hui Xue

Deconstructing Classifiers: Towards A Data Reconstruction Attack Against Text Classification Models

Natural language processing (NLP) models have become increasingly popular in real-world applications, such as text classification. However, they are vulnerable to privacy attacks, including data reconstruction attacks that aim to extract…

Computation and Language · Computer Science 2023-06-27 Adel Elmahdy , Ahmed Salem

Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

Machine unlearning is motivated by desire for data autonomy: a person can request to have their data's influence removed from deployed models, and those models should be updated as if they were retrained without the person's data. We show…

Machine Learning · Computer Science 2024-05-31 Martin Bertran , Shuai Tang , Michael Kearns , Jamie Morgenstern , Aaron Roth , Zhiwei Steven Wu

Privacy-preserving Collaborative Learning with Automatic Transformation Search

Collaborative learning has gained great popularity due to its benefit of data privacy protection: participants can jointly train a Deep Learning model without sharing their training sets. However, recent works discovered that an adversary…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Wei Gao , Shangwei Guo , Tianwei Zhang , Han Qiu , Yonggang Wen , Yang Liu

Machine Unlearning for Document Classification

Document understanding models have recently demonstrated remarkable performance by leveraging extensive collections of user documents. However, since documents often contain large amounts of personal data, their usage can pose a threat to…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Lei Kang , Mohamed Ali Souibgui , Fei Yang , Lluis Gomez , Ernest Valveny , Dimosthenis Karatzas

DocMIA: Document-Level Membership Inference Attacks against DocVQA Models

Document Visual Question Answering (DocVQA) has introduced a new paradigm for end-to-end document understanding, and quickly became one of the standard benchmarks for multimodal LLMs. Automating document processing workflows, driven by…

Machine Learning · Computer Science 2025-02-07 Khanh Nguyen , Raouf Kerkouche , Mario Fritz , Dimosthenis Karatzas

Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks

Large language models (LLMs) are highly susceptible to backdoor attacks (BAs), wherein training samples are poisoned using trigger-based harmful content. Furthermore, existing defenses have proven ineffective when extensively tested across…

Cryptography and Security · Computer Science 2026-05-20 John T. Halloran , Noopur S. Bhatt

Robust LLM Unlearning Against Relearning Attacks: The Minor Components in Representations Matter

Large language model (LLM) unlearning aims to remove specific data influences from pre-trained model without costly retraining, addressing privacy, copyright, and safety concerns. However, recent studies reveal a critical vulnerability:…

Computation and Language · Computer Science 2026-05-13 Zeguan Xiao , Xuanzhe Xu , Yun Chen , Yong Wang , Jian Yang , Yanqing Hu , Guanhua Chen

Layout-Aware Representation Learning for Open-Set ID Fraud Discovery

Identity-document fraud detection is not a stationary binary classification problem. Adaptive attackers modify templates and fabrication pipelines, making historical fraud labels stale, and successful forgeries recur at scale as coherent…

Computer Vision and Pattern Recognition · Computer Science 2026-05-08 Jinxing Li , Nicholas Ren , Cathy Chang , Hongkai Pan , Daniel George

SoK: Data Reconstruction Attacks Against Machine Learning Models: Definition, Metrics, and Benchmark

Data reconstruction attacks, which aim to recover the training dataset of a target model with limited access, have gained increasing attention in recent years. However, there is currently no consensus on a formal definition of data…

Cryptography and Security · Computer Science 2025-06-10 Rui Wen , Yiyong Liu , Michael Backes , Yang Zhang

Knowledge-Enriched Distributional Model Inversion Attacks

Model inversion (MI) attacks are aimed at reconstructing training data from model parameters. Such attacks have triggered increasing concerns about privacy, especially given a growing number of online model repositories. However, existing…

Machine Learning · Computer Science 2021-08-20 Si Chen , Mostafa Kahla , Ruoxi Jia , Guo-Jun Qi

Deep Learning Model Inversion Attacks and Defenses: A Comprehensive Survey

The rapid adoption of deep learning in sensitive domains has brought tremendous benefits. However, this widespread adoption has also given rise to serious vulnerabilities, particularly model inversion (MI) attacks, posing a significant…

Cryptography and Security · Computer Science 2025-05-01 Wencheng Yang , Song Wang , Di Wu , Taotao Cai , Yanming Zhu , Shicheng Wei , Yiying Zhang , Xu Yang , Zhaohui Tang , Yan Li

Amplifying Training Data Exposure through Fine-Tuning with Pseudo-Labeled Memberships

Neural language models (LMs) are vulnerable to training data extraction attacks due to data memorization. This paper introduces a novel attack scenario wherein an attacker adversarially fine-tunes pre-trained LMs to amplify the exposure of…

Computation and Language · Computer Science 2024-09-04 Myung Gyo Oh , Hong Eun Ahn , Leo Hyun Park , Taekyoung Kwon

Evaluation of Inference Attack Models for Deep Learning on Medical Data

Deep learning has attracted broad interest in healthcare and medical communities. However, there has been little research into the privacy issues created by deep networks trained for medical applications. Recently developed inference attack…

Machine Learning · Computer Science 2020-11-03 Maoqiang Wu , Xinyue Zhang , Jiahao Ding , Hien Nguyen , Rong Yu , Miao Pan , Stephen T. Wong

Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective

Machine unlearning focuses on efficiently removing specific data from trained models, addressing privacy and compliance concerns with reasonable costs. Although exact unlearning ensures complete data removal equivalent to retraining, it is…

Cryptography and Security · Computer Science 2025-06-17 Nima Naderloui , Shenao Yan , Binghui Wang , Jie Fu , Wendy Hui Wang , Weiran Liu , Yuan Hong

Unveiling the Vulnerability of Private Fine-Tuning in Split-Based Frameworks for Large Language Models: A Bidirectionally Enhanced Attack

Recent advancements in pre-trained large language models (LLMs) have significantly influenced various domains. Adapting these models for specific tasks often involves fine-tuning (FT) with private, domain-specific data. However, privacy…

Cryptography and Security · Computer Science 2024-12-11 Guanzhong Chen , Zhenghan Qin , Mingxin Yang , Yajie Zhou , Tao Fan , Tianyu Du , Zenglin Xu