English

Passive Inference Attacks on Split Learning via Adversarial Regularization

Cryptography and Security 2025-03-25 v6 Machine Learning

Abstract

Split Learning (SL) has emerged as a practical and efficient alternative to traditional federated learning. While previous attempts to attack SL have often relied on overly strong assumptions or targeted easily exploitable models, we seek to develop more capable attacks. We introduce SDAR, a novel attack framework against SL with an honest-but-curious server. SDAR leverages auxiliary data and adversarial regularization to learn a decodable simulator of the client's private model, which can effectively infer the client's private features under the vanilla SL, and both features and labels under the U-shaped SL. We perform extensive experiments in both configurations to validate the effectiveness of our proposed attacks. Notably, in challenging scenarios where existing passive attacks struggle to reconstruct the client's private data effectively, SDAR consistently achieves significantly superior attack performance, even comparable to active attacks. On CIFAR-10, at the deep split level of 7, SDAR achieves private feature reconstruction with less than 0.025 mean squared error in both the vanilla and the U-shaped SL, and attains a label inference accuracy of over 98% in the U-shaped setting, while existing attacks fail to produce non-trivial results.

Keywords

Cite

@article{arxiv.2310.10483,
  title  = {Passive Inference Attacks on Split Learning via Adversarial Regularization},
  author = {Xiaochen Zhu and Xinjian Luo and Yuncheng Wu and Yangfan Jiang and Xiaokui Xiao and Beng Chin Ooi},
  journal= {arXiv preprint arXiv:2310.10483},
  year   = {2025}
}

Comments

NDSS 2025; 25 pages, 27 figures; Fixed typos

R2 v1 2026-06-28T12:52:10.947Z