Related papers: PrivQuant: Communication-Efficient Private Inferen…

CoPriv: Network/Protocol Co-Optimization for Communication-Efficient Private Inference

Deep neural network (DNN) inference based on secure 2-party computation (2PC) can offer cryptographically-secure privacy protection but suffers from orders of magnitude latency overhead due to enormous communication. Previous works heavily…

Cryptography and Security · Computer Science 2023-11-06 Wenxuan Zeng , Meng Li , Haichuan Yang , Wen-jie Lu , Runsheng Wang , Ru Huang

HEQuant: Marrying Homomorphic Encryption and Quantization for Communication-Efficient Private Inference

Secure two-party computation with homomorphic encryption (HE) protects data privacy with a formal security guarantee but suffers from high communication overhead. While previous works, e.g., Cheetah, Iron, etc, have proposed efficient…

Cryptography and Security · Computer Science 2024-02-01 Tianshi Xu , Meng Li , Runsheng Wang

PrivCirNet: Efficient Private Inference via Block Circulant Transformation

Homomorphic encryption (HE)-based deep neural network (DNN) inference protects data and model privacy but suffers from significant computation overhead. We observe transforming the DNN weights into circulant matrices converts general…

Cryptography and Security · Computer Science 2024-10-30 Tianshi Xu , Lemeng Wu , Runsheng Wang , Meng Li

EQO: Exploring Ultra-Efficient Private Inference with Winograd-Based Protocol and Quantization Co-Optimization

Private convolutional neural network (CNN) inference based on secure two-party computation (2PC) suffers from high communication and latency overhead, especially from convolution layers. In this paper, we propose EQO, a quantized 2PC…

Cryptography and Security · Computer Science 2024-04-16 Wenxuan Zeng , Tianshi Xu , Meng Li , Runsheng Wang

Practical Two-party Privacy-preserving Neural Network Based on Secret Sharing

Neural networks, with the capability to provide efficient predictive models, have been widely used in medical, financial, and other fields, bringing great convenience to our lives. However, the high accuracy of the model requires a large…

Cryptography and Security · Computer Science 2021-04-13 Zhengqiang Ge , Zhipeng Zhou , Dong Guo , Qiang Li

Privacy-Preserving Inference for Quantized BERT Models

With the increasing deployment of generative machine learning models in privacy-sensitive domains such as healthcare and personalized services, ensuring secure inference has become a critical challenge. Secure multi-party computation (MPC)…

Machine Learning · Computer Science 2025-08-05 Tianpei Lu , Bingsheng Zhang , Lekun Peng , Bowen Zheng , Lichun Li , Kui Ren

Bits for Privacy: Evaluating Post-Training Quantization via Membership Inference

Deep neural networks are widely deployed with quantization techniques to reduce memory and computational costs by lowering the numerical precision of their parameters. While quantization alters model parameters and their outputs, existing…

Machine Learning · Computer Science 2025-12-18 Chenxiang Zhang , Tongxi Qu , Zhong Li , Tian Zhang , Jun Pang , Sjouke Mauw

PASNet: Polynomial Architecture Search Framework for Two-party Computation-based Secure Neural Network Deployment

Two-party computation (2PC) is promising to enable privacy-preserving deep learning (DL). However, the 2PC-based privacy-preserving DL implementation comes with high comparison protocol overhead from the non-linear operators. This work…

Cryptography and Security · Computer Science 2023-06-28 Hongwu Peng , Shanglin Zhou , Yukui Luo , Nuo Xu , Shijin Duan , Ran Ran , Jiahui Zhao , Chenghong Wang , Tong Geng , Wujie Wen , Xiaolin Xu , Caiwen Ding

UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization

Private convolutional neural network (CNN) inference based on secure two-party computation (2PC) suffers from high communication and latency overhead, especially from convolution layers. In this paper, we propose UFO, a quantized 2PC…

Cryptography and Security · Computer Science 2026-02-24 Wenxuan Zeng , Chao Yang , Tianshi Xu , Bo Zhang , Changrui Ren , Jin Dong , Meng Li

PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration

With the rise of large language models, service providers offer language models as a service, enabling users to fine-tune customized models via uploaded private datasets. However, this raises concerns about sensitive data leakage. Prior…

Cryptography and Security · Computer Science 2026-01-22 Yi Liu , Weixiang Han , Chengjun Cai , Xingliang Yuan , Cong Wang

Low Latency Privacy Preserving Inference

When applying machine learning to sensitive data, one has to find a balance between accuracy, information security, and computational-complexity. Recent studies combined Homomorphic Encryption with neural networks to make inferences while…

Machine Learning · Computer Science 2019-06-07 Alon Brutzkus , Oren Elisha , Ran Gilad-Bachrach

DPQuant: Efficient and Differentially-Private Model Training via Dynamic Quantization Scheduling

Differentially-Private SGD (DP-SGD) and its adaptive variant DP-Adam are powerful techniques to protect user privacy when using sensitive data to train neural networks. During training, converting model weights and activations into…

Machine Learning · Computer Science 2026-04-17 Yubo Gao , Renbo Tu , Gennady Pekhimenko , Nandita Vijaykumar

Dual Precision Quantization for Efficient and Accurate Deep Neural Networks Inference

Deep neural networks have achieved state-of-the-art results in a wide range of applications, from natural language processing and computer vision to speech recognition. However, as tasks become increasingly complex, model sizes continue to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Tomer Gafni , Asaf Karnieli , Yair Hanani

QUOTIENT: Two-Party Secure Neural Network Training and Prediction

Recently, there has been a wealth of effort devoted to the design of secure protocols for machine learning tasks. Much of this is aimed at enabling secure prediction from highly-accurate Deep Neural Networks (DNNs). However, as DNNs are…

Cryptography and Security · Computer Science 2019-07-09 Nitin Agrawal , Ali Shahin Shamsabadi , Matt J. Kusner , Adrià Gascón

C2PI: An Efficient Crypto-Clear Two-Party Neural Network Private Inference

Recently, private inference (PI) has addressed the rising concern over data and model privacy in machine learning inference as a service. However, existing PI frameworks suffer from high computational and communication costs due to the…

Cryptography and Security · Computer Science 2023-04-27 Yuke Zhang , Dake Chen , Souvik Kundu , Haomei Liu , Ruiheng Peng , Peter A. Beerel

RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference

The proliferation of deep learning (DL) has led to the emergence of privacy and security concerns. To address these issues, secure Two-party computation (2PC) has been proposed as a means of enabling privacy-preserving DL computation.…

Cryptography and Security · Computer Science 2023-02-24 Hongwu Peng , Shanglin Zhou , Yukui Luo , Nuo Xu , Shijin Duan , Ran Ran , Jiahui Zhao , Shaoyi Huang , Xi Xie , Chenghong Wang , Tong Geng , Wujie Wen , Xiaolin Xu , Caiwen Ding

DistPrivacy: Privacy-Aware Distributed Deep Neural Networks in IoT surveillance systems

With the emergence of smart cities, Internet of Things (IoT) devices as well as deep learning technologies have witnessed an increasing adoption. To support the requirements of such paradigm in terms of memory and computation, joint and…

Networking and Internet Architecture · Computer Science 2020-10-27 Emna Baccour , Aiman Erbad , Amr Mohamed , Mounir Hamdi , Mohsen Guizani

PrivyNet: A Flexible Framework for Privacy-Preserving Deep Neural Network Training

Massive data exist among user local platforms that usually cannot support deep neural network (DNN) training due to computation and storage resource constraints. Cloud-based training schemes provide beneficial services but suffer from…

Machine Learning · Computer Science 2018-01-15 Meng Li , Liangzhen Lai , Naveen Suda , Vikas Chandra , David Z. Pan

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation

Quantization of deep neural networks (DNN) has been proven effective for compressing and accelerating DNN models. Data-free quantization (DFQ) is a promising approach without the original datasets under privacy-sensitive and confidential…

Machine Learning · Computer Science 2022-02-16 Cong Guo , Yuxian Qiu , Jingwen Leng , Xiaotian Gao , Chen Zhang , Yunxin Liu , Fan Yang , Yuhao Zhu , Minyi Guo

A Differentially Private Quantum Neural Network for Probabilistic Optimal Power Flow

The stochastic nature of renewable energy and load demand requires efficient and accurate solutions for probabilistic optimal power flow (OPF). Quantum neural networks (QNNs), which combine quantum computing and machine learning, offer…

Systems and Control · Electrical Eng. & Systems 2024-12-17 Yuji Cao , Yue Chen , Yan Xu