Related papers: Selective Network Linearization for Efficient Priv…

DeepReDuce: ReLU Reduction for Fast Private Inference

The recent rise of privacy concerns has led researchers to devise methods for private neural inference -- where inferences are made directly on encrypted data, never seeing inputs. The primary challenge facing private inference is that…

Machine Learning · Computer Science 2021-06-23 Nandan Kumar Jha , Zahra Ghodsi , Siddharth Garg , Brandon Reagen

Learning to Linearize Deep Neural Networks for Secure and Efficient Private Inference

The large number of ReLU non-linearity operations in existing deep neural networks makes them ill-suited for latency-efficient private inference (PI). Existing techniques to reduce ReLU operations often involve manual effort and sacrifice…

Computer Vision and Pattern Recognition · Computer Science 2023-01-24 Souvik Kundu , Shunlin Lu , Yuke Zhang , Jacqueline Liu , Peter A. Beerel

Linearizing Models for Efficient yet Robust Private Inference

The growing concern about data privacy has led to the development of private inference (PI) frameworks in client-server applications which protects both data privacy and model IP. However, the cryptographic primitives required yield…

Machine Learning · Computer Science 2024-02-09 Sreetama Sarkar , Souvik Kundu , Peter A. Beerel

DeepReShape: Redesigning Neural Networks for Efficient Private Inference

Prior work on Private Inference (PI) -- inferences performed directly on encrypted input -- has focused on minimizing a network's ReLUs, which have been assumed to dominate PI latency rather than FLOPs. Recent work has shown that FLOPs for…

Cryptography and Security · Computer Science 2024-06-25 Nandan Kumar Jha , Brandon Reagen

Sphynx: ReLU-Efficient Network Design for Private Inference

The emergence of deep learning has been accompanied by privacy concerns surrounding users' data and service providers' models. We focus on private inference (PI), where the goal is to perform inference on a user's data sample using a…

Cryptography and Security · Computer Science 2022-11-08 Minsu Cho , Zahra Ghodsi , Brandon Reagen , Siddharth Garg , Chinmay Hegde

CryptoNAS: Private Inference on a ReLU Budget

Machine learning as a service has given raise to privacy concerns surrounding clients' data and providers' models and has catalyzed research in private inference (PI): methods to process inferences without disclosing inputs. Recently,…

Machine Learning · Computer Science 2021-05-14 Zahra Ghodsi , Akshaj Veldanda , Brandon Reagen , Siddharth Garg

Disparate Impact on Group Accuracy of Linearization for Private Inference

Ensuring privacy-preserving inference on cryptographically secure data is a well-known computational challenge. To alleviate the bottleneck of costly cryptographic computations in non-linear activations, recent methods have suggested…

Machine Learning · Computer Science 2024-08-21 Saswat Das , Marco Romanelli , Ferdinando Fioretto

DeepShare: Sharing ReLU Across Channels and Layers for Efficient Private Inference

Private Inference (PI) uses cryptographic primitives to perform privacy preserving machine learning. In this setting, the owner of the network runs inference on the data of the client without learning anything about the data and without…

Machine Learning · Computer Science 2025-12-22 Yonathan Bornfeld , Shai Avidan

Circa: Stochastic ReLUs for Private Deep Learning

The simultaneous rise of machine learning as a service and concerns over user privacy have increasingly motivated the need for private inference (PI). While recent work demonstrates PI is possible using cryptographic primitives, the…

Machine Learning · Computer Science 2021-06-17 Zahra Ghodsi , Nandan Kumar Jha , Brandon Reagen , Siddharth Garg

AutoReP: Automatic ReLU Replacement for Fast Private Network Inference

The growth of the Machine-Learning-As-A-Service (MLaaS) market has highlighted clients' data privacy and security issues. Private inference (PI) techniques using cryptographic primitives offer a solution but often have high computation and…

Cryptography and Security · Computer Science 2023-08-22 Hongwu Peng , Shaoyi Huang , Tong Zhou , Yukui Luo , Chenghong Wang , Zigeng Wang , Jiahui Zhao , Xi Xie , Ang Li , Tony Geng , Kaleel Mahmood , Wujie Wen , Xiaolin Xu , Caiwen Ding

TruncFormer: Private LLM Inference Using Only Truncations

Private inference (PI) serves an important role in guaranteeing the privacy of user data when interfacing with proprietary machine learning models such as LLMs. However, PI remains practically intractable due to the massive latency costs…

Cryptography and Security · Computer Science 2024-12-03 Patrick Yubeaton , Jianqiao Cambridge Mo , Karthik Garimella , Nandan Kumar Jha , Brandon Reagen , Chinmay Hegde , Siddharth Garg

Sisyphus: A Cautionary Tale of Using Low-Degree Polynomial Activations in Privacy-Preserving Deep Learning

Privacy concerns in client-server machine learning have given rise to private inference (PI), where neural inference occurs directly on encrypted inputs. PI protects clients' personal data and the server's intellectual property. A common…

Machine Learning · Computer Science 2021-11-04 Karthik Garimella , Nandan Kumar Jha , Brandon Reagen

Characterizing and Optimizing End-to-End Systems for Private Inference

In two-party machine learning prediction services, the client's goal is to query a remote server's trained machine learning model to perform neural network inference in some application domain. However, sensitive information can be obtained…

Cryptography and Security · Computer Science 2023-02-20 Karthik Garimella , Zahra Ghodsi , Nandan Kumar Jha , Siddharth Garg , Brandon Reagen

xMLP: Revolutionizing Private Inference with Exclusive Square Activation

Private Inference (PI) enables deep neural networks (DNNs) to work on private data without leaking sensitive information by exploiting cryptographic primitives such as multi-party computation (MPC) and homomorphic encryption (HE). However,…

Machine Learning · Computer Science 2024-03-14 Jiajie Li , Jinjun Xiong

Regularized PolyKervNets: Optimizing Expressiveness and Efficiency for Private Inference in Deep Neural Networks

Private computation of nonlinear functions, such as Rectified Linear Units (ReLUs) and max-pooling operations, in deep neural networks (DNNs) poses significant challenges in terms of storage, bandwidth, and time consumption. To address…

Machine Learning · Computer Science 2023-12-27 Toluwani Aremu

Flash: A Hybrid Private Inference Protocol for Deep CNNs with High Accuracy and Low Latency on CPU

This paper presents Flash, an optimized private inference (PI) hybrid protocol utilizing both homomorphic encryption (HE) and secure two-party computation (2PC), which can reduce the end-to-end PI latency for deep CNN models less than 1…

Cryptography and Security · Computer Science 2025-01-20 Hyeri Roh , Jinsu Yeo , Yeongil Ko , Gu-Yeon Wei , David Brooks , Woo-Seok Choi

Making Models Shallow Again: Jointly Learning to Reduce Non-Linearity and Depth for Latency-Efficient Private Inference

Large number of ReLU and MAC operations of Deep neural networks make them ill-suited for latency and compute-efficient private inference. In this paper, we present a model optimization method that allows a model to learn to be shallow. In…

Machine Learning · Computer Science 2023-04-27 Souvik Kundu , Yuke Zhang , Dake Chen , Peter A. Beerel

Low Latency Privacy Preserving Inference

When applying machine learning to sensitive data, one has to find a balance between accuracy, information security, and computational-complexity. Recent studies combined Homomorphic Encryption with neural networks to make inferences while…

Machine Learning · Computer Science 2019-06-07 Alon Brutzkus , Oren Elisha , Ran Gilad-Bachrach

CryptoNite: Revealing the Pitfalls of End-to-End Private Inference at Scale

The privacy concerns of providing deep learning inference as a service have underscored the need for private inference (PI) protocols that protect users' data and the service provider's model using cryptographic methods. Recently proposed…

Cryptography and Security · Computer Science 2022-07-19 Karthik Garimella , Nandan Kumar Jha , Zahra Ghodsi , Siddharth Garg , Brandon Reagen

Entropy-Guided Attention for Private LLMs

The pervasiveness of proprietary language models has raised critical privacy concerns, necessitating advancements in private inference (PI), where computations are performed directly on encrypted data without revealing users' sensitive…

Machine Learning · Computer Science 2025-01-10 Nandan Kumar Jha , Brandon Reagen