Related papers: Model Extraction Attack against Self-supervised Sp…

On the Difficulty of Defending Self-Supervised Learning against Model Extraction

Self-Supervised Learning (SSL) is an increasingly popular ML paradigm that trains models to transform complex inputs into representations without relying on explicit labels. These representations encode similarity structures that enable…

Machine Learning · Computer Science 2022-06-30 Adam Dziedzic , Nikita Dhawan , Muhammad Ahmad Kaleem , Jonas Guan , Nicolas Papernot

Probing Self-supervised Learning Models with Target Speech Extraction

Large-scale pre-trained self-supervised learning (SSL) models have shown remarkable advancements in speech-related tasks. However, the utilization of these models in complex multi-talker scenarios, such as extracting a target speaker in a…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-21 Junyi Peng , Marc Delcroix , Tsubasa Ochiai , Oldrich Plchot , Takanori Ashihara , Shoko Araki , Jan Cernocky

Target Speech Extraction with Pre-trained Self-supervised Learning Models

Pre-trained self-supervised learning (SSL) models have achieved remarkable success in various speech tasks. However, their potential in target speech extraction (TSE) has not been fully exploited. TSE aims to extract the speech of a target…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-21 Junyi Peng , Marc Delcroix , Tsubasa Ochiai , Oldrich Plchot , Shoko Araki , Jan Cernocky

A Systematic Survey of Model Extraction Attacks and Defenses: State-of-the-Art and Perspectives

Machine learning (ML) models have significantly grown in complexity and utility, driving advances across multiple domains. However, substantial computational resources and specialized expertise have historically restricted their wide…

Cryptography and Security · Computer Science 2025-08-28 Kaixiang Zhao , Lincan Li , Kaize Ding , Neil Zhenqiang Gong , Yue Zhao , Yushun Dong

Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices

With growing popularity, deep learning (DL) models are becoming larger-scale, and only the companies with vast training datasets and immense computing power can manage their business serving such large models. Most of those DL models are…

Artificial Intelligence · Computer Science 2024-03-06 Younghan Lee , Sohee Jun , Yungi Cho , Woorim Han , Hyungon Moon , Yunheung Paek

Membership Inference Attacks Against Self-supervised Speech Models

Recently, adapting the idea of self-supervised learning (SSL) on continuous speech has started gaining attention. SSL models pre-trained on a huge amount of unlabeled audio can generate general-purpose representations that benefit a wide…

Cryptography and Security · Computer Science 2022-08-16 Wei-Cheng Tseng , Wei-Tsung Kao , Hung-yi Lee

Model Extraction Attacks Revisited

Model extraction (ME) attacks represent one major threat to Machine-Learning-as-a-Service (MLaaS) platforms by ``stealing'' the functionality of confidential machine-learning models through querying black-box APIs. Over seven years have…

Machine Learning · Computer Science 2025-10-01 Jiacheng Liang , Ren Pang , Changjiang Li , Ting Wang

Semi-supervised learning using teacher-student models for vocal melody extraction

The lack of labeled data is a major obstacle in many music information retrieval tasks such as melody extraction, where labeling is extremely laborious or costly. Semi-supervised learning (SSL) provides a solution to alleviate the issue by…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-17 Sangeun Kum , Jing-Hua Lin , Li Su , Juhan Nam

Beyond Labeling Oracles: What does it mean to steal ML models?

Model extraction attacks are designed to steal trained models with only query access, as is often provided through APIs that ML-as-a-Service providers offer. Machine Learning (ML) models are expensive to train, in part because data is hard…

Machine Learning · Computer Science 2024-06-14 Avital Shafran , Ilia Shumailov , Murat A. Erdogdu , Nicolas Papernot

Generative Extraction of Audio Classifiers for Speaker Identification

It is perhaps no longer surprising that machine learning models, especially deep neural networks, are particularly vulnerable to attacks. One such vulnerability that has been well studied is model extraction: a phenomenon in which the…

Cryptography and Security · Computer Science 2022-07-27 Tejumade Afonja , Lucas Bourtoule , Varun Chandrasekaran , Sageev Oore , Nicolas Papernot

Exploring Connections Between Active Learning and Model Extraction

Machine learning is being increasingly used by individuals, research institutions, and corporations. This has resulted in the surge of Machine Learning-as-a-Service (MLaaS) - cloud services that provide (a) tools and resources to learn the…

Machine Learning · Computer Science 2019-11-21 Varun Chandrasekaran , Kamalika Chaudhuri , Irene Giacomelli , Somesh Jha , Songbai Yan

Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning

Semi-supervised learning (SSL) leverages both labeled and unlabeled data to train machine learning (ML) models. State-of-the-art SSL methods can achieve comparable performance to supervised learning by leveraging much fewer labeled data.…

Cryptography and Security · Computer Science 2022-07-27 Xinlei He , Hongbin Liu , Neil Zhenqiang Gong , Yang Zhang

MeaeQ: Mount Model Extraction Attacks with Efficient Queries

We study model extraction attacks in natural language processing (NLP) where attackers aim to steal victim models by repeatedly querying the open Application Programming Interfaces (APIs). Recent works focus on limited-query budget settings…

Computation and Language · Computer Science 2023-10-24 Chengwei Dai , Minxuan Lv , Kun Li , Wei Zhou

Self-training through Classifier Disagreement for Cross-Domain Opinion Target Extraction

Opinion target extraction (OTE) or aspect extraction (AE) is a fundamental task in opinion mining that aims to extract the targets (or aspects) on which opinions have been expressed. Recent work focus on cross-domain OTE, which is typically…

Computation and Language · Computer Science 2023-03-01 Kai Sun , Richong Zhang , Samuel Mensah , Nikolaos Aletras , Yongyi Mao , Xudong Liu

Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation

Self-Supervised Learning (SSL) models have been successfully applied in various deep learning-based speech tasks, particularly those with a limited amount of data. However, the quality of SSL representations depends highly on the…

Computation and Language · Computer Science 2022-04-20 Dan Berrebbi , Jiatong Shi , Brian Yan , Osbel Lopez-Francisco , Jonathan D. Amith , Shinji Watanabe

Match to Win: Analysing Sequences Lengths for Efficient Self-supervised Learning in Speech and Audio

Self-supervised learning (SSL) has proven vital in speech and audio-related applications. The paradigm trains a general model on unlabeled data that can later be used to solve specific downstream tasks. This type of model is costly to train…

Sound · Computer Science 2022-11-23 Yan Gao , Javier Fernandez-Marques , Titouan Parcollet , Pedro P. B. de Gusmao , Nicholas D. Lane

MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model

Speech discrete representation has proven effective in various downstream applications due to its superior compression rate of the waveform, fast convergence during training, and compatibility with other modalities. Discrete units extracted…

Sound · Computer Science 2024-06-17 Jiatong Shi , Xutai Ma , Hirofumi Inaguma , Anna Sun , Shinji Watanabe

Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss

Self-supervised learning (SSL) is the latest breakthrough in speech processing, especially for label-scarce downstream tasks by leveraging massive unlabeled audio data. The noise robustness of the SSL is one of the important challenges to…

Audio and Speech Processing · Electrical Eng. & Systems 2023-05-25 Hiroshi Sato , Ryo Masumura , Tsubasa Ochiai , Marc Delcroix , Takafumi Moriya , Takanori Ashihara , Kentaro Shinayama , Saki Mizuno , Mana Ihori , Tomohiro Tanaka , Nobukatsu Hojo

MEGEX: Data-Free Model Extraction Attack against Gradient-Based Explainable AI

The advance of explainable artificial intelligence, which provides reasons for its predictions, is expected to accelerate the use of deep neural networks in the real world like Machine Learning as a Service (MLaaS) that returns predictions…

Cryptography and Security · Computer Science 2021-07-20 Takayuki Miura , Satoshi Hasegawa , Toshiki Shibahara

On the social bias of speech self-supervised models

Self-supervised learning (SSL) speech models have achieved remarkable performance in various tasks, yet the biased outcomes, especially affecting marginalized groups, raise significant concerns. Social bias refers to the phenomenon where…

Audio and Speech Processing · Electrical Eng. & Systems 2026-01-06 Yi-Cheng Lin , Tzu-Quan Lin , Hsi-Che Lin , Andy T. Liu , Hung-yi Lee