Related papers: Reprogramming Self-supervised Learning-based Speec…

Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models

Speaker anonymization aims to protect the privacy of speakers while preserving spoken linguistic information from speech. Current mainstream neural network speaker anonymization systems are complicated, containing an F0 extractor, speaker…

Sound · Computer Science 2022-04-28 Xiaoxiao Miao , Xin Wang , Erica Cooper , Junichi Yamagishi , Natalia Tomashenko

Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation

Self-supervised learning (SSL) has reduced the reliance on expensive labeling in speech technologies by learning meaningful representations from unannotated data. Since most SSL-based downstream tasks prioritize content information in…

Sound · Computer Science 2025-05-27 Giuseppe Ruggiero , Matteo Testa , Jurgen Van de Walle , Luigi Di Caro

Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models

Recent advancements in Self-Supervised Learning (SSL) have shown promising results in Speaker Verification (SV). However, narrowing the performance gap with supervised systems remains an ongoing challenge. Several studies have observed that…

Audio and Speech Processing · Electrical Eng. & Systems 2025-06-25 Victor Miara , Theo Lepage , Reda Dehak

Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix

Speaker anonymization is an effective privacy protection solution that aims to conceal the speaker's identity while preserving the naturalness and distinctiveness of the original speech. Mainstream approaches use an utterance-level vector…

Audio and Speech Processing · Electrical Eng. & Systems 2024-05-20 Jixun Yao , Qing Wang , Pengcheng Guo , Ziqian Ning , Lei Xie

Privacy-preserving Representation Learning for Speech Understanding

Existing privacy-preserving speech representation learning methods target a single application domain. In this paper, we present a novel framework to anonymize utterance-level speech embeddings generated by pre-trained encoders and show its…

Audio and Speech Processing · Electrical Eng. & Systems 2023-10-27 Minh Tran , Mohammad Soleymani

End-to-end streaming model for low-latency speech anonymization

Speaker anonymization aims to conceal cues to speaker identity while preserving linguistic content. Current machine learning based approaches require substantial computational resources, hindering real-time streaming applications. To…

Audio and Speech Processing · Electrical Eng. & Systems 2024-11-04 Waris Quamer , Ricardo Gutierrez-Osuna

Speaker Anonymization with Phonetic Intermediate Representations

In this work, we propose a speaker anonymization pipeline that leverages high quality automatic speech recognition and synthesis systems to generate speech conditioned on phonetic transcriptions and anonymized speaker embeddings. Using…

Sound · Computer Science 2022-07-12 Sarina Meyer , Florian Lux , Pavel Denisov , Julia Koch , Pascal Tilli , Ngoc Thang Vu

Towards End-to-End Private Automatic Speaker Recognition

The development of privacy-preserving automatic speaker verification systems has been the focus of a number of studies with the intent of allowing users to authenticate themselves without risking the privacy of their voice. However, current…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-28 Francisco Teixeira , Alberto Abad , Bhiksha Raj , Isabel Trancoso

Differentially Private Speaker Anonymization

Sharing real-world speech utterances is key to the training and deployment of voice-based services. However, it also raises privacy risks as speech contains a wealth of personal data. Speaker anonymization aims to remove speaker information…

Sound · Computer Science 2022-10-07 Ali Shahin Shamsabadi , Brij Mohan Lal Srivastava , Aurélien Bellet , Nathalie Vauquier , Emmanuel Vincent , Mohamed Maouche , Marc Tommasi , Nicolas Papernot

On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer

Smart devices serviced by large-scale AI models necessitates user data transfer to the cloud for inference. For speech applications, this means transferring private user information, e.g., speaker identity. Our paper proposes a…

Audio and Speech Processing · Electrical Eng. & Systems 2023-07-26 Md Asif Jalal , Pablo Peso Parada , Jisi Zhang , Karthikeyan Saravanan , Mete Ozay , Myoungji Han , Jung In Lee , Seokyeong Jung

SEF-MK: Speaker-Embedding-Free Voice Anonymization through Multi-k-means Quantization

Voice anonymization protects speaker privacy by concealing identity while preserving linguistic and paralinguistic content. Self-supervised learning (SSL) representations encode linguistic features but preserve speaker traits. We propose a…

Sound · Computer Science 2025-08-19 Beilong Tang , Xiaoxiao Miao , Xin Wang , Ming Li

Are disentangled representations all you need to build speaker anonymization systems?

Speech signals contain a lot of sensitive information, such as the speaker's identity, which raises privacy concerns when speech data get collected. Speaker anonymization aims to transform a speech signal to remove the source speaker's…

Sound · Computer Science 2023-01-16 Pierre Champion , Denis Jouvet , Anthony Larcher

Privacy-Preserving Speech Representation Learning using Vector Quantization

With the popularity of virtual assistants (e.g., Siri, Alexa), the use of speech recognition is now becoming more and more widespread.However, speech signals contain a lot of sensitive information, such as the speaker's identity, which…

Audio and Speech Processing · Electrical Eng. & Systems 2022-03-21 Pierre Champion , Denis Jouvet , Anthony Larcher

Speaker Anonymization Using X-vector and Neural Waveform Models

The social media revolution has produced a plethora of web services to which users can easily upload and share multimedia documents. Despite the popularity and convenience of such services, the sharing of such inherently personal data,…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-03 Fuming Fang , Xin Wang , Junichi Yamagishi , Isao Echizen , Massimiliano Todisco , Nicholas Evans , Jean-Francois Bonastre

VoicePAT: An Efficient Open-source Evaluation Toolkit for Voice Privacy Research

Speaker anonymization is the task of modifying a speech recording such that the original speaker cannot be identified anymore. Since the first Voice Privacy Challenge in 2020, along with the release of a framework, the popularity of this…

Sound · Computer Science 2023-12-25 Sarina Meyer , Xiaoxiao Miao , Ngoc Thang Vu

UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training

Self-supervised learning (SSL) is a long-standing goal for speech processing, since it utilizes large-scale unlabeled data and avoids extensive human labeling. Recent years witness great successes in applying self-supervised learning in…

Computation and Language · Computer Science 2021-10-13 Sanyuan Chen , Yu Wu , Chengyi Wang , Zhengyang Chen , Zhuo Chen , Shujie Liu , Jian Wu , Yao Qian , Furu Wei , Jinyu Li , Xiangzhan Yu

Exploring Efficient-tuning Methods in Self-supervised Speech Models

In this study, we aim to explore efficient tuning methods for speech self-supervised learning. Recent studies show that self-supervised learning (SSL) can learn powerful representations for different speech tasks. However, fine-tuning…

Audio and Speech Processing · Electrical Eng. & Systems 2023-01-31 Zih-Ching Chen , Chin-Lun Fu , Chih-Ying Liu , Shang-Wen Li , Hung-yi Lee

Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques

The growing use of voice user interfaces has led to a surge in the collection and storage of speech data. While data collection allows for the development of efficient tools powering most speech services, it also poses serious privacy…

Cryptography and Security · Computer Science 2024-03-04 Pierre Champion

SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation

Speaker anonymization aims to conceal a speaker's identity without degrading speech quality and intelligibility. Most speaker anonymization systems disentangle the speaker representation from the original speech and achieve anonymization by…

Sound · Computer Science 2023-10-10 Yuanjun Lv , Jixun Yao , Peikun Chen , Hongbin Zhou , Heng Lu , Lei Xie

Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding

Voice anonymization has been developed as a technique for preserving privacy by replacing the speaker's voice in a speech signal with that of a pseudo-speaker, thereby obscuring the original voice attributes from machine recognition and…

Sound · Computer Science 2024-11-13 Rui Wang , Liping Chen , Kong AiK Lee , Zhen-Hua Ling