Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization

Xiaojiao Chen; Sheng Li; Jiyi Li; Hao Huang; Yang Cao; Liang He

Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization

Audio and Speech Processing 2023-11-20 v1

Authors: Xiaojiao Chen , Sheng Li , Jiyi Li , Hao Huang , Yang Cao , Liang He

Abstract

Current speaker anonymization methods, especially with self-supervised learning (SSL) models, require massive computational resources when hiding speaker identity. This paper proposes an effective and parameter-efficient speaker anonymization method based on recent End-to-End model reprogramming technology. To improve the anonymization performance, we first extract speaker representation from large SSL models as the speaker identifies. To hide the speaker's identity, we reprogram the speaker representation by adapting the speaker to a pseudo domain. Extensive experiments are carried out on the VoicePrivacy Challenge (VPC) 2022 datasets to demonstrate the effectiveness of our proposed parameter-efficient learning anonymization methods. Additionally, while achieving comparable performance with the VPC 2022 strong baseline 1.b, our approach consumes less computational resources during anonymization.

Keywords

speaker recognition and verification speaker verification and anti-spoofing self-supervised speech learning

Cite

@article{arxiv.2311.10664,
  title  = {Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization},
  author = {Xiaojiao Chen and Sheng Li and Jiyi Li and Hao Huang and Yang Cao and Liang He},
  journal= {arXiv preprint arXiv:2311.10664},
  year   = {2023}
}

Comments

accepted in ACM Multimedia Asia2023

Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization

Abstract

Keywords

Cite

Comments

Related papers