English

Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization

Audio and Speech Processing 2023-11-20 v1

Abstract

Current speaker anonymization methods, especially with self-supervised learning (SSL) models, require massive computational resources when hiding speaker identity. This paper proposes an effective and parameter-efficient speaker anonymization method based on recent End-to-End model reprogramming technology. To improve the anonymization performance, we first extract speaker representation from large SSL models as the speaker identifies. To hide the speaker's identity, we reprogram the speaker representation by adapting the speaker to a pseudo domain. Extensive experiments are carried out on the VoicePrivacy Challenge (VPC) 2022 datasets to demonstrate the effectiveness of our proposed parameter-efficient learning anonymization methods. Additionally, while achieving comparable performance with the VPC 2022 strong baseline 1.b, our approach consumes less computational resources during anonymization.

Keywords

Cite

@article{arxiv.2311.10664,
  title  = {Reprogramming Self-supervised Learning-based Speech Representations for Speaker Anonymization},
  author = {Xiaojiao Chen and Sheng Li and Jiyi Li and Hao Huang and Yang Cao and Liang He},
  journal= {arXiv preprint arXiv:2311.10664},
  year   = {2023}
}

Comments

accepted in ACM Multimedia Asia2023

R2 v1 2026-06-28T13:24:26.701Z