English
Related papers

Related papers: DiffAttack: Diffusion-based Timbre-reserved Advers…

200 papers

As a type of biometric identification, a speaker identification (SID) system is confronted with various kinds of attacks. The spoofing attacks typically imitate the timbre of the target speakers, while the adversarial attacks confuse the…

Sound · Computer Science 2023-09-06 Qing Wang , Jixun Yao , Li Zhang , Pengcheng Guo , Lei Xie

In this study, we propose a timbre-reserved adversarial attack approach for speaker identification (SID) to not only exploit the weakness of the SID model but also preserve the timbre of the target speaker in a black-box attack setting.…

Sound · Computer Science 2023-05-31 Qing Wang , Jixun Yao , Ziqian Wang , Pengcheng Guo , Lei Xie

Recently, automatic speaker verification (ASV) based on deep learning is easily contaminated by adversarial attacks, which is a new type of attack that injects imperceptible perturbations to audio signals so as to make ASV produce wrong…

Audio and Speech Processing · Electrical Eng. & Systems 2024-07-10 Yibo Bai , Xiao-Lei Zhang , Xuelong Li

Adversarial attacks involve adding perturbations to the source image to cause misclassification by the target model, which demonstrates the potential of attacking face recognition models. Existing adversarial face image generation methods…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Decheng Liu , Xijun Wang , Chunlei Peng , Nannan Wang , Ruiming Hu , Xinbo Gao

A primary challenge when deploying speaker recognition systems in real-world applications is performance degradation caused by environmental mismatch. We propose a diffusion-based method that takes speaker embeddings extracted from a…

Audio and Speech Processing · Electrical Eng. & Systems 2025-05-23 KiHyun Nam , Jungwoo Heo , Jee-weon Jung , Gangin Park , Chaeyoung Jung , Ha-Jin Yu , Joon Son Chung

The strategy of combining diffusion-based generative models with classifiers continues to demonstrate state-of-the-art performance on adversarial robustness benchmarks. Known as adversarial purification, this exploits a diffusion model's…

Cryptography and Security · Computer Science 2026-01-06 David D. Nguyen , The-Anh Ta , Yansong Gao , Alsharif Abuadbba

Existing audio-text retrieval (ATR) methods are essentially discriminative models that aim to maximize the conditional likelihood, represented as p(candidates|query). Nevertheless, this methodology fails to consider the intrinsic data…

Sound · Computer Science 2024-10-18 Yifei Xin , Xuxin Cheng , Zhihong Zhu , Xusheng Yang , Yuexian Zou

Automatic speech recognition (ASR) systems are known to be vulnerable to adversarial attacks. This paper addresses detection and defence against targeted white-box attacks on speech signals for ASR systems. While existing work has utilised…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-13 Nikolai L. Kühne , Astrid H. F. Kitchen , Marie S. Jensen , Mikkel S. L. Brøndt , Martin Gonzalez , Christophe Biscio , Zheng-Hua Tan

Adversarial attacks, particularly patch attacks, pose significant threats to the robustness and reliability of deep learning models. Developing reliable defenses against patch attacks is crucial for real-world applications. This paper…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Caixin Kang , Yinpeng Dong , Zhengyi Wang , Shouwei Ruan , Yubo Chen , Hang Su , Xingxing Wei

With the development of diffusion-based customization methods like DreamBooth, individuals now have access to train the models that can generate their personalized images. Despite the convenience, malicious users have misused these…

Computer Vision and Pattern Recognition · Computer Science 2024-07-29 Yisu Liu , Jinyang An , Wanqian Zhang , Dayan Wu , Jingzi Gu , Zheng Lin , Weiping Wang

Advancements in artificial intelligence and machine learning have significantly improved synthetic speech generation. This paper explores diffusion models, a novel method for creating realistic synthetic speech. We create a diffusion…

Cryptography and Security · Computer Science 2025-01-15 Anton Firc , Kamil Malinka , Petr Hanáček

Despite their immense popularity, deep learning-based acoustic systems are inherently vulnerable to adversarial attacks, wherein maliciously crafted audios trigger target systems to misbehave. In this paper, we present SirenAttack, a new…

Cryptography and Security · Computer Science 2019-07-25 Tianyu Du , Shouling Ji , Jinfeng Li , Qinchen Gu , Ting Wang , Raheem Beyah

Diffusion-based purification defenses leverage diffusion models to remove crafted perturbations of adversarial examples and achieve state-of-the-art robustness. Recent studies show that even advanced attacks cannot break such defenses…

Cryptography and Security · Computer Science 2024-01-05 Mintong Kang , Dawn Song , Bo Li

Speaker verification systems are increasingly deployed in security-sensitive applications but remain highly vulnerable to adversarial perturbations. In this work, we propose the Mask Diffusion Detector (MDD), a novel adversarial detection…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-27 Yibo Bai , Sizhou Chen , Michele Panariello , Xiao-Lei Zhang , Massimiliano Todisco , Nicholas Evans

Speaker recognition is a popular topic in biometric authentication and many deep learning approaches have achieved extraordinary performances. However, it has been shown in both image and speech applications that deep neural networks are…

Sound · Computer Science 2020-05-25 Qing Wang , Pengcheng Guo , Lei Xie

Dysarthric speech reconstruction (DSR) aims to convert dysarthric speech into comprehensible speech while maintaining the speaker's identity. Despite significant advancements, existing methods often struggle with low speech intelligibility…

Sound · Computer Science 2025-06-03 Xueyuan Chen , Dongchao Yang , Wenxuan Wu , Minglin Wu , Jing Xu , Xixin Wu , Zhiyong Wu , Helen Meng

The diffusion model is capable of generating high-quality data through a probabilistic approach. However, it suffers from the drawback of slow generation speed due to the requirement of a large number of time steps. To address this…

Sound · Computer Science 2024-04-30 Myeongjin Ko , Yong-Hoon Choi

Animating virtual avatars to make co-speech gestures facilitates various applications in human-machine interaction. The existing methods mainly rely on generative adversarial networks (GANs), which typically suffer from notorious mode…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Lingting Zhu , Xian Liu , Xuanyu Liu , Rui Qian , Ziwei Liu , Lequan Yu

Voice biometric systems based on automatic speaker verification (ASV) are exposed to \textit{spoofing} attacks which may compromise their security. To increase the robustness against such attacks, anti-spoofing or presentation attack…

Audio and Speech Processing · Electrical Eng. & Systems 2022-01-05 Alejandro Gomez-Alanis , Jose A. Gonzalez-Lopez , Antonio M. Peinado

Deep neural network based speaker recognition systems can easily be deceived by an adversary using minuscule imperceptible perturbations to the input speech samples. These adversarial attacks pose serious security threats to the speaker…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-02 Monisankha Pal , Arindam Jati , Raghuveer Peri , Chin-Cheng Hsu , Wael AbdAlmageed , Shrikanth Narayanan
‹ Prev 1 2 3 10 Next ›