Related papers: Characterizing Audio Adversarial Examples Using Te…

Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition

Adversarial examples are inputs to machine learning models designed by an adversary to cause an incorrect output. So far, adversarial examples have been studied most extensively in the image domain. In this domain, adversarial examples can…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-10 Yao Qin , Nicholas Carlini , Ian Goodfellow , Garrison Cottrell , Colin Raffel

Detecting Adversarial Attacks On Audiovisual Speech Recognition

Adversarial attacks pose a threat to deep learning models. However, research on adversarial detection methods, especially in the multi-modal domain, is very limited. In this work, we propose an efficient and straightforward detection method…

Computer Vision and Pattern Recognition · Computer Science 2021-02-15 Pingchuan Ma , Stavros Petridis , Maja Pantic

Weighted-Sampling Audio Adversarial Example Attack

Recent studies have highlighted audio adversarial examples as a ubiquitous threat to state-of-the-art automatic speech recognition systems. Thorough studies on how to effectively generate adversarial examples are essential to prevent…

Audio and Speech Processing · Electrical Eng. & Systems 2024-03-13 Xiaolei Liu , Xiaosong Zhang , Kun Wan , Qingxin Zhu , Yufei Ding

Perceptual Based Adversarial Audio Attacks

Recent work has shown the possibility of adversarial attacks on automatic speechrecognition (ASR) systems. However, in the vast majority of work in this area, theattacks have been executed only in the digital space, or have involved short…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-18 Joseph Szurley , J. Zico Kolter

Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization

With the widespread application of automatic speech recognition (ASR) systems, their vulnerability to adversarial attacks has been extensively studied. However, most existing adversarial examples are generated on specific individual models,…

Sound · Computer Science 2025-03-26 Weifei Jin , Junjie Su , Hejia Wang , Yulin Ye , Jie Hao

Robustifying automatic speech recognition by extracting slowly varying features

In the past few years, it has been shown that deep learning systems are highly vulnerable under attacks with adversarial examples. Neural-network-based automatic speech recognition (ASR) systems are no exception. Targeted and untargeted…

Audio and Speech Processing · Electrical Eng. & Systems 2024-11-07 Matías Pizarro , Dorothea Kolossa , Asja Fischer

Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise

An automatic speech recognition (ASR) system based on a deep neural network is vulnerable to attack by an adversarial example, especially if the command-dependent ASR fails. A defense method against adversarial examples is proposed to…

Sound · Computer Science 2021-10-19 Mingyu Dong , Diqun Yan , Yongkang Gong , Rangding Wang

Dompteur: Taming Audio Adversarial Examples

Adversarial examples seem to be inevitable. These specifically crafted inputs allow attackers to arbitrarily manipulate machine learning systems. Even worse, they often seem harmless to human observers. In our digital society, this poses a…

Cryptography and Security · Computer Science 2021-06-04 Thorsten Eisenhofer , Lea Schönherr , Joel Frank , Lars Speckemeier , Dorothea Kolossa , Thorsten Holz

An Integrated Algorithm for Robust and Imperceptible Audio Adversarial Examples

Audio adversarial examples are audio files that have been manipulated to fool an automatic speech recognition (ASR) system, while still sounding benign to a human listener. Most methods to generate such samples are based on a two-step…

Sound · Computer Science 2023-10-06 Armin Ettenhofer , Jan-Philipp Schulze , Karla Pizzi

Training Augmentation with Adversarial Examples for Robust Speech Recognition

This paper explores the use of adversarial examples in training speech recognition systems to increase robustness of deep neural network acoustic models. During training, the fast gradient sign method is used to generate adversarial…

Computation and Language · Computer Science 2018-06-19 Sining Sun , Ching-Feng Yeh , Mari Ostendorf , Mei-Yuh Hwang , Lei Xie

Language Dependencies in Adversarial Attacks on Speech Recognition Systems

Automatic speech recognition (ASR) systems are ubiquitously present in our daily devices. They are vulnerable to adversarial attacks, where manipulated input samples fool the ASR system's recognition. While adversarial examples for various…

Computation and Language · Computer Science 2022-02-03 Karla Markert , Donika Mirdita , Konstantin Böttinger

Robust Audio Adversarial Example for a Physical Attack

We propose a method to generate audio adversarial examples that can attack a state-of-the-art speech recognition model in the physical world. Previous work assumes that generated adversarial examples are directly fed to the recognition…

Machine Learning · Computer Science 2019-08-20 Hiromu Yakura , Jun Sakuma

Adversarial Learning of Raw Speech Features for Domain Invariant Speech Recognition

Recent advances in neural network based acoustic modelling have shown significant improvements in automatic speech recognition (ASR) performance. In order for acoustic models to be able to handle large acoustic variability, large amounts of…

Audio and Speech Processing · Electrical Eng. & Systems 2018-05-23 Aditay Tripathi , Aanchan Mohan , Saket Anand , Maneesh Singh

Towards Resistant Audio Adversarial Examples

Adversarial examples tremendously threaten the availability and integrity of machine learning-based systems. While the feasibility of such attacks has been observed first in the domain of image processing, recent research shows that speech…

Sound · Computer Science 2020-10-15 Tom Dörr , Karla Markert , Nicolas M. Müller , Konstantin Böttinger

WaveGuard: Understanding and Mitigating Audio Adversarial Examples

There has been a recent surge in adversarial attacks on deep learning based automatic speech recognition (ASR) systems. These attacks pose new challenges to deep learning security and have raised significant concerns in deploying ASR…

Cryptography and Security · Computer Science 2021-03-08 Shehzeen Hussain , Paarth Neekhara , Shlomo Dubnov , Julian McAuley , Farinaz Koushanfar

Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

Recent studies have highlighted adversarial examples as ubiquitous threats to the deep neural network (DNN) based speech recognition systems. In this work, we present a U-Net based attention model, U-Net$_{At}$, to enhance adversarial…

Audio and Speech Processing · Electrical Eng. & Systems 2022-01-04 Chao-Han Huck Yang , Jun Qi , Pin-Yu Chen , Xiaoli Ma , Chin-Hui Lee

Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise

In recent years, significant progress has been made in deep model-based automatic speech recognition (ASR), leading to its widespread deployment in the real world. At the same time, adversarial attacks against deep ASR systems are highly…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-04 Christian Heider Nielsen , Zheng-Hua Tan

Modeling Adversarial Noise for Adversarial Training

Deep neural networks have been demonstrated to be vulnerable to adversarial noise, promoting the development of defense against adversarial attacks. Motivated by the fact that adversarial noise contains well-generalizing features and that…

Machine Learning · Computer Science 2022-07-19 Dawei Zhou , Nannan Wang , Bo Han , Tongliang Liu

Invariant Representations for Noisy Speech Recognition

Modern automatic speech recognition (ASR) systems need to be robust under acoustic variability arising from environmental, speaker, channel, and recording conditions. Ensuring such robustness to variability is a challenge in modern day…

Computation and Language · Computer Science 2016-12-07 Dmitriy Serdyuk , Kartik Audhkhasi , Philémon Brakel , Bhuvana Ramabhadran , Samuel Thomas , Yoshua Bengio

Exploiting Context-dependent Duration Features for Voice Anonymization Attack Systems

The temporal dynamics of speech, encompassing variations in rhythm, intonation, and speaking rate, contain important and unique information about speaker identity. This paper proposes a new method for representing speaker characteristics by…

Sound · Computer Science 2025-07-22 Natalia Tomashenko , Emmanuel Vincent , Marc Tommasi