Related papers: Codebook Design Method for Noise Robust Speaker Id…

Improvement of Text Dependent Speaker Identification System Using Neuro-Genetic Hybrid Algorithm in Office Environmental Conditions

In this paper, an improved strategy for automated text dependent speaker identification system has been proposed in noisy environment. The identification process incorporates the Neuro- Genetic hybrid algorithm with cepstral based features.…

Sound · Computer Science 2009-09-15 Md. Rabiul Islam , Md. Fayzur Rahman

Robust Speaker Recognition Using Speech Enhancement And Attention Model

In this paper, a novel architecture for speaker recognition is proposed by cascading speech enhancement and speaker processing. Its aim is to improve speaker recognition performance when speech signals are corrupted by noise. Instead of…

Computation and Language · Computer Science 2020-05-25 Yanpei Shi , Qiang Huang , Thomas Hain

Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model

While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments. To improve robustness of speaker recognition system performance in…

Audio and Speech Processing · Electrical Eng. & Systems 2020-05-19 Yanpei Shi , Qiang Huang , Thomas Hain

Robust parameter design for Wiener-based binaural noise reduction methods in hearing aids

This work presents a method for designing the weighting parameter required by Wiener-based binaural noise reduction methods. This parameter establishes the desired tradeoff between noise reduction and binaural cue preservation in hearing…

Audio and Speech Processing · Electrical Eng. & Systems 2021-04-21 Diego M. Carmo , Ricardo Borsoi , Márcio H. Costa

Neural Network Based Speaker Classification and Verification Systems with Enhanced Features

This work presents a novel framework based on feed-forward neural network for text-independent speaker classification and verification, two related systems of speaker recognition. With optimized features and model training, it achieves 100%…

Sound · Computer Science 2017-03-20 Zhenhao Ge , Ananth N. Iyer , Srinath Cheluvaraja , Ram Sundaram , Aravind Ganapathiraju

Optimization of data-driven filterbank for automatic speaker verification

Most of the speech processing applications use triangular filters spaced in mel-scale for feature extraction. In this paper, we propose a new data-driven filter design method which optimizes filter parameters from a given speech data.…

Audio and Speech Processing · Electrical Eng. & Systems 2020-07-22 Susanta Sarangi , Md Sahidullah , Goutam Saha

Speaker Re-identification with Speaker Dependent Speech Enhancement

While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments. Here speech enhancement methods have traditionally allowed improved…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-28 Yanpei Shi , Qiang Huang , Thomas Hain

Robust Training for Speaker Verification against Noisy Labels

The deep learning models used for speaker verification rely heavily on large amounts of data and correct labeling. However, noisy (incorrect) labels often occur, which degrades the performance of the system. In this paper, we propose a…

Sound · Computer Science 2026-04-29 Zhihua Fang , Liang He , Hanhan Ma , Xiaochen Guo , Lin Li

Enhancing Automatic Speech Recognition Through Integrated Noise Detection Architecture

This research presents a novel approach to enhancing automatic speech recognition systems by integrating noise detection capabilities directly into the recognition architecture. Building upon the wav2vec2 framework, the proposed method…

Sound · Computer Science 2025-12-11 Karamvir Singh

Adversarial Network Bottleneck Features for Noise Robust Speaker Verification

In this paper, we propose a noise robust bottleneck feature representation which is generated by an adversarial network (AN). The AN includes two cascade connected networks, an encoding network (EN) and a discriminative network (DN).…

Sound · Computer Science 2017-06-13 Hong Yu , Zheng-Hua Tan , Zhanyu Ma , Jun Guo

Enhancing Speaker Verification with Whispered Speech via Post-Processing

Speaker verification is a task of confirming an individual's identity through the analysis of their voice. Whispered speech differs from phonated speech in acoustic characteristics, which degrades the performance of speaker verification…

Sound · Computer Science 2026-05-08 Magdalena Gołębiowska , Piotr Syga

Speaker recognition with two-step multi-modal deep cleansing

Neural network-based speaker recognition has achieved significant improvement in recent years. A robust speaker representation learns meaningful knowledge from both hard and easy samples in the training set to achieve good performance.…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-31 Ruijie Tao , Kong Aik Lee , Zhan Shi , Haizhou Li

Speech denoising by parametric resynthesis

This work proposes the use of clean speech vocoder parameters as the target for a neural network performing speech enhancement. These parameters have been designed for text-to-speech synthesis so that they both produce high-quality…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-03 Soumi Maiti , Michael I Mandel

Source-Aware Neural Speech Coding for Noisy Speech Compression

This paper introduces a novel neural network-based speech coding system that can process noisy speech effectively. The proposed source-aware neural audio coding (SANAC) system harmonizes a deep autoencoder-based source separation model and…

Audio and Speech Processing · Electrical Eng. & Systems 2020-11-11 Haici Yang , Kai Zhen , Seungkwon Beack , Minje Kim

A Robust Frame-based Nonlinear Prediction System for Automatic Speech Coding

In this paper, we propose a neural-based coding scheme in which an artificial neural network is exploited to automatically compress and decompress speech signals by a trainable approach. Having a two-stage training phase, the system can be…

Sound · Computer Science 2016-01-25 Mahmood Yousefi-Azar , Farbod Razzazi

Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder

Recent neural networks such as WaveNet and sampleRNN that learn directly from speech waveform samples have achieved very high-quality synthetic speech in terms of both naturalness and speaker similarity even in multi-speaker text-to-speech…

Audio and Speech Processing · Electrical Eng. & Systems 2018-08-01 Yi Zhao , Shinji Takaki , Hieu-Thi Luong , Junichi Yamagishi , Daisuke Saito , Nobuaki Minematsu

Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training

In realistic environments, speech is usually interfered by various noise and reverberation, which dramatically degrades the performance of automatic speech recognition (ASR) systems. To alleviate this issue, the commonest way is to use a…

Sound · Computer Science 2018-05-04 Bin Liu , Shuai Nie , Yaping Zhang , Dengfeng Ke , Shan Liang , Wenju Liu1

A New Nonlinear speaker parameterization algorithm for speaker identification

In this paper we propose a new parameterization algorithm based on nonlinear prediction, which is an extension of the classical LPC parameters. The parameters performances are estimated by two different methods: the Arithmetic-Harmonic…

Sound · Computer Science 2022-04-07 Mohamed Chetouani , Marcos Faundez-Zanuy , Bruno Gas , Jean-Luc Zarader

Speaker recognition with a MLP classifier and LPCC codebook

This paper improves the speaker recognition rates of a MLP classifier and LPCC codebook alone, using a linear combination between both methods. In simulations we have obtained an improvement of 4.7% over a LPCC codebook of 32 vectors and…

Sound · Computer Science 2022-03-23 Daniel Rodriguez-Porcheron , Marcos Faundez-Zanuy

Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification

Robust speaker verification under noisy conditions remains an open challenge. Conventional deep learning methods learn a robust unified speaker representation space against diverse background noise and achieve significant improvement. In…

Sound · Computer Science 2026-03-11 Bin Gu , Haitao Zhao , Jibo Wei