Related papers: Speaker Identification using MFCC-Domain Support V…

A text-independent speaker verification model: A comparative analysis

The most pressing challenge in the field of voice biometrics is selecting the most efficient technique of speaker recognition. Every individual's voice is peculiar, factors like physical differences in vocal organs, accent and pronunciation…

Sound · Computer Science 2017-12-05 Rishi Charan , Manisha. A , Karthik. R , Rajesh Kumar M

Text Independent Speaker Identification System for Access Control

Even human intelligence system fails to offer 100% accuracy in identifying speeches from a specific individual. Machine intelligence is trying to mimic humans in speaker identification problems through various approaches to speech feature…

Audio and Speech Processing · Electrical Eng. & Systems 2022-09-30 Oluyemi E. Adetoyi

Robust Support Vector Machines for Speaker Verification Task

An important step in speaker verification is extracting features that best characterize the speaker voice. This paper investigates a front-end processing that aims at improving the performance of speaker verification based on the SVMs…

Machine Learning · Computer Science 2013-06-13 Kawthar Yasmine Zergat , Abderrahmane Amrouche

Speaker Verification Using Simple Temporal Features and Pitch Synchronous Cepstral Coefficients

Speaker verification is the process by which a speakers claim of identity is tested against a claimed speaker by his or her voice. Speaker verification is done by the use of some parameters (features) from the speakers voice which can be…

Sound · Computer Science 2019-08-16 Bhavana V. S , Pradip K. Das

A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

In this paper, we propose a novel family of windowing technique to compute Mel Frequency Cepstral Coefficient (MFCC) for automatic speaker recognition from speech. The proposed method is based on fundamental property of discrete time…

Computer Vision and Pattern Recognition · Computer Science 2015-06-05 Md. Sahidullah , Goutam Saha

Wavelet-Based Mel-Frequency Cepstral Coefficients for Speaker Identification using Hidden Markov Models

To improve the performance of speaker identification systems, an effective and robust method is proposed to extract speech features, capable of operating in noisy environment. Based on the time-frequency multi-resolution property of wavelet…

Sound · Computer Science 2010-03-31 Mahmoud I. Abdalla , Hanaa S. Ali

Speaker Recognition using Deep Belief Networks

Short time spectral features such as mel frequency cepstral coefficients(MFCCs) have been previously deployed in state of the art speaker recognition systems, however lesser heed has been paid to short term spectral features that can be…

Audio and Speech Processing · Electrical Eng. & Systems 2018-05-24 Adrish Banerjee , Akash Dubey , Abhishek Menon , Shubham Nanda , Gora Chand Nandi

Histogram Transform-based Speaker Identification

A novel text-independent speaker identification (SI) method is proposed. This method uses the Mel-frequency Cepstral coefficients (MFCCs) and the dynamic information among adjacent frames as feature sets to capture speaker's…

Sound · Computer Science 2020-02-04 Zhanyu Ma , Hong Yu

Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model

Speech Emotion Recognition (SER) is the use of machines to detect the emotional state of humans based on the speech, which is gaining importance in natural human-computer interaction. Speech is a very valuable source of information, as…

Sound · Computer Science 2026-04-30 Adelekun Oluwademilade , Ademola Adedamola , Abiola Abdulhakeem , Akinpelu Azeezat , Eraiyetan Israel , Omotosho Oluwadunsin , Ibenye Ikechukwu , Ayuba Muhammad , Olusanya Olamide , Kamorudeen Amuda

A Multi Level Data Fusion Approach for Speaker Identification on Telephone Speech

Several speaker identification systems are giving good performance with clean speech but are affected by the degradations introduced by noisy audio conditions. To deal with this problem, we investigate the use of complementary information…

Sound · Computer Science 2014-07-03 Imen Trabelsi , Dorra Ben Ayed

Gender Identification using MFCC for Telephone Applications - A Comparative Study

Gender recognition is an essential component of automatic speech recognition and interactive voice response systems. Determining gender of the speaker reduces the computational burden of such systems for any further processing. Typical…

Sound · Computer Science 2016-01-08 Jamil Ahmad , Mustansar Fiaz , Soon-il Kwon , Maleerat Sodanil , Bay Vo , Sung Wook Baik

Learnable MFCCs for Speaker Verification

We propose a learnable mel-frequency cepstral coefficient (MFCC) frontend architecture for deep neural network (DNN) based automatic speaker verification. Our architecture retains the simplicity and interpretability of MFCC-based features…

Sound · Computer Science 2021-02-23 Xuechen Liu , Md Sahidullah , Tomi Kinnunen

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Speaker Verification (SV) systems involve mainly two individual stages: feature extraction and classification. In this paper, we explore these two modules with the aim of improving the performance of a speaker verification system under…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-06 Kerlos Atia Abdalmalak , Ascensión Gallardo-Antol'in

On the Use of Different Feature Extraction Methods for Linear and Non Linear kernels

The speech feature extraction has been a key focus in robust speech recognition research; it significantly affects the recognition performance. In this paper, we first study a set of different features extraction methods such as linear…

Computation and Language · Computer Science 2014-07-01 Imen Trabelsi , Dorra Ben Ayed

A Comparison of Classifiers in Performing Speaker Accent Recognition Using MFCCs

An algorithm involving Mel-Frequency Cepstral Coefficients (MFCCs) is provided to perform signal feature extraction for the task of speaker accent recognition. Then different classifiers are compared based on the MFCC feature. For each…

Sound · Computer Science 2015-02-02 Zichen Ma , Ernest Fokoue

Improvement of Text Dependent Speaker Identification System Using Neuro-Genetic Hybrid Algorithm in Office Environmental Conditions

In this paper, an improved strategy for automated text dependent speaker identification system has been proposed in noisy environment. The identification process incorporates the Neuro- Genetic hybrid algorithm with cepstral based features.…

Sound · Computer Science 2009-09-15 Md. Rabiul Islam , Md. Fayzur Rahman

Pitch-synchronous DCT features: A pilot study on speaker identification

We propose a new feature, namely, pitchsynchronous discrete cosine transform (PS-DCT), for the task of speaker identification. These features are obtained directly from the voiced segments of the speech signal, without any preemphasis or…

Audio and Speech Processing · Electrical Eng. & Systems 2018-12-07 Amit Meghanani , A G Ramakrishnan

Speaker Re-identification with Speaker Dependent Speech Enhancement

While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments. Here speech enhancement methods have traditionally allowed improved…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-28 Yanpei Shi , Qiang Huang , Thomas Hain

A Novel Minimum Divergence Approach to Robust Speaker Identification

In this work, a novel solution to the speaker identification problem is proposed through minimization of statistical divergences between the probability distribution (g). of feature vectors from the test utterance and the probability…

Machine Learning · Statistics 2015-12-17 Ayanendranath Basu , Smarajit Bose , Amita Pal , Anish Mukherjee , Debasmita Das

Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs

It is well known that speaker identification performs extremely well in the neutral talking environments; however, the identification performance is declined sharply in the shouted talking environments. This work aims at proposing,…

Artificial Intelligence · Computer Science 2017-06-30 Ismail Shahin