Related papers: Histogram Transform-based Speaker Identification

Wavelet-Based Mel-Frequency Cepstral Coefficients for Speaker Identification using Hidden Markov Models

To improve the performance of speaker identification systems, an effective and robust method is proposed to extract speech features, capable of operating in noisy environment. Based on the time-frequency multi-resolution property of wavelet…

Sound · Computer Science 2010-03-31 Mahmoud I. Abdalla , Hanaa S. Ali

A Novel Windowing Technique for Efficient Computation of MFCC for Speaker Recognition

In this paper, we propose a novel family of windowing technique to compute Mel Frequency Cepstral Coefficient (MFCC) for automatic speaker recognition from speech. The proposed method is based on fundamental property of discrete time…

Computer Vision and Pattern Recognition · Computer Science 2015-06-05 Md. Sahidullah , Goutam Saha

Speaker Recognition -- Wavelet Packet Based Multiresolution Feature Extraction Approach

This paper proposes a novel Wavelet Packet based feature extraction approach for the task of text independent speaker recognition. The features are extracted by using the combination of Mel Frequency Cepstral Coefficient (MFCC) and Wavelet…

Sound · Computer Science 2025-12-25 Saurabh Bhardwaj , Smriti Srivastava , Abhishek Bhandari , Krit Gupta , Hitesh Bahl , J. R. P. Gupta

Speaker Identification using MFCC-Domain Support Vector Machine

Speech recognition and speaker identification are important for authentication and verification in security purpose, but they are difficult to achieve. Speaker identification methods can be divided into text-independent and text-dependent.…

Machine Learning · Computer Science 2010-09-28 S. M. Kamruzzaman , A. N. M. Rezaul Karim , Md. Saiful Islam , Md. Emdadul Haque

Pitch-synchronous DCT features: A pilot study on speaker identification

We propose a new feature, namely, pitchsynchronous discrete cosine transform (PS-DCT), for the task of speaker identification. These features are obtained directly from the voiced segments of the speech signal, without any preemphasis or…

Audio and Speech Processing · Electrical Eng. & Systems 2018-12-07 Amit Meghanani , A G Ramakrishnan

Text Independent Speaker Identification System for Access Control

Even human intelligence system fails to offer 100% accuracy in identifying speeches from a specific individual. Machine intelligence is trying to mimic humans in speaker identification problems through various approaches to speech feature…

Audio and Speech Processing · Electrical Eng. & Systems 2022-09-30 Oluyemi E. Adetoyi

The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices

Due to improvements in artificial intelligence, speaker identification (SI) technologies have brought a great direction and are now widely used in a variety of sectors. One of the most important components of SI is feature extraction, which…

Sound · Computer Science 2021-12-16 Noor Ahmad Al Hindawi , Ismail Shahin , Ali Bou Nassif

Speaker Verification Using Simple Temporal Features and Pitch Synchronous Cepstral Coefficients

Speaker verification is the process by which a speakers claim of identity is tested against a claimed speaker by his or her voice. Speaker verification is done by the use of some parameters (features) from the speakers voice which can be…

Sound · Computer Science 2019-08-16 Bhavana V. S , Pradip K. Das

Improving Performance of Speaker Identification System Using Complementary Information Fusion

Feature extraction plays an important role as a front-end processing block in speaker identification (SI) process. Most of the SI systems utilize like Mel-Frequency Cepstral Coefficients (MFCC), Perceptual Linear Prediction (PLP), Linear…

Sound · Computer Science 2015-03-19 Md. Sahidullah , Sandipan Chakroborty , Goutam Saha

Performance Evaluation of Statistical Approaches for Text Independent Speaker Recognition Using Source Feature

This paper introduces the performance evaluation of statistical approaches for TextIndependent speaker recognition system using source feature. Linear prediction LP residual is used as a representation of excitation information in speech.…

Computation and Language · Computer Science 2011-04-26 R. Rajeswara Rao , V. Kamakshi Prasad , A. Nagesh

Spoken Language Identification Using Hybrid Feature Extraction Methods

This paper introduces and motivates the use of hybrid robust feature extraction technique for spoken language identification (LID) system. The speech recognizers use a parametric form of a signal to get the most important distinguishable…

Sound · Computer Science 2010-03-31 Pawan Kumar , Astik Biswas , A . N. Mishra , Mahesh Chandra

Speaker Recognition with Random Digit Strings Using Uncertainty Normalized HMM-based i-vectors

In this paper, we combine Hidden Markov Models (HMMs) with i-vector extractors to address the problem of text-dependent speaker recognition with random digit strings. We employ digit-specific HMMs to segment the utterances into digits, to…

Audio and Speech Processing · Electrical Eng. & Systems 2019-07-16 Nooshin Maghsoodi , Hossein Sameti , Hossein Zeinali , Themos~Stafylakis

Automatic Speech Recognition Using Template Model for Man-Machine Interface

Speech is a natural form of communication for human beings, and computers with the ability to understand speech and speak with a human voice are expected to contribute to the development of more natural man-machine interfaces. Computers…

Sound · Computer Science 2013-05-15 Neema Mishra , Urmila Shrawankar , V M Thakare

A text-independent speaker verification model: A comparative analysis

The most pressing challenge in the field of voice biometrics is selecting the most efficient technique of speaker recognition. Every individual's voice is peculiar, factors like physical differences in vocal organs, accent and pronunciation…

Sound · Computer Science 2017-12-05 Rishi Charan , Manisha. A , Karthik. R , Rajesh Kumar M

Speech Emotion Recognition Using MFCC Features and LSTM-Based Deep Learning Model

Speech Emotion Recognition (SER) is the use of machines to detect the emotional state of humans based on the speech, which is gaining importance in natural human-computer interaction. Speech is a very valuable source of information, as…

Sound · Computer Science 2026-04-30 Adelekun Oluwademilade , Ademola Adedamola , Abiola Abdulhakeem , Akinpelu Azeezat , Eraiyetan Israel , Omotosho Oluwadunsin , Ibenye Ikechukwu , Ayuba Muhammad , Olusanya Olamide , Kamorudeen Amuda

The Secret Source : Incorporating Source Features to Improve Acoustic-to-Articulatory Speech Inversion

In this work, we incorporated acoustically derived source features, aperiodicity, periodicity and pitch as additional targets to an acoustic-to-articulatory speech inversion (SI) system. We also propose a Temporal Convolution based SI…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-01 Yashish M. Siriwardena , Carol Espy-Wilson

Learnable MFCCs for Speaker Verification

We propose a learnable mel-frequency cepstral coefficient (MFCC) frontend architecture for deep neural network (DNN) based automatic speaker verification. Our architecture retains the simplicity and interpretability of MFCC-based features…

Sound · Computer Science 2021-02-23 Xuechen Liu , Md Sahidullah , Tomi Kinnunen

Audio Signal Processing Using Time Domain Mel-Frequency Wavelet Coefficient

Extracting features from the speech is the most critical process in speech signal processing. Mel Frequency Cepstral Coefficients (MFCC) are the most widely used features in the majority of the speaker and speech recognition applications,…

Sound · Computer Science 2025-10-31 Rinku Sebastian , Simon O'Keefe , Martin Trefzer

Adaptive Frequency Cepstral Coefficients for Word Mispronunciation Detection

Systems based on automatic speech recognition (ASR) technology can provide important functionality in computer assisted language learning applications. This is a young but growing area of research motivated by the large number of students…

Sound · Computer Science 2016-02-29 Zhenhao Ge , Sudhendu R. Sharma , Mark J. T. Smith

Improvement of Text Dependent Speaker Identification System Using Neuro-Genetic Hybrid Algorithm in Office Environmental Conditions

In this paper, an improved strategy for automated text dependent speaker identification system has been proposed in noisy environment. The identification process incorporates the Neuro- Genetic hybrid algorithm with cepstral based features.…

Sound · Computer Science 2009-09-15 Md. Rabiul Islam , Md. Fayzur Rahman