Related papers: Explainable Attribute-Based Speaker Verification

Study on the Fairness of Speaker Verification Systems on Underrepresented Accents in English

Speaker verification (SV) systems are currently being used to make sensitive decisions like giving access to bank accounts or deciding whether the voice of a suspect coincides with that of the perpetrator of a crime. Ensuring that these…

Audio and Speech Processing · Electrical Eng. & Systems 2025-11-18 Mariel Estevez , Luciana Ferrer

Vo-Ve: An Explainable Voice-Vector for Speaker Identity Evaluation

In this paper, we propose Vo-Ve, a novel voice-vector embedding that captures speaker identity. Unlike conventional speaker embeddings, Vo-Ve is explainable, as it contains the probabilities of explicit voice attribute classes. Through…

Sound · Computer Science 2025-06-25 Jaejun Lee , Kyogu Lee

Design Guidelines for Inclusive Speaker Verification Evaluation Datasets

Speaker verification (SV) provides billions of voice-enabled devices with access control, and ensures the security of voice-driven technologies. As a type of biometrics, it is necessary that SV is unbiased, with consistent and reliable…

Audio and Speech Processing · Electrical Eng. & Systems 2022-09-14 Wiebke Toussaint Hutiri , Lauriane Gorce , Aaron Yi Ding

An Age-Agnostic System for Robust Speaker Verification

In speaker verification (SV), the acoustic mismatch between children's and adults' speech leads to suboptimal performance when adult-trained SV systems are applied to children's speaker verification (C-SV). While domain adaptation…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-05 Jiusi Zheng , Vishwas Shetty , Natarajan Balaji Shankar , Abeer Alwan

Speaker Verification Using Simple Temporal Features and Pitch Synchronous Cepstral Coefficients

Speaker verification is the process by which a speakers claim of identity is tested against a claimed speaker by his or her voice. Speaker verification is done by the use of some parameters (features) from the speakers voice which can be…

Sound · Computer Science 2019-08-16 Bhavana V. S , Pradip K. Das

Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion

Verifying the identity of a speaker is crucial in modern human-machine interfaces, e.g., to ensure privacy protection or to enable biometric authentication. Classical speaker verification (SV) approaches estimate a fixed-dimensional…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-29 Ahmad Aloradi , Wolfgang Mack , Mohamed Elminshawi , Emanuël A. P. Habets

Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation

In speech technologies, speaker's voice representation is used in many applications such as speech recognition, voice conversion, speech synthesis and, obviously, user authentication. Modern vocal representations of the speaker are based on…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-17 Paul-Gauthier Noé , Mohammad Mohammadamini , Driss Matrouf , Titouan Parcollet , Andreas Nautsch , Jean-François Bonastre

Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities

Unlike other data modalities such as text and vision, speech does not lend itself to easy interpretation. While lay people can understand how to describe an image or sentence via perception, non-expert descriptions of speech often end at…

Sound · Computer Science 2023-10-05 Robin Netzorg , Bohan Yu , Andrea Guzman , Peter Wu , Luna McNulty , Gopala Anumanchipalli

Explaining Speaker and Spoof Embeddings via Probing

This study investigates the explainability of embedding representations, specifically those used in modern audio spoofing detection systems based on deep neural networks, known as spoof embeddings. Building on established work in speaker…

Sound · Computer Science 2024-12-25 Xuechen Liu , Junichi Yamagishi , Md Sahidullah , Tomi kinnunen

ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification

In speaker verification, we use computational method to verify if an utterance matches the identity of an enrolled speaker. This task is similar to the manual task of forensic voice comparison, where linguistic analysis is combined with…

Sound · Computer Science 2025-01-15 Yi Ma , Shuai Wang , Tianchi Liu , Haizhou Li

PhiNet: Speaker Verification with Phonetic Interpretability

Despite remarkable progress, automatic speaker verification (ASV) systems typically lack the transparency required for high-accountability applications. Motivated by how human experts perform forensic speaker comparison (FSC), we propose a…

Audio and Speech Processing · Electrical Eng. & Systems 2026-04-07 Yi Ma , Shuai Wang , Tianchi Liu , Haizhou Li

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?

Recently, self-supervised learning (SSL) has demonstrated strong performance in speaker recognition, even if the pre-training objective is designed for speech recognition. In this paper, we study which factor leads to the success of…

Computation and Language · Computer Science 2022-06-28 Sanyuan Chen , Yu Wu , Chengyi Wang , Shujie Liu , Zhuo Chen , Peidong Wang , Gang Liu , Jinyu Li , Jian Wu , Xiangzhan Yu , Furu Wei

Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion

Voice conversion (VC), as a voice style transfer technology, is becoming increasingly prevalent while raising serious concerns about its illegal use. Proactively tracing the origins of VC-generated speeches, i.e., speaker traceability, can…

Sound · Computer Science 2023-07-27 Yanzhen Ren , Hongcheng Zhu , Liming Zhai , Zongkun Sun , Rubing Shen , Lina Wang

An Explainable Probabilistic Attribute Embedding Approach for Spoofed Speech Characterization

We propose a novel approach for spoofed speech characterization through explainable probabilistic attribute embeddings. In contrast to high-dimensional raw embeddings extracted from a spoofing countermeasure (CM) whose dimensions are not…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-18 Manasi Chhibber , Jagabandhu Mishra , Hyejin Shim , Tomi H. Kinnunen

Improving speaker verification robustness with synthetic emotional utterances

A speaker verification (SV) system offers an authentication service designed to confirm whether a given speech sample originates from a specific speaker. This technology has paved the way for various personalized applications that cater to…

Sound · Computer Science 2024-12-03 Nikhil Kumar Koditala , Chelsea Jui-Ting Ju , Ruirui Li , Minho Jin , Aman Chadha , Andreas Stolcke

SEEF-ALDR: A Speaker Embedding Enhancement Framework via Adversarial Learning based Disentangled Representation

Speaker verification, as a biometric authentication mechanism, has been widely used due to the pervasiveness of voice control on smart devices. However, the task of "in-the-wild" speaker verification is still challenging, considering the…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-27 Jianwei Tai , Xiaoqi Jia , Qingjia Huang , Weijuan Zhang , Haichao Du , Shengzhi Zhang

Bias in Automated Speaker Recognition

Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment…

Sound · Computer Science 2022-06-22 Wiebke Toussaint Hutiri , Aaron Ding

Speaker Verification using Convolutional Neural Networks

In this paper, a novel Convolutional Neural Network architecture has been developed for speaker verification in order to simultaneously capture and discard speaker and non-speaker information, respectively. In training phase, the network is…

Audio and Speech Processing · Electrical Eng. & Systems 2018-08-13 Hossein Salehghaffari

Speaker Re-identification with Speaker Dependent Speech Enhancement

While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments. Here speech enhancement methods have traditionally allowed improved…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-28 Yanpei Shi , Qiang Huang , Thomas Hain

SpeakerLLM: A Speaker-Specialized Audio-LLM for Speaker Understanding and Verification Reasoning

As audio-first agents become increasingly common in physical AI, conversational robots, and screenless wearables, audio large language models (audio-LLMs) must integrate speaker-specific understanding to support user authorization,…

Sound · Computer Science 2026-05-15 KiHyun Nam , Jungwoo Heo , Siu Bae , Ha-Jin Yu , Joon Son Chung