English
Related papers

Related papers: Explainable Attribute-Based Speaker Verification

200 papers

Speaker verification (SV) systems are currently being used to make sensitive decisions like giving access to bank accounts or deciding whether the voice of a suspect coincides with that of the perpetrator of a crime. Ensuring that these…

Audio and Speech Processing · Electrical Eng. & Systems 2025-11-18 Mariel Estevez , Luciana Ferrer

In this paper, we propose Vo-Ve, a novel voice-vector embedding that captures speaker identity. Unlike conventional speaker embeddings, Vo-Ve is explainable, as it contains the probabilities of explicit voice attribute classes. Through…

Sound · Computer Science 2025-06-25 Jaejun Lee , Kyogu Lee

Speaker verification (SV) provides billions of voice-enabled devices with access control, and ensures the security of voice-driven technologies. As a type of biometrics, it is necessary that SV is unbiased, with consistent and reliable…

Audio and Speech Processing · Electrical Eng. & Systems 2022-09-14 Wiebke Toussaint Hutiri , Lauriane Gorce , Aaron Yi Ding

In speaker verification (SV), the acoustic mismatch between children's and adults' speech leads to suboptimal performance when adult-trained SV systems are applied to children's speaker verification (C-SV). While domain adaptation…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-05 Jiusi Zheng , Vishwas Shetty , Natarajan Balaji Shankar , Abeer Alwan

Speaker verification is the process by which a speakers claim of identity is tested against a claimed speaker by his or her voice. Speaker verification is done by the use of some parameters (features) from the speakers voice which can be…

Sound · Computer Science 2019-08-16 Bhavana V. S , Pradip K. Das

Verifying the identity of a speaker is crucial in modern human-machine interfaces, e.g., to ensure privacy protection or to enable biometric authentication. Classical speaker verification (SV) approaches estimate a fixed-dimensional…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-29 Ahmad Aloradi , Wolfgang Mack , Mohamed Elminshawi , Emanuël A. P. Habets

In speech technologies, speaker's voice representation is used in many applications such as speech recognition, voice conversion, speech synthesis and, obviously, user authentication. Modern vocal representations of the speaker are based on…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-17 Paul-Gauthier Noé , Mohammad Mohammadamini , Driss Matrouf , Titouan Parcollet , Andreas Nautsch , Jean-François Bonastre

Unlike other data modalities such as text and vision, speech does not lend itself to easy interpretation. While lay people can understand how to describe an image or sentence via perception, non-expert descriptions of speech often end at…

Sound · Computer Science 2023-10-05 Robin Netzorg , Bohan Yu , Andrea Guzman , Peter Wu , Luna McNulty , Gopala Anumanchipalli

This study investigates the explainability of embedding representations, specifically those used in modern audio spoofing detection systems based on deep neural networks, known as spoof embeddings. Building on established work in speaker…

Sound · Computer Science 2024-12-25 Xuechen Liu , Junichi Yamagishi , Md Sahidullah , Tomi kinnunen

In speaker verification, we use computational method to verify if an utterance matches the identity of an enrolled speaker. This task is similar to the manual task of forensic voice comparison, where linguistic analysis is combined with…

Sound · Computer Science 2025-01-15 Yi Ma , Shuai Wang , Tianchi Liu , Haizhou Li

Despite remarkable progress, automatic speaker verification (ASV) systems typically lack the transparency required for high-accountability applications. Motivated by how human experts perform forensic speaker comparison (FSC), we propose a…

Audio and Speech Processing · Electrical Eng. & Systems 2026-04-07 Yi Ma , Shuai Wang , Tianchi Liu , Haizhou Li

Recently, self-supervised learning (SSL) has demonstrated strong performance in speaker recognition, even if the pre-training objective is designed for speech recognition. In this paper, we study which factor leads to the success of…

Computation and Language · Computer Science 2022-06-28 Sanyuan Chen , Yu Wu , Chengyi Wang , Shujie Liu , Zhuo Chen , Peidong Wang , Gang Liu , Jinyu Li , Jian Wu , Xiangzhan Yu , Furu Wei

Voice conversion (VC), as a voice style transfer technology, is becoming increasingly prevalent while raising serious concerns about its illegal use. Proactively tracing the origins of VC-generated speeches, i.e., speaker traceability, can…

Sound · Computer Science 2023-07-27 Yanzhen Ren , Hongcheng Zhu , Liming Zhai , Zongkun Sun , Rubing Shen , Lina Wang

We propose a novel approach for spoofed speech characterization through explainable probabilistic attribute embeddings. In contrast to high-dimensional raw embeddings extracted from a spoofing countermeasure (CM) whose dimensions are not…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-18 Manasi Chhibber , Jagabandhu Mishra , Hyejin Shim , Tomi H. Kinnunen

A speaker verification (SV) system offers an authentication service designed to confirm whether a given speech sample originates from a specific speaker. This technology has paved the way for various personalized applications that cater to…

Speaker verification, as a biometric authentication mechanism, has been widely used due to the pervasiveness of voice control on smart devices. However, the task of "in-the-wild" speaker verification is still challenging, considering the…

Audio and Speech Processing · Electrical Eng. & Systems 2020-10-27 Jianwei Tai , Xiaoqi Jia , Qingjia Huang , Weijuan Zhang , Haichao Du , Shengzhi Zhang

Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment…

Sound · Computer Science 2022-06-22 Wiebke Toussaint Hutiri , Aaron Ding

In this paper, a novel Convolutional Neural Network architecture has been developed for speaker verification in order to simultaneously capture and discard speaker and non-speaker information, respectively. In training phase, the network is…

Audio and Speech Processing · Electrical Eng. & Systems 2018-08-13 Hossein Salehghaffari

While the use of deep neural networks has significantly boosted speaker recognition performance, it is still challenging to separate speakers in poor acoustic environments. Here speech enhancement methods have traditionally allowed improved…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-28 Yanpei Shi , Qiang Huang , Thomas Hain

As audio-first agents become increasingly common in physical AI, conversational robots, and screenless wearables, audio large language models (audio-LLMs) must integrate speaker-specific understanding to support user authorization,…

Sound · Computer Science 2026-05-15 KiHyun Nam , Jungwoo Heo , Siu Bae , Ha-Jin Yu , Joon Son Chung
‹ Prev 1 2 3 10 Next ›