Related papers: Weakly Supervised PLDA Training

Local Training for PLDA in Speaker Verification

PLDA is a popular normalization approach for the i-vector model, and it has delivered state-of-the-art performance in speaker verification. However, PLDA training requires a large amount of labeled development data, which is highly…

Sound · Computer Science 2016-09-28 Chenghui Zhao , Lantian Li , Dong Wang , April Pu

Weaker Than You Think: A Critical Look at Weakly Supervised Learning

Weakly supervised learning is a popular approach for training machine learning models in low-resource settings. Instead of requesting high-quality yet costly human annotations, it allows training models with noisy annotations obtained from…

Computation and Language · Computer Science 2023-09-19 Dawei Zhu , Xiaoyu Shen , Marius Mosbach , Andreas Stephan , Dietrich Klakow

A Speaker Verification Backend with Robust Performance across Conditions

In this paper, we address the problem of speaker verification in conditions unseen or unknown during development. A standard method for speaker verification consists of extracting speaker embeddings with a deep neural network and processing…

Sound · Computer Science 2021-08-18 Luciana Ferrer , Mitchell McLaren , Niko Brummer

Multiobjective Optimization Training of PLDA for Speaker Verification

Most current state-of-the-art text-independent speaker verification systems take probabilistic linear discriminant analysis (PLDA) as their backend classifiers. The parameters of PLDA are often estimated by maximizing the objective…

Sound · Computer Science 2018-11-13 Liang He , Xianhong Chen , Can Xu , Jia Liu

Blind score normalization method for PLDA based speaker recognition

Probabilistic Linear Discriminant Analysis (PLDA) has become state-of-the-art method for modeling $i$-vector space in speaker recognition task. However the performance degradation is observed if enrollment data size differs from one speaker…

Computation and Language · Computer Science 2016-02-24 Danila Doroshin , Nikolay Lubimov , Marina Nastasenko , Mikhail Kotov

Data Consistency for Weakly Supervised Learning

In many applications, training machine learning models involves using large amounts of human-annotated data. Obtaining precise labels for the data is expensive. Instead, training with weak supervision provides a low-cost alternative. We…

Machine Learning · Computer Science 2022-02-09 Chidubem Arachie , Bert Huang

Weakly Supervised Label Learning Flows

Supervised learning usually requires a large amount of labelled data. However, attaining ground-truth labels is costly for many tasks. Alternatively, weakly supervised methods learn with cheap weak signals that only approximately label some…

Machine Learning · Computer Science 2024-11-26 You Lu , Wenzhuo Song , Chidubem Arachie , Bert Huang

Pairwise Discriminative Neural PLDA for Speaker Verification

The state-of-art approach to speaker verification involves the extraction of discriminative embeddings like x-vectors followed by a generative model back-end using a probabilistic linear discriminant analysis (PLDA). In this paper, we…

Audio and Speech Processing · Electrical Eng. & Systems 2020-02-10 Shreyas Ramoji , Prashant Krishnan , Prachi Singh , Sriram Ganapathy

A Study on Decoupled Probabilistic Linear Discriminant Analysis

Probabilistic linear discriminant analysis (PLDA) has broad application in open-set verification tasks, such as speaker verification. A key concern for PLDA is that the model is too simple (linear Gaussian) to deal with complicated data;…

Sound · Computer Science 2021-11-25 Di Wang , Lantian Li , Hongzhi Yu , Dong Wang

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

Iterative self-training, or iterative pseudo-labeling (IPL) -- using an improved model from the current iteration to provide pseudo-labels for the next iteration -- has proven to be a powerful approach to enhance the quality of speaker…

Audio and Speech Processing · Electrical Eng. & Systems 2025-01-22 Zakaria Aldeneh , Takuya Higuchi , Jee-weon Jung , Li-Wei Chen , Stephen Shum , Ahmed Hussen Abdelaziz , Shinji Watanabe , Tatiana Likhomanenko , Barry-John Theobald

PLDA-Based Diarization of Telephone Conversations

This paper investigates the application of the probabilistic linear discriminant analysis (PLDA) to speaker diarization of telephone conversations. We introduce using a variational Bayes (VB) approach for inference under a PLDA model for…

Audio and Speech Processing · Electrical Eng. & Systems 2017-10-03 Ahmet E. Bulut , Hakan Demir , Yusuf Ziya Isik , Hakan Erdogan

Weakly Supervised Training of Speaker Identification Models

We propose an approach for training speaker identification models in a weakly supervised manner. We concentrate on the setting where the training data consists of a set of audio recordings and the speaker annotation is provided only at the…

Sound · Computer Science 2018-06-25 Martin Karu , Tanel Alumäe

Domain adaptation based Speaker Recognition on Short Utterances

This paper explores how the in- and out-domain probabilistic linear discriminant analysis (PLDA) speaker verification behave when enrolment and verification lengths are reduced. Experiment studies have found that when full-length utterance…

Sound · Computer Science 2016-10-12 Ahilan Kanagasundaram , David Dean , Sridha Sridharan , Clinton Fookes

Reward Modeling with Weak Supervision for Language Models

Recent advancements in large language models (LLMs) have led to their increased application across various tasks, with reinforcement learning from human feedback (RLHF) being a crucial part of their training to align responses with user…

Computation and Language · Computer Science 2024-10-29 Ben Hauptvogel , Malte Ostendorff , Georg Rehm , Sebastian Möller

The CORAL+ Algorithm for Unsupervised Domain Adaptation of PLDA

State-of-the-art speaker recognition systems comprise an x-vector (or i-vector) speaker embedding front-end followed by a probabilistic linear discriminant analysis (PLDA) backend. The effectiveness of these components relies on the…

Machine Learning · Computer Science 2020-04-22 Kong Aik Lee , Qiongqiong Wang , Takafumi Koshinaka

Self-Training with Weak Supervision

State-of-the-art deep neural networks require large-scale labeled training data that is often expensive to obtain or not available for many tasks. Weak supervision in the form of domain-specific rules has been shown to be useful in such…

Computation and Language · Computer Science 2021-04-13 Giannis Karamanolakis , Subhabrata Mukherjee , Guoqing Zheng , Ahmed Hassan Awadallah

Unsupervised Adaptation of SPLDA

State-of-the-art speaker recognition relays on models that need a large amount of training data. This models are successful in tasks like NIST SRE because there is sufficient data available. However, in real applications, we usually do not…

Machine Learning · Statistics 2015-11-25 Jesús Villalba

Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive Learning

State-of-the-art speaker verification systems are inherently dependent on some kind of human supervision as they are trained on massive amounts of labeled data. However, manually annotating utterances is slow, expensive and not scalable to…

Audio and Speech Processing · Electrical Eng. & Systems 2025-06-25 Théo Lepage , Réda Dehak

Joint Probabilistic Linear Discriminant Analysis

Standard probabilistic linear discriminant analysis (PLDA) for speaker recognition assumes that the sample's features (usually, i-vectors) are given by a sum of three terms: a term that depends on the speaker identity, a term that models…

Machine Learning · Computer Science 2018-01-17 Luciana Ferrer

Limitations of weak labels for embedding and tagging

Many datasets and approaches in ambient sound analysis use weakly labeled data.Weak labels are employed because annotating every data sample with a strong label is too expensive.Yet, their impact on the performance in comparison to strong…

Sound · Computer Science 2020-12-08 Nicolas Turpault , Romain Serizel , Emmanuel Vincent