English
Related papers

Related papers: Learning Repeatable Speech Embeddings Using An Int…

200 papers

Noise robustness in keyword spotting remains a challenge as many models fail to overcome the heavy influence of noises, causing the deterioration of the quality of feature embeddings. We proposed a contrastive regularization method called…

Sound · Computer Science 2022-09-15 Dianwen Ng , Jia Qi Yip , Tanmay Surana , Zhao Yang , Chong Zhang , Yukun Ma , Chongjia Ni , Eng Siong Chng , Bin Ma

Over the recent years, various deep learning-based methods were proposed for extracting a fixed-dimensional embedding vector from speech signals. Although the deep learning-based embedding extraction methods have shown good performance in…

Audio and Speech Processing · Electrical Eng. & Systems 2021-12-08 Woo Hyun Kang , Jahangir Alam , Abderrahim Fathan

Contrastive learning-based methods, such as unsup-SimCSE, have achieved state-of-the-art (SOTA) performances in learning unsupervised sentence embeddings. However, in previous studies, each embedding used for contrastive learning only…

Computation and Language · Computer Science 2023-05-19 Hongliang He , Junlei Zhang , Zhenzhong Lan , Yue Zhang

Despite rapid advances in speech recognition, current models remain brittle to superficial perturbations to their inputs. Small amounts of noise can destroy the performance of an otherwise state-of-the-art model. To harden models against…

Audio and Speech Processing · Electrical Eng. & Systems 2018-07-19 Davis Liang , Zhiheng Huang , Zachary C. Lipton

Consistency regularization is a commonly-used technique for semi-supervised and self-supervised learning. It is an auxiliary objective function that encourages the prediction of the network to be similar in the vicinity of the observed…

Machine Learning · Computer Science 2021-10-05 Erik Englesson , Hossein Azizpour

Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered…

Machine Learning · Computer Science 2020-06-25 Jary Pomponi , Simone Scardapane , Vincenzo Lomonaco , Aurelio Uncini

In this paper, we propose an effective training strategy to ex-tract robust speaker representations from a speech signal. Oneof the key challenges in speaker recognition tasks is to learnlatent representations or embeddings containing…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-05 Yoohwan Kwon , Soo-Whan Chung , Hong-Goo Kang

Deep neural networks suffer from catastrophic forgetting, where performance on previous tasks degrades after training on a new task. This issue arises due to the model's tendency to overwrite previously acquired knowledge with new…

Machine Learning · Computer Science 2025-12-02 Lama Alssum , Hasan Abed Al Kader Hammoud , Motasem Alfarra , Juan C Leon Alcazar , Bernard Ghanem

Word embeddings have been shown to benefit from ensambling several word embedding sources, often carried out using straightforward mathematical operations over the set of word vectors. More recently, self-supervised learning has been used…

Computation and Language · Computer Science 2020-01-27 James O' Neill , Danushka Bollegala

To extract robust deep representations from long sequential modeling of speech data, we propose a self-supervised learning approach, namely Contrastive Separative Coding (CSC). Our key finding is to learn such representations by separating…

Audio and Speech Processing · Electrical Eng. & Systems 2021-03-02 Jun Wang , Max W. Y. Lam , Dan Su , Dong Yu

Contrastive Learning (CL) has been proved to be a powerful self-supervised approach for a wide range of domains, including computer vision and graph representation learning. However, the incremental learning issue of CL has rarely been…

Machine Learning · Computer Science 2023-01-31 Cheng Ji , Jianxin Li , Hao Peng , Jia Wu , Xingcheng Fu , Qingyun Sun , Phillip S. Yu

While promising performance for speaker verification has been achieved by deep speaker embeddings, the advantage would reduce in the case of speaking-style variability. Speaking rate mismatch is often observed in practical speaker…

Audio and Speech Processing · Electrical Eng. & Systems 2022-05-31 Fuchuan Tong , Siqi Zheng , Haodong Zhou , Xingjia Xie , Qingyang Hong , Lin Li

In forecasting multiple time series, accounting for the individual features of each sequence can be challenging. To address this, modern deep learning methods for time series analysis combine a shared (global) model with local layers,…

Machine Learning · Computer Science 2025-02-14 Luca Butera , Giovanni De Felice , Andrea Cini , Cesare Alippi

Data augmentation is used in machine learning to make the classifier invariant to label-preserving transformations. Usually this invariance is only encouraged implicitly by including a single augmented input during training. However,…

Machine Learning · Computer Science 2022-03-08 Aleksander Botev , Matthias Bauer , Soham De

In this paper, we address the problem of speaker recognition in challenging acoustic conditions using a novel method to extract robust speaker-discriminative speech representations. We adopt a recently proposed unsupervised adversarial…

Audio and Speech Processing · Electrical Eng. & Systems 2019-11-05 Raghuveer Peri , Monisankha Pal , Arindam Jati , Krishna Somandepalli , Shrikanth Narayanan

Self-supervised methods such as Contrastive predictive Coding (CPC) have greatly improved the quality of the unsupervised representations. These representations significantly reduce the amount of labeled data needed for downstream task…

Audio and Speech Processing · Electrical Eng. & Systems 2023-04-27 Saurabhchand Bhati , Jesús Villalba , Piotr Żelasko , Laureano Moro-Velazquez , Najim Dehak

Consistency regularization (CR), which enforces agreement between model predictions on augmented views, has found recent benefits in automatic speech recognition [1]. In this paper, we propose the use of consistency regularization for audio…

Sound · Computer Science 2025-09-15 Shanmuka Sadhu , Weiran Wang

Consistency regularization is a commonly used practice to encourage the model to generate consistent representation from distorted input features and improve model generalization. It shows significant improvement on various speech…

Computation and Language · Computer Science 2024-11-12 Cindy Tseng , Yun Tang , Vijendra Raj Apsingekar

Contrastive self-supervised learning (CSL) with a prototypical regularization has been introduced in learning meaningful representations for downstream tasks that require strong semantic information. However, to optimize CSL with a loss…

Computer Vision and Pattern Recognition · Computer Science 2022-10-20 Shentong Mo , Zhun Sun , Chao Li

We employ triplet loss as a feature embedding regularizer to boost classification performance. Standard architectures, like ResNet and Inception, are extended to support both losses with minimal hyper-parameter tuning. This promotes…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Ahmed Taha , Yi-Ting Chen , Teruhisa Misu , Abhinav Shrivastava , Larry Davis
‹ Prev 1 2 3 10 Next ›