English
Related papers

Related papers: Segmental Contrastive Predictive Coding for Unsupe…

200 papers

Typically, unsupervised segmentation of speech into the phone and word-like units are treated as separate tasks and are often done via different methods which do not fully leverage the inter-dependence of the two tasks. Here, we unify them…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-12 Saurabhchand Bhati , Jesús Villalba , Piotr Żelasko , Laureano Moro-Velazquez , Najim Dehak

We investigate the performance on phoneme categorization and phoneme and word segmentation of several self-supervised learning (SSL) methods based on Contrastive Predictive Coding (CPC). Our experiments show that with the existing…

Phoneme boundary detection plays an essential first step for a variety of speech processing applications such as speaker diarization, speech science, keyword spotting, etc. In this work, we propose a neural architecture coupled with a…

Audio and Speech Processing · Electrical Eng. & Systems 2020-02-18 Felix Kreuk , Yaniv Sheena , Joseph Keshet , Yossi Adi

To extract robust deep representations from long sequential modeling of speech data, we propose a self-supervised learning approach, namely Contrastive Separative Coding (CSC). Our key finding is to learn such representations by separating…

Audio and Speech Processing · Electrical Eng. & Systems 2021-03-02 Jun Wang , Max W. Y. Lam , Dan Su , Dong Yu

Many speech processing tasks involve measuring the acoustic similarity between speech segments. Acoustic word embeddings (AWE) allow for efficient comparisons by mapping speech segments of arbitrary duration to fixed-dimensional vectors.…

Computation and Language · Computer Science 2020-12-15 Lisa van Staden , Herman Kamper

Neural network models using predictive coding are interesting from the viewpoint of computational modelling of human language acquisition, where the objective is to understand how linguistic units could be learned from speech without any…

Computation and Language · Computer Science 2020-07-09 María Andrea Cruz Blandón , Okko Räsänen

This thesis describes our ongoing work on Contrastive Predictive Coding (CPC) features for speaker verification. CPC is a recently proposed representation learning framework based on predictive coding and noise contrastive estimation. We…

Computation and Language · Computer Science 2019-04-04 Cheng-I Lai

We propose a self-supervised representation learning model for the task of unsupervised phoneme boundary detection. The model is a convolutional neural network that operates directly on the raw waveform. It is optimized to identify spectral…

Audio and Speech Processing · Electrical Eng. & Systems 2020-08-07 Felix Kreuk , Joseph Keshet , Yossi Adi

Phonemic segmentation of speech is a critical step of speech recognition systems. We propose a novel unsupervised algorithm based on sequence prediction models such as Markov chains and recurrent neural network. Our approach consists in…

Computation and Language · Computer Science 2017-05-30 Paul Michel , Okko Räsänen , Roland Thiollière , Emmanuel Dupoux

Segments that span contiguous parts of inputs, such as phonemes in speech, named entities in sentences, actions in videos, occur frequently in sequence prediction problems. Segmental models, a class of models that explicitly hypothesizes…

Computation and Language · Computer Science 2018-06-14 Hao Tang

Speech segmentation at both word and phoneme levels is crucial for various speech processing tasks. It significantly aids in extracting meaningful units from an utterance, thus enabling the generation of discrete elements. In this work we…

Machine Learning · Computer Science 2024-11-18 Simone Carnemolla , Salvatore Calcagno , Simone Palazzo , Daniela Giordano

We investigate the possibility of forcing a self-supervised model trained using a contrastive predictive loss to extract slowly varying latent representations. Rather than producing individual predictions for each of the future…

To overcome the data-hungry challenge, we have proposed a semi-supervised contrastive learning framework for the task of class-imbalanced semantic segmentation. First and foremost, to make the model operate in a semi-supervised manner, we…

Computer Vision and Pattern Recognition · Computer Science 2022-11-29 Kangcheng Liu

The success of deep learning comes from its ability to capture the hierarchical structure of data by learning high-level representations defined in terms of low-level ones. In this paper we explore self-supervised learning of hierarchical…

Unsupervised models of representations based on Contrastive Predictive Coding (CPC)[1] are primarily used in spoken language modelling in that they encode phonetic information. In this study, we ask what other types of information are…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-02 Maureen de Seyssel , Marvin Lavechin , Yossi Adi , Emmanuel Dupoux , Guillaume Wisniewski

We look at the long-standing problem of segmenting unlabeled speech into word-like segments and clustering these into a lexicon. Several previous methods use a scoring model coupled with dynamic programming to find an optimal segmentation.…

Audio and Speech Processing · Electrical Eng. & Systems 2025-01-14 Simon Malan , Benjamin van Niekerk , Herman Kamper

Contrastive predictive coding (CPC) aims to learn representations of speech by distinguishing future observations from a set of negative examples. Previous work has shown that linear classifiers trained on CPC features can accurately…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-03 Benjamin van Niekerk , Leanne Nortje , Matthew Baas , Herman Kamper

Weakly Supervised Semantic Segmentation (WSSS) with image-level labels has gained attention for its cost-effectiveness. Most existing methods emphasize inter-class separation, often neglecting the shared semantics among related categories…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Wangyu Wu , Zhenhong Chen , Xiaowen Ma , Wenqiao Zhang , Xianglin Qiu , Siqi Song , Xiaowei Huang , Fei Ma , Jimin Xiao

This work presents a novel approach for semi-supervised semantic segmentation. The key element of this approach is our contrastive learning module that enforces the segmentation network to yield similar pixel-level feature representations…

Computer Vision and Pattern Recognition · Computer Science 2021-08-09 Inigo Alonso , Alberto Sabater , David Ferstl , Luis Montesano , Ana C. Murillo

Contrastive Predictive Coding (CPC) is a representation learning method that maximizes the mutual information between intermediate latent representations and the output of a given model. It can be used to effectively initialize the encoder…

Computation and Language · Computer Science 2023-02-06 Aparna Khare , Minhua Wu , Saurabhchand Bhati , Jasha Droppo , Roland Maas
‹ Prev 1 2 3 10 Next ›