English
Related papers

Related papers: Unsupervised Word Segmentation Using Temporal Grad…

200 papers

In settings where only unlabelled speech data is available, speech technology needs to be developed without transcriptions, pronunciation dictionaries, or language modelling text. A similar problem is faced when modelling infant language…

Computation and Language · Computer Science 2016-03-10 Herman Kamper , Aren Jansen , Sharon Goldwater

Semantic segmentation is an important and popular research area in computer vision that focuses on classifying pixels in an image based on their semantics. However, supervised deep learning requires large amounts of data to train models and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Lingyan Ran , Yali Li , Guoqiang Liang , Yanning Zhang

The crux of semi-supervised semantic segmentation is to assign adequate pseudo-labels to the pixels of unlabeled images. A common practice is to select the highly confident predictions as the pseudo ground-truth, but it leads to a problem…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Yuchao Wang , Haochen Wang , Yujun Shen , Jingjing Fei , Wei Li , Guoqiang Jin , Liwei Wu , Rui Zhao , Xinyi Le

We propose the application of a semi-supervised learning method to improve the performance of acoustic modelling for automatic speech recognition based on deep neural net- works. As opposed to unsupervised initialisation followed by…

Machine Learning · Statistics 2016-10-04 Akash Kumar Dhaka , Giampiero Salvi

The lack of labeled data is a common challenge in speech classification tasks, particularly those requiring extensive subjective assessment, such as cognitive state classification. In this work, we propose a Semi-Supervised Learning (SSL)…

Audio and Speech Processing · Electrical Eng. & Systems 2025-05-01 Yuanchao Li , Zixing Zhang , Jing Han , Peter Bell , Catherine Lai

In the recent trend of semi-supervised speech recognition, both self-supervised representation learning and pseudo-labeling have shown promising results. In this paper, we propose a novel approach to combine their ideas for end-to-end…

Audio and Speech Processing · Electrical Eng. & Systems 2021-10-11 Shaoshi Ling , Chen Shen , Meng Cai , Zejun Ma

The Audio-Visual Video Parsing task aims to identify and temporally localize the events that occur in either or both the audio and visual streams of audible videos. It often performs in a weakly-supervised manner, where only video event…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Jinxing Zhou , Dan Guo , Yiran Zhong , Meng Wang

Semantic segmentation is a key computer vision task that has been actively researched for decades. In recent years, supervised methods have reached unprecedented accuracy, however they require many pixel-level annotations for every new…

Computer Vision and Pattern Recognition · Computer Science 2021-12-07 Nir Zabari , Yedid Hoshen

Being able to segment unseen classes not observed during training is an important technical challenge in deep learning, because of its potential to reduce the expensive annotation required for semantic segmentation. Prior zero-label…

Computer Vision and Pattern Recognition · Computer Science 2021-04-26 Giuseppe Pastore , Fabio Cermelli , Yongqin Xian , Massimiliano Mancini , Zeynep Akata , Barbara Caputo

Existing weakly or semi-supervised semantic segmentation methods utilize image or box-level supervision to generate pseudo-labels for weakly labeled images. However, due to the lack of strong supervision, the generated pseudo-labels are…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Md Amirul Islam , Matthew Kowal , Sen Jia , Konstantinos G. Derpanis , Neil D. B. Bruce

In conventional supervised pattern recognition tasks, model selection is typically accomplished by minimizing the classification error rate on a set of so-called development data, subject to ground-truth labeling by human experts or some…

Machine Learning · Statistics 2011-08-25 Christopher M. White , Sanjeev P. Khudanpur , Patrick J. Wolfe

Automatic detection of speaker confidence is critical for adaptive computing but remains constrained by limited labelled data and the subjectivity of paralinguistic annotations. This paper proposes a semi-supervised hybrid framework that…

Sound · Computer Science 2026-05-13 Adam Wynn , Jingyun Wang

We revisit a self-supervised method that segments unlabelled speech into word-like segments. We start from the two-stage duration-penalised dynamic programming method that performs zero-resource segmentation without learning an explicit…

Audio and Speech Processing · Electrical Eng. & Systems 2024-02-01 Herman Kamper , Benjamin van Niekerk

Deep learning usually achieves the best results with complete supervision. In the case of semantic segmentation, this means that large amounts of pixelwise annotations are required to learn accurate models. In this paper, we show that we…

Computer Vision and Pattern Recognition · Computer Science 2020-05-07 Yi Zhu , Zhongyue Zhang , Chongruo Wu , Zhi Zhang , Tong He , Hang Zhang , R. Manmatha , Mu Li , Alexander Smola

Recent advancements in open vocabulary models, like CLIP, have notably advanced zero-shot classification and segmentation by utilizing natural language for class-specific embeddings. However, most research has focused on improving model…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Wenfang Sun , Yingjun Du , Gaowen Liu , Ramana Kompella , Cees G. M. Snoek

Speech segmentation at both word and phoneme levels is crucial for various speech processing tasks. It significantly aids in extracting meaningful units from an utterance, thus enabling the generation of discrete elements. In this work we…

Machine Learning · Computer Science 2024-11-18 Simone Carnemolla , Salvatore Calcagno , Simone Palazzo , Daniela Giordano

Recent studies have shown that the benefits provided by self-supervised pre-training and self-training (pseudo-labeling) are complementary. Semi-supervised fine-tuning strategies under the pre-training framework, however, remain…

Sound · Computer Science 2022-06-28 Bowen Zhang , Songjun Cao , Xiaoming Zhang , Yike Zhang , Long Ma , Takahiro Shinozaki

We bring a new perspective to semi-supervised semantic segmentation by providing an analysis on the labeled and unlabeled distributions in training datasets. We first figure out that the distribution gap between labeled and unlabeled…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Daoan Zhang , Yunhao Luo , Jianguo Zhang

Semi-supervised learning leverages unlabeled data to enhance model performance, addressing the limitations of fully supervised approaches. Among its strategies, pseudo-supervision has proven highly effective, typically relying on one or…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Negin Ghamsarian , Sahar Nasirihaghighi , Klaus Schoeffmann , Raphael Sznitman

Despite the remarkable performance of supervised medical image segmentation models, relying on a large amount of labeled data is impractical in real-world situations. Semi-supervised learning approaches aim to alleviate this challenge using…

Computer Vision and Pattern Recognition · Computer Science 2025-09-17 Yunyao Lu , Yihang Wu , Ahmad Chaddad , Tareef Daqqaq , Reem Kateb
‹ Prev 1 2 3 10 Next ›