Related papers: Semi-supervised Sequence Learning

Semi-Supervised Text Classification via Self-Pretraining

We present a neural semi-supervised learning model termed Self-Pretraining. Our model is inspired by the classic self-training algorithm. However, as opposed to self-training, Self-Pretraining is threshold-free, it can potentially update…

Computation and Language · Computer Science 2021-10-01 Payam Karisani , Negin Karisani

A Survey on Self-supervised Pre-training for Sequential Transfer Learning in Neural Networks

Deep neural networks are typically trained under a supervised learning framework where a model learns a single task using labeled data. Instead of relying solely on labeled data, practitioners can harness unlabeled or related data to…

Machine Learning · Computer Science 2020-07-03 Huanru Henry Mao

Semi-supervised sequence tagging with bidirectional language models

Pre-trained word embeddings learned from unlabeled text have become a standard component of neural network architectures for NLP tasks. However, in most cases, the recurrent network that operates on word-level representations to produce…

Computation and Language · Computer Science 2017-05-02 Matthew E. Peters , Waleed Ammar , Chandra Bhagavatula , Russell Power

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training

Recent studies have shown that the benefits provided by self-supervised pre-training and self-training (pseudo-labeling) are complementary. Semi-supervised fine-tuning strategies under the pre-training framework, however, remain…

Sound · Computer Science 2022-06-28 Bowen Zhang , Songjun Cao , Xiaoming Zhang , Yike Zhang , Long Ma , Takahiro Shinozaki

Big Self-Supervised Models are Strong Semi-Supervised Learners

One paradigm for learning from few labeled examples while making best use of a large amount of unlabeled data is unsupervised pretraining followed by supervised fine-tuning. Although this paradigm uses unlabeled data in a task-agnostic way,…

Machine Learning · Computer Science 2020-10-27 Ting Chen , Simon Kornblith , Kevin Swersky , Mohammad Norouzi , Geoffrey Hinton

Scheduled Sampling for Sequence Prediction with Recurrent Neural Networks

Recurrent Neural Networks can be trained to produce sequences of tokens given some input, as exemplified by recent results in machine translation and image captioning. The current approach to training them consists of maximizing the…

Machine Learning · Computer Science 2015-09-24 Samy Bengio , Oriol Vinyals , Navdeep Jaitly , Noam Shazeer

Semi-supervised sequence classification through change point detection

Sequential sensor data is generated in a wide variety of practical applications. A fundamental challenge involves learning effective classifiers for such sequential data. While deep learning has led to impressive performance gains in recent…

Machine Learning · Computer Science 2020-10-07 Nauman Ahad , Mark A. Davenport

Neural Semi-supervised Learning for Text Classification Under Large-Scale Pretraining

The goal of semi-supervised learning is to utilize the unlabeled, in-domain dataset U to improve models trained on the labeled dataset D. Under the context of large-scale language-model (LM) pretraining, how we can make the best use of U is…

Computation and Language · Computer Science 2020-11-20 Zijun Sun , Chun Fan , Xiaofei Sun , Yuxian Meng , Fei Wu , Jiwei Li

Avoiding Your Teacher's Mistakes: Training Neural Networks with Controlled Weak Supervision

Training deep neural networks requires massive amounts of training data, but for many tasks only limited labeled data is available. This makes weak supervision attractive, using weak or noisy signals like the output of heuristic methods or…

Machine Learning · Computer Science 2017-12-08 Mostafa Dehghani , Aliaksei Severyn , Sascha Rothe , Jaap Kamps

Self-training Improves Pre-training for Natural Language Understanding

Unsupervised pre-training has led to much recent progress in natural language understanding. In this paper, we study self-training as another way to leverage unlabeled data through semi-supervised learning. To obtain additional data for a…

Computation and Language · Computer Science 2020-10-06 Jingfei Du , Edouard Grave , Beliz Gunel , Vishrav Chaudhary , Onur Celebi , Michael Auli , Ves Stoyanov , Alexis Conneau

Learning to Predict Gradients for Semi-Supervised Continual Learning

A key challenge for machine intelligence is to learn new visual concepts without forgetting the previously acquired knowledge. Continual learning is aimed towards addressing this challenge. However, there is a gap between existing…

Machine Learning · Computer Science 2024-02-01 Yan Luo , Yongkang Wong , Mohan Kankanhalli , Qi Zhao

Revisiting Self-Training for Neural Sequence Generation

Self-training is one of the earliest and simplest semi-supervised methods. The key idea is to augment the original labeled dataset with unlabeled data paired with the model's prediction (i.e. the pseudo-parallel data). While self-training…

Machine Learning · Computer Science 2020-10-20 Junxian He , Jiatao Gu , Jiajun Shen , Marc'Aurelio Ranzato

Predictions For Pre-training Language Models

Language model pre-training has proven to be useful in many language understanding tasks. In this paper, we investigate whether it is still helpful to add the self-training method in the pre-training step and the fine-tuning step. Towards…

Computation and Language · Computer Science 2023-02-17 Tong Guo

Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning

Effective convolutional neural networks are trained on large sets of labeled data. However, creating large labeled datasets is a very costly and time-consuming task. Semi-supervised learning uses unlabeled data to train a model with higher…

Computer Vision and Pattern Recognition · Computer Science 2016-06-16 Mehdi Sajjadi , Mehran Javanmardi , Tolga Tasdizen

Heterogeneous Self-Supervised Acoustic Pre-Training with Local Constraints

Self-supervised pre-training using unlabeled data is widely used in automatic speech recognition. In this paper, we propose a new self-supervised pre-training approach to dealing with heterogeneous data. Instead of mixing all the data and…

Machine Learning · Computer Science 2025-09-10 Xiaodong Cui , A F M Saif , Brian Kingsbury , Tianyi Chen

Self-Training: A Survey

Semi-supervised algorithms aim to learn prediction functions from a small set of labeled observations and a large set of unlabeled observations. Because this framework is relevant in many applications, they have received a lot of interest…

Machine Learning · Computer Science 2025-02-17 Massih-Reza Amini , Vasilii Feofanov , Loic Pauletto , Lies Hadjadj , Emilie Devijver , Yury Maximov

Unsupervised Learning of Sequence Representations by Autoencoders

Sequence data is challenging for machine learning approaches, because the lengths of the sequences may vary between samples. In this paper, we present an unsupervised learning model for sequence data, called the Integrated Sequence…

Computer Vision and Pattern Recognition · Computer Science 2018-04-30 Wenjie Pei , David M. J. Tax

Learning Deep Representations Using Convolutional Auto-encoders with Symmetric Skip Connections

Unsupervised pre-training was a critical technique for training deep neural networks years ago. With sufficient labeled data and modern training techniques, it is possible to train very deep neural networks from scratch in a purely…

Computer Vision and Pattern Recognition · Computer Science 2017-03-29 Jianfeng Dong , Xiao-Jiao Mao , Chunhua Shen , Yu-Bin Yang

Meta Co-Training: Two Views are Better than One

In many critical computer vision scenarios unlabeled data is plentiful, but labels are scarce and difficult to obtain. As a result, semi-supervised learning which leverages unlabeled data to boost the performance of supervised classifiers…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Jay C. Rothenberger , Dimitrios I. Diochnos

Semi-Supervised Learning using Siamese Networks

Neural networks have been successfully used as classification models yielding state-of-the-art results when trained on a large number of labeled samples. These models, however, are more difficult to train successfully for semi-supervised…

Machine Learning · Computer Science 2021-09-13 Attaullah Sahito , Eibe Frank , Bernhard Pfahringer