Related papers: Self-Supervised Test-Time Learning for Reading Com…

Bridging the Gap between Language Model and Reading Comprehension: Unsupervised MRC via Self-Supervision

Despite recent success in machine reading comprehension (MRC), learning high-quality MRC models still requires large-scale labeled training data, even using strong pre-trained language models (PLMs). The pre-training tasks for PLMs are not…

Computation and Language · Computer Science 2021-07-20 Ning Bian , Xianpei Han , Bo Chen , Hongyu Lin , Ben He , Le Sun

Improving Cross-Lingual Reading Comprehension with Self-Training

Substantial improvements have been made in machine reading comprehension, where the machine answers questions based on a given context. Current state-of-the-art models even surpass human performance on several benchmarks. However, their…

Computation and Language · Computer Science 2021-05-11 Wei-Cheng Huang , Chien-yu Huang , Hung-yi Lee

Test-Time Learning for Large Language Models

While Large Language Models (LLMs) have exhibited remarkable emergent capabilities through extensive pre-training, they still face critical limitations in generalizing to specialized domains and handling diverse linguistic variations, known…

Computation and Language · Computer Science 2025-05-28 Jinwu Hu , Zhitian Zhang , Guohao Chen , Xutao Wen , Chao Shuai , Wei Luo , Bin Xiao , Yuanqing Li , Mingkui Tan

Transfer Learning or Self-supervised Learning? A Tale of Two Pretraining Paradigms

Pretraining has become a standard technique in computer vision and natural language processing, which usually helps to improve performance substantially. Previously, the most dominant pretraining method is transfer learning (TL), which uses…

Computer Vision and Pattern Recognition · Computer Science 2020-07-09 Xingyi Yang , Xuehai He , Yuxiao Liang , Yue Yang , Shanghang Zhang , Pengtao Xie

Self-Augmented In-Context Learning for Unsupervised Word Translation

Recent work has shown that, while large language models (LLMs) demonstrate strong word translation or bilingual lexicon induction (BLI) capabilities in few-shot setups, they still cannot match the performance of 'traditional' mapping-based…

Computation and Language · Computer Science 2024-06-06 Yaoyiran Li , Anna Korhonen , Ivan Vulić

Learning to Read by Spelling: Towards Unsupervised Text Recognition

This work presents a method for visual text recognition without using any paired supervisory data. We formulate the text recognition task as one of aligning the conditional distribution of strings predicted from given text images, with…

Computer Vision and Pattern Recognition · Computer Science 2018-12-11 Ankush Gupta , Andrea Vedaldi , Andrew Zisserman

ClusT3: Information Invariant Test-Time Training

Deep Learning models have shown remarkable performance in a broad range of vision tasks. However, they are often vulnerable against domain shifts at test-time. Test-time training (TTT) methods have been developed in an attempt to mitigate…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Gustavo A. Vargas Hakim , David Osowiechi , Mehrdad Noori , Milad Cheraghalikhani , Ismail Ben Ayed , Christian Desrosiers

Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning

Meta-reinforcement learning typically requires orders of magnitude more samples than single task reinforcement learning methods. This is because meta-training needs to deal with more diverse distributions and train extra components such as…

Machine Learning · Computer Science 2021-03-12 Bernie Wang , Simon Xu , Kurt Keutzer , Yang Gao , Bichen Wu

Task-adaptive Pre-training and Self-training are Complementary for Natural Language Understanding

Task-adaptive pre-training (TAPT) and Self-training (ST) have emerged as the major semi-supervised approaches to improve natural language understanding (NLU) tasks with massive amount of unlabeled data. However, it's unclear whether they…

Computation and Language · Computer Science 2023-02-21 Shiyang Li , Semih Yavuz , Wenhu Chen , Xifeng Yan

Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMs

Large Language Models (LLMs) have demonstrated impressive performance on a wide range of natural language processing (NLP) tasks, primarily through in-context learning (ICL). In ICL, the LLM is provided with examples that represent a given…

Computation and Language · Computer Science 2025-02-19 Abdellah El Mekki , Muhammad Abdul-Mageed

Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models

Test-Time Scaling (TTS) is a promising approach to progressively elicit the model's intelligence during inference. Recently, training-based TTS methods, such as continued reinforcement learning (RL), have further surged in popularity, while…

Computation and Language · Computer Science 2025-09-10 Kaiyan Chang , Yonghao Shi , Chenglong Wang , Hang Zhou , Chi Hu , Xiaoqian Liu , Yingfeng Luo , Yuan Ge , Tong Xiao , Jingbo Zhu

Unsupervised Question Answering by Cloze Translation

Obtaining training data for Question Answering (QA) is time-consuming and resource-intensive, and existing QA datasets are only available for limited domains and languages. In this work, we explore to what extent high quality training data…

Computation and Language · Computer Science 2020-05-05 Patrick Lewis , Ludovic Denoyer , Sebastian Riedel

TTRL: Test-Time Reinforcement Learning

This paper investigates Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in Large Language Models (LLMs). The core challenge of the problem is reward estimation during inference while not having access to…

Computation and Language · Computer Science 2025-07-01 Yuxin Zuo , Kaiyan Zhang , Li Sheng , Shang Qu , Ganqu Cui , Xuekai Zhu , Haozhan Li , Yuchen Zhang , Xinwei Long , Ermo Hua , Biqing Qi , Youbang Sun , Zhiyuan Ma , Lifan Yuan , Ning Ding , Bowen Zhou

Text2Token: Unsupervised Text Representation Learning with Token Target Prediction

Unsupervised text representation learning (TRL) is a fundamental task in natural language processing, which is beneficial for improving search and recommendations with the web's unlabeled texts. A recent empirical study finds that the…

Computation and Language · Computer Science 2025-10-14 Ruize An , Richong Zhang , Zhijie Nie , Zhanyu Wu , Yanzhao Zhang , Dingkun Long

Evaluating Self-Supervised Speech Models via Text-Based LLMS

Self-Supervised Learning (SSL) has gained traction for its ability to learn rich representations with low labeling costs, applicable across diverse downstream tasks. However, assessing the downstream-task performance remains challenging due…

Sound · Computer Science 2025-10-07 Takashi Maekaku , Keita Goto , Jinchuan Tian , Yusuke Shinohara , Shinji Watanabe

Self-supervised Regularization for Text Classification

Text classification is a widely studied problem and has broad applications. In many real-world problems, the number of texts for training classification models is limited, which renders these models prone to overfitting. To address this…

Computation and Language · Computer Science 2021-03-25 Meng Zhou , Zechen Li , Pengtao Xie

Neural Skill Transfer from Supervised Language Tasks to Reading Comprehension

Reading comprehension is a challenging task in natural language processing and requires a set of skills to be solved. While current approaches focus on solving the task as a whole, in this paper, we propose to use a neural network `skill'…

Computation and Language · Computer Science 2017-11-13 Todor Mihaylov , Zornitsa Kozareva , Anette Frank

An Efficient Self-Supervised Cross-View Training For Sentence Embedding

Self-supervised sentence representation learning is the task of constructing an embedding space for sentences without relying on human annotation efforts. One straightforward approach is to finetune a pretrained language model (PLM) with a…

Computation and Language · Computer Science 2023-11-07 Peerat Limkonchotiwat , Wuttikorn Ponwitayarat , Lalita Lowphansirikul , Can Udomcharoenchaikit , Ekapol Chuangsuwanich , Sarana Nutanong

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised…

Machine Learning · Computer Science 2020-07-03 Yu Sun , Xiaolong Wang , Zhuang Liu , John Miller , Alexei A. Efros , Moritz Hardt

Unsupervised Meta-Learning via In-Context Learning

Unsupervised meta-learning aims to learn feature representations from unsupervised datasets that can transfer to downstream tasks with limited labeled data. In this paper, we propose a novel approach to unsupervised meta-learning that…

Machine Learning · Computer Science 2025-02-11 Anna Vettoruzzo , Lorenzo Braccaioli , Joaquin Vanschoren , Marlena Nowaczyk