English
Related papers

Related papers: Self-Supervised Test-Time Learning for Reading Com…

200 papers

Despite recent success in machine reading comprehension (MRC), learning high-quality MRC models still requires large-scale labeled training data, even using strong pre-trained language models (PLMs). The pre-training tasks for PLMs are not…

Computation and Language · Computer Science 2021-07-20 Ning Bian , Xianpei Han , Bo Chen , Hongyu Lin , Ben He , Le Sun

Substantial improvements have been made in machine reading comprehension, where the machine answers questions based on a given context. Current state-of-the-art models even surpass human performance on several benchmarks. However, their…

Computation and Language · Computer Science 2021-05-11 Wei-Cheng Huang , Chien-yu Huang , Hung-yi Lee

While Large Language Models (LLMs) have exhibited remarkable emergent capabilities through extensive pre-training, they still face critical limitations in generalizing to specialized domains and handling diverse linguistic variations, known…

Computation and Language · Computer Science 2025-05-28 Jinwu Hu , Zhitian Zhang , Guohao Chen , Xutao Wen , Chao Shuai , Wei Luo , Bin Xiao , Yuanqing Li , Mingkui Tan

Pretraining has become a standard technique in computer vision and natural language processing, which usually helps to improve performance substantially. Previously, the most dominant pretraining method is transfer learning (TL), which uses…

Computer Vision and Pattern Recognition · Computer Science 2020-07-09 Xingyi Yang , Xuehai He , Yuxiao Liang , Yue Yang , Shanghang Zhang , Pengtao Xie

Recent work has shown that, while large language models (LLMs) demonstrate strong word translation or bilingual lexicon induction (BLI) capabilities in few-shot setups, they still cannot match the performance of 'traditional' mapping-based…

Computation and Language · Computer Science 2024-06-06 Yaoyiran Li , Anna Korhonen , Ivan Vulić

This work presents a method for visual text recognition without using any paired supervisory data. We formulate the text recognition task as one of aligning the conditional distribution of strings predicted from given text images, with…

Computer Vision and Pattern Recognition · Computer Science 2018-12-11 Ankush Gupta , Andrea Vedaldi , Andrew Zisserman

Deep Learning models have shown remarkable performance in a broad range of vision tasks. However, they are often vulnerable against domain shifts at test-time. Test-time training (TTT) methods have been developed in an attempt to mitigate…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Gustavo A. Vargas Hakim , David Osowiechi , Mehrdad Noori , Milad Cheraghalikhani , Ismail Ben Ayed , Christian Desrosiers

Meta-reinforcement learning typically requires orders of magnitude more samples than single task reinforcement learning methods. This is because meta-training needs to deal with more diverse distributions and train extra components such as…

Machine Learning · Computer Science 2021-03-12 Bernie Wang , Simon Xu , Kurt Keutzer , Yang Gao , Bichen Wu

Task-adaptive pre-training (TAPT) and Self-training (ST) have emerged as the major semi-supervised approaches to improve natural language understanding (NLU) tasks with massive amount of unlabeled data. However, it's unclear whether they…

Computation and Language · Computer Science 2023-02-21 Shiyang Li , Semih Yavuz , Wenhu Chen , Xifeng Yan

Large Language Models (LLMs) have demonstrated impressive performance on a wide range of natural language processing (NLP) tasks, primarily through in-context learning (ICL). In ICL, the LLM is provided with examples that represent a given…

Computation and Language · Computer Science 2025-02-19 Abdellah El Mekki , Muhammad Abdul-Mageed

Test-Time Scaling (TTS) is a promising approach to progressively elicit the model's intelligence during inference. Recently, training-based TTS methods, such as continued reinforcement learning (RL), have further surged in popularity, while…

Computation and Language · Computer Science 2025-09-10 Kaiyan Chang , Yonghao Shi , Chenglong Wang , Hang Zhou , Chi Hu , Xiaoqian Liu , Yingfeng Luo , Yuan Ge , Tong Xiao , Jingbo Zhu

Obtaining training data for Question Answering (QA) is time-consuming and resource-intensive, and existing QA datasets are only available for limited domains and languages. In this work, we explore to what extent high quality training data…

Computation and Language · Computer Science 2020-05-05 Patrick Lewis , Ludovic Denoyer , Sebastian Riedel

This paper investigates Reinforcement Learning (RL) on data without explicit labels for reasoning tasks in Large Language Models (LLMs). The core challenge of the problem is reward estimation during inference while not having access to…

Unsupervised text representation learning (TRL) is a fundamental task in natural language processing, which is beneficial for improving search and recommendations with the web's unlabeled texts. A recent empirical study finds that the…

Computation and Language · Computer Science 2025-10-14 Ruize An , Richong Zhang , Zhijie Nie , Zhanyu Wu , Yanzhao Zhang , Dingkun Long

Self-Supervised Learning (SSL) has gained traction for its ability to learn rich representations with low labeling costs, applicable across diverse downstream tasks. However, assessing the downstream-task performance remains challenging due…

Sound · Computer Science 2025-10-07 Takashi Maekaku , Keita Goto , Jinchuan Tian , Yusuke Shinohara , Shinji Watanabe

Text classification is a widely studied problem and has broad applications. In many real-world problems, the number of texts for training classification models is limited, which renders these models prone to overfitting. To address this…

Computation and Language · Computer Science 2021-03-25 Meng Zhou , Zechen Li , Pengtao Xie

Reading comprehension is a challenging task in natural language processing and requires a set of skills to be solved. While current approaches focus on solving the task as a whole, in this paper, we propose to use a neural network `skill'…

Computation and Language · Computer Science 2017-11-13 Todor Mihaylov , Zornitsa Kozareva , Anette Frank

Self-supervised sentence representation learning is the task of constructing an embedding space for sentences without relying on human annotation efforts. One straightforward approach is to finetune a pretrained language model (PLM) with a…

In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised…

Machine Learning · Computer Science 2020-07-03 Yu Sun , Xiaolong Wang , Zhuang Liu , John Miller , Alexei A. Efros , Moritz Hardt

Unsupervised meta-learning aims to learn feature representations from unsupervised datasets that can transfer to downstream tasks with limited labeled data. In this paper, we propose a novel approach to unsupervised meta-learning that…

Machine Learning · Computer Science 2025-02-11 Anna Vettoruzzo , Lorenzo Braccaioli , Joaquin Vanschoren , Marlena Nowaczyk
‹ Prev 1 2 3 10 Next ›