Related papers: Training with Streaming Annotation

Iterative autoregression: a novel trick to improve your low-latency speech enhancement model

Streaming models are an essential component of real-time speech enhancement tools. The streaming regime constrains speech enhancement models to use only a tiny context of future information. As a result, the low-latency streaming setup is…

Sound · Computer Science 2023-12-06 Pavel Andreev , Nicholas Babaev , Azat Saginbaev , Ivan Shchekotov , Aibek Alanov

Distill and Collect for Semi-Supervised Temporal Action Segmentation

Recent temporal action segmentation approaches need frame annotations during training to be effective. These annotations are very expensive and time-consuming to obtain. This limits their performances when only limited annotated data is…

Computer Vision and Pattern Recognition · Computer Science 2022-11-04 Sovan Biswas , Anthony Rhodes , Ramesh Manuvinakurike , Giuseppe Raffa , Richard Beckwith

ProtoDA: Efficient Transfer Learning for Few-Shot Intent Classification

Practical sequence classification tasks in natural language processing often suffer from low training data availability for target classes. Recent works towards mitigating this problem have focused on transfer learning using embeddings…

Computation and Language · Computer Science 2021-01-29 Manoj Kumar , Varun Kumar , Hadrien Glaude , Cyprien delichy , Aman Alok , Rahul Gupta

Partial Rewriting for Multi-Stage ASR

For many streaming automatic speech recognition tasks, it is important to provide timely intermediate streaming results, while refining a high quality final result. This can be done using a multi-stage architecture, where a small…

Computation and Language · Computer Science 2023-12-18 Antoine Bruguier , David Qiu , Yanzhang He

Test-Time Training on Video Streams

Prior work has established Test-Time Training (TTT) as a general framework to further improve a trained model at test time. Before making a prediction on each test instance, the model is first trained on the same instance using a…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Renhao Wang , Yu Sun , Arnuv Tandon , Yossi Gandelsman , Xinlei Chen , Alexei A. Efros , Xiaolong Wang

Memory-efficient training with streaming dimensionality reduction

The movement of large quantities of data during the training of a Deep Neural Network presents immense challenges for machine learning workloads. To minimize this overhead, especially on the movement and calculation of gradient information,…

Machine Learning · Computer Science 2020-04-28 Siyuan Huang , Brian D. Hoskins , Matthew W. Daniels , Mark D. Stiles , Gina C. Adam

Few-Shot Adaptation of Pre-Trained Networks for Domain Shift

Deep networks are prone to performance degradation when there is a domain shift between the source (training) data and target (test) data. Recent test-time adaptation methods update batch normalization layers of pre-trained source models…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Wenyu Zhang , Li Shen , Wanyue Zhang , Chuan-Sheng Foo

Track Anything Annotate: Video annotation and dataset generation of computer vision models

Modern machine learning methods require significant amounts of labelled data, making the preparation process time-consuming and resource-intensive. In this paper, we propose to consider the process of prototyping a tool for annotating and…

Computer Vision and Pattern Recognition · Computer Science 2025-05-26 Nikita Ivanov , Mark Klimov , Dmitry Glukhikh , Tatiana Chernysheva , Igor Glukhikh

Optimizing Active Learning for Low Annotation Budgets

When we can not assume a large amount of annotated data , active learning is a good strategy. It consists in learning a model on a small amount of annotated data (annotation budget) and in choosing the best set of points to annotate in…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Umang Aggarwal , Adrian Popescu , Céline Hudelot

Training neural audio classifiers with few data

We investigate supervised learning strategies that improve the training of neural network audio classifiers on small annotated collections. In particular, we study whether (i) a naive regularization of the solution space, (ii) prototypical…

Sound · Computer Science 2018-11-07 Jordi Pons , Joan Serrà , Xavier Serra

Sample Efficient Text Summarization Using a Single Pre-Trained Transformer

Language model (LM) pre-training has resulted in impressive performance and sample efficiency on a variety of language understanding tasks. However, it remains unclear how to best use pre-trained LMs for generation tasks such as abstractive…

Computation and Language · Computer Science 2019-05-23 Urvashi Khandelwal , Kevin Clark , Dan Jurafsky , Lukasz Kaiser

Annotation Techniques for Judo Combat Phase Classification from Tournament Footage

This paper presents a semi-supervised approach to extracting and analyzing combat phases in judo tournaments using live-streamed footage. The objective is to automate the annotation and summarization of live streamed judo matches. We train…

Computer Vision and Pattern Recognition · Computer Science 2024-12-11 Anthony Miyaguchi , Jed Moutahir , Tanmay Sutar

A Comprehensive Comparison of Pre-training Language Models

Recently, the development of pre-trained language models has brought natural language processing (NLP) tasks to the new state-of-the-art. In this paper we explore the efficiency of various pre-trained language models. We pre-train a list of…

Computation and Language · Computer Science 2023-07-27 Tong Guo

Few-Shot Learning of an Interleaved Text Summarization Model by Pretraining with Synthetic Data

Interleaved texts, where posts belonging to different threads occur in a sequence, commonly occur in online chat posts, so that it can be time-consuming to quickly obtain an overview of the discussions. Existing systems first disentangle…

Computation and Language · Computer Science 2021-03-10 Sanjeev Kumar Karn , Francine Chen , Yan-Ying Chen , Ulli Waltinger , Hinrich Schuetze

Teaching Arithmetic to Small Transformers

Large language models like GPT-4 exhibit emergent capabilities across general-purpose tasks, such as basic arithmetic, when trained on extensive text data, even though these tasks are not explicitly encoded by the unsupervised, next-token…

Machine Learning · Computer Science 2023-07-10 Nayoung Lee , Kartik Sreenivasan , Jason D. Lee , Kangwook Lee , Dimitris Papailiopoulos

Is More Data Better? Re-thinking the Importance of Efficiency in Abusive Language Detection with Transformers-Based Active Learning

Annotating abusive language is expensive, logistically complex and creates a risk of psychological harm. However, most machine learning research has prioritized maximizing effectiveness (i.e., F1 or accuracy score) rather than data…

Computation and Language · Computer Science 2022-09-22 Hannah Rose Kirk , Bertie Vidgen , Scott A. Hale

Meta-Learning Transformers to Improve In-Context Generalization

In-context learning enables transformer models to generalize to new tasks based solely on input prompts, without any need for weight updates. However, existing training paradigms typically rely on large, unstructured datasets that are…

Machine Learning · Computer Science 2025-07-08 Lorenzo Braccaioli , Anna Vettoruzzo , Prabhant Singh , Joaquin Vanschoren , Mohamed-Rafik Bouguelia , Nicola Conci

Scale Up Event Extraction Learning via Automatic Training Data Generation

The task of event extraction has long been investigated in a supervised learning paradigm, which is bound by the number and the quality of the training instances. Existing training data must be manually generated through a combination of…

Computation and Language · Computer Science 2017-12-12 Ying Zeng , Yansong Feng , Rong Ma , Zheng Wang , Rui Yan , Chongde Shi , Dongyan Zhao

Leveraging AI Predicted and Expert Revised Annotations in Interactive Segmentation: Continual Tuning or Full Training?

Interactive segmentation, an integration of AI algorithms and human expertise, premises to improve the accuracy and efficiency of curating large-scale, detailed-annotated datasets in healthcare. Human experts revise the annotations…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Tiezheng Zhang , Xiaoxi Chen , Chongyu Qu , Alan Yuille , Zongwei Zhou

SOTASTREAM: A Streaming Approach to Machine Translation Training

Many machine translation toolkits make use of a data preparation step wherein raw data is transformed into a tensor format that can be used directly by the trainer. This preparation step is increasingly at odds with modern research and…

Computation and Language · Computer Science 2023-08-16 Matt Post , Thamme Gowda , Roman Grundkiewicz , Huda Khayrallah , Rohit Jain , Marcin Junczys-Dowmunt