Related papers: Small-Text: Active Learning for Text Classificatio…

Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models

Active learning is an iterative labeling process that is used to obtain a small labeled subset, despite the absence of labeled data, thereby enabling to train a model for supervised tasks such as text classification. While active learning…

Computation and Language · Computer Science 2024-10-07 Christopher Schröder , Gerhard Heyer

When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes

We present FastFit, a method, and a Python package design to provide fast and accurate few-shot classification, especially for scenarios with many semantically similar classes. FastFit utilizes a novel approach integrating batch contrastive…

Computation and Language · Computer Science 2024-04-19 Asaf Yehudai , Elron Bendel

Active Few-Shot Learning for Text Classification

The rise of Large Language Models (LLMs) has boosted the use of Few-Shot Learning (FSL) methods in natural language processing, achieving acceptable performance even when working with limited training data. The goal of FSL is to effectively…

Computation and Language · Computer Science 2025-02-27 Saeed Ahmadnia , Arash Yousefi Jordehi , Mahsa Hosseini Khasheh Heyran , Seyed Abolghasem Mirroshandel , Owen Rambow , Cornelia Caragea

Improving Probabilistic Models in Text Classification via Active Learning

Social scientists often classify text documents to use the resulting labels as an outcome or a predictor in empirical research. Automated text classification has become a standard tool, since it requires less human coding. However, scholars…

Computation and Language · Computer Science 2025-05-14 Mitchell Bosley , Saki Kuzushima , Ted Enamorado , Yuki Shiraito

Towards Agile Text Classifiers for Everyone

Text-based safety classifiers are widely used for content moderation and increasingly to tune generative language model behavior - a topic of growing concern for the safety of digital assistants and chatbots. However, different policies…

Computation and Language · Computer Science 2023-10-24 Maximilian Mozes , Jessica Hoffmann , Katrin Tomanek , Muhamed Kouate , Nithum Thain , Ann Yuan , Tolga Bolukbasi , Lucas Dixon

FineText: Text Classification via Attention-based Language Model Fine-tuning

Training deep neural networks from scratch on natural language processing (NLP) tasks requires significant amount of manually labeled text corpus and substantial time to converge, which usually cannot be satisfied by the customers. In this…

Computation and Language · Computer Science 2019-10-29 Yunzhe Tao , Saurabh Gupta , Satyapriya Krishna , Xiong Zhou , Orchid Majumder , Vineet Khare

pyGSL: A Graph Structure Learning Toolkit

We introduce pyGSL, a Python library that provides efficient implementations of state-of-the-art graph structure learning models along with diverse datasets to evaluate them on. The implementations are written in GPU-friendly ways, allowing…

Machine Learning · Computer Science 2022-11-08 Max Wasserman , Gonzalo Mateos

LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models

Vision-language pre-training like CLIP has shown promising performance on various downstream tasks such as zero-shot image classification and image-text retrieval. Most of the existing CLIP-alike works usually adopt relatively large image…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Ying Nie , Wei He , Kai Han , Yehui Tang , Tianyu Guo , Fanyi Du , Yunhe Wang

Deep Fast Vision: A Python Library for Accelerated Deep Transfer Learning Vision Prototyping

Deep learning-based vision is characterized by intricate frameworks that often necessitate a profound understanding, presenting a barrier to newcomers and limiting broad adoption. With many researchers grappling with the constraints of…

Computer Vision and Pattern Recognition · Computer Science 2023-11-13 Fabi Prezja

TransformerRanker: A Tool for Efficiently Finding the Best-Suited Language Models for Downstream Classification Tasks

Classification tasks in NLP are typically addressed by selecting a pre-trained language model (PLM) from a model hub, and fine-tuning it for the task at hand. However, given the very large number of PLMs that are currently available, a…

Computation and Language · Computer Science 2024-09-11 Lukas Garbas , Max Ploner , Alan Akbik

Small Language Models in the Real World: Insights from Industrial Text Classification

With the emergence of ChatGPT, Transformer models have significantly advanced text classification and related tasks. Decoder-only models such as Llama exhibit strong performance and flexibility, yet they suffer from inefficiency on…

Computation and Language · Computer Science 2025-06-25 Lujun Li , Lama Sleem , Niccolo' Gentile , Geoffrey Nichil , Radu State

Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data

Machine learning-based classifiers have been used for text classification, such as sentiment analysis, news classification, and toxic comment classification. However, supervised machine learning models often require large amounts of labeled…

Computation and Language · Computer Science 2025-05-06 Yejian Zhang , Shingo Takada

Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction

Large Language Models (LLMs) have shown extraordinary success across various text generation tasks; however, their potential for simple yet essential text classification remains underexplored, as LLM pre-training tends to emphasize…

Computation and Language · Computer Science 2025-10-02 Zhexiong Liu , Diane Litman

A scikit-based Python environment for performing multi-label classification

scikit-multilearn is a Python library for performing multi-label classification. The library is compatible with the scikit/scipy ecosystem and uses sparse matrices for all internal operations. It provides native Python implementations of…

Machine Learning · Computer Science 2018-12-11 Piotr Szymański , Tomasz Kajdanowicz

AutoTrain: No-code training for state-of-the-art models

With the advancements in open-source models, training (or finetuning) models on custom datasets has become a crucial part of developing solutions which are tailored to specific industrial or open-source applications. Yet, there is no single…

Artificial Intelligence · Computer Science 2024-11-15 Abhishek Thakur

Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks

Meta-learning has emerged as a prominent technology for few-shot text classification and has achieved promising performance. However, existing methods often encounter difficulties in drawing accurate class prototypes from support set…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Xinyue Liu , Yunlong Gao , Linlin Zong , Bo Xu

Integrated Image-Text Based on Semi-supervised Learning for Small Sample Instance Segmentation

Small sample instance segmentation is a very challenging task, and many existing methods follow the training strategy of meta-learning which pre-train models on support set and fine-tune on query set. The pre-training phase, which is highly…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Ruting Chi , Zhiyi Huang , Yuexing Han

The Application of Active Query K-Means in Text Classification

Active learning is a state-of-art machine learning approach to deal with an abundance of unlabeled data. In the field of Natural Language Processing, typically it is costly and time-consuming to have all the data annotated. This…

Computation and Language · Computer Science 2021-07-19 Yukun Jiang

PrismSSL: One Interface, Many Modalities; A Single-Interface Library for Multimodal Self-Supervised Learning

We present PrismSSL, a Python library that unifies state-of-the-art self-supervised learning (SSL) methods across audio, vision, graphs, and cross-modal settings in a single, modular codebase. The goal of the demo is to show how researchers…

Machine Learning · Computer Science 2025-11-25 Melika Shirian , Kianoosh Vadaei , Kian Majlessi , Audrina Ebrahimi , Arshia Hemmat , Peyman Adibi , Hossein Karshenas

pySigLib -- Fast Signature-Based Computations on CPU and GPU

Signature-based methods have recently gained significant traction in machine learning for sequential data. In particular, signature kernels have emerged as powerful discriminators and training losses for generative models on time-series,…

Machine Learning · Computer Science 2025-09-16 Daniil Shmelev , Cristopher Salvi