Related papers: Large Language Model-guided Document Selection

Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking

In the field of information retrieval, Query Likelihood Models (QLMs) rank documents based on the probability of generating the query given the content of a document. Recently, advanced large language models (LLMs) have emerged as effective…

Information Retrieval · Computer Science 2023-10-23 Shengyao Zhuang , Bing Liu , Bevan Koopman , Guido Zuccon

Large Language Models Are Zero-Shot Text Classifiers

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Zero-Shot Prompting and Few-Shot Fine-Tuning: Revisiting Document Image Classification Using Large Language Models

Classifying scanned documents is a challenging problem that involves image, layout, and text analysis for document understanding. Nevertheless, for certain benchmark datasets, notably RVL-CDIP, the state of the art is closing in to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Anna Scius-Bertrand , Michael Jungo , Lars Vögtlin , Jean-Marc Spat , Andreas Fischer

Large Language Models Make Sample-Efficient Recommender Systems

Large language models (LLMs) have achieved remarkable progress in the field of natural language processing (NLP), demonstrating remarkable abilities in producing text that resembles human language for various tasks. This opens up new…

Information Retrieval · Computer Science 2024-06-05 Jianghao Lin , Xinyi Dai , Rong Shan , Bo Chen , Ruiming Tang , Yong Yu , Weinan Zhang

Making Large Language Models Better Data Creators

Although large language models (LLMs) have advanced the state-of-the-art in NLP significantly, deploying them for downstream applications is still challenging due to cost, responsiveness, control, or concerns around privacy and security. As…

Computation and Language · Computer Science 2023-11-01 Dong-Ho Lee , Jay Pujara , Mohit Sewak , Ryen W. White , Sujay Kumar Jauhar

Self-training Large Language Models through Knowledge Detection

Large language models (LLMs) often necessitate extensive labeled datasets and training compute to achieve impressive performance across downstream tasks. This paper explores a self-training paradigm, where the LLM autonomously curates its…

Computation and Language · Computer Science 2024-11-13 Wei Jie Yeo , Teddy Ferdinan , Przemyslaw Kazienko , Ranjan Satapathy , Erik Cambria

Language Models for Text Classification: Is In-Context Learning Enough?

Recent foundational language models have shown state-of-the-art performance in many NLP tasks in zero- and few-shot settings. An advantage of these models over more standard approaches based on fine-tuning is the ability to understand…

Computation and Language · Computer Science 2024-04-16 Aleksandra Edwards , Jose Camacho-Collados

Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data

In real-world NLP applications, Large Language Models (LLMs) offer promising solutions due to their extensive training on vast datasets. However, the large size and high computation demands of LLMs limit their practicality in many…

Artificial Intelligence · Computer Science 2025-04-01 Juanhui Li , Sreyashi Nag , Hui Liu , Xianfeng Tang , Sheikh Sarwar , Limeng Cui , Hansu Gu , Suhang Wang , Qi He , Jiliang Tang

Exploring Large Language Models for Feature Selection: A Data-centric Perspective

The rapid advancement of Large Language Models (LLMs) has significantly influenced various domains, leveraging their exceptional few-shot and zero-shot learning capabilities. In this work, we aim to explore and understand the LLMs-based…

Artificial Intelligence · Computer Science 2024-10-24 Dawei Li , Zhen Tan , Huan Liu

Using General Large Language Models to Classify Mathematical Documents

In this article we report on an initial exploration to assess the viability of using the general large language models (LLMs), recently made public, to classify mathematical documents. Automated classification would be useful from the…

Information Retrieval · Computer Science 2024-06-18 Patrick D. F. Ion , Stephen M. Watt

Procedural Text Mining with Large Language Models

Recent advancements in the field of Natural Language Processing, particularly the development of large-scale language models that are pretrained on vast amounts of knowledge, are creating novel opportunities within the realm of Knowledge…

Computation and Language · Computer Science 2023-10-06 Anisa Rula , Jennifer D'Souza

Text Classification in the LLM Era -- Where do we stand?

Large Language Models revolutionized NLP and showed dramatic performance improvements across several tasks. In this paper, we investigated the role of such language models in text classification and how they compare with other approaches…

Computation and Language · Computer Science 2025-02-21 Sowmya Vajjala , Shwetali Shimangaud

In-context Pretraining: Language Modeling Beyond Document Boundaries

Large language models (LMs) are currently trained to predict tokens given document prefixes, enabling them to directly perform long-form generation and prompting-style tasks which can be reduced to document completion. Existing pretraining…

Computation and Language · Computer Science 2024-06-25 Weijia Shi , Sewon Min , Maria Lomeli , Chunting Zhou , Margaret Li , Gergely Szilvasy , Rich James , Xi Victoria Lin , Noah A. Smith , Luke Zettlemoyer , Scott Yih , Mike Lewis

Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection

Large language models (LLMs) are very proficient text generators. We leverage this capability of LLMs to generate task-specific data via zero-shot prompting and promote cross-lingual transfer for low-resource target languages. Given…

Computation and Language · Computer Science 2024-07-16 Barah Fazili , Ashish Sunil Agrawal , Preethi Jyothi

Exploring Large Language Models for Code Explanation

Automating code documentation through explanatory text can prove highly beneficial in code understanding. Large Language Models (LLMs) have made remarkable strides in Natural Language Processing, especially within software engineering tasks…

Software Engineering · Computer Science 2023-10-26 Paheli Bhattacharya , Manojit Chakraborty , Kartheek N S N Palepu , Vikas Pandey , Ishan Dindorkar , Rakesh Rajpurohit , Rishabh Gupta

Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering

We address the task of evidence retrieval for long document question answering, which involves locating relevant paragraphs within a document to answer a question. We aim to assess the applicability of large language models (LLMs) in the…

Computation and Language · Computer Science 2023-11-23 Inderjeet Nair , Shwetha Somasundaram , Apoorv Saxena , Koustava Goswami

Large Language Models are Good Prompt Learners for Low-Shot Image Classification

Low-shot image classification, where training images are limited or inaccessible, has benefited from recent progress on pre-trained vision-language (VL) models with strong generalizability, e.g. CLIP. Prompt learning methods built with VL…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Zhaoheng Zheng , Jingmin Wei , Xuefeng Hu , Haidong Zhu , Ram Nevatia

Using Large Language Models to Enrich the Documentation of Datasets for Machine Learning

Recent regulatory initiatives like the European AI Act and relevant voices in the Machine Learning (ML) community stress the need to describe datasets along several key dimensions for trustworthy AI, such as the provenance processes and…

Digital Libraries · Computer Science 2024-05-27 Joan Giner-Miguelez , Abel Gómez , Jordi Cabot

Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers

Recent studies have demonstrated the great potential of Large Language Models (LLMs) serving as zero-shot relevance rankers. The typical approach involves making comparisons between pairs or lists of documents. Although effective, these…

Information Retrieval · Computer Science 2023-11-06 Weiwei Sun , Zheng Chen , Xinyu Ma , Lingyong Yan , Shuaiqiang Wang , Pengjie Ren , Zhumin Chen , Dawei Yin , Zhaochun Ren

D4: Improving LLM Pretraining via Document De-Duplication and Diversification

Over recent years, an increasing amount of compute and data has been poured into training large language models (LLMs), usually by doing one-pass learning on as many tokens as possible randomly selected from large-scale web corpora. While…

Computation and Language · Computer Science 2023-08-24 Kushal Tirumala , Daniel Simig , Armen Aghajanyan , Ari S. Morcos