Related papers: Language models are weak learners

BoostLLM: Boosting-inspired LLM Fine-tuning for Few-shot Tabular Classification

Large language models (LLMs) have recently been adapted to tabular prediction by serializing structured features into natural language, but their performance in low-data regimes remains limited compared to gradient-boosted decision trees…

Machine Learning · Computer Science 2026-05-12 Yi-Siang Wang , Kuan-Yu Chen , Yu-Chen Den , Darby Tien-Hao Chang

Large Language Models Can be Lazy Learners: Analyze Shortcuts in In-Context Learning

Large language models (LLMs) have recently shown great potential for in-context learning, where LLMs learn a new task simply by conditioning on a few input-label pairs (prompts). Despite their potential, our understanding of the factors…

Computation and Language · Computer Science 2023-09-12 Ruixiang Tang , Dehan Kong , Longtao Huang , Hui Xue

Escaping Collapse: The Strength of Weak Data for Large Language Model Training

Synthetically-generated data plays an increasingly larger role in training large language models. However, while synthetic data has been found to be useful, studies have also shown that without proper curation it can cause LLM performance…

Machine Learning · Computer Science 2025-12-02 Kareem Amin , Sara Babakniya , Alex Bie , Weiwei Kong , Umar Syed , Sergei Vassilvitskii

Dictionary-based Phrase-level Prompting of Large Language Models for Machine Translation

Large language models (LLMs) demonstrate remarkable machine translation (MT) abilities via prompting, even though they were not explicitly trained for this task. However, even given the incredible quantities of data they are trained on,…

Computation and Language · Computer Science 2023-02-16 Marjan Ghazvininejad , Hila Gonen , Luke Zettlemoyer

How well do Large Language Models Recognize Instructional Moves? Establishing Baselines for Foundation Models in Educational Discourse

Large language models (LLMs) are increasingly adopted in educational technologies for a variety of tasks, from generating instructional materials and assisting with assessment design to tutoring. While prior work has investigated how models…

Computation and Language · Computer Science 2025-12-24 Kirk Vanacore , Rene F. Kizilcec

Gradient Boosting Trees and Large Language Models for Tabular Data Few-Shot Learning

Large Language Models (LLM) have brought numerous of new applications to Machine Learning (ML). In the context of tabular data (TD), recent studies show that TabLLM is a very powerful mechanism for few-shot-learning (FSL) applications, even…

Machine Learning · Computer Science 2024-11-08 Carlos Huertas

A Note on Statistically Accurate Tabular Data Generation Using Large Language Models

Large language models (LLMs) have shown promise in synthetic tabular data generation, yet existing methods struggle to preserve complex feature dependencies, particularly among categorical variables. This work introduces a…

Machine Learning · Computer Science 2025-05-07 Andrey Sidorenko

Large Language Models as Universal Predictors? An Empirical Study on Small Tabular Datasets

Large Language Models (LLMs), originally developed for natural language processing (NLP), have demonstrated the potential to generalize across modalities and domains. With their in-context learning (ICL) capabilities, LLMs can perform…

Artificial Intelligence · Computer Science 2025-08-26 Nikolaos Pavlidis , Vasilis Perifanis , Symeon Symeonidis , Pavlos S. Efraimidis

Pronunciation Assessment with Multi-modal Large Language Models

Large language models (LLMs), renowned for their powerful conversational abilities, are widely recognized as exceptional tools in the field of education, particularly in the context of automated intelligent instruction systems for language…

Computation and Language · Computer Science 2024-07-19 Kaiqi Fu , Linkai Peng , Nan Yang , Shuran Zhou

Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!

Large Language Models (LLMs) have made remarkable strides in various tasks. Whether LLMs are competitive few-shot solvers for information extraction (IE) tasks, however, remains an open problem. In this work, we aim to provide a thorough…

Computation and Language · Computer Science 2024-04-15 Yubo Ma , Yixin Cao , YongChing Hong , Aixin Sun

Towards Modeling Learner Performance with Large Language Models

Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including…

Computers and Society · Computer Science 2024-03-25 Seyed Parsa Neshaei , Richard Lee Davis , Adam Hazimeh , Bojan Lazarevski , Pierre Dillenbourg , Tanja Käser

Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development

The use of large language model (LLM)-powered chatbots, such as ChatGPT, has become popular across various domains, supporting a range of tasks and processes. However, due to the intrinsic complexity of LLMs, effective prompting is more…

Computers and Society · Computer Science 2025-03-05 Dimitri Ognibene , Gregor Donabauer , Emily Theophilou , Cansu Koyuturk , Mona Yavari , Sathya Bursic , Alessia Telari , Alessia Testa , Raffaele Boiano , Davide Taibi , Davinia Hernandez-Leo , Udo Kruschwitz , Martin Ruskov

Large Language Models are Good Prompt Learners for Low-Shot Image Classification

Low-shot image classification, where training images are limited or inaccessible, has benefited from recent progress on pre-trained vision-language (VL) models with strong generalizability, e.g. CLIP. Prompt learning methods built with VL…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Zhaoheng Zheng , Jingmin Wei , Xuefeng Hu , Haidong Zhu , Ram Nevatia

Large Language Models are Strong Zero-Shot Retriever

In this work, we propose a simple method that applies a large language model (LLM) to large-scale retrieval in zero-shot scenarios. Our method, the Language language model as Retriever (LameR), is built upon no other neural models but an…

Computation and Language · Computer Science 2023-08-03 Tao Shen , Guodong Long , Xiubo Geng , Chongyang Tao , Tianyi Zhou , Daxin Jiang

Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

Large Language Model (LLM) agents significantly extend the capabilities of standalone LLMs, empowering them to interact with external tools (e.g., APIs, functions) and complete various tasks in a self-directed fashion. The challenge of tool…

Artificial Intelligence · Computer Science 2024-02-19 Weizhou Shen , Chenliang Li , Hongzhan Chen , Ming Yan , Xiaojun Quan , Hehong Chen , Ji Zhang , Fei Huang

Large Language Model Enhanced Machine Learning Estimators for Classification

Pre-trained large language models (LLM) have emerged as a powerful tool for simulating various scenarios and generating output given specific instructions and multimodal input. In this work, we analyze the specific use of LLM to enhance a…

Machine Learning · Computer Science 2024-05-10 Yuhang Wu , Yingfei Wang , Chu Wang , Zeyu Zheng

Large Language Models Are Zero-Shot Text Classifiers

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Helping Language Models Learn More: Multi-dimensional Task Prompt for Few-shot Tuning

Large language models (LLMs) can be used as accessible and intelligent chatbots by constructing natural language queries and directly inputting the prompt into the large language model. However, different prompt' constructions often lead to…

Computation and Language · Computer Science 2023-12-14 Jinta Weng , Jiarui Zhang , Yue Hu , Daidong Fa , Xiaofeng Xuand , Heyan Huang

Large Language Models Can Automatically Engineer Features for Few-Shot Tabular Learning

Large Language Models (LLMs), with their remarkable ability to tackle challenging and unseen reasoning problems, hold immense potential for tabular learning, that is vital for many real-world applications. In this paper, we propose a novel…

Machine Learning · Computer Science 2024-05-07 Sungwon Han , Jinsung Yoon , Sercan O Arik , Tomas Pfister

Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting

We explore the ability of large language models (LLMs) to act as speech recognition post-processors that perform rescoring and error correction. Our first focus is on instruction prompting to let LLMs perform these task without fine-tuning,…

Computation and Language · Computer Science 2024-01-29 Chao-Han Huck Yang , Yile Gu , Yi-Chieh Liu , Shalini Ghosh , Ivan Bulyko , Andreas Stolcke