Related papers: Error-driven Data-efficient Large Multimodal Model…

Maybe Only 0.5% Data is Needed: A Preliminary Exploration of Low Training Data Instruction Tuning

Instruction tuning for large language models (LLMs) has gained attention from researchers due to its ability to unlock the potential of LLMs in following instructions. While instruction tuning offers advantages for facilitating the…

Artificial Intelligence · Computer Science 2023-05-17 Hao Chen , Yiming Zhang , Qi Zhang , Hantao Yang , Xiaomeng Hu , Xuetao Ma , Yifan Yanggong , Junbo Zhao

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Recently, Language Models (LMs) instruction-tuned on multiple tasks, also known as multitask-prompted fine-tuning (MT), have shown the capability to generalize to unseen tasks. Previous work has shown that scaling the number of training…

Computation and Language · Computer Science 2023-02-10 Joel Jang , Seungone Kim , Seonghyeon Ye , Doyoung Kim , Lajanugen Logeswaran , Moontae Lee , Kyungjae Lee , Minjoon Seo

Continual Instruction Tuning for Large Multimodal Models

Instruction tuning is now a widely adopted approach to aligning large multimodal models (LMMs) to follow human intent. It unifies the data format of vision-language tasks, enabling multi-task joint training. However, vision-language tasks…

Machine Learning · Computer Science 2023-11-29 Jinghan He , Haiyun Guo , Ming Tang , Jinqiao Wang

Fine-Tuned In-Context Learners for Efficient Adaptation

When adapting large language models (LLMs) to a specific downstream task, two primary approaches are commonly employed: (1) prompt engineering, often with in-context few-shot learning, leveraging the model's inherent generalization…

Machine Learning · Computer Science 2025-12-24 Jorg Bornschein , Clare Lyle , Yazhe Li , Amal Rannen-Triki , Xu Owen He , Razvan Pascanu

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Pretrained large language models (LLMs) are currently state-of-the-art for solving the vast majority of natural language processing tasks. While many real-world applications still require fine-tuning to reach satisfactory levels of…

Computation and Language · Computer Science 2024-07-16 Nicholas Lee , Thanakul Wattanawong , Sehoon Kim , Karttikeya Mangalam , Sheng Shen , Gopala Anumanchipalli , Michael W. Mahoney , Kurt Keutzer , Amir Gholami

Efficiently Estimating Data Efficiency for Language Model Fine-tuning

While large language models (LLMs) demonstrate reasonable zero-shot capability across many downstream tasks, fine-tuning is a common practice to improve their performance. However, a task's data efficiency--i.e., the number of fine-tuning…

Machine Learning · Computer Science 2026-01-01 Gyung Hyun Je , Colin Raffel

Towards Robust Instruction Tuning on Multimodal Large Language Models

Fine-tuning large language models (LLMs) on multi-task instruction-following data has been proven to be a powerful learning paradigm for improving their zero-shot capabilities on new tasks. Recent works about high-quality…

Computation and Language · Computer Science 2024-06-17 Wei Han , Hui Chen , Soujanya Poria

IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

As large language models (LLMs) continue to advance, instruction tuning has become critical for improving their ability to generate accurate and contextually appropriate responses. Although numerous instruction-tuning datasets have been…

Computation and Language · Computer Science 2024-10-18 Jielin Song , Siyu Liu , Bin Zhu , Yanghui Rao

How to Teach Large Multimodal Models New Skills

How can we teach large multimodal models (LMMs) new skills without erasing prior abilities? We study sequential fine-tuning on five target skills while monitoring general ability on eight held-out benchmarks across three model families.…

Artificial Intelligence · Computer Science 2026-04-22 Zhen Zhu , Yiming Gong , Yao Xiao , Yaoyao Liu , Derek Hoiem

RL-Guided Data Selection for Language Model Finetuning

Data selection for finetuning Large Language Models (LLMs) can be framed as a budget-constrained optimization problem: maximizing a model's downstream performance under a strict training data budget. Solving this problem is generally…

Machine Learning · Computer Science 2025-10-01 Animesh Jha , Harshit Gupta , Ananjan Nandi

LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms

Large Language Models are traditionally finetuned on large instruction datasets. However recent studies suggest that small, high-quality datasets can suffice for general purpose instruction following. This lack of consensus surrounding…

Machine Learning · Computer Science 2023-12-29 Aditi Jha , Sam Havens , Jeremy Dohmann , Alex Trott , Jacob Portes

Crafting Efficient Fine-Tuning Strategies for Large Language Models

This paper addresses the challenges of efficiently fine-tuning large language models (LLMs) by exploring data efficiency and hyperparameter optimization. We investigate the minimum data required for effective fine-tuning and propose a novel…

Computation and Language · Computer Science 2024-07-22 Michael Oliver , Guan Wang

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Large language models (LLMs) achieve remarkable advancements by leveraging tools to interact with environments, a critical step toward generalized AI. However, the standard supervised fine-tuning (SFT) approach, which relies on large-scale…

Computation and Language · Computer Science 2025-08-27 Junjie Ye , Yilong Wu , Sixian Li , Yuming Yang , Zhiheng Xi , Tao Gui , Qi Zhang , Xuanjing Huang , Peng Wang , Zhongchao Shi , Jianping Fan , Zhengyin Du

Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models

The in-context learning ability of large language models (LLMs) enables them to generalize to novel downstream tasks with relatively few labeled examples. However, they require enormous computational resources to be deployed. Alternatively,…

Computation and Language · Computer Science 2024-01-09 Jean Kaddour , Qi Liu

On the Performance of Multimodal Language Models

Instruction-tuned large language models (LLMs) have demonstrated promising zero-shot generalization capabilities across various downstream tasks. Recent research has introduced multimodal capabilities to LLMs by integrating independently…

Computation and Language · Computer Science 2023-11-29 Utsav Garg , Erhan Bas

Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance

Proprietary Large Language Models (LLMs), such as ChatGPT, have garnered significant attention due to their exceptional capabilities in handling a diverse range of tasks. Recent studies demonstrate that open-sourced smaller foundational…

Computation and Language · Computer Science 2023-10-10 Yue Zhang , Leyang Cui , Deng Cai , Xinting Huang , Tao Fang , Wei Bi

Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Instruction tuning is critical to improve LLMs but usually suffers from low-quality and redundant data. Data filtering for instruction tuning has proved important in improving both the efficiency and performance of the tuning process. But…

Computation and Language · Computer Science 2024-06-11 Ming Li , Yong Zhang , Shwai He , Zhitao Li , Hongyu Zhao , Jianzong Wang , Ning Cheng , Tianyi Zhou

Fine-tuning Large Language Models with Limited Data: A Survey and Practical Guide

Fine-tuning large language models (LLMs) with limited data poses a practical challenge in low-resource languages, specialized domains, and constrained deployment settings. While pre-trained LLMs provide strong foundations, effective…

Computation and Language · Computer Science 2025-10-29 Marton Szep , Daniel Rueckert , Rüdiger von Eisenhart-Rothe , Florian Hinterwimmer

Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model

Multi-modal Large Language Models (MLLMs) integrate visual and linguistic reasoning to address complex tasks such as image captioning and visual question answering. While MLLMs demonstrate remarkable versatility, MLLMs appears limited…

Computation and Language · Computer Science 2025-03-07 Wenke Huang , Jian Liang , Xianda Guo , Yiyang Fang , Guancheng Wan , Xuankun Rong , Chi Wen , Zekun Shi , Qingyun Li , Didi Zhu , Yanbiao Ma , Ke Liang , Bin Yang , He Li , Jiawei Shao , Mang Ye , Bo Du

Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

Instruction tuning is a pivotal technique for aligning large language models (LLMs) with human intentions, safety constraints, and domain-specific requirements. This survey provides a comprehensive overview of the full pipeline,…

Computation and Language · Computer Science 2025-11-20 Xudong Han , Junjie Yang , Tianyang Wang , Ziqian Bi , Xinyuan Song , Junfeng Hao , Junhao Song