Related papers: Differentiable Instruction Optimization for Cross-…

Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks

Instruction tuning (IT) achieves impressive zero-shot generalization results by training large language models (LLMs) on a massive amount of diverse tasks with instructions. However, how to select new tasks to improve the performance and…

Computation and Language · Computer Science 2023-11-02 Po-Nien Kung , Fan Yin , Di Wu , Kai-Wei Chang , Nanyun Peng

Instruction Diversity Drives Generalization To Unseen Tasks

Instruction tuning -- fine-tuning a large language model (LLM) on pairs of instructions and desired outcomes -- is an approach that enables pre-trained language models to perform real-world tasks and follow human instructions. Its practical…

Computation and Language · Computer Science 2024-02-19 Dylan Zhang , Justin Wang , Francois Charton

Improving Cross-Task Generalization with Step-by-Step Instructions

Instruction tuning has been shown to be able to improve cross-task generalization of language models. However, it is still challenging for language models to complete the target tasks following the instructions, as the instructions are…

Computation and Language · Computer Science 2023-05-09 Yang Wu , Yanyan Zhao , Zhongyang Li , Bing Qin , Kai Xiong

From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers

Instruction tuning -- tuning large language models on instruction-output pairs -- is a promising technique for making models better adapted to the real world. Yet, the key factors driving the model's capability to understand and follow…

Computation and Language · Computer Science 2024-06-03 Dylan Zhang , Justin Wang , Francois Charton

Specialist or Generalist? Instruction Tuning for Specific NLP Tasks

The potential of large language models (LLMs) to simultaneously perform a wide range of natural language processing (NLP) tasks has been the subject of extensive research. Although instruction tuning has proven to be a data-efficient method…

Computation and Language · Computer Science 2023-10-25 Chufan Shi , Yixuan Su , Cheng Yang , Yujiu Yang , Deng Cai

Instruction Matters: A Simple yet Effective Task Selection for Optimized Instruction Tuning of Specific Tasks

Instruction tuning has been proven effective in enhancing zero-shot generalization across various tasks and in improving the performance of specific tasks. For task-specific improvements, strategically selecting and training on related…

Computation and Language · Computer Science 2024-10-18 Changho Lee , Janghoon Han , Seonghyeon Ye , Stanley Jungkyu Choi , Honglak Lee , Kyunghoon Bae

In Defense of the Unitary Scalarization for Deep Multi-Task Learning

Recent multi-task learning research argues against unitary scalarization, where training simply minimizes the sum of the task losses. Several ad-hoc multi-task optimization algorithms have instead been proposed, inspired by various…

Machine Learning · Computer Science 2023-03-10 Vitaly Kurin , Alessandro De Palma , Ilya Kostrikov , Shimon Whiteson , M. Pawan Kumar

Robustness of Learning from Task Instructions

Traditional supervised learning mostly works on individual tasks and requires training on a large set of task-specific examples. This paradigm seriously hinders the development of task generalization since preparing a task-specific example…

Computation and Language · Computer Science 2023-05-24 Jiasheng Gu , Hongyu Zhao , Hanzi Xu , Liangyu Nie , Hongyuan Mei , Wenpeng Yin

Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning

Instruction tuning has emerged as a powerful technique, significantly boosting zero-shot performance on unseen tasks. While recent work has explored cross-lingual generalization by applying instruction tuning to multilingual models,…

Computation and Language · Computer Science 2024-06-14 Janghoon Han , Changho Lee , Joongbo Shin , Stanley Jungkyu Choi , Honglak Lee , Kynghoon Bae

Curriculum Learning of Multiple Tasks

Sharing information between multiple tasks enables algorithms to achieve good generalization performance even from small amounts of training data. However, in a realistic scenario of multi-task learning not all tasks are equally related to…

Machine Learning · Statistics 2014-12-04 Anastasia Pentina , Viktoriia Sharmanska , Christoph H. Lampert

MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning

Instruction tuning, a new learning paradigm that fine-tunes pre-trained language models on tasks specified through instructions, has shown promising zero-shot performance on various natural language processing tasks. However, it has yet to…

Computation and Language · Computer Science 2023-06-13 Zhiyang Xu , Ying Shen , Lifu Huang

Leveraging convergence behavior to balance conflicting tasks in multi-task learning

Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to…

Machine Learning · Computer Science 2022-04-15 Angelica Tiemi Mizuno Nakamura , Denis Fernando Wolf , Valdir Grassi

Cross-Task Generalization via Natural Language Crowdsourcing Instructions

Humans (e.g., crowdworkers) have a remarkable ability in solving different tasks, by simply reading textual instructions that define them and looking at a few examples. Despite the success of the conventional supervised learning on…

Computation and Language · Computer Science 2022-03-15 Swaroop Mishra , Daniel Khashabi , Chitta Baral , Hannaneh Hajishirzi

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Knowledge editing for large language models can offer an efficient solution to alter a model's behavior without negatively impacting the overall performance. However, the current approaches encounter issues with limited generalizability…

Computation and Language · Computer Science 2024-04-30 Ningyu Zhang , Bozhong Tian , Siyuan Cheng , Xiaozhuan Liang , Yi Hu , Kouying Xue , Yanjie Gou , Xi Chen , Huajun Chen

Tasks, stability, architecture, and compute: Training more effective learned optimizers, and using them to train themselves

Much as replacing hand-designed features with learned functions has revolutionized how we solve perceptual tasks, we believe learned algorithms will transform how we train models. In this work we focus on general-purpose learned optimizers…

Machine Learning · Computer Science 2020-09-24 Luke Metz , Niru Maheswaranathan , C. Daniel Freeman , Ben Poole , Jascha Sohl-Dickstein

Demystifying Instruction Mixing for Fine-tuning Large Language Models

Instruction tuning significantly enhances the performance of large language models (LLMs) across various tasks. However, the procedure to optimizing the mixing of instruction datasets for LLM fine-tuning is still poorly understood. This…

Computation and Language · Computer Science 2024-02-20 Renxi Wang , Haonan Li , Minghao Wu , Yuxia Wang , Xudong Han , Chiyu Zhang , Timothy Baldwin

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Recently, Language Models (LMs) instruction-tuned on multiple tasks, also known as multitask-prompted fine-tuning (MT), have shown the capability to generalize to unseen tasks. Previous work has shown that scaling the number of training…

Computation and Language · Computer Science 2023-02-10 Joel Jang , Seungone Kim , Seonghyeon Ye , Doyoung Kim , Lajanugen Logeswaran , Moontae Lee , Kyungjae Lee , Minjoon Seo

Linguistically-Informed Multilingual Instruction Tuning: Is There an Optimal Set of Languages to Tune?

Multilingual language models often perform unevenly across different languages due to limited generalization capabilities for some languages. This issue is significant because of the growing interest in making universal language models that…

Computation and Language · Computer Science 2024-10-11 Gürkan Soykan , Gözde Gül Şahin

Continual Instruction Tuning for Large Multimodal Models

Instruction tuning is now a widely adopted approach to aligning large multimodal models (LMMs) to follow human intent. It unifies the data format of vision-language tasks, enabling multi-task joint training. However, vision-language tasks…

Machine Learning · Computer Science 2023-11-29 Jinghan He , Haiyun Guo , Ming Tang , Jinqiao Wang

Visual Instruction Tuning towards General-Purpose Multimodal Model: A Survey

Traditional computer vision generally solves each single task independently by a dedicated model with the task instruction implicitly designed in the model architecture, arising two limitations: (1) it leads to task-specific models, which…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Jiaxing Huang , Jingyi Zhang , Kai Jiang , Han Qiu , Shijian Lu