Related papers: AIR: Complex Instruction Generation via Automatic …

Automated Instruction Revision (AIR): A Structured Comparison of Task Adaptation Strategies for LLM

This paper studies Automated Instruction Revision (AIR), a rule-induction-based method for adapting large language models (LLMs) to downstream tasks using limited task-specific examples. We position AIR within the broader landscape of…

Computation and Language · Computer Science 2026-04-13 Solomiia Bilyk , Volodymyr Getmanskyi , Taras Firman

Automated Image Recognition Framework

While the efficacy of deep learning models heavily relies on data, gathering and annotating data for specific tasks, particularly when addressing novel or sensitive subjects lacking relevant datasets, poses significant time and resource…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Quang-Binh Nguyen , Trong-Vu Hoang , Ngoc-Do Tran , Tam V. Nguyen , Minh-Triet Tran , Trung-Nghia Le

Self-Review Framework for Enhancing Instruction Following Capability of LLM

Various techniques have been proposed to improve large language models (LLMs) adherence to formatting and instruction constraints. One of the most effective approaches involves utilizing high-quality data generated by powerful models.…

Computation and Language · Computer Science 2025-07-09 Sihyun Park

FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions

Modern Language Models (LMs) are capable of following long and complex instructions that enable a large and diverse set of user requests. While Information Retrieval (IR) models use these LMs as the backbone of their architectures,…

Information Retrieval · Computer Science 2024-05-08 Orion Weller , Benjamin Chang , Sean MacAvaney , Kyle Lo , Arman Cohan , Benjamin Van Durme , Dawn Lawrie , Luca Soldaini

Improved Iterative Refinement for Chart-to-Code Generation via Structured Instruction

Recently, multimodal large language models (MLLMs) have attracted increasing research attention due to their powerful visual understanding capabilities. While they have achieved impressive results on various vision tasks, their performance…

Computer Vision and Pattern Recognition · Computer Science 2026-03-18 Chengzhi Xu , Yuyang Wang , Lai Wei , Lichao Sun , Weiran Huang

Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning

Continual instruction tuning enables large language models (LLMs) to learn incrementally while retaining past knowledge, whereas existing methods primarily focus on how to retain old knowledge rather than on selecting which new knowledge to…

Computation and Language · Computer Science 2025-03-21 Peiyi Lin , Fukai Zhang , Kai Niu , Hao Fu

LLM-driven Constrained Copy Generation through Iterative Refinement

Crafting a marketing message (copy), or copywriting is a challenging generation task, as the copy must adhere to various constraints. Copy creation is inherently iterative for humans, starting with an initial draft followed by successive…

Computation and Language · Computer Science 2025-04-15 Varun Vasudevan , Faezeh Akhavizadegan , Abhinav Prakash , Yokila Arora , Jason Cho , Tanya Mendiratta , Sushant Kumar , Kannan Achan

From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models

It is imperative for Large language models (LLMs) to follow instructions with elaborate requirements (i.e. Complex Instructions Following). Yet, it remains under-explored how to enhance the ability of LLMs to follow complex instructions…

Computation and Language · Computer Science 2024-06-19 Qianyu He , Jie Zeng , Qianxi He , Jiaqing Liang , Yanghua Xiao

Improving the Robustness of Large Language Models via Consistency Alignment

Large language models (LLMs) have shown tremendous success in following user instructions and generating helpful responses. Nevertheless, their robustness is still far from optimal, as they may generate significantly inconsistent responses…

Computation and Language · Computer Science 2024-03-25 Yukun Zhao , Lingyong Yan , Weiwei Sun , Guoliang Xing , Shuaiqiang Wang , Chong Meng , Zhicong Cheng , Zhaochun Ren , Dawei Yin

A New Pipeline For Generating Instruction Dataset via RAG and Self Fine-Tuning

With the rapid development of large language models in recent years, there has been an increasing demand for domain-specific Agents that can cater to the unique needs of enterprises and organizations. Unlike general models, which strive for…

Computation and Language · Computer Science 2024-08-13 Chih-Wei Song , Yu-Kai Lee , Yin-Te Tsai

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Following natural instructions is crucial for the effective application of Retrieval-Augmented Generation (RAG) systems. Despite recent advancements in Large Language Models (LLMs), research on assessing and improving instruction-following…

Computation and Language · Computer Science 2024-10-15 Guanting Dong , Xiaoshuai Song , Yutao Zhu , Runqi Qiao , Zhicheng Dou , Ji-Rong Wen

PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation

Pre-trained Transformers have enabled impressive breakthroughs in generating long and fluent text, yet their outputs are often "rambling" without coherently arranged content. In this work, we present a novel content-controlled text…

Computation and Language · Computer Science 2020-10-07 Xinyu Hua , Lu Wang

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Preference learning is critical for aligning large language models (LLMs) with human values, yet its success hinges on high-quality datasets comprising three core components: Preference \textbf{A}nnotations, \textbf{I}nstructions, and…

Computation and Language · Computer Science 2025-09-03 Bingxiang He , Wenbin Zhang , Jiaxi Song , Cheng Qian , Zixuan Fu , Bowen Sun , Ning Ding , Haiwen Hong , Longtao Huang , Hui Xue , Ganqu Cui , Wanxiang Che , Zhiyuan Liu , Maosong Sun

ConAIR:Consistency-Augmented Iterative Interaction Framework to Enhance the Reliability of Code Generation

Code generation techniques generate code snippets automatically based on the problem requirements in natural language. Recently, large language models (LLMs) achieve the SOTA performance on code generation. However, LLMs still struggle at…

Software Engineering · Computer Science 2024-11-26 Jinhao Dong , Jun Sun , Wenjie Zhang , Jin Song Dong , Dan Hao

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Experts

Large language model (LLM) agents have emerged as a promising solution to automate the workflow of machine learning, but most existing methods share a common limitation: they attempt to optimize entire pipelines in a single step before…

Computer Vision and Pattern Recognition · Computer Science 2025-09-17 Eric Xue , Ke Chen , Zeyi Huang , Yuyang Ji , Haohan Wang

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Large language models (LLMs) struggle to follow instructions with complex constraints in format, length, etc. Following the conventional instruction-tuning practice, previous works conduct post-training on complex instruction-response pairs…

Computation and Language · Computer Science 2025-04-30 Yunjia Qi , Hao Peng , Xiaozhi Wang , Bin Xu , Lei Hou , Juanzi Li

Automated Data Curation for Robust Language Model Fine-Tuning

Large Language Models have become the de facto approach to sequence-to-sequence text generation tasks, but for specialized tasks/domains, a pretrained LLM lacks specific capabilities to produce accurate or well-formatted responses.…

Computation and Language · Computer Science 2024-03-20 Jiuhai Chen , Jonas Mueller

Ada-Instruct: Adapting Instruction Generators for Complex Reasoning

Instructions augmentation is a crucial step for unleashing the full potential of large language models (LLMs) in downstream tasks. Existing Self-Instruct methods primarily simulate new instructions from a few initial instructions with…

Computation and Language · Computer Science 2024-10-04 Wanyun Cui , Qianle Wang

EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models

In recent years, instruction tuning has gained increasing attention and emerged as a crucial technique to enhance the capabilities of Large Language Models (LLMs). To construct high-quality instruction datasets, many instruction processing…

Computation and Language · Computer Science 2024-06-25 Yixin Ou , Ningyu Zhang , Honghao Gui , Ziwen Xu , Shuofei Qiao , Yida Xue , Runnan Fang , Kangwei Liu , Lei Li , Zhen Bi , Guozhou Zheng , Huajun Chen

Retrieval-Augmented Instruction Tuning for Automated Process Engineering Calculations : A Tool-Chaining Problem-Solving Framework with Attributable Reflection

The current technology landscape lacks a foundational AI model for solving process engineering calculations. In this work, we introduce a novel autonomous agent framework leveraging Retrieval-Augmented Instruction-Tuning (RAIT) to enhance…

Software Engineering · Computer Science 2024-08-29 Sagar Srinivas Sakhinana , Geethan Sannidhi , Venkataramana Runkana