Related papers: Functionality learning through specification instr…

Large Language Model Instruction Following: A Survey of Progresses and Challenges

Task semantics can be expressed by a set of input-output examples or a piece of textual instruction. Conventional machine learning approaches for natural language processing (NLP) mainly rely on the availability of large-scale sets of…

Computation and Language · Computer Science 2024-05-28 Renze Lou , Kai Zhang , Wenpeng Yin

Instruction Induction: From Few Examples to Natural Language Task Descriptions

Large language models are able to perform a task by conditioning on a few input-output demonstrations - a paradigm known as in-context learning. We show that language models can explicitly infer an underlying task from a few demonstrations…

Computation and Language · Computer Science 2022-05-24 Or Honovich , Uri Shaham , Samuel R. Bowman , Omer Levy

Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

Large language models (LLMs) have shown impressive performance in following natural language instructions to solve unseen tasks. However, it remains unclear whether models truly understand task definitions and whether the human-written…

Computation and Language · Computer Science 2023-06-05 Fan Yin , Jesse Vig , Philippe Laban , Shafiq Joty , Caiming Xiong , Chien-Sheng Jason Wu

Third-Party Language Model Performance Prediction from Instruction

Language model-based instruction-following systems have lately shown increasing performance on many benchmark tasks, demonstrating the capability of adapting to a broad variety of instructions. However, such systems are often not designed…

Computation and Language · Computer Science 2024-03-20 Rahul Nadkarni , Yizhong Wang , Noah A. Smith

Guiding Policies with Language via Meta-Learning

Behavioral skills or policies for autonomous agents are conventionally learned from reward functions, via reinforcement learning, or from demonstrations, via imitation learning. However, both modes of task specification have their…

Machine Learning · Computer Science 2019-01-30 John D. Co-Reyes , Abhishek Gupta , Suvansh Sanjeev , Nick Altieri , Jacob Andreas , John DeNero , Pieter Abbeel , Sergey Levine

Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach

In the past decades, recommender systems have attracted much attention in both research and industry communities, and a large number of studies have been devoted to developing effective recommendation models. Basically speaking, these…

Information Retrieval · Computer Science 2023-05-12 Junjie Zhang , Ruobing Xie , Yupeng Hou , Wayne Xin Zhao , Leyu Lin , Ji-Rong Wen

Open (Clinical) LLMs are Sensitive to Instruction Phrasings

Instruction-tuned Large Language Models (LLMs) can perform a wide range of tasks given natural language instructions to do so, but they are sensitive to how such instructions are phrased. This issue is especially concerning in healthcare,…

Computation and Language · Computer Science 2024-07-15 Alberto Mario Ceballos Arroyo , Monica Munnangi , Jiuding Sun , Karen Y. C. Zhang , Denis Jered McInerney , Byron C. Wallace , Silvio Amir

Semantic Guidance and Feedback for the Construction of Specifications and Implementations

The problem of writing a specification which accurately reflects the intent of the developer has long been recognized as fundamental. We propose a method and a supporting tool to write and check a specification and an implementation using a…

Software Engineering · Computer Science 2013-05-20 Paul C Attie , Fadi A Zaraket , Mohammad Fawaz , Mohammad Noureddine

Evaluating the Zero-shot Robustness of Instruction-tuned Language Models

Instruction fine-tuning has recently emerged as a promising approach for improving the zero-shot capabilities of Large Language Models (LLMs) on new tasks. This technique has shown particular strength in improving the performance of…

Computation and Language · Computer Science 2023-07-13 Jiuding Sun , Chantal Shaib , Byron C. Wallace

Can Large Language Models Understand Real-World Complex Instructions?

Large language models (LLMs) can understand human instructions, showing their potential for pragmatic applications beyond traditional NLP tasks. However, they still struggle with complex instructions, which can be either complex task…

Computation and Language · Computer Science 2024-01-09 Qianyu He , Jie Zeng , Wenhao Huang , Lina Chen , Jin Xiao , Qianxi He , Xunzhe Zhou , Lida Chen , Xintao Wang , Yuncheng Huang , Haoning Ye , Zihan Li , Shisong Chen , Yikai Zhang , Zhouhong Gu , Jiaqing Liang , Yanghua Xiao

Your Instructions Are Not Always Helpful: Assessing the Efficacy of Instruction Fine-tuning for Software Vulnerability Detection

Software, while beneficial, poses potential cybersecurity risks due to inherent vulnerabilities. Detecting these vulnerabilities is crucial, and deep learning has shown promise as an effective tool for this task due to its ability to…

Software Engineering · Computer Science 2024-01-17 Imam Nur Bani Yusuf , Lingxiao Jiang

The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models

Following multiple instructions is a crucial ability for large language models (LLMs). Evaluating this ability comes with significant challenges: (i) limited coherence between multiple instructions, (ii) positional bias where the order of…

Computation and Language · Computer Science 2025-12-12 Xinyi Chen , Baohao Liao , Jirui Qi , Panagiotis Eustratiadis , Christof Monz , Arianna Bisazza , Maarten de Rijke

Improving students' code correctness and test completeness by informal specifications

The quality of software produced by students is often poor. How to teach students to develop good quality software has long been a topic in computer science education and research. We must conclude that we still do not have a good answer to…

Software Engineering · Computer Science 2023-09-06 Arno Broeders , Ruud Hermans , Sylvia Stuurman , Lex Bijlsma , Harrie Passier

SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning

Large language models (LLMs) hold the promise of solving diverse tasks when provided with appropriate natural language prompts. However, prompting often leads models to make predictions with lower accuracy compared to finetuning a model…

Computation and Language · Computer Science 2024-08-13 Chenyang Zhao , Xueying Jia , Vijay Viswanathan , Tongshuang Wu , Graham Neubig

Task--Specificity Score: Measuring How Much Instructions Really Matter for Supervision

Instruction tuning is now the default way to train and adapt large language models, but many instruction--input--output pairs are only weakly specified: for a given input, the same output can remain plausible under several alternative…

Computation and Language · Computer Science 2026-02-04 Pritam Kadasi , Abhishek Upperwal , Mayank Singh

Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing

Large Language Models (LLMs) have demonstrated human-like instruction-following abilities, particularly those exceeding 100 billion parameters. The combined capability of some smaller, resource-friendly LLMs can address most of the…

Computation and Language · Computer Science 2025-02-25 Yi-Kai Zhang , De-Chuan Zhan , Han-Jia Ye

Prompting for Performance: Exploring LLMs for Configuring Software

Software systems usually provide numerous configuration options that can affect performance metrics such as execution time, memory usage, binary size, or bitrate. On the one hand, making informed decisions is challenging and requires domain…

Software Engineering · Computer Science 2025-09-24 Helge Spieker , Théo Matricon , Nassim Belmecheri , Jørn Eirik Betten , Gauthier Le Bartz Lyan , Heraldo Borges , Quentin Mazouni , Dennis Gross , Arnaud Gotlieb , Mathieu Acher

From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models

It is imperative for Large language models (LLMs) to follow instructions with elaborate requirements (i.e. Complex Instructions Following). Yet, it remains under-explored how to enhance the ability of LLMs to follow complex instructions…

Computation and Language · Computer Science 2024-06-19 Qianyu He , Jie Zeng , Qianxi He , Jiaqing Liang , Yanghua Xiao

When Prompts Go Wrong: Evaluating Code Model Robustness to Ambiguous, Contradictory, and Incomplete Task Descriptions

Large Language Models (LLMs) have demonstrated impressive performance in code generation tasks under idealized conditions, where task descriptions are clear and precise. However, in practice, task descriptions frequently exhibit ambiguity,…

Software Engineering · Computer Science 2025-07-29 Maya Larbi , Amal Akli , Mike Papadakis , Rihab Bouyousfi , Maxime Cordy , Federica Sarro , Yves Le Traon

A Composable Specification Language for Reinforcement Learning Tasks

Reinforcement learning is a promising approach for learning control policies for robot tasks. However, specifying complex tasks (e.g., with multiple objectives and safety constraints) can be challenging, since the user must design a reward…

Machine Learning · Computer Science 2020-10-30 Kishor Jothimurugan , Rajeev Alur , Osbert Bastani