Related papers: InstructZero: Efficient Instruction Optimization f…

Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation

In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension and generation tasks. We have the following main findings. First, for the zero-shot setting, instructed LLMs are very competitive on code…

Computation and Language · Computer Science 2023-08-03 Zhiqiang Yuan , Junwei Liu , Qiancheng Zi , Mingwei Liu , Xin Peng , Yiling Lou

Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models

Large language models (LLMs) can perform a wide range of tasks by following natural language instructions, without the necessity of task-specific fine-tuning. Unfortunately, the performance of LLMs is greatly influenced by the quality of…

Computation and Language · Computer Science 2023-10-23 Zhihan Zhang , Shuohang Wang , Wenhao Yu , Yichong Xu , Dan Iter , Qingkai Zeng , Yang Liu , Chenguang Zhu , Meng Jiang

Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers

Recent studies have demonstrated the great potential of Large Language Models (LLMs) serving as zero-shot relevance rankers. The typical approach involves making comparisons between pairs or lists of documents. Although effective, these…

Information Retrieval · Computer Science 2023-11-06 Weiwei Sun , Zheng Chen , Xinyu Ma , Lingyong Yan , Shuaiqiang Wang , Pengjie Ren , Zhumin Chen , Dawei Yin , Zhaochun Ren

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

Large language models (LLMs) have shown impressive success in various applications. However, these models are often not well aligned with human intents, which calls for additional treatments on them; that is, the alignment problem. To make…

Computation and Language · Computer Science 2024-06-24 Jiale Cheng , Xiao Liu , Kehan Zheng , Pei Ke , Hongning Wang , Yuxiao Dong , Jie Tang , Minlie Huang

PRESTO: Preimage-Informed Instruction Optimization for Prompting Black-Box LLMs

Large language models (LLMs) have achieved remarkable success across diverse domains, due to their strong instruction-following capabilities. This has led to increasing interest in optimizing instructions for black-box LLMs, whose internal…

Machine Learning · Computer Science 2025-10-31 Jaewon Chu , Seunghun Lee , Hyunwoo J. Kim

Evaluating the Zero-shot Robustness of Instruction-tuned Language Models

Instruction fine-tuning has recently emerged as a promising approach for improving the zero-shot capabilities of Large Language Models (LLMs) on new tasks. This technique has shown particular strength in improving the performance of…

Computation and Language · Computer Science 2023-07-13 Jiuding Sun , Chantal Shaib , Byron C. Wallace

VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization

This paper presents VisLingInstruct, a novel approach to advancing Multi-Modal Language Models (MMLMs) in zero-shot learning. Current MMLMs show impressive zero-shot abilities in multi-modal tasks, but their performance depends heavily on…

Artificial Intelligence · Computer Science 2024-06-21 Dongsheng Zhu , Xunzhu Tang , Weidong Han , Jinghui Lu , Yukun Zhao , Guoliang Xing , Junfeng Wang , Dawei Yin

InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models

Extractive summarization can produce faithful summaries but often requires additional constraints such as a desired summary length. Traditional sentence compression models do not typically consider the constraints because of their…

Computation and Language · Computer Science 2024-06-21 Juseon-Do , Jingun Kwon , Hidetaka Kamigaito , Manabu Okumura

A Practical Survey on Zero-shot Prompt Design for In-context Learning

The remarkable advancements in large language models (LLMs) have brought about significant improvements in Natural Language Processing(NLP) tasks. This paper presents a comprehensive review of in-context learning techniques, focusing on…

Computation and Language · Computer Science 2023-09-26 Yinheng Li

DRO-InstructZero: Distributionally Robust Prompt Optimization for Large Language Models

Large language models are highly sensitive to prompt wording. However, popular automatic prompt search methods, including InstructZero, often degrade under distribution shift and adversarial evaluation because they optimize expected…

Machine Learning · Computer Science 2025-10-20 Yangyang Li

LLaMoCo: Instruction Tuning of Large Language Models for Optimization Code Generation

Recent research explores optimization using large language models (LLMs) by either iteratively seeking next-step solutions from LLMs or directly prompting LLMs for an optimizer. However, these approaches exhibit inherent limitations,…

Optimization and Control · Mathematics 2024-03-06 Zeyuan Ma , Hongshu Guo , Jiacheng Chen , Guojun Peng , Zhiguang Cao , Yining Ma , Yue-Jiao Gong

Large Language Models Are Zero-Shot Text Classifiers

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance

Large language models (LLMs) have revolutionized zero-shot task performance, mitigating the need for task-specific annotations while enhancing task generalizability. Despite its advancements, current methods using trigger phrases such as…

Computation and Language · Computer Science 2024-06-13 Saurabh Srivastava , Chengyue Huang , Weiguo Fan , Ziyu Yao

InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators

Instruction-based language modeling has received significant attention in pretrained language models. However, the efficiency of instruction engineering remains low and hinders the development of instruction studies. Recent studies have…

Computation and Language · Computer Science 2023-10-27 Heng Yang , Ke Li

Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives

Large language models (LLMs) have shown increasing power on various natural language processing (NLP) tasks. However, tuning these models for downstream tasks usually needs exorbitant costs or is unavailable due to commercial…

Computation and Language · Computer Science 2024-05-07 Qiushi Sun , Chengcheng Han , Nuo Chen , Renyu Zhu , Jingyang Gong , Xiang Li , Ming Gao

Large Language Models As Evolution Strategies

Large Transformer models are capable of implementing a plethora of so-called in-context learning algorithms. These include gradient descent, classification, sequence completion, transformation, and improvement. In this work, we investigate…

Artificial Intelligence · Computer Science 2024-02-29 Robert Tjarko Lange , Yingtao Tian , Yujin Tang

Instruction Learning Paradigms: A Dual Perspective on White-box and Black-box LLMs

Optimizing instructions for large language models (LLMs) is critical for harnessing their full potential in complex and diverse tasks. However, relying solely on white-box approaches demands extensive computational resources and offers…

Computation and Language · Computer Science 2025-06-30 Yanwei Ren , Liu Liu , Baosheng Yu , Jiayan Qiu , Quan Chen

Studying and Benchmarking Large Language Models For Log Level Suggestion

Large Language Models (LLMs) have become a focal point of research across various domains, including software engineering, where their capabilities are increasingly leveraged. Recent studies have explored the integration of LLMs into…

Software Engineering · Computer Science 2024-10-14 Yi Wen Heng , Zeyang Ma , Zhenhao Li , Dong Jae Kim , Tse-Hsun , Chen

Language Models as Black-Box Optimizers for Vision-Language Models

Vision-language models (VLMs) pre-trained on web-scale datasets have demonstrated remarkable capabilities on downstream tasks when fine-tuned with minimal data. However, many VLMs rely on proprietary data and are not open-source, which…

Computation and Language · Computer Science 2024-05-15 Shihong Liu , Zhiqiu Lin , Samuel Yu , Ryan Lee , Tiffany Ling , Deepak Pathak , Deva Ramanan

Curate-Train-Refine: A Closed-Loop Agentic Framework for Zero Shot Classification

Large language models (LLMs) and high-capacity encoders have advanced zero and few-shot classification, but their inference cost and latency limit practical deployment. We propose training lightweight text classifiers using dynamically…

Computation and Language · Computer Science 2026-01-26 Gaurav Maheshwari , Kevin El Haddad