English
Related papers

Related papers: InstructZero: Efficient Instruction Optimization f…

200 papers

In this work, we evaluate 10 open-source instructed LLMs on four representative code comprehension and generation tasks. We have the following main findings. First, for the zero-shot setting, instructed LLMs are very competitive on code…

Computation and Language · Computer Science 2023-08-03 Zhiqiang Yuan , Junwei Liu , Qiancheng Zi , Mingwei Liu , Xin Peng , Yiling Lou

Large language models (LLMs) can perform a wide range of tasks by following natural language instructions, without the necessity of task-specific fine-tuning. Unfortunately, the performance of LLMs is greatly influenced by the quality of…

Computation and Language · Computer Science 2023-10-23 Zhihan Zhang , Shuohang Wang , Wenhao Yu , Yichong Xu , Dan Iter , Qingkai Zeng , Yang Liu , Chenguang Zhu , Meng Jiang

Recent studies have demonstrated the great potential of Large Language Models (LLMs) serving as zero-shot relevance rankers. The typical approach involves making comparisons between pairs or lists of documents. Although effective, these…

Information Retrieval · Computer Science 2023-11-06 Weiwei Sun , Zheng Chen , Xinyu Ma , Lingyong Yan , Shuaiqiang Wang , Pengjie Ren , Zhumin Chen , Dawei Yin , Zhaochun Ren

Large language models (LLMs) have shown impressive success in various applications. However, these models are often not well aligned with human intents, which calls for additional treatments on them; that is, the alignment problem. To make…

Computation and Language · Computer Science 2024-06-24 Jiale Cheng , Xiao Liu , Kehan Zheng , Pei Ke , Hongning Wang , Yuxiao Dong , Jie Tang , Minlie Huang

Large language models (LLMs) have achieved remarkable success across diverse domains, due to their strong instruction-following capabilities. This has led to increasing interest in optimizing instructions for black-box LLMs, whose internal…

Machine Learning · Computer Science 2025-10-31 Jaewon Chu , Seunghun Lee , Hyunwoo J. Kim

Instruction fine-tuning has recently emerged as a promising approach for improving the zero-shot capabilities of Large Language Models (LLMs) on new tasks. This technique has shown particular strength in improving the performance of…

Computation and Language · Computer Science 2023-07-13 Jiuding Sun , Chantal Shaib , Byron C. Wallace

This paper presents VisLingInstruct, a novel approach to advancing Multi-Modal Language Models (MMLMs) in zero-shot learning. Current MMLMs show impressive zero-shot abilities in multi-modal tasks, but their performance depends heavily on…

Artificial Intelligence · Computer Science 2024-06-21 Dongsheng Zhu , Xunzhu Tang , Weidong Han , Jinghui Lu , Yukun Zhao , Guoliang Xing , Junfeng Wang , Dawei Yin

Extractive summarization can produce faithful summaries but often requires additional constraints such as a desired summary length. Traditional sentence compression models do not typically consider the constraints because of their…

Computation and Language · Computer Science 2024-06-21 Juseon-Do , Jingun Kwon , Hidetaka Kamigaito , Manabu Okumura

The remarkable advancements in large language models (LLMs) have brought about significant improvements in Natural Language Processing(NLP) tasks. This paper presents a comprehensive review of in-context learning techniques, focusing on…

Computation and Language · Computer Science 2023-09-26 Yinheng Li

Large language models are highly sensitive to prompt wording. However, popular automatic prompt search methods, including InstructZero, often degrade under distribution shift and adversarial evaluation because they optimize expected…

Machine Learning · Computer Science 2025-10-20 Yangyang Li

Recent research explores optimization using large language models (LLMs) by either iteratively seeking next-step solutions from LLMs or directly prompting LLMs for an optimizer. However, these approaches exhibit inherent limitations,…

Optimization and Control · Mathematics 2024-03-06 Zeyuan Ma , Hongshu Guo , Jiacheng Chen , Guojun Peng , Zhiguang Cao , Yining Ma , Yue-Jiao Gong

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Large language models (LLMs) have revolutionized zero-shot task performance, mitigating the need for task-specific annotations while enhancing task generalizability. Despite its advancements, current methods using trigger phrases such as…

Computation and Language · Computer Science 2024-06-13 Saurabh Srivastava , Chengyue Huang , Weiguo Fan , Ziyu Yao

Instruction-based language modeling has received significant attention in pretrained language models. However, the efficiency of instruction engineering remains low and hinders the development of instruction studies. Recent studies have…

Computation and Language · Computer Science 2023-10-27 Heng Yang , Ke Li

Large language models (LLMs) have shown increasing power on various natural language processing (NLP) tasks. However, tuning these models for downstream tasks usually needs exorbitant costs or is unavailable due to commercial…

Computation and Language · Computer Science 2024-05-07 Qiushi Sun , Chengcheng Han , Nuo Chen , Renyu Zhu , Jingyang Gong , Xiang Li , Ming Gao

Large Transformer models are capable of implementing a plethora of so-called in-context learning algorithms. These include gradient descent, classification, sequence completion, transformation, and improvement. In this work, we investigate…

Artificial Intelligence · Computer Science 2024-02-29 Robert Tjarko Lange , Yingtao Tian , Yujin Tang

Optimizing instructions for large language models (LLMs) is critical for harnessing their full potential in complex and diverse tasks. However, relying solely on white-box approaches demands extensive computational resources and offers…

Computation and Language · Computer Science 2025-06-30 Yanwei Ren , Liu Liu , Baosheng Yu , Jiayan Qiu , Quan Chen

Large Language Models (LLMs) have become a focal point of research across various domains, including software engineering, where their capabilities are increasingly leveraged. Recent studies have explored the integration of LLMs into…

Software Engineering · Computer Science 2024-10-14 Yi Wen Heng , Zeyang Ma , Zhenhao Li , Dong Jae Kim , Tse-Hsun , Chen

Vision-language models (VLMs) pre-trained on web-scale datasets have demonstrated remarkable capabilities on downstream tasks when fine-tuned with minimal data. However, many VLMs rely on proprietary data and are not open-source, which…

Computation and Language · Computer Science 2024-05-15 Shihong Liu , Zhiqiu Lin , Samuel Yu , Ryan Lee , Tiffany Ling , Deepak Pathak , Deva Ramanan

Large language models (LLMs) and high-capacity encoders have advanced zero and few-shot classification, but their inference cost and latency limit practical deployment. We propose training lightweight text classifiers using dynamically…

Computation and Language · Computer Science 2026-01-26 Gaurav Maheshwari , Kevin El Haddad
‹ Prev 1 2 3 10 Next ›