English
Related papers

Related papers: Not All Tokens Matter: Data-Centric Optimization f…

200 papers

Instruction-tuned Language Models (ILMs) have become essential components of modern AI systems, demonstrating exceptional versatility across natural language and reasoning tasks. Among their most impactful applications is code generation,…

Software Engineering · Computer Science 2026-02-18 Zaiyu Cheng , Antonio Mastropaolo

Software documentation is essential for program comprehension, developer onboarding, code review, and long-term maintenance. Yet producing quality documentation manually is time-consuming and frequently yields incomplete or inconsistent…

Software Engineering · Computer Science 2026-04-20 Afia Farjana , Zaiyu Cheng , Antonio Mastropaolo

Prompting language models (LMs) with training examples and task descriptions has been seen as critical to recent successes in few-shot learning. In this work, we show that finetuning LMs in the few-shot setting can considerably reduce the…

Computation and Language · Computer Science 2021-07-02 Robert L. Logan , Ivana Balažević , Eric Wallace , Fabio Petroni , Sameer Singh , Sebastian Riedel

Large Language Models (LLMs) have shown remarkable capabilities in manipulating natural language across multiple applications, but their ability to handle simple reasoning tasks is often questioned. In this work, we aim to provide a…

Computation and Language · Computer Science 2025-05-05 Alessandro Raganato , Rafael Peñaloza , Marco Viviani , Gabriella Pasi

Large language models (LLMs) have made significant strides at code generation through improved model design, training, and chain-of-thought. However, prompt-level optimizations remain an important yet under-explored aspect of LLMs for…

Software Engineering · Computer Science 2024-12-05 Derek Xu , Tong Xie , Botao Xia , Haoyu Li , Yunsheng Bai , Yizhou Sun , Wei Wang

Large Language Models (LLMs) are increasingly used by software engineers for code generation. However, limitations of LLMs such as irrelevant or incorrect code have highlighted the need for prompt programming (or prompt engineering) where…

Software Engineering · Computer Science 2025-07-09 Ranim Khojah , Francisco Gomes de Oliveira Neto , Mazen Mohamad , Philipp Leitner

Requirements classification assigns natural language requirements to predefined classes, such as functional and non functional. Accurate classification reduces risk and improves software quality. Most existing models rely on supervised…

Software Engineering · Computer Science 2025-09-18 Manal Binkhonain , Reem Alfayaz

Large language models (LLMs) are increasingly adopted in educational technologies for a variety of tasks, from generating instructional materials and assisting with assessment design to tutoring. While prior work has investigated how models…

Computation and Language · Computer Science 2025-12-24 Kirk Vanacore , Rene F. Kizilcec

Real-world applications of large language models (LLMs) in computational social science (CSS) tasks primarily depend on the effectiveness of instruction tuning (IT) or in-context learning (ICL). While IT has shown highly effective at…

Computation and Language · Computer Science 2024-09-24 Taihang Wang , Xiaoman Xu , Yimin Wang , Ye Jiang

The capabilities of Large Language Models (LLMs) in code generation have been extensively studied, particularly for implementing target functionalities from natural-language descriptions. Alternatively, input-output (I/O) examples provide…

Software Engineering · Computer Science 2025-05-13 Yingjie Fu , Bozhou Li , Linyi Li , Wentao Zhang , Tao Xie

Due to their architecture and vast pre-training data, large language models (LLMs) demonstrate strong text classification performance. However, LLM output - here, the category assigned to a text - depends heavily on the wording of the…

Computation and Language · Computer Science 2025-12-04 Kylie L. Anglin , Stephanie Milan , Brittney Hernandez , Claudia Ventura

Pre-trained Language Models (PLMs) have achieved remarkable performance for various language understanding tasks in IR systems, which require the fine-tuning process based on labeled training data. For low-resource scenarios, prompt-based…

Computation and Language · Computer Science 2022-04-04 Ziyun Xu , Chengyu Wang , Minghui Qiu , Fuli Luo , Runxin Xu , Songfang Huang , Jun Huang

Large language models (LLMs) are widely used as zero-shot and few-shot classifiers, where task behaviour is largely controlled through prompting. A growing number of works have observed that LLMs are sensitive to prompt variations, with…

Computation and Language · Computer Science 2026-02-05 Branislav Pecher , Michal Spiegel , Robert Belanec , Jan Cegin

Large Language Models (LLMs) are nowadays extensively used for various types of software engineering tasks, primarily code generation. Previous research has shown how suitable prompt engineering could help developers in improving their code…

Code snippet adaptation is a fundamental activity in the software development process. Unlike code generation, code snippet adaptation is not a "free creation", which requires developers to tailor a given code snippet in order to fit…

Software Engineering · Computer Science 2024-11-26 Tanghaoran Zhang , Yue Yu , Xinjun Mao , Shangwen Wang , Kang Yang , Yao Lu , Zhang Zhang , Yuxin Zhao

The use of Large Language Models (LLMs) for program code generation has gained substantial attention, but their biases and limitations with non-English prompts challenge global inclusivity. This paper investigates the complexities of…

Computation and Language · Computer Science 2025-05-13 Mingda Li , Abhijit Mishra , Utkarsh Mujumdar

Large language models (LLMs) have scaled up to unlock a wide range of complex reasoning tasks with the aid of various prompting methods. However, current prompting methods generate natural language intermediate steps to help reasoning,…

Computation and Language · Computer Science 2023-10-10 Yi Hu , Haotong Yang , Zhouchen Lin , Muhan Zhang

The remarkable advancements in large language models (LLMs) have brought about significant improvements in Natural Language Processing(NLP) tasks. This paper presents a comprehensive review of in-context learning techniques, focusing on…

Computation and Language · Computer Science 2023-09-26 Yinheng Li

Large Language Models (LLMs) with vast context windows offer new avenues for in-context learning (ICL), where providing many examples ("many-shot" prompting) is often assumed to enhance performance. We investigate this assumption for the…

Software Engineering · Computer Science 2025-12-10 Amirkia Rafiei Oskooei , Kaan Baturalp Cosdan , Husamettin Isiktas , Mehmet S. Aktas

The advent of Large Language Models (LLMs) has significantly advanced the field of automated code generation. LLMs rely on large and diverse datasets to learn syntax, semantics, and usage patterns of programming languages. For low-resource…

Software Engineering · Computer Science 2025-02-03 Alessandro Giagnorio , Alberto Martin-Lopez , Gabriele Bavota
‹ Prev 1 2 3 10 Next ›