English
Related papers

Related papers: Error-Driven Prompt Optimization for Arithmetic Re…

200 papers

Prompt engineering plays a critical role in adapting large language models (LLMs) to complex reasoning and labeling tasks without the need for extensive fine-tuning. In this paper, we propose a novel prompt optimization pipeline for frame…

Computation and Language · Computer Science 2025-12-23 Do Minh Duc , Quan Xuan Truong , Nguyen Tat Dat , Nguyen Van Vinh

The rapid advancement of Large Language Models (LLMs) in the realm of mathematical reasoning necessitates comprehensive evaluations to gauge progress and inspire future directions. Existing assessments predominantly focus on problem-solving…

Computation and Language · Computer Science 2024-06-05 Xiaoyuan Li , Wenjie Wang , Moxin Li , Junrong Guo , Yang Zhang , Fuli Feng

Prompt engineering is a challenging and important task due to the high sensitivity of Large Language Models (LLMs) to the given prompt and the inherent ambiguity of a textual task instruction. Automatic prompt engineering is essential to…

Computation and Language · Computer Science 2024-02-06 Elad Levi , Eli Brosh , Matan Friedmann

Large language models (LLMs) are increasingly utilized in various complex reasoning tasks due to their excellent instruction following capability. However, the model's performance is highly dependent on the open-ended characteristics of the…

Computation and Language · Computer Science 2026-04-28 Zhenzhen Huang , Chaoning Zhang , Fachrina Dewi Puspitasari , Jiaquan Zhang , Yitian Zhou , Shuxu Chen , Yang Yang

Evaluating the quality of machine-generated natural language content is a challenging task in Natural Language Processing (NLP). Recently, large language models (LLMs) like GPT-4 have been employed for this purpose, but they are…

Computation and Language · Computer Science 2024-12-23 Daniil Larionov , Steffen Eger

Prompt engineering has proven to be a crucial step in leveraging pretrained large language models (LLMs) in solving various real-world tasks. Numerous solutions have been proposed that seek to automate prompt engineering by using the model…

Recent advances have shown that optimizing prompts for Large Language Models (LLMs) can significantly improve task performance, yet many optimization techniques rely on heuristics or manual exploration. We present LatentPrompt, a…

Computation and Language · Computer Science 2025-08-05 Mateusz Bystroński , Grzegorz Piotrowski , Nitesh V. Chawla , Tomasz Kajdanowicz

Retrieval Augmented Generation (RAG) is a powerful approach for enhancing the factual grounding of language models by integrating external knowledge. While widely studied for large language models, the optimization of RAG for Small Language…

Computation and Language · Computer Science 2026-02-17 Amir Hossein Mohammadi , Ali Moeinian , Zahra Razavizade , Afsaneh Fatemi , Reza Ramezani

Large Language Models (LLMs) have limited performance when solving arithmetic reasoning tasks and often provide incorrect answers. Unlike natural language understanding, math problems typically have a single correct answer, making the task…

Computation and Language · Computer Science 2023-03-10 Shima Imani , Liang Du , Harsh Shrivastava

Synthetic data has become a cornerstone for scaling large language models, yet its multilingual use remains bottlenecked by translation-based prompts. This strategy inherits English-centric framing and style and neglects cultural…

Computation and Language · Computer Science 2025-10-23 David Mora , Viraat Aryabumi , Wei-Yin Ko , Sara Hooker , Julia Kreutzer , Marzieh Fadaee

Large Language Models (LLMs) are increasingly utilized in AI-driven educational instruction and assessment, particularly within mathematics education. The capability of LLMs to generate accurate answers and detailed solutions for math…

Artificial Intelligence · Computer Science 2025-08-15 Liang Zhang , Edith Aurora Graf

Mathematical reasoning and optimization are fundamental to artificial intelligence and computational problem-solving. Recent advancements in Large Language Models (LLMs) have significantly improved AI-driven mathematical reasoning, theorem…

Artificial Intelligence · Computer Science 2025-03-25 Ali Forootani

Recent advances in Large Language Models have led to remarkable achievements across a variety of Natural Language Processing tasks, making prompt engineering increasingly central to guiding model outputs. While manual methods can be…

Computation and Language · Computer Science 2025-07-15 Wendi Cui , Zhuohang Li , Hao Sun , Damien Lopez , Kamalika Das , Bradley A. Malin , Sricharan Kumar , Jiaxin Zhang

Root cause analysis (RCA) for incidents in large-scale cloud systems is a complex, knowledge-intensive task that often requires significant manual effort from on-call engineers (OCEs). Improving RCA is vital for accelerating the incident…

Prompt engineering is an essential technique for enhancing the abilities of large language models (LLMs) by providing explicit and specific instructions. It enables LLMs to excel in various tasks, such as arithmetic reasoning, question…

Computation and Language · Computer Science 2024-03-29 Fobo Shi , Peijun Qing , Dong Yang , Nan Wang , Youbo Lei , Haonan Lu , Xiaodong Lin , Duantengchuan Li

The performance of large language models (LLMs) depends on how they are prompted, with choices spanning both the high-level prompting pattern (e.g., Zero-Shot, CoT, ReAct, ReWOO) and the specific prompt content (instructions and few-shot…

Machine Learning · Computer Science 2025-11-05 Claudio Spiess , Mandana Vaziri , Louis Mandel , Martin Hirzel

We investigate whether internal activations in language models can be used to detect arithmetic errors. Starting with a controlled setting of 3-digit addition, we show that simple probes can accurately decode both the model's predicted…

Computation and Language · Computer Science 2025-07-17 Yucheng Sun , Alessandro Stolfo , Mrinmaya Sachan

This paper introduces a novel Large Language Models (LLMs)-assisted agent that automatically converts natural-language descriptions of power system optimization scenarios into compact, solver-ready formulations and generates corresponding…

Artificial Intelligence · Computer Science 2025-08-12 Yunkai Hu , Tianqiao Zhao , Meng Yue

Evaluating Large Language Model (LLM) applications differs from traditional software testing because outputs are stochastic, high-dimensional, and sensitive to prompt and model changes. We present an evaluation-driven workflow - Define,…

Computation and Language · Computer Science 2026-01-30 Daniel Commey

Generative large language models (LLMs), e.g., ChatGPT, have demonstrated remarkable proficiency across several NLP tasks, such as machine translation, text summarization. Recent research (Kocmi and Federmann, 2023) has shown that utilizing…

Computation and Language · Computer Science 2024-06-06 Qingyu Lu , Baopu Qiu , Liang Ding , Kanjian Zhang , Tom Kocmi , Dacheng Tao
‹ Prev 1 2 3 10 Next ›