Related papers: Proofread: Fixes All Errors with One Tap

Understanding and supporting how developers prompt for LLM-powered code editing in practice

Large Language Models (LLMs) are rapidly transforming software engineering, with coding assistants embedded in an IDE becoming increasingly prevalent. While research has focused on improving the tools and understanding developer…

Software Engineering · Computer Science 2025-12-22 Daye Nam , Ahmed Omran , Ambar Murillo , Saksham Thakur , Abner Araujo , Marcel Blistein , Alexander Frömmgen , Vincent Hellendoorn , Satish Chandra

Unveiling Large Language Models Generated Texts: A Multi-Level Fine-Grained Detection Framework

Large language models (LLMs) have transformed human writing by enhancing grammar correction, content expansion, and stylistic refinement. However, their widespread use raises concerns about authorship, originality, and ethics, even…

Computation and Language · Computer Science 2024-10-21 Zhen Tao , Zhiyu Li , Runyu Chen , Dinghao Xi , Wei Xu

Rethinking Supervised Fine-Tuning: Emphasizing Key Answer Tokens for Improved LLM Accuracy

With the rapid advancement of Large Language Models (LLMs), the Chain-of-Thought (CoT) component has become significant for complex reasoning tasks. However, in conventional Supervised Fine-Tuning (SFT), the model could allocate…

Computation and Language · Computer Science 2025-12-25 Xiaofeng Shi , Qian Kou , Yuduo Li , Hua Zhou

Reflective Prompt Tuning through Language Model Function-Calling

Large language models (LLMs) have become increasingly capable of following instructions and complex reasoning, making prompting a flexible interface for adapting models without parameter updates. Yet prompt design remains labor-intensive…

Computation and Language · Computer Science 2026-05-22 Farima Fatahi Bayat , Moin Aminnaseri , Pouya Pezeshkpour , Estevam Hruschka

Prompt-Efficient Fine-Tuning for GPT-like Deep Models to Reduce Hallucination and to Improve Reproducibility in Scientific Text Generation Using Stochastic Optimisation Techniques

Large Language Models (LLMs) are increasingly adopted for complex scientific text generation tasks, yet they often suffer from limitations in accuracy, consistency, and hallucination control. This thesis introduces a Parameter-Efficient…

Computation and Language · Computer Science 2024-11-12 Daniil Sulimov

TLoRA+: A Low-Rank Parameter-Efficient Fine-Tuning Method for Large Language Models

Fine-tuning large language models (LLMs) aims to adapt pre-trained models to specific tasks using relatively small and domain-specific datasets. Among Parameter-Efficient Fine-Tuning (PEFT) methods, Low-Rank Adaptation (LoRA) stands out by…

Computation and Language · Computer Science 2026-04-16 Yarui Cao , Kai Liu

ProRank: Prompt Warmup via Reinforcement Learning for Small Language Models Reranking

Reranking is fundamental to information retrieval and retrieval-augmented generation, with recent Large Language Models (LLMs) significantly advancing reranking quality. Most current works rely on large-scale LLMs (>7B parameters),…

Information Retrieval · Computer Science 2026-04-17 Xianming Li , Aamir Shakir , Rui Huang , Tsz-fung Andrew Lee , Julius Lipp , Benjamin Clavié , Jing Li

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models

Improving the reasoning capabilities of large language models (LLMs) typically requires supervised fine-tuning with labeled data or computationally expensive sampling. We introduce Unsupervised Prefix Fine-Tuning (UPFT), which leverages the…

Computation and Language · Computer Science 2025-03-05 Ke Ji , Jiahao Xu , Tian Liang , Qiuzhi Liu , Zhiwei He , Xingyu Chen , Xiaoyuan Liu , Zhijie Wang , Junying Chen , Benyou Wang , Zhaopeng Tu , Haitao Mi , Dong Yu

Stress-Testing the Reasoning Competence of LLMs With Proofs Under Minimal Formalism

We introduce ProofGrid, a benchmark suite for evaluating LLM reasoning through machine-checkable proofs rather than final answers alone. ProofGrid contains 15 tasks spanning proof writing, proof checking, proof masking, and proof…

Logic in Computer Science · Computer Science 2026-05-14 Konstantine Arkoudas , Serafim Batzoglou

Fine-Grained and Multi-Dimensional Metrics for Document-Level Machine Translation

Large language models (LLMs) have excelled in various NLP tasks, including machine translation (MT), yet most studies focus on sentence-level translation. This work investigates the inherent capability of instruction-tuned LLMs for…

Computation and Language · Computer Science 2025-04-22 Yirong Sun , Dawei Zhu , Yanjun Chen , Erjia Xiao , Xinghao Chen , Xiaoyu Shen

Enhancing Large Language Model Reasoning via Selective Critical Token Fine-Tuning

Large language models (LLMs) primarily rely on supervised fine-tuning (SFT) as a key method to adapt pre-trained models to domain-specific tasks such as mathematical reasoning. However, standard SFT uniformly penalizes all tokens,…

Computation and Language · Computer Science 2025-10-14 Zhiwen Ruan , Yixia Li , He Zhu , Yun Chen , Peng Li , Yang Liu , Guanhua Chen

Investigating Large Language Models' Linguistic Abilities for Text Preprocessing

Text preprocessing is a fundamental component of Natural Language Processing, involving techniques such as stopword removal, stemming, and lemmatization to prepare text as input for further processing and analysis. Despite the…

Computation and Language · Computer Science 2025-10-14 Marco Braga , Gian Carlo Milanese , Gabriella Pasi

Fine-Tuning or Fine-Failing? Debunking Performance Myths in Large Language Models

Large Language Models (LLMs) have the unique capability to understand and generate human-like text from input queries. When fine-tuned, these models show enhanced performance on domain-specific queries. OpenAI highlights the process of…

Computation and Language · Computer Science 2024-07-02 Scott Barnett , Zac Brannelly , Stefanus Kurniawan , Sheng Wong

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning

Recent studies have augmented large language models (LLMs) with speech capabilities, leading to the development of speech language models (SpeechLMs). Earlier SpeechLMs focused on single-turn speech-based question answering (QA), where user…

Computation and Language · Computer Science 2025-02-10 Yifan Peng , Krishna C. Puvvada , Zhehuai Chen , Piotr Zelasko , He Huang , Kunal Dhawan , Ke Hu , Shinji Watanabe , Jagadeesh Balam , Boris Ginsburg

Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance

Large language models (LLMs) have revolutionized zero-shot task performance, mitigating the need for task-specific annotations while enhancing task generalizability. Despite its advancements, current methods using trigger phrases such as…

Computation and Language · Computer Science 2024-06-13 Saurabh Srivastava , Chengyue Huang , Weiguo Fan , Ziyu Yao

Parameter-Efficient Tuning Large Language Models for Graph Representation Learning

Text-rich graphs, which exhibit rich textual information on nodes and edges, are prevalent across a wide range of real-world business applications. Large Language Models (LLMs) have demonstrated remarkable abilities in understanding text,…

Computation and Language · Computer Science 2024-04-30 Qi Zhu , Da Zheng , Xiang Song , Shichang Zhang , Bowen Jin , Yizhou Sun , George Karypis

Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction

Large Language Models (LLMs) have shown extraordinary success across various text generation tasks; however, their potential for simple yet essential text classification remains underexplored, as LLM pre-training tends to emphasize…

Computation and Language · Computer Science 2025-10-02 Zhexiong Liu , Diane Litman

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

The performance of large language models (LLMs) in program synthesis and mathematical reasoning is fundamentally limited by the quality of their pre-training corpora. We introduce two openly licensed pre-training datasets, released under…

Machine Learning · Computer Science 2026-03-03 Kazuki Fujii , Yukito Tajima , Sakae Mizuki , Masaki Kawamura , Hinari Shimada , Taihei Shiotani , Koshiro Saito , Masanari Oi , Taishi Nakamura , Takumi Okamoto , Shigeki Ishida , Kakeru Hattori , Youmi Ma , Hiroya Takamura , Rio Yokota , Jun Sakuma , Naoaki Okazaki

Logits-Based Finetuning

In recent years, developing compact and efficient large language models (LLMs) has emerged as a thriving area of research. Traditional Supervised Fine-Tuning (SFT), which relies on singular ground truth labels, often fails to capture…

Machine Learning · Computer Science 2025-06-12 Jingyao Li , Senqiao Yang , Sitong Wu , Han Shi , Chuanyang Zheng , Hong Xu , Jiaya Jia

Bridging the Editing Gap in LLMs: FineEdit for Precise and Targeted Text Modifications

Large Language Models (LLMs) have significantly advanced natural language processing, demonstrating strong capabilities in tasks such as text generation, summarization, and reasoning. Recently, their potential for automating precise text…

Computation and Language · Computer Science 2026-01-27 Yiming Zeng , Wanhao Yu , Zexin Li , Tao Ren , Yu Ma , Jinghan Cao , Xiyan Chen , Tingting Yu