Related papers: Beyond Sample-Level Feedback: Using Reference-Leve…

SEFL: A Framework for Generating Synthetic Educational Assignment Feedback with LLM Agents

Providing high-quality feedback on student assignments is crucial for student success, but it is heavily limited by time and budgetary constraints. In this work, we introduce Synthetic Educational Feedback Loops (SEFL), a synthetic data…

Computation and Language · Computer Science 2026-02-25 Mike Zhang , Amalie Pernille Dilling , Léon Gondelman , Niels Erik Ruan Lyngdorf , Euan D. Lindsay , Johannes Bjerva

Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation

Recent advances in large language model (LLM) training have highlighted the need for diverse, high-quality instruction data. Recently, many works are exploring synthetic data generation using LLMs. However, they primarily focus on prompt…

Computation and Language · Computer Science 2024-12-10 Yifang Chen , David Zhu , Simon Du , Kevin Jamieson , Yang Liu

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Large language models (LLMs) deployed as agents solve user-specified tasks over multiple steps while keeping the required manual engagement to a minimum. Crucially, such LLMs need to ground their generations in any feedback obtained to…

Computation and Language · Computer Science 2025-02-19 Jonas Gehring , Kunhao Zheng , Jade Copet , Vegard Mella , Quentin Carbonneaux , Taco Cohen , Gabriel Synnaeve

Advancing Student Writing Through Automated Syntax Feedback

This study underscores the pivotal role of syntax feedback in augmenting the syntactic proficiency of students. Recognizing the challenges faced by learners in mastering syntactic nuances, we introduce a specialized dataset named…

Computation and Language · Computer Science 2025-01-15 Kamyar Zeinalipour , Mehak Mehak , Fatemeh Parsamotamed , Marco Maggini , Marco Gori

SampleLLM: Optimizing Tabular Data Synthesis in Recommendations

Tabular data synthesis is crucial in machine learning, yet existing general methods-primarily based on statistical or deep learning models-are highly data-dependent and often fall short in recommender systems. This limitation arises from…

Information Retrieval · Computer Science 2025-02-12 Jingtong Gao , Zhaocheng Du , Xiaopeng Li , Yichao Wang , Xiangyang Li , Huifeng Guo , Ruiming Tang , Xiangyu Zhao

Learning to Verify Summary Facts with Fine-Grained LLM Feedback

Training automatic summary fact verifiers often faces the challenge of a lack of human-labeled data. In this paper, we explore alternative way of leveraging Large Language Model (LLM) generated feedback to address the inherent limitation of…

Computation and Language · Computer Science 2024-12-17 Jihwan Oh , Jeonghwan Choi , Nicole Hee-Yeon Kim , Taewon Yun , Hwanjun Song

Synthetic Data Generation in Low-Resource Settings via Fine-Tuning of Large Language Models

The in-context learning ability of large language models (LLMs) enables them to generalize to novel downstream tasks with relatively few labeled examples. However, they require enormous computational resources to be deployed. Alternatively,…

Computation and Language · Computer Science 2024-01-09 Jean Kaddour , Qi Liu

FRACTAL: Fine-Grained Scoring from Aggregate Text Labels

Large language models (LLMs) are being increasingly tuned to power complex generation tasks such as writing, fact-seeking, querying and reasoning. Traditionally, human or model feedback for evaluating and further tuning LLM performance has…

Computation and Language · Computer Science 2024-04-09 Yukti Makhija , Priyanka Agrawal , Rishi Saket , Aravindan Raghuveer

Evaluation of Synthetic Datasets for Conversational Recommender Systems

For researchers leveraging Large-Language Models (LLMs) in the generation of training datasets, especially for conversational recommender systems - the absence of robust evaluation frameworks has been a long-standing problem. The efficiency…

Computation and Language · Computer Science 2022-12-19 Harsh Lara , Manoj Tiwari

Think Less, Label Better: Multi-Stage Domain-Grounded Synthetic Data Generation for Fine-Tuning Large Language Models in Telecommunications

The success of large language models (LLMs) depends heavily on large-scale, high-quality instruction-following and reinforcement datasets. However, generating such data through human annotation is prohibitively time-consuming particularly…

Computation and Language · Computer Science 2026-02-02 Chenhua Shi , Gregor Macdonald , Bhavika Jalli , Wanlu Lei , John Zou , Mridul Jain , Joji Philip

A Teacher Is Worth A Million Instructions

Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent…

Machine Learning · Computer Science 2024-06-28 Nikhil Kothari , Ravindra Nayak , Shreyas Shetty , Amey Patil , Nikesh Garera

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Instruction-based Large Language Models (LLMs) have proven effective in numerous few-shot or zero-shot Natural Language Processing (NLP) tasks. However, creating human-annotated instruction data is time-consuming, expensive, and often…

Computation and Language · Computer Science 2025-05-13 Aniruddha Roy , Pretam Ray , Abhilash Nandy , Somak Aditya , Pawan Goyal

SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning

Large language models (LLMs) hold the promise of solving diverse tasks when provided with appropriate natural language prompts. However, prompting often leads models to make predictions with lower accuracy compared to finetuning a model…

Computation and Language · Computer Science 2024-08-13 Chenyang Zhao , Xueying Jia , Vijay Viswanathan , Tongshuang Wu , Graham Neubig

Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration

Large language models (LLMs) have significantly benefited from training on diverse, high-quality task-specific data, leading to impressive performance across a range of downstream applications. Current methods often rely on human-annotated…

Computation and Language · Computer Science 2024-10-23 Qintong Li , Jiahui Gao , Sheng Wang , Renjie Pi , Xueliang Zhao , Chuan Wu , Xin Jiang , Zhenguo Li , Lingpeng Kong

Principled Synthetic Data Enables the First Scaling Laws for LLMs in Recommendation

Large Language Models (LLMs) represent a promising frontier for recommender systems, yet their development has been impeded by the absence of predictable scaling laws, which are crucial for guiding research and optimizing resource…

Information Retrieval · Computer Science 2026-02-16 Benyu Zhang , Qiang Zhang , Jianpeng Cheng , Hong-You Chen , Qifei Wang , Wei Sun , Shen Li , Jia Li , Jiahao Wu , Xiangjun Fan , Hong Yan

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

There is a consensus that instruction fine-tuning of LLMs requires high-quality data, but what are they? LIMA (NeurIPS 2023) and AlpaGasus (ICLR 2024) are state-of-the-art methods for selecting such high-quality examples, either via manual…

Computation and Language · Computer Science 2024-06-05 Hao Zhao , Maksym Andriushchenko , Francesco Croce , Nicolas Flammarion

XL-Suite: Cross-Lingual Synthetic Training and Evaluation Data for Open-Ended Generation

Cross-lingual open-ended generation - responding in a language different from that of the query - is an important yet understudied problem. This work proposes XL-Instruct, a novel technique for generating high-quality synthetic data, and…

Computation and Language · Computer Science 2025-09-30 Vivek Iyer , Pinzhen Chen , Ricardo Rei , Alexandra Birch

LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms

Large Language Models are traditionally finetuned on large instruction datasets. However recent studies suggest that small, high-quality datasets can suffice for general purpose instruction following. This lack of consensus surrounding…

Machine Learning · Computer Science 2023-12-29 Aditi Jha , Sam Havens , Jeremy Dohmann , Alex Trott , Jacob Portes

Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Instruction tuning is critical to large language models (LLMs) for achieving better instruction following and task adaptation capabilities but its success heavily relies on the training data quality. Many recent methods focus on improving…

Computation and Language · Computer Science 2024-06-11 Ming Li , Lichang Chen , Jiuhai Chen , Shwai He , Jiuxiang Gu , Tianyi Zhou

Improving Language Models via Plug-and-Play Retrieval Feedback

Large language models (LLMs) exhibit remarkable performance across various NLP tasks. However, they often generate incorrect or hallucinated information, which hinders their practical applicability in real-world scenarios. Human feedback…

Computation and Language · Computer Science 2023-05-24 Wenhao Yu , Zhihan Zhang , Zhenwen Liang , Meng Jiang , Ashish Sabharwal