Related papers: Reformatted Alignment

LearnAlign: Data Selection for LLM Reinforcement Learning with Improved Gradient Alignment

Reinforcement learning with verifiable rewards (RLVR) has become a key technique for enhancing LLMs' reasoning abilities, yet its data inefficiency remains a major bottleneck. To address this critical yet challenging issue, we present a…

Machine Learning · Computer Science 2026-04-28 Shipeng Li , Zhiqin Yang , Shikun Li , Xiaobo Xia , Hengyu Liu , Xinghua Zhang , Gaode Chen , Dong Fang , Ying Tai , Zhe Peng

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

Recent advancements in Large Language Models (LLMs) have expanded the horizons of natural language understanding and generation. Notably, the output control and alignment with the input of LLMs can be refined through instruction tuning.…

Computation and Language · Computer Science 2023-10-19 Ming Li , Lichang Chen , Jiuhai Chen , Shwai He , Heng Huang , Jiuxiang Gu , Tianyi Zhou

Beyond Fine-Tuning: Effective Strategies for Mitigating Hallucinations in Large Language Models for Data Analytics

Large Language Models (LLMs) have become increasingly important in natural language processing, enabling advanced data analytics through natural language queries. However, these models often generate "hallucinations"-inaccurate or…

Computation and Language · Computer Science 2024-10-29 Mikhail Rumiantsau , Aliaksei Vertsel , Ilya Hrytsuk , Isaiah Ballah

Using Natural Language Explanations to Rescale Human Judgments

The rise of large language models (LLMs) has brought a critical need for high-quality human-labeled data, particularly for processes like human feedback and evaluation. A common practice is to label data via consensus annotation over human…

Computation and Language · Computer Science 2025-06-23 Manya Wadhwa , Jifan Chen , Junyi Jessy Li , Greg Durrett

FLAME: Factuality-Aware Alignment for Large Language Models

Alignment is a standard procedure to fine-tune pre-trained large language models (LLMs) to follow natural language instructions and serve as helpful AI assistants. We have observed, however, that the conventional alignment process fails to…

Computation and Language · Computer Science 2024-05-03 Sheng-Chieh Lin , Luyu Gao , Barlas Oguz , Wenhan Xiong , Jimmy Lin , Wen-tau Yih , Xilun Chen

Efficient Alignment of Large Language Models via Data Sampling

LLM alignment ensures that large language models behave safely and effectively by aligning their outputs with human values, goals, and intentions. Aligning LLMs employ huge amounts of data, computation, and time. Moreover, curating data…

Machine Learning · Computer Science 2025-02-19 Amrit Khera , Rajat Ghosh , Debojyoti Dutta

Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG) has emerged as a powerful framework to improve factuality in large language models (LLMs) by grounding their outputs in retrieved documents. However, ensuring perfect retrieval of relevant information…

Computation and Language · Computer Science 2025-12-04 Zhan Peng Lee , Andre Lin , Calvin Tan

Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis

The rapid development of large language models (LLMs) has not only provided numerous opportunities but also presented significant challenges. This becomes particularly evident when LLMs inadvertently generate harmful or toxic content,…

Computation and Language · Computer Science 2024-02-20 Kai Chen , Chunwei Wang , Kuo Yang , Jianhua Han , Lanqing Hong , Fei Mi , Hang Xu , Zhengying Liu , Wenyong Huang , Zhenguo Li , Dit-Yan Yeung , Lifeng Shang , Xin Jiang , Qun Liu

GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning

Reinforcement learning (RL) has become a central post-training paradigm for large language models (LLMs), but its performance is highly sensitive to the quality of training problems. This sensitivity stems from the non-stationarity of RL:…

Machine Learning · Computer Science 2026-02-26 Ningyuan Yang , Weihua Du , Weiwei Sun , Sean Welleck , Yiming Yang

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation

Despite showing increasingly human-like abilities, large language models (LLMs) often struggle with factual inaccuracies, i.e. "hallucinations", even when they hold relevant knowledge. To address these hallucinations, current approaches…

Computation and Language · Computer Science 2024-06-12 Xiaoying Zhang , Baolin Peng , Ye Tian , Jingyan Zhou , Lifeng Jin , Linfeng Song , Haitao Mi , Helen Meng

High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

Large Language Models (LLMs) currently respond to every prompt. However, they can produce incorrect answers when they lack knowledge or capability -- a problem known as hallucination. We instead propose post-training an LLM to generate…

Computation and Language · Computer Science 2026-02-17 Tim Franzmeyer , Archie Sravankumar , Lijuan Liu , Yuning Mao , Rui Hou , Sinong Wang , Jakob N. Foerster , Luke Zettlemoyer , Madian Khabsa

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

There is a consensus that instruction fine-tuning of LLMs requires high-quality data, but what are they? LIMA (NeurIPS 2023) and AlpaGasus (ICLR 2024) are state-of-the-art methods for selecting such high-quality examples, either via manual…

Computation and Language · Computer Science 2024-06-05 Hao Zhao , Maksym Andriushchenko , Francesco Croce , Nicolas Flammarion

InstructAlign: High-and-Low Resource Language Alignment via Continual Crosslingual Instruction Tuning

Large language models (LLMs) that are tuned with instructions have demonstrated remarkable capabilities in various tasks and languages. However, their ability to generalize to underrepresented languages is limited due to the scarcity of…

Computation and Language · Computer Science 2023-10-25 Samuel Cahyawijaya , Holy Lovenia , Tiezheng Yu , Willy Chung , Pascale Fung

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Pretrained large language models (LLMs) are currently state-of-the-art for solving the vast majority of natural language processing tasks. While many real-world applications still require fine-tuning to reach satisfactory levels of…

Computation and Language · Computer Science 2024-07-16 Nicholas Lee , Thanakul Wattanawong , Sehoon Kim , Karttikeya Mangalam , Sheng Shen , Gopala Anumanchipalli , Michael W. Mahoney , Kurt Keutzer , Amir Gholami

ReAlign: Generalizable Image Forgery Detection via Reasoning-Aligned Representation

The rise of AI-generated images (AIGIs) poses growing challenges for digital authenticity, prompting the need for efficient, generalizable image forgery detection systems. Existing methods, whether non-LLM-based or LLM-based, exhibit…

Computer Vision and Pattern Recognition · Computer Science 2026-05-18 Qing Huang , Zhipei Xu , Xuanyu Zhang , Xiangyu Yu , Jian Zhang

ReAlign: Optimizing the Visual Document Retriever with Reasoning-Guided Fine-Grained Alignment

Visual document retrieval aims to retrieve a set of document pages relevant to a query from visually rich collections. Existing methods often employ Vision-Language Models (VLMs) to encode queries and visual pages into a shared embedding…

Information Retrieval · Computer Science 2026-04-10 Hao Yang , Yifan Ji , Zhipeng Xu , Zhenghao Liu , Yukun Yan , Zulong Chen , Shuo Wang , Yu Gu , Ge Yu

Towards Efficient and Effective Alignment of Large Language Models

Large language models (LLMs) exhibit remarkable capabilities across diverse tasks, yet aligning them efficiently and effectively with human expectations remains a critical challenge. This thesis advances LLM alignment by introducing novel…

Computation and Language · Computer Science 2025-06-12 Yuxin Jiang

More Human, More Efficient: Aligning Annotations with Quantized SLMs

As Large Language Model (LLM) capabilities advance, the demand for high-quality annotation of exponentially increasing text corpora has outpaced human capacity, leading to the widespread adoption of LLMs in automatic evaluation and…

Computation and Language · Computer Science 2026-04-02 Jiayu Wang , Junyoung Lee

Alleviating the Fear of Losing Alignment in LLM Fine-tuning

Large language models (LLMs) have demonstrated revolutionary capabilities in understanding complex contexts and performing a wide range of tasks. However, LLMs can also answer questions that are unethical or harmful, raising concerns about…

Cryptography and Security · Computer Science 2025-04-15 Kang Yang , Guanhong Tao , Xun Chen , Jun Xu

Refine-n-Judge: Curating High-Quality Preference Chains for LLM-Fine-Tuning

Large Language Models (LLMs) have demonstrated remarkable progress through preference-based fine-tuning, which critically depends on the quality of the underlying training data. While human feedback is essential for improving data quality,…

Artificial Intelligence · Computer Science 2025-10-31 Derin Cayir , Renjie Tao , Rashi Rungta , Kai Sun , Sean Chen , Haidar Khan , Minseok Kim , Julia Reinspach , Yue Liu