Related papers: Self-Taught Evaluators

Self-Improving VLM Judges Without Human Annotations

Effective judges of Vision-Language Models (VLMs) are crucial for model development. Current methods for training VLM judges mainly rely on large-scale human preference annotations. However, such an approach is costly, and the annotations…

Computer Vision and Pattern Recognition · Computer Science 2025-12-08 Inna Wanyin Lin , Yushi Hu , Shuyue Stella Li , Scott Geng , Pang Wei Koh , Luke Zettlemoyer , Tim Althoff , Marjan Ghazvininejad

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Large Language Models (LLMs) are rapidly surpassing human knowledge in many domains. While improving these models traditionally relies on costly human data, recent self-rewarding mechanisms (Yuan et al., 2024) have shown that LLMs can…

Computation and Language · Computer Science 2024-07-31 Tianhao Wu , Weizhe Yuan , Olga Golovneva , Jing Xu , Yuandong Tian , Jiantao Jiao , Jason Weston , Sainbayar Sukhbaatar

Becoming Experienced Judges: Selective Test-Time Learning for Evaluators

Automatic evaluation with large language models, commonly known as LLM-as-a-judge, is now standard across reasoning and alignment tasks. Despite evaluating many samples in deployment, these evaluators typically (i) treat each case…

Computation and Language · Computer Science 2025-12-09 Seungyeon Jwa , Daechul Ahn , Reokyoung Kim , Dongyeop Kang , Jonghyun Choi

Semi-Supervised Reward Modeling via Iterative Self-Training

Reward models (RM) capture the values and preferences of humans and play a central role in Reinforcement Learning with Human Feedback (RLHF) to align pretrained large language models (LLMs). Traditionally, training these models relies on…

Machine Learning · Computer Science 2024-09-12 Yifei He , Haoxiang Wang , Ziyan Jiang , Alexandros Papangelis , Han Zhao

Calibrating LLM-Based Evaluator

Recent advancements in large language models (LLMs) on language modeling and emergent capabilities make them a promising reference-free evaluator of natural language generation quality, and a competent alternative to human evaluation.…

Computation and Language · Computer Science 2023-09-26 Yuxuan Liu , Tianchi Yang , Shaohan Huang , Zihan Zhang , Haizhen Huang , Furu Wei , Weiwei Deng , Feng Sun , Qi Zhang

Self-Rewarding Language Models

We posit that to achieve superhuman agents, future models require superhuman feedback in order to provide an adequate training signal. Current approaches commonly train reward models from human preferences, which may then be bottlenecked by…

Computation and Language · Computer Science 2025-03-31 Weizhe Yuan , Richard Yuanzhe Pang , Kyunghyun Cho , Xian Li , Sainbayar Sukhbaatar , Jing Xu , Jason Weston

A Model Can Help Itself: Reward-Free Self-Training for LLM Reasoning

Can language models improve their reasoning performance without external rewards, using only their own sampled responses for training? We show that they can. We propose Self-evolving Post-Training (SePT), a simple post-training method that…

Machine Learning · Computer Science 2026-05-18 Mengqi Li , Lei Zhao , Anthony Man-Cho So , Ruoyu Sun , Xiao Li

Self-Judge: Selective Instruction Following with Alignment Self-Evaluation

Pre-trained large language models (LLMs) can be tailored to adhere to human instructions through instruction tuning. However, due to shifts in the distribution of test-time data, they may not always execute instructions accurately,…

Computation and Language · Computer Science 2024-09-04 Hai Ye , Hwee Tou Ng

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Recent progress in multimodal large language models has led to strong performance on reasoning tasks, but these improvements largely rely on high-quality annotated data or teacher-model distillation, both of which are costly and difficult…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Zhengxian Wu , Kai Shi , Chuanrui Zhang , Zirui Liao , Jun Yang , Ni Yang , Qiuying Peng , Luyuan Zhang , Hangrui Xu , Tianhuang Su , Zhenyu Yang , Haonan Lu , Haoqian Wang

GRAM: A Generative Foundation Reward Model for Reward Generalization

In aligning large language models (LLMs), reward models have played an important role, but are standardly trained as discriminative models and rely only on labeled human preference data. In this paper, we explore methods that train reward…

Computation and Language · Computer Science 2026-01-27 Chenglong Wang , Yang Gan , Yifu Huo , Yongyu Mu , Qiaozhi He , Murun Yang , Bei Li , Tong Xiao , Chunliang Zhang , Tongran Liu , Jingbo Zhu

Rapidly Developing High-quality Instruction Data and Evaluation Benchmark for Large Language Models with Minimal Human Effort: A Case Study on Japanese

The creation of instruction data and evaluation benchmarks for serving Large language models often involves enormous human annotation. This issue becomes particularly pronounced when rapidly developing such resources for a non-English…

Computation and Language · Computer Science 2024-03-07 Yikun Sun , Zhen Wan , Nobuhiro Ueda , Sakiko Yahata , Fei Cheng , Chenhui Chu , Sadao Kurohashi

Large Language Models Can Self-Improve

Large Language Models (LLMs) have achieved excellent performances in various tasks. However, fine-tuning an LLM requires extensive supervision. Human, on the other hand, may improve their reasoning abilities by self-thinking without…

Computation and Language · Computer Science 2022-10-26 Jiaxin Huang , Shixiang Shane Gu , Le Hou , Yuexin Wu , Xuezhi Wang , Hongkun Yu , Jiawei Han

Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions

LLMs-as-a-judge is a recently popularized method which replaces human judgements in task evaluation (Zheng et al. 2024) with automatic evaluation using LLMs. Due to widespread use of RLHF (Reinforcement Learning from Human Feedback),…

Artificial Intelligence · Computer Science 2026-02-27 Bhuvanashree Murugadoss , Christian Poelitz , Ian Drosos , Vu Le , Nick McKenna , Carina Suzana Negreanu , Chris Parnin , Advait Sarkar

LLMs Could Autonomously Learn Without External Supervision

In the quest for super-human performance, Large Language Models (LLMs) have traditionally been tethered to human-annotated datasets and predefined training objectives-a process that is both labor-intensive and inherently limited. This paper…

Computation and Language · Computer Science 2024-06-10 Ke Ji , Junying Chen , Anningzhe Gao , Wenya Xie , Xiang Wan , Benyou Wang

Self-rationalization improves LLM as a fine-grained judge

LLM-as-a-judge models have been used for evaluating both human and AI generated content, specifically by providing scores and rationales. Rationales, in addition to increasing transparency, help models learn to calibrate its judgments.…

Computation and Language · Computer Science 2024-10-10 Prapti Trivedi , Aditya Gulati , Oliver Molenschot , Meghana Arakkal Rajeev , Rajkumar Ramamurthy , Keith Stevens , Tanveesh Singh Chaudhery , Jahnavi Jambholkar , James Zou , Nazneen Rajani

EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics

Language models encode substantial evaluative knowledge from pretraining, yet current post-training methods rely on external supervision (human annotations, proprietary models, or scalar reward models) to produce reward signals. Each…

Artificial Intelligence · Computer Science 2026-05-06 Shuyue Stella Li , Rui Xin , Teng Xiao , Yike Wang , Rulin Shao , Zoey Hao , Melanie Sclar , Sewoong Oh , Faeze Brahman , Pang Wei Koh , Yulia Tsvetkov

A Judge-Aware Ranking Framework for Evaluating Large Language Models without Ground Truth

Evaluating large language models (LLMs) on open-ended tasks without ground-truth labels is increasingly done via the LLM-as-a-judge paradigm. A critical but under-modeled issue is that judge LLMs differ substantially in reliability;…

Machine Learning · Statistics 2026-01-30 Mingyuan Xu , Xinzi Tan , Jiawei Wu , Doudou Zhou

Analysis of instruction-based LLMs' capabilities to score and judge text-input problems in an academic setting

Large language models (LLMs) can act as evaluators, a role studied by methods like LLM-as-a-Judge and fine-tuned judging LLMs. In the field of education, LLMs have been studied as assistant tools for students and teachers. Our research…

Computation and Language · Computer Science 2025-09-26 Valeria Ramirez-Garcia , David de-Fitero-Dominguez , Antonio Garcia-Cabot , Eva Garcia-Lopez

Language Model Self-improvement by Reinforcement Learning Contemplation

Large Language Models (LLMs) have exhibited remarkable performance across various natural language processing (NLP) tasks. However, fine-tuning these models often necessitates substantial supervision, which can be expensive and…

Computation and Language · Computer Science 2023-05-25 Jing-Cheng Pang , Pengyuan Wang , Kaiyuan Li , Xiong-Hui Chen , Jiacheng Xu , Zongzhang Zhang , Yang Yu

Play Favorites: A Statistical Method to Measure Self-Bias in LLM-as-a-Judge

Large language models (LLMs) can serve as judges that offer rapid and reliable assessments of other LLM outputs. However, models may systematically assign overly favorable ratings to their own outputs, a phenomenon known as self-bias, which…

Computation and Language · Computer Science 2025-08-12 Evangelia Spiliopoulou , Riccardo Fogliato , Hanna Burnsky , Tamer Soliman , Jie Ma , Graham Horwood , Miguel Ballesteros