Related papers: Self-training Large Language Models through Knowle…

LLMs Could Autonomously Learn Without External Supervision

In the quest for super-human performance, Large Language Models (LLMs) have traditionally been tethered to human-annotated datasets and predefined training objectives-a process that is both labor-intensive and inherently limited. This paper…

Computation and Language · Computer Science 2024-06-10 Ke Ji , Junying Chen , Anningzhe Gao , Wenya Xie , Xiang Wan , Benyou Wang

Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data

In real-world NLP applications, Large Language Models (LLMs) offer promising solutions due to their extensive training on vast datasets. However, the large size and high computation demands of LLMs limit their practicality in many…

Artificial Intelligence · Computer Science 2025-04-01 Juanhui Li , Sreyashi Nag , Hui Liu , Xianfeng Tang , Sheikh Sarwar , Limeng Cui , Hansu Gu , Suhang Wang , Qi He , Jiliang Tang

(Im)possibility of Automated Hallucination Detection in Large Language Models

Is automated hallucination detection possible? In this work, we introduce a theoretical framework to analyze the feasibility of automatically detecting hallucinations produced by large language models (LLMs). Inspired by the classical…

Machine Learning · Computer Science 2025-06-03 Amin Karbasi , Omar Montasser , John Sous , Grigoris Velegkas

A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs

A primary challenge in large language model (LLM) development is their onerous pre-training cost. Typically, such pre-training involves optimizing a self-supervised objective (such as next-token prediction) over a large corpus. This paper…

Machine Learning · Computer Science 2024-10-25 Ankit Singh Rawat , Veeranjaneyulu Sadhanala , Afshin Rostamizadeh , Ayan Chakrabarti , Wittawat Jitkrittum , Vladimir Feinberg , Seungyeon Kim , Hrayr Harutyunyan , Nikunj Saunshi , Zachary Nado , Rakesh Shivanna , Sashank J. Reddi , Aditya Krishna Menon , Rohan Anil , Sanjiv Kumar

The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic Feedback

Reinforcement learning (RL) has demonstrated potential in enhancing the reasoning capabilities of large language models (LLMs), but such training typically demands substantial efforts in creating and annotating data. In this work, we…

Computation and Language · Computer Science 2025-10-06 Hangfan Zhang , Siyuan Xu , Zhimeng Guo , Huaisheng Zhu , Shicheng Liu , Xinrun Wang , Qiaosheng Zhang , Yang Chen , Peng Ye , Lei Bai , Shuyue Hu

Self Iterative Label Refinement via Robust Unlabeled Learning

Recent advances in large language models (LLMs) have yielded impressive performance on various tasks, yet they often depend on high-quality feedback that can be costly. Self-refinement methods attempt to leverage LLMs' internal evaluation…

Computation and Language · Computer Science 2025-12-01 Hikaru Asano , Tadashi Kozuno , Yukino Baba

Exploring a New Competency Modeling Process with Large Language Models

Competency modeling is widely used in human resource management to select, develop, and evaluate talent. However, traditional expert-driven approaches rely heavily on manual analysis of large volumes of interview transcripts, making them…

Computation and Language · Computer Science 2026-02-16 Silin Du , Manqing Xin , Raymond Jia Wang

Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach

Self-improvement in multimodal large language models (MLLMs) is crucial for enhancing their reliability and robustness. However, current methods often rely heavily on MLLMs themselves as judges, leading to high computational costs and…

Computation and Language · Computer Science 2024-11-28 Shijian Deng , Wentian Zhao , Yu-Jhe Li , Kun Wan , Daniel Miranda , Ajinkya Kale , Yapeng Tian

Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM

The remarkable performance of large language models (LLMs) in zero-shot language understanding has garnered significant attention. However, employing LLMs for large-scale inference or domain-specific fine-tuning requires immense…

Computation and Language · Computer Science 2024-04-16 Ruohong Zhang , Yau-Shian Wang , Yiming Yang

Test-Time Learning for Large Language Models

While Large Language Models (LLMs) have exhibited remarkable emergent capabilities through extensive pre-training, they still face critical limitations in generalizing to specialized domains and handling diverse linguistic variations, known…

Computation and Language · Computer Science 2025-05-28 Jinwu Hu , Zhitian Zhang , Guohao Chen , Xutao Wen , Chao Shuai , Wei Luo , Bin Xiao , Yuanqing Li , Mingkui Tan

Do Large Language Models Know What They Don't Know?

Large language models (LLMs) have a wealth of knowledge that allows them to excel in various Natural Language Processing (NLP) tasks. Current research focuses on enhancing their performance within their existing knowledge. Despite their…

Computation and Language · Computer Science 2023-05-31 Zhangyue Yin , Qiushi Sun , Qipeng Guo , Jiawen Wu , Xipeng Qiu , Xuanjing Huang

Navigating Uncertainty: Optimizing API Dependency for Hallucination Reduction in Closed-Book Question Answering

While Large Language Models (LLM) are able to accumulate and restore knowledge, they are still prone to hallucination. Especially when faced with factual questions, LLM cannot only rely on knowledge stored in parameters to guarantee…

Computation and Language · Computer Science 2024-01-04 Pierre Erbacher , Louis Falissar , Vincent Guigue , Laure Soulier

UNLEARN Efficient Removal of Knowledge in Large Language Models

Given the prevalence of large language models (LLMs) and the prohibitive cost of training these models from scratch, dynamically forgetting specific knowledge e.g., private or proprietary, without retraining the model has become an…

Computation and Language · Computer Science 2024-08-09 Tyler Lizzo , Larry Heck

Fine-Tuning Large Language Models to Appropriately Abstain with Semantic Entropy

Large Language Models (LLMs) are known to hallucinate, whereby they generate plausible but inaccurate text. This phenomenon poses significant risks in critical applications, such as medicine or law, necessitating robust hallucination…

Computation and Language · Computer Science 2024-10-23 Benedict Aaron Tjandra , Muhammed Razzak , Jannik Kossen , Kunal Handa , Yarin Gal

Making Large Language Models Better Data Creators

Although large language models (LLMs) have advanced the state-of-the-art in NLP significantly, deploying them for downstream applications is still challenging due to cost, responsiveness, control, or concerns around privacy and security. As…

Computation and Language · Computer Science 2023-11-01 Dong-Ho Lee , Jay Pujara , Mohit Sewak , Ryen W. White , Sujay Kumar Jauhar

Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

While many capabilities of language models (LMs) improve with increased training budget, the influence of scale on hallucinations is not yet fully understood. Hallucinations come in many forms, and there is no universally accepted…

Computation and Language · Computer Science 2024-08-16 Jiri Hron , Laura Culp , Gamaleldin Elsayed , Rosanne Liu , Ben Adlam , Maxwell Bileschi , Bernd Bohnet , JD Co-Reyes , Noah Fiedel , C. Daniel Freeman , Izzeddin Gur , Kathleen Kenealy , Jaehoon Lee , Peter J. Liu , Gaurav Mishra , Igor Mordatch , Azade Nova , Roman Novak , Aaron Parisi , Jeffrey Pennington , Alex Rizkowsky , Isabelle Simpson , Hanie Sedghi , Jascha Sohl-dickstein , Kevin Swersky , Sharad Vikram , Tris Warkentin , Lechao Xiao , Kelvin Xu , Jasper Snoek , Simon Kornblith

Large Language Models Can Self-Improve

Large Language Models (LLMs) have achieved excellent performances in various tasks. However, fine-tuning an LLM requires extensive supervision. Human, on the other hand, may improve their reasoning abilities by self-thinking without…

Computation and Language · Computer Science 2022-10-26 Jiaxin Huang , Shixiang Shane Gu , Le Hou , Yuexin Wu , Xuezhi Wang , Hongkun Yu , Jiawei Han

Test-Time Self-Adaptive Small Language Models for Question Answering

Recent instruction-finetuned large language models (LMs) have achieved notable performances in various tasks, such as question-answering (QA). However, despite their ability to memorize a vast amount of general knowledge across diverse…

Computation and Language · Computer Science 2023-10-23 Soyeong Jeong , Jinheon Baek , Sukmin Cho , Sung Ju Hwang , Jong C. Park

Predictions For Pre-training Language Models

Language model pre-training has proven to be useful in many language understanding tasks. In this paper, we investigate whether it is still helpful to add the self-training method in the pre-training step and the fine-tuning step. Towards…

Computation and Language · Computer Science 2023-02-17 Tong Guo

Large Language Models Facilitate Vision Reflection in Image Classification

This paper presents several novel findings on the explainability of vision reflection in large multimodal models (LMMs). First, we show that prompting an LMM to verify the prediction of a specialized vision model can improve recognition…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Guoyuan An , JaeYoon Kim , SungEui Yoon