Related papers: Benchmarking Large Language Models for Knowledge G…

Systematic Assessment of Factual Knowledge in Large Language Models

Previous studies have relied on existing question-answering benchmarks to evaluate the knowledge stored in large language models (LLMs). However, this approach has limitations regarding factual knowledge coverage, as it mostly focuses on…

Computation and Language · Computer Science 2023-10-31 Linhao Luo , Thuy-Trang Vu , Dinh Phung , Gholamreza Haffari

Enhancing Large Language Models with Reliable Knowledge Graphs

Large Language Models (LLMs) have demonstrated remarkable capabilities in text generation and understanding, yet their reliance on implicit, unstructured knowledge often leads to factual inaccuracies and limited interpretability. Knowledge…

Computation and Language · Computer Science 2025-06-17 Qinggang Zhang

Improving Factuality in LLMs via Inference-Time Knowledge Graph Construction

Large Language Models (LLMs) often struggle with producing factually consistent answers due to limitations in their parametric memory. Retrieval-Augmented Generation (RAG) paradigms mitigate this issue by incorporating external knowledge at…

Computation and Language · Computer Science 2026-05-05 Shanglin Wu , Lihui Liu , Jinho D. Choi , Kai Shu

Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge Graphs

Recent advancements in Large Language Models (LLMs) have showcased their proficiency in answering natural language queries. However, their effectiveness is hindered by limited domain-specific knowledge, raising concerns about the…

Computation and Language · Computer Science 2024-08-07 Daniel Steinigen , Roman Teucher , Timm Heine Ruland , Max Rudat , Nicolas Flores-Herr , Peter Fischer , Nikola Milosevic , Christopher Schymura , Angelo Ziletti

Hybrid Fact-Checking that Integrates Knowledge Graphs, Large Language Models, and Search-Based Retrieval Agents Improves Interpretable Claim Verification

Large language models (LLMs) excel in generating fluent utterances but can lack reliable grounding in verified information. At the same time, knowledge-graph-based fact-checkers deliver precise and interpretable evidence, yet suffer from…

Computation and Language · Computer Science 2025-11-06 Shaghayegh Kolli , Richard Rosenbaum , Timo Cavelius , Lasse Strothe , Andrii Lata , Jana Diesner

Assessing Automated Fact-Checking for Medical LLM Responses with Knowledge Graphs

The recent proliferation of large language models (LLMs) holds the potential to revolutionize healthcare, with strong capabilities in diverse medical tasks. Yet, deploying LLMs in high-stakes healthcare settings requires rigorous…

Machine Learning · Computer Science 2025-12-22 Shasha Zhou , Mingyu Huang , Jack Cole , Charles Britton , Ming Yin , Jan Wolber , Ke Li

KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs

Knowledge graphs have emerged as a popular method for injecting up-to-date, factual knowledge into large language models (LLMs). This is typically achieved by converting the knowledge graph into text that the LLM can process in context.…

Computation and Language · Computer Science 2025-04-10 Elan Markowitz , Krupa Galiya , Greg Ver Steeg , Aram Galstyan

KGLens: Towards Efficient and Effective Knowledge Probing of Large Language Models with Knowledge Graphs

Large Language Models (LLMs) might hallucinate facts, while curated Knowledge Graph (KGs) are typically factually reliable especially with domain-specific knowledge. Measuring the alignment between KGs and LLMs can effectively probe the…

Artificial Intelligence · Computer Science 2024-08-02 Shangshang Zheng , He Bai , Yizhe Zhang , Yi Su , Xiaochuan Niu , Navdeep Jaitly

RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking

Large Language Models (LLMs) hold significant potential for advancing fact-checking by leveraging their capabilities in reasoning, evidence retrieval, and explanation generation. However, existing benchmarks fail to comprehensively evaluate…

Computation and Language · Computer Science 2025-06-17 Shuo Yang , Yuqin Dai , Guoqing Wang , Xinran Zheng , Jinfeng Xu , Jinze Li , Zhenzhe Ying , Weiqiang Wang , Edith C. H. Ngai

FactKG: Fact Verification via Reasoning on Knowledge Graphs

In real world applications, knowledge graphs (KG) are widely used in various domains (e.g. medical applications and dialogue agents). However, for fact verification, KGs have not been adequately utilized as a knowledge source. KGs can be a…

Computation and Language · Computer Science 2023-10-18 Jiho Kim , Sungjin Park , Yeonsu Kwon , Yohan Jo , James Thorne , Edward Choi

Logical Consistency of Large Language Models in Fact-checking

In recent years, large language models (LLMs) have demonstrated significant success in performing varied natural language tasks such as language translation, question-answering, summarizing, fact-checking, etc. Despite LLMs' impressive…

Computation and Language · Computer Science 2025-03-03 Bishwamittra Ghosh , Sarah Hasan , Naheed Anjum Arafat , Arijit Khan

A Human-in-the-Loop, LLM-Centered Architecture for Knowledge-Graph Question Answering

Large Language Models (LLMs) excel at language understanding but remain limited in knowledge-intensive domains due to hallucinations, outdated information, and limited explainability. Text-based retrieval-augmented generation (RAG) helps…

Computation and Language · Computer Science 2026-02-09 Larissa Pusch , Alexandre Courtiol , Tim Conrad

KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models

While large language models (LLMs) have made considerable advancements in understanding and generating unstructured text, their application in structured data remains underexplored. Particularly, using LLMs for complex reasoning tasks on…

Computation and Language · Computer Science 2023-10-18 Jiho Kim , Yeonsu Kwon , Yohan Jo , Edward Choi

Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling

Recently, ChatGPT, a representative large language model (LLM), has gained considerable attention due to its powerful emergent abilities. Some researchers suggest that LLMs could potentially replace structured knowledge bases like knowledge…

Computation and Language · Computer Science 2024-01-31 Linyao Yang , Hongyang Chen , Zhao Li , Xiao Ding , Xindong Wu

Resolving Conflicting Evidence in Automated Fact-Checking: A Study on Retrieval-Augmented LLMs

Large Language Models (LLMs) augmented with retrieval mechanisms have demonstrated significant potential in fact-checking tasks by integrating external knowledge. However, their reliability decreases when confronted with conflicting…

Computation and Language · Computer Science 2025-05-26 Ziyu Ge , Yuhao Wu , Daniel Wai Kit Chin , Roy Ka-Wei Lee , Rui Cao

GraphCheck: Breaking Long-Term Text Barriers with Extracted Knowledge Graph-Powered Fact-Checking

Large language models (LLMs) are widely used, but they often generate subtle factual errors, especially in long-form text. These errors are fatal in some specialized domains such as medicine. Existing fact-checking with grounding documents…

Computation and Language · Computer Science 2025-05-29 Yingjian Chen , Haoran Liu , Yinhong Liu , Jinxiang Xie , Rui Yang , Han Yuan , Yanran Fu , Peng Yuan Zhou , Qingyu Chen , James Caverlee , Irene Li

MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations

Large Language Models (LLMs) have inherent limitations of faithfulness and factuality, commonly referred to as hallucinations. Several benchmarks have been developed that provide a test bed for factuality evaluation within the context of…

Computation and Language · Computer Science 2025-10-24 Ernests Lavrinovics , Russa Biswas , Katja Hose , Johannes Bjerva

Exploring Knowledge Conflicts for Faithful LLM Reasoning: Benchmark and Method

Large language models (LLMs) have achieved remarkable success across a wide range of applications especially when augmented by external knowledge through retrieval-augmented generation (RAG). Despite their widespread adoption, recent…

Computation and Language · Computer Science 2026-04-14 Tianzhe Zhao , Jiaoyan Chen , Shuxiu Zhang , Haiping Zhu , Qika Lin , Jun Liu

TruthfulRAG: Resolving Factual-level Conflicts in Retrieval-Augmented Generation with Knowledge Graphs

Retrieval-Augmented Generation (RAG) has emerged as a powerful framework for enhancing the capabilities of Large Language Models (LLMs) by integrating retrieval-based methods with generative models. As external knowledge repositories…

Computation and Language · Computer Science 2025-11-14 Shuyi Liu , Yuming Shang , Xi Zhang

Generative Large Language Models in Automated Fact-Checking: A Survey

The dissemination of false information on online platforms presents a serious societal challenge. While manual fact-checking remains crucial, Large Language Models (LLMs) offer promising opportunities to support fact-checkers with their…

Computation and Language · Computer Science 2024-10-31 Ivan Vykopal , Matúš Pikuliak , Simon Ostermann , Marián Šimko