Deepchecks: Evaluating Retrieval-Augmented Generation (RAG)

Assaf Gerner; Netta Madvil; Nadav Barak; Alex Zaikman; Jonatan Liberman; Liron Hamra; Rotem Brazilay; Shay Tsadok; Yaron Friedman; Neal Harow; Noam Bresler; Shir Chorev; Philip Tannor; Lior Rokach

Deepchecks: Evaluating Retrieval-Augmented Generation (RAG)

Artificial Intelligence 2026-05-15 v1

Authors: Assaf Gerner , Netta Madvil , Nadav Barak , Alex Zaikman , Jonatan Liberman , Liron Hamra , Rotem Brazilay , Shay Tsadok , Yaron Friedman , Neal Harow , Noam Bresler , Shir Chorev , Philip Tannor , Lior Rokach

View on arXiv ↗ PDF ↗

Abstract

Large Language Models (LLMs) augmented with Retrieval-Augmented Generation (RAG) techniques are revolutionizing applications across multiple domains, such as healthcare, finance, and customer service. Despite their potential, evaluating RAG systems remains a complex challenge due to the stochastic nature of generated outputs and the intricate interplay between retrieval and generation components. This paper introduces Deepchecks, a comprehensive framework tailored for evaluating RAG applications. Deepchecks' evaluation framework addresses RAG applications evaluation through a multi-faceted approach, root cause analysis and production monitoring. By ensuring alignment with application-specific requirements, Deepchecks framework provides a robust foundation for assessing reliability, relevance, and user satisfaction in RAG systems.

Keywords

retrieval-augmented generation large language model evaluation automated reasoning

Cite

@article{arxiv.2605.14488,
  title  = {Deepchecks: Evaluating Retrieval-Augmented Generation (RAG)},
  author = {Assaf Gerner and Netta Madvil and Nadav Barak and Alex Zaikman and Jonatan Liberman and Liron Hamra and Rotem Brazilay and Shay Tsadok and Yaron Friedman and Neal Harow and Noam Bresler and Shir Chorev and Philip Tannor and Lior Rokach},
  journal= {arXiv preprint arXiv:2605.14488},
  year   = {2026}
}

Deepchecks: Evaluating Retrieval-Augmented Generation (RAG)

Abstract

Keywords

Cite

Related papers