View a PDF of the paper titled MRAG-Suite: A Diagnostic Evaluation Platform for Visual Retrieval-Augmented Generation, by Yuelyu Ji and 2 other authors
Abstract:Multimodal Retrieval-Augmented Generation (Visual RAG) significantly advances question answering by integrating visual and textual evidence. Yet, current evaluations fail to systematically account for query difficulty and ambiguity. We propose MRAG-Suite, a diagnostic evaluation platform integrating diverse multimodal benchmarks (WebQA, Chart-RAG, Visual-RAG, MRAG-Bench). We introduce difficulty-based and ambiguity-aware filtering strategies, alongside MM-RAGChecker, a claim-level diagnostic tool. Our results demonstrate substantial accuracy reductions under difficult and ambiguous queries, highlighting prevalent hallucinations. MM-RAGChecker effectively diagnoses these issues, guiding future improvements in Visual RAG systems.
Submission history
From: Yuelyu Ji [view email]
[v1]
Mon, 29 Sep 2025 03:55:28 UTC (942 KB)
[v2]
Mon, 15 Dec 2025 01:52:43 UTC (939 KB)
[v3]
Tue, 13 Jan 2026 13:26:27 UTC (939 KB)
Source link
#Diagnostic #Evaluation #Platform #Visual #RetrievalAugmented #Generation

























