SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Title	Date	Tasks	Status
MHTS: Multi-Hop Tree Structure Framework for Generating Difficulty-Controllable QA Datasets for RAG Evaluation	Mar 29, 2025	Answer GenerationBenchmarking	—Unverified
Citegeist: Automated Generation of Related Work Analysis on the arXiv Corpus	Mar 29, 2025	ArticlesRAG	CodeCode Available
Memory-Aware and Uncertainty-Guided Retrieval for Multi-Hop Question Answering	Mar 29, 2025	Multi-hop Question AnsweringQuestion Answering	—Unverified
DAT: Dynamic Alpha Tuning for Hybrid Retrieval in Retrieval-Augmented Generation	Mar 29, 2025	Information RetrievalLanguage Modeling	—Unverified
Understanding Inequality of LLM Fact-Checking over Geographic Regions with Agent and Retrieval models	Mar 28, 2025	Fact CheckingGeneral Knowledge	—Unverified
Real-Time Evaluation Models for RAG: Who Detects Hallucinations Best?	Mar 27, 2025	HallucinationHallucination Evaluation	—Unverified
MemInsight: Autonomous Memory Augmentation for LLM Agents	Mar 27, 2025	Conversational RecommendationLanguage Modeling	—Unverified
Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack	Mar 27, 2025	HallucinationRAG	—Unverified
AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models	Mar 27, 2025	Diagnosticparameter-efficient fine-tuning	—Unverified
A Survey of Multimodal Retrieval-Augmented Generation	Mar 26, 2025	Information RetrievalQuestion Answering	—Unverified

Title

Status

Hype

MHTS: Multi-Hop Tree Structure Framework for Generating Difficulty-Controllable QA Datasets for RAG Evaluation

—Unverified

Citegeist: Automated Generation of Related Work Analysis on the arXiv Corpus

CodeCode Available

Memory-Aware and Uncertainty-Guided Retrieval for Multi-Hop Question Answering

—Unverified

DAT: Dynamic Alpha Tuning for Hybrid Retrieval in Retrieval-Augmented Generation

—Unverified

Understanding Inequality of LLM Fact-Checking over Geographic Regions with Agent and Retrieval models