SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 401425 of 2111 papers

TitleStatusHype
Citekit: A Modular Toolkit for Large Language Model Citation GenerationCode1
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge GraphsCode1
EgoNormia: Benchmarking Physical Social Norm UnderstandingCode1
PAKTON: A Multi-Agent Framework for Question Answering in Long Legal AgreementsCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language ModelsCode1
Docopilot: Improving Multimodal Models for Document-Level UnderstandingCode1
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for RetrievalCode1
RAGSynth: Synthetic Data for Robust and Faithful RAG Component OptimizationCode1
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
InsQABench: Benchmarking Chinese Insurance Domain Question Answering with Large Language ModelsCode1
InteractiveSurvey: An LLM-based Personalized and Interactive Survey Paper Generation SystemCode1
Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A SurveyCode1
NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question AnsweringCode1
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented GenerationCode1
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection AttacksCode1
Benchmarking Multimodal Knowledge Conflict for Large Multimodal ModelsCode1
Neuro-Symbolic Query CompilerCode1
Not All Contexts Are Equal: Teaching LLMs Credibility-aware GenerationCode1
Benchmarking LLM Faithfulness in RAG with Evolving LeaderboardsCode1
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point ThinkingCode1
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM CollaborationCode1
Deep Equilibrium Object DetectionCode1
AlignRAG: Leveraging Critique Learning for Evidence-Sensitive Retrieval-Augmented ReasoningCode1
MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal GenerationCode1
Show:102550
← PrevPage 17 of 85Next →

No leaderboard results yet.