SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 491500 of 2111 papers

TitleStatusHype
A System for Comprehensive Assessment of RAG FrameworksCode0
Conversational Gold: Evaluating Personalized Conversational Search System using Gold NuggetsCode0
Mitigating Bias in RAG: Controlling the EmbedderCode0
Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-ReasoningCode0
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting FrameworkCode0
MindScope: Exploring cognitive biases in large language models through Multi-Agent SystemsCode0
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented GenerationCode0
A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory TextsCode0
MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAGCode0
Continual Stereo Matching of Continuous Driving Scenes With Growing ArchitectureCode0
Show:102550
← PrevPage 50 of 212Next →

No leaderboard results yet.