SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 581590 of 2111 papers

TitleStatusHype
Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering0
ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems0
Composing Open-domain Vision with RAG for Ocean Monitoring and Conservation0
Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework0
ASTRAL: Automated Safety Testing of Large Language Models0
Comparing the Utility, Preference, and Performance of Course Material Search Functionality and Retrieval-Augmented Generation Large Language Model (RAG-LLM) AI Chatbots in Information-Seeking Tasks0
Comparative Analysis of Retrieval Systems in the Real World0
Assessing the Robustness of Retrieval-Augmented Generation Systems in K-12 Educational Question Answering with Knowledge Discrepancies0
AgentNet: Decentralized Evolutionary Coordination for LLM-based Multi-Agent Systems0
BadJudge: Backdoor Vulnerabilities of LLM-as-a-Judge0
Show:102550
← PrevPage 59 of 212Next →

No leaderboard results yet.