SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 18911900 of 2111 papers

TitleStatusHype
A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance JudgmentCode0
Visual-RAG: Benchmarking Text-to-Image Retrieval Augmented Generation for Visual Knowledge Intensive QueriesCode0
Unipa-GPT: Large Language Models for university-oriented QA in ItalianCode0
Evidence-backed Fact Checking using RAG and Few-Shot In-Context Learning with LLMsCode0
LLM Robustness Against Misinformation in Biomedical Question AnsweringCode0
LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large ContextsCode0
CommunityKG-RAG: Leveraging Community Structures in Knowledge Graphs for Advanced Retrieval-Augmented Generation in Fact-CheckingCode0
Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical DomainCode0
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and MitigationCode0
Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context UnderstandingCode0
Show:102550
← PrevPage 190 of 212Next →

No leaderboard results yet.