SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 201250 of 2111 papers

TitleStatusHype
Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented GenerationCode2
PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric AgentsCode2
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical QuestionsCode2
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language ModelsCode2
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language ModelsCode2
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text GenerationCode2
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG SystemsCode2
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented GenerationCode2
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QACode2
LumberChunker: Long-Form Narrative Document SegmentationCode2
CodeRAG-Bench: Can Retrieval Augment Code Generation?Code2
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based FrameworkCode2
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized RationalesCode2
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision MakersCode2
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance AssessorCode2
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent ControlCode2
Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-LearningCode2
Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam GenerationCode2
Evaluation of Retrieval-Augmented Generation: A SurveyCode2
Telco-RAG: Navigating the Challenges of Retrieval-Augmented Language Models for TelecommunicationsCode2
LongEmbed: Extending Embedding Models for Long Context RetrievalCode2
Superposition Prompting: Improving and Accelerating Retrieval-Augmented GenerationCode2
AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information RetrievalCode2
ARAGOG: Advanced RAG Output GradingCode2
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based RetrieversCode2
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language ModelsCode2
RAGGED: Towards Informed Design of Retrieval Augmented Generation SystemsCode2
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-FeedbackCode2
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented GenerationCode2
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)Code2
ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented AgentsCode2
EVOR: Evolving Retrieval for Code GenerationCode2
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language ModelCode2
CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity KnowledgeCode2
LitLLM: A Toolkit for Scientific Literature ReviewCode2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and DistillationCode2
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language ModelsCode2
The Power of Noise: Redefining Retrieval for RAG SystemsCode2
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language ModelsCode2
Biomedical knowledge graph-optimized prompt generation for large language modelsCode2
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation SystemsCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
Huatuo-26M, a Large-scale Chinese Medical QA DatasetCode2
SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative RefinementCode1
Constructing and Evaluating Declarative RAG Pipelines in PyTerrierCode1
FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented GenerationCode1
DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMsCode1
SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from DesignCode1
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation SystemsCode1
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff TableCode1
Show:102550
← PrevPage 5 of 43Next →

No leaderboard results yet.