RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 2111 papers

Title	Date	Tasks	Status	Hype
Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation	Jul 15, 2024	Information RetrievalKnowledge Graphs	CodeCode Available	2
PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents	Jul 12, 2024	Information RetrievalQuestion Answering	CodeCode Available	2
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions	Jul 6, 2024	Question AnsweringRAG	CodeCode Available	2
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models	Jul 6, 2024	Medical DiagnosisRAG	CodeCode Available	2
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models	Jul 4, 2024	RAGRetrieval-augmented Generation	CodeCode Available	2
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text Generation	Jul 2, 2024	HallucinationRAG	CodeCode Available	2
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems	Jul 1, 2024	RAG	CodeCode Available	2
Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation	Jun 26, 2024	HallucinationKnowledge Base Question Answering	CodeCode Available	2
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA	Jun 25, 2024	BenchmarkingLong-Context Understanding	CodeCode Available	2
LumberChunker: Long-Form Narrative Document Segmentation	Jun 25, 2024	ChunkingForm	CodeCode Available	2
CodeRAG-Bench: Can Retrieval Augment Code Generation?	Jun 20, 2024	Code GenerationRAG	CodeCode Available	2
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework	Jun 20, 2024	HallucinationQuestion Answering	CodeCode Available	2
InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales	Jun 19, 2024	DenoisingIn-Context Learning	CodeCode Available	2
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers	Jun 18, 2024	Decision MakingRAG	CodeCode Available	2
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor	Jun 10, 2024	RAGRetrieval	CodeCode Available	2
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control	May 29, 2024	RAGResponse Generation	CodeCode Available	2
Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning	May 27, 2024	Question AnsweringRAG	CodeCode Available	2
Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation	May 22, 2024	InformativenessLanguage Modeling	CodeCode Available	2
Evaluation of Retrieval-Augmented Generation: A Survey	May 13, 2024	Information RetrievalRAG	CodeCode Available	2
Telco-RAG: Navigating the Challenges of Retrieval-Augmented Language Models for Telecommunications	Apr 24, 2024	RAGRetrieval	CodeCode Available	2
LongEmbed: Extending Embedding Models for Long Context Retrieval	Apr 18, 2024	4k8k	CodeCode Available	2
Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation	Apr 10, 2024	Question AnsweringRAG	CodeCode Available	2
AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval	Apr 9, 2024	AllInformation Retrieval	CodeCode Available	2
ARAGOG: Advanced RAG Output Grading	Apr 1, 2024	Document EmbeddingLanguage Modeling	CodeCode Available	2
Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers	Mar 22, 2024	Information Retrieval	CodeCode Available	2
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models	Mar 15, 2024	RAGRetrieval	CodeCode Available	2
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems	Mar 14, 2024	DecoderQuestion Answering	CodeCode Available	2
RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback	Mar 11, 2024	RAGRetrieval	CodeCode Available	2
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation	Feb 28, 2024	Code GenerationIn-Context Learning	CodeCode Available	2
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)	Feb 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents	Feb 21, 2024	Active LearningPosition	CodeCode Available	2
EVOR: Evolving Retrieval for Code Generation	Feb 19, 2024	Code GenerationRAG	CodeCode Available	2
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model	Feb 16, 2024	Autonomous DrivingDecision Making	CodeCode Available	2
CyberMetric: A Benchmark Dataset based on Retrieval-Augmented Generation for Evaluating LLMs in Cybersecurity Knowledge	Feb 12, 2024	General KnowledgeMultiple-choice	CodeCode Available	2
LitLLM: A Toolkit for Scientific Literature Review	Feb 2, 2024	RAGRetrieval	CodeCode Available	2
LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation	Jan 30, 2024	HallucinationKnowledge Distillation	CodeCode Available	2
Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models	Jan 27, 2024	Medical Question AnsweringMultiple-choice	CodeCode Available	2
The Power of Noise: Redefining Retrieval for RAG Systems	Jan 26, 2024	Information RetrievalRAG	CodeCode Available	2
RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models	Dec 31, 2023	HallucinationRAG	CodeCode Available	2
Biomedical knowledge graph-optimized prompt generation for large language models	Nov 29, 2023	BenchmarkingKnowledge Graphs	CodeCode Available	2
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems	Nov 16, 2023	RAGRetrieval	CodeCode Available	2
Benchmarking Large Language Models in Retrieval-Augmented Generation	Sep 4, 2023	Benchmarkingcounterfactual	CodeCode Available	2
Huatuo-26M, a Large-scale Chinese Medical QA Dataset	May 2, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement	Jun 16, 2025	document understandingQuestion Answering	CodeCode Available	1
Constructing and Evaluating Declarative RAG Pipelines in PyTerrier	Jun 12, 2025	Natural QuestionsRAG	CodeCode Available	1
FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation	Jun 10, 2025	RAGRetrieval	CodeCode Available	1
DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs	Jun 10, 2025	RAGRetrieval-augmented Generation	CodeCode Available	1
SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design	Jun 9, 2025	Code GenerationRAG	CodeCode Available	1
Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems	Jun 6, 2025	RAGRetrieval	CodeCode Available	1
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table	Jun 5, 2025	RAG	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 43Next →

No leaderboard results yet.