RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 2111 papers

Title	Date	Tasks	Status	Hype
RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism	Jun 30, 2025	Question AnsweringRAG	CodeCode Available	5
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation	Aug 15, 2024	DiagnosticRAG	CodeCode Available	5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations	Dec 10, 2024	AttributeBenchmarking	CodeCode Available	5
MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation	Jan 12, 2025	RAGRetrieval	CodeCode Available	5
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG	Jan 15, 2025	Natural Language UnderstandingRAG	CodeCode Available	5
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks	Dec 20, 2024	AllRAG	CodeCode Available	5
TrustRAG: An Information Assistant with Retrieval Augmented Generation	Feb 19, 2025	Answer GenerationChunking	CodeCode Available	5
Search-o1: Agentic Search-Enhanced Large Reasoning Models	Jan 9, 2025	Code Generation	CodeCode Available	5
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering	Feb 12, 2024	Common Sense ReasoningGraph Classification	CodeCode Available	4
Benchmarking Retrieval-Augmented Generation for Medicine	Feb 20, 2024	BenchmarkingInformation Retrieval	CodeCode Available	4
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks	May 22, 2020	Fact VerificationQuestion Answering	CodeCode Available	4
Retrieval-Augmented Generation for Large Language Models: A Survey	Dec 18, 2023	HallucinationRAG	CodeCode Available	4
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding	Jan 14, 2025	RAGRetrieval	CodeCode Available	4
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement	Nov 10, 2024	AttributeImage Generation	CodeCode Available	4
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation	Feb 4, 2025	BenchmarkingInformation Retrieval	CodeCode Available	4
Retrieval-Augmented Generation with Hierarchical Knowledge	Mar 13, 2025	Multi-hop Question AnsweringQuestion Answering	CodeCode Available	4
A Survey of LLM DATA	May 24, 2025	Large Language ModelManagement	CodeCode Available	4
2D Matryoshka Sentence Embeddings	Feb 22, 2024	RAGRepresentation Learning	CodeCode Available	4
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization	Apr 2, 2024	RAGRetrieval	CodeCode Available	4
OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit	May 12, 2025	GPUPrivacy Preserving	CodeCode Available	4
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning	May 22, 2025	MemorizationRAG	CodeCode Available	4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented Generation	Aug 8, 2024	ChunkingFact Checking	CodeCode Available	4
DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments	Apr 4, 2025	NavigatePrompt Engineering	CodeCode Available	4
Data-Prep-Kit: getting your data ready for LLM application development	Sep 26, 2024	CPULanguage Modeling	CodeCode Available	4
Generative Representational Instruction Tuning	Feb 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	4

Show:10 25 50

← PrevPage 2 of 85Next →

No leaderboard results yet.