SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 2650 of 2111 papers

TitleStatusHype
RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query ParallelismCode5
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented GenerationCode5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive AnnotationsCode5
MiniRAG: Towards Extremely Simple Retrieval-Augmented GenerationCode5
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAGCode5
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge TasksCode5
TrustRAG: An Information Assistant with Retrieval Augmented GenerationCode5
Search-o1: Agentic Search-Enhanced Large Reasoning ModelsCode5
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question AnsweringCode4
Benchmarking Retrieval-Augmented Generation for MedicineCode4
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
Retrieval-Augmented Generation for Large Language Models: A SurveyCode4
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process RewardingCode4
Region-Aware Text-to-Image Generation via Hard Binding and Soft RefinementCode4
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented GenerationCode4
Retrieval-Augmented Generation with Hierarchical KnowledgeCode4
A Survey of LLM DATACode4
2D Matryoshka Sentence EmbeddingsCode4
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt OptimizationCode4
OnPrem.LLM: A Privacy-Conscious Document Intelligence ToolkitCode4
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement LearningCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world EnvironmentsCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
Generative Representational Instruction TuningCode4
Show:102550
← PrevPage 2 of 85Next →

No leaderboard results yet.