RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 2111 papers

Title	Date	Tasks	Status	Hype
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models	May 30, 2024	Question AnsweringRAG	CodeCode Available	1
Neuro-Symbolic Query Compiler	May 17, 2025	RAGResponse Generation	CodeCode Available	1
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks	Mar 6, 2024	RAGRetrieval	CodeCode Available	1
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization	Dec 19, 2024	InformativenessRAG	CodeCode Available	1
NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering	May 26, 2025	ChunkingLarge Language Model	CodeCode Available	1
AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative Reasoning	Oct 16, 2024	Decision MakingInformation Retrieval	CodeCode Available	1
AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning	Nov 25, 2024	HallucinationQuestion Answering	CodeCode Available	1
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation	Dec 18, 2023	HallucinationLanguage Modelling	CodeCode Available	1
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression	Oct 20, 2024	In-Context LearningLong-Context Understanding	CodeCode Available	1
MRD-RAG: Enhancing Medical Diagnosis with Multi-Round Retrieval-Augmented Generation	Apr 10, 2025	DiagnosticMedical Diagnosis	CodeCode Available	1
MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal Generation	Feb 6, 2025	Answer Generationmultimodal generation	CodeCode Available	1
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata	Jun 19, 2024	RAGRetrieval	CodeCode Available	1
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks	Feb 25, 2025	MisinformationQuestion Answering	CodeCode Available	1
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems	Oct 17, 2024	Answer GenerationLanguage Modeling	CodeCode Available	1
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation	Jun 19, 2024	Question AnsweringRAG	CodeCode Available	1
Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines	Nov 25, 2024	multimodal generationRAG	CodeCode Available	1
Merging-Diverging Hybrid Transformer Networks for Survival Prediction in Head and Neck Cancer	Jul 7, 2023	DecoderPrediction	CodeCode Available	1
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory	Apr 17, 2024	HallucinationLanguage Modeling	CodeCode Available	1
MedPix 2.0: A Comprehensive Multimodal Biomedical Data set for Advanced AI Applications	Jul 3, 2024	Knowledge GraphsRAG	CodeCode Available	1
Med-R^2: Crafting Trustworthy LLM Physicians via Retrieval and Reasoning of Evidence-Based Medicine	Jan 21, 2025	RAGRetrieval	CodeCode Available	1
GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development	Mar 2, 2025	Code GenerationProgram Synthesis	CodeCode Available	1
MetaGen Blended RAG: Higher Accuracy for Domain-Specific Q&A Without Fine-Tuning	May 23, 2025	Few-Shot LearningQuestion Answering	CodeCode Available	1
MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity	Dec 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG	May 10, 2025	RAGRetrieval	CodeCode Available	1
LotusFilter: Fast Diverse Nearest Neighbor Search via a Learned Cutoff Table	Jun 5, 2025	RAG	CodeCode Available	1
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant	Nov 11, 2024	Decision MakingHallucination	CodeCode Available	1
Long-Context Inference with Retrieval-Augmented Speculative Decoding	Feb 27, 2025	Computational EfficiencyRAG	CodeCode Available	1
Long Context vs. RAG for LLMs: An Evaluation and Revisits	Dec 27, 2024	Question AnsweringRAG	CodeCode Available	1
LLM-Lasso: A Robust Framework for Domain-Informed Feature Selection and Regularization	Feb 15, 2025	feature selectionRAG	CodeCode Available	1
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics	Apr 30, 2025	In-Context LearningObject	CodeCode Available	1
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation	Apr 22, 2024	HallucinationRAG	CodeCode Available	1
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation	Feb 28, 2025	ArticlesBenchmarking	CodeCode Available	1
Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards	Aug 21, 2024	ChunkingComputational Efficiency	CodeCode Available	1
LexDrafter: Terminology Drafting for Legislative Documents using Retrieval Augmented Generation	Mar 24, 2024	ArticlesRAG	CodeCode Available	1
Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding	Mar 16, 2025	Autonomous DrivingRAG	CodeCode Available	1
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?	Oct 3, 2024	8kDocument Summarization	CodeCode Available	1
Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA	Oct 8, 2024	Knowledge GraphsRAG	CodeCode Available	1
Know Or Not: a library for evaluating out-of-knowledge base robustness	May 19, 2025	HallucinationRAG	CodeCode Available	1
KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing	May 26, 2025	Knowledge TracingMulti-hop Question Answering	CodeCode Available	1
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization	Feb 27, 2025	Information RetrievalKnowledge Graphs	CodeCode Available	1
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models	Feb 28, 2025	AttributeAutonomous Driving	CodeCode Available	1
Knowledge graph enhanced retrieval-augmented generation for failure mode and effects analysis	Jun 26, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
LaB-RAG: Label Boosted Retrieval Augmented Generation for Radiology Report Generation	Nov 25, 2024	Image CaptioningRAG	CodeCode Available	1
AgentAda: Skill-Adaptive Data Analytics for Tailored Insight Discovery	Apr 10, 2025	RAGRetrieval-augmented Generation	CodeCode Available	1
KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification	May 8, 2025	Knowledge GraphsRAG	CodeCode Available	1
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System	Mar 18, 2025	BenchmarkingIn-Context Learning	CodeCode Available	1
JORA: JAX Tensor-Parallel LoRA Library for Retrieval Augmented Fine-Tuning	Mar 17, 2024	GPUManagement	CodeCode Available	1
KG-Retriever: Efficient Knowledge Indexing for Retrieval-Augmented Large Language Models	Dec 7, 2024	Multi-hop Question AnsweringNavigate	CodeCode Available	1
InteractiveSurvey: An LLM-based Personalized and Interactive Survey Paper Generation System	Mar 31, 2025	Paper generationRAG	CodeCode Available	1
Jasper and Stella: distillation of SOTA embedding models	Dec 26, 2024	RAGRepresentation Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 6 of 43Next →

No leaderboard results yet.