RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 2111 papers

Title	Date	Tasks	Status	Hype	Score
Citekit: A Modular Toolkit for Large Language Model Citation Generation	Aug 6, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs	May 16, 2025	Information RetrievalKnowledge Graphs	CodeCode Available	1	5
EgoNormia: Benchmarking Physical Social Norm Understanding	Feb 27, 2025	Answer GenerationBenchmarking	CodeCode Available	1	5
Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases	Mar 15, 2024	RAGRetrieval	CodeCode Available	1	5
Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End System	Apr 12, 2025	Information RetrievalRAG	CodeCode Available	1	5
InteractiveSurvey: An LLM-based Personalized and Interactive Survey Paper Generation System	Mar 31, 2025	Paper generationRAG	CodeCode Available	1	5
Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQL	Apr 19, 2024	RAGRetrieval	CodeCode Available	1	5
Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)	Dec 18, 2023	RAGRetrieval	CodeCode Available	1	5
Jasper and Stella: distillation of SOTA embedding models	Dec 26, 2024	RAGRepresentation Learning	CodeCode Available	1	5
DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs	Jun 10, 2025	RAGRetrieval-augmented Generation	CodeCode Available	1	5
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage	Oct 20, 2024	Answer GenerationRAG	CodeCode Available	1	5
ShizishanGPT: An Agricultural Large Language Model Integrating Tools and Resources	Sep 20, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization	May 16, 2025	RAGSynthetic Data Generation	CodeCode Available	1	5
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System	Mar 18, 2025	BenchmarkingIn-Context Learning	CodeCode Available	1	5
Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A Survey	Sep 20, 2024	RAGRetrieval	CodeCode Available	1	5
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation	Jun 9, 2024	Common Sense ReasoningDenoising	CodeCode Available	1	5
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization	Dec 19, 2024	InformativenessRAG	CodeCode Available	1	5
CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for Retrieval-Augmented Generation with Enhanced Data Diversity	Oct 16, 2024	ChunkingDiversity	CodeCode Available	1	5
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training	May 31, 2024	HallucinationMulti-Task Learning	CodeCode Available	1	5
Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models	May 26, 2025	BenchmarkingRAG	CodeCode Available	1	5
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text	May 26, 2024	Arrhythmia DetectionRAG	CodeCode Available	1	5
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models	Mar 20, 2025	counterfactualRAG	CodeCode Available	1	5
PeerQA: A Scientific Question Answering Dataset from Peer Reviews	Feb 19, 2025	answerability predictionAnswer Generation	CodeCode Available	1	5
Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards	May 7, 2025	BenchmarkingHallucination	CodeCode Available	1	5
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience Report	Oct 21, 2024	Information RetrievalRAG	CodeCode Available	1	5
KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing	May 26, 2025	Knowledge TracingMulti-hop Question Answering	CodeCode Available	1	5
Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems	Mar 19, 2025	Question AnsweringRAG	CodeCode Available	1	5
Combining Large Language Models with Static Analyzers for Code Review Generation	Feb 10, 2025	RAGRetrieval-augmented Generation	CodeCode Available	1	5
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models	May 30, 2024	Question AnsweringRAG	CodeCode Available	1	5
AlignRAG: Leveraging Critique Learning for Evidence-Sensitive Retrieval-Augmented Reasoning	Apr 21, 2025	RAGRetrieval	CodeCode Available	1	5
ORAN-Bench-13K: An Open Source Benchmark for Assessing LLMs in Open Radio Access Networks	Jul 8, 2024	Anomaly DetectionCode Generation	CodeCode Available	1	5
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking	Feb 28, 2025	RAGRetrieval	CodeCode Available	1	5
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation	Dec 18, 2023	HallucinationLanguage Modelling	CodeCode Available	1	5
Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation	Apr 10, 2024	AllRAG	CodeCode Available	1	5
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs	Dec 10, 2024	Knowledge GraphsRAG	CodeCode Available	1	5
NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering	May 26, 2025	ChunkingLarge Language Model	CodeCode Available	1	5
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval	Sep 4, 2024	Image RetrievalRAG	CodeCode Available	1	5
Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards	Aug 21, 2024	ChunkingComputational Efficiency	CodeCode Available	1	5
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks	Mar 6, 2024	RAGRetrieval	CodeCode Available	1	5
Deep Equilibrium Object Detection	Aug 18, 2023	DecoderObject	CodeCode Available	1	5
Docopilot: Improving Multimodal Models for Document-Level Understanding	Jan 1, 2025	document understandingRAG	CodeCode Available	1	5
Neuro-Symbolic Query Compiler	May 17, 2025	RAGResponse Generation	CodeCode Available	1	5
OverThink: Slowdown Attacks on Reasoning LLMs	Feb 4, 2025	RAG	CodeCode Available	1	5
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata	Jun 19, 2024	RAGRetrieval	CodeCode Available	1	5
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics	Apr 30, 2025	In-Context LearningObject	CodeCode Available	1	5
Constructing and Evaluating Declarative RAG Pipelines in PyTerrier	Jun 12, 2025	Natural QuestionsRAG	CodeCode Available	1	5
Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines	Nov 25, 2024	multimodal generationRAG	CodeCode Available	1	5
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering	Sep 29, 2024	Graph Question AnsweringQuestion Answering	CodeCode Available	1	5
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models	Feb 5, 2024	RAGRetrieval	CodeCode Available	1	5
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence	Apr 16, 2024	Question AnsweringRAG	CodeCode Available	1	5

Show:10 25 50

← PrevPage 9 of 43Next →

No leaderboard results yet.