RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 2111 papers

Title	Date	Tasks	Status	Hype
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal System	Mar 18, 2025	BenchmarkingIn-Context Learning	CodeCode Available	1
Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding	Mar 16, 2025	Autonomous DrivingRAG	CodeCode Available	1
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity Reduction	Mar 3, 2025	RAGRetrieval	CodeCode Available	1
GPIoT: Tailoring Small Language Models for IoT Program Synthesis and Development	Mar 2, 2025	Code GenerationProgram Synthesis	CodeCode Available	1
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation	Feb 28, 2025	ArticlesBenchmarking	CodeCode Available	1
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation Models	Feb 28, 2025	AttributeAutonomous Driving	CodeCode Available	1
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking	Feb 28, 2025	RAGRetrieval	CodeCode Available	1
ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models	Feb 27, 2025	Question AnsweringRAG	CodeCode Available	1
Long-Context Inference with Retrieval-Augmented Speculative Decoding	Feb 27, 2025	Computational EfficiencyRAG	CodeCode Available	1
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization	Feb 27, 2025	Information RetrievalKnowledge Graphs	CodeCode Available	1
EgoNormia: Benchmarking Physical Social Norm Understanding	Feb 27, 2025	Answer GenerationBenchmarking	CodeCode Available	1
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks	Feb 25, 2025	MisinformationQuestion Answering	CodeCode Available	1
Code Summarization Beyond Function Level	Feb 23, 2025	Code SummarizationFew-Shot Learning	CodeCode Available	1
PeerQA: A Scientific Question Answering Dataset from Peer Reviews	Feb 19, 2025	answerability predictionAnswer Generation	CodeCode Available	1
LLM-Lasso: A Robust Framework for Domain-Informed Feature Selection and Regularization	Feb 15, 2025	feature selectionRAG	CodeCode Available	1
Graph RAG-Tool Fusion	Feb 11, 2025	RAGRetrieval	CodeCode Available	1
Combining Large Language Models with Static Analyzers for Code Review Generation	Feb 10, 2025	RAGRetrieval-augmented Generation	CodeCode Available	1
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning	Feb 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding	Feb 8, 2025	RAG	CodeCode Available	1
MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal Generation	Feb 6, 2025	Answer Generationmultimodal generation	CodeCode Available	1
OverThink: Slowdown Attacks on Reasoning LLMs	Feb 4, 2025	RAG	CodeCode Available	1
Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation	Feb 1, 2025	Membership Inference AttackRAG	CodeCode Available	1
RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects	Jan 30, 2025	counterfactualRAG	CodeCode Available	1
CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter	Jan 25, 2025	Computational EfficiencyRAG	CodeCode Available	1
Med-R^2: Crafting Trustworthy LLM Physicians via Retrieval and Reasoning of Evidence-Based Medicine	Jan 21, 2025	RAGRetrieval	CodeCode Available	1
Chat3GPP: An Open-Source Retrieval-Augmented Generation Framework for 3GPP Documents	Jan 20, 2025	ChunkingRAG	CodeCode Available	1
InsQABench: Benchmarking Chinese Insurance Domain Question Answering with Large Language Models	Jan 19, 2025	BenchmarkingQuestion Answering	CodeCode Available	1
Docopilot: Improving Multimodal Models for Document-Level Understanding	Jan 1, 2025	document understandingRAG	CodeCode Available	1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search	Dec 30, 2024	RAGRetrieval	CodeCode Available	1
Plancraft: an evaluation dataset for planning with LLM agents	Dec 30, 2024	Decision MakingMinecraft	CodeCode Available	1
Long Context vs. RAG for LLMs: An Evaluation and Revisits	Dec 27, 2024	Question AnsweringRAG	CodeCode Available	1
RAG with Differential Privacy	Dec 26, 2024	General KnowledgeRAG	CodeCode Available	1
Jasper and Stella: distillation of SOTA embedding models	Dec 26, 2024	RAGRepresentation Learning	CodeCode Available	1
Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp)	Dec 23, 2024	Contrastive LearningInformation Retrieval	CodeCode Available	1
Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAG	Dec 20, 2024	Classificationimage-classification	CodeCode Available	1
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization	Dec 19, 2024	InformativenessRAG	CodeCode Available	1
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment	Dec 18, 2024	BenchmarkingRAG	CodeCode Available	1
Context-DPO: Aligning Language Models for Context-Faithfulness	Dec 18, 2024	RAGRetrieval-augmented Generation	CodeCode Available	1
EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation	Dec 17, 2024	Question AnsweringRAG	CodeCode Available	1
RAG Playground: A Framework for Systematic Evaluation of Retrieval Strategies and Prompt Engineering in RAG Systems	Dec 16, 2024	Prompt EngineeringRAG	CodeCode Available	1
SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation	Dec 14, 2024	RAGRetrieval-augmented Generation	CodeCode Available	1
CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models	Dec 13, 2024	RAG	CodeCode Available	1
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge Graphs	Dec 10, 2024	Knowledge GraphsRAG	CodeCode Available	1
KG-Retriever: Efficient Knowledge Indexing for Retrieval-Augmented Large Language Models	Dec 7, 2024	Multi-hop Question AnsweringNavigate	CodeCode Available	1
SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot	Dec 6, 2024	Decision MakingRAG	CodeCode Available	1
Retrieval-Augmented Machine Translation with Unstructured Knowledge	Dec 5, 2024	Knowledge GraphsMachine Translation	CodeCode Available	1
HEAL: Hierarchical Embedding Alignment Loss for Improved Retrieval and Representation Learning	Dec 5, 2024	Contrastive LearningDocument Classification	CodeCode Available	1
MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity	Dec 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning	Nov 25, 2024	HallucinationQuestion Answering	CodeCode Available	1
Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong Baselines	Nov 25, 2024	multimodal generationRAG	CodeCode Available	1

Show:10 25 50

← PrevPage 7 of 43Next →

No leaderboard results yet.