SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 401450 of 2111 papers

TitleStatusHype
Citekit: A Modular Toolkit for Large Language Model Citation GenerationCode1
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge GraphsCode1
EgoNormia: Benchmarking Physical Social Norm UnderstandingCode1
Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-BasesCode1
Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End SystemCode1
InteractiveSurvey: An LLM-based Personalized and Interactive Survey Paper Generation SystemCode1
Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQLCode1
Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI)Code1
Jasper and Stella: distillation of SOTA embedding modelsCode1
DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMsCode1
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question CoverageCode1
ShizishanGPT: An Agricultural Large Language Model Integrating Tools and ResourcesCode1
RAGSynth: Synthetic Data for Robust and Faithful RAG Component OptimizationCode1
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal SystemCode1
Contextual Compression in Retrieval-Augmented Generation for Large Language Models: A SurveyCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
PA-RAG: RAG Alignment via Multi-Perspective Preference OptimizationCode1
CoFE-RAG: A Comprehensive Full-chain Evaluation Framework for Retrieval-Augmented Generation with Enhanced Data DiversityCode1
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial TrainingCode1
Benchmarking Multimodal Knowledge Conflict for Large Multimodal ModelsCode1
ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological TextCode1
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language ModelsCode1
PeerQA: A Scientific Question Answering Dataset from Peer ReviewsCode1
Benchmarking LLM Faithfulness in RAG with Evolving LeaderboardsCode1
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge TracingCode1
Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation SystemsCode1
Combining Large Language Models with Static Analyzers for Code Review GenerationCode1
One Token Can Help! Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language ModelsCode1
AlignRAG: Leveraging Critique Learning for Evidence-Sensitive Retrieval-Augmented ReasoningCode1
ORAN-Bench-13K: An Open Source Benchmark for Assessing LLMs in Open Radio Access NetworksCode1
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point ThinkingCode1
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented GenerationCode1
Not All Contexts Are Equal: Teaching LLMs Credibility-aware GenerationCode1
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsCode1
NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question AnsweringCode1
NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for RetrievalCode1
Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP StandardsCode1
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection AttacksCode1
Deep Equilibrium Object DetectionCode1
Docopilot: Improving Multimodal Models for Document-Level UnderstandingCode1
Neuro-Symbolic Query CompilerCode1
OverThink: Slowdown Attacks on Reasoning LLMsCode1
Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted MetadataCode1
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household RoboticsCode1
Constructing and Evaluating Declarative RAG Pipelines in PyTerrierCode1
Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong BaselinesCode1
CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question AnsweringCode1
C-RAG: Certified Generation Risks for Retrieval-Augmented Language ModelsCode1
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidenceCode1
Show:102550
← PrevPage 9 of 43Next →

No leaderboard results yet.