SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 301350 of 2111 papers

TitleStatusHype
JuDGE: Benchmarking Judgment Document Generation for Chinese Legal SystemCode1
Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene UnderstandingCode1
SePer: Measure Retrieval Utility Through The Lens Of Semantic Perplexity ReductionCode1
GPIoT: Tailoring Small Language Models for IoT Program Synthesis and DevelopmentCode1
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation ConversationCode1
SafeAuto: Knowledge-Enhanced Safe Autonomous Driving with Multimodal Foundation ModelsCode1
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point ThinkingCode1
ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language ModelsCode1
Long-Context Inference with Retrieval-Augmented Speculative DecodingCode1
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix FactorizationCode1
EgoNormia: Benchmarking Physical Social Norm UnderstandingCode1
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning AttacksCode1
Code Summarization Beyond Function LevelCode1
PeerQA: A Scientific Question Answering Dataset from Peer ReviewsCode1
LLM-Lasso: A Robust Framework for Domain-Informed Feature Selection and RegularizationCode1
Graph RAG-Tool FusionCode1
Combining Large Language Models with Static Analyzers for Code Review GenerationCode1
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation LearningCode1
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel EncodingCode1
MRAMG-Bench: A Comprehensive Benchmark for Advancing Multimodal Retrieval-Augmented Multimodal GenerationCode1
OverThink: Slowdown Attacks on Reasoning LLMsCode1
Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented GenerationCode1
RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval DefectsCode1
CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo FilterCode1
Med-R^2: Crafting Trustworthy LLM Physicians via Retrieval and Reasoning of Evidence-Based MedicineCode1
Chat3GPP: An Open-Source Retrieval-Augmented Generation Framework for 3GPP DocumentsCode1
InsQABench: Benchmarking Chinese Insurance Domain Question Answering with Large Language ModelsCode1
Docopilot: Improving Multimodal Models for Document-Level UnderstandingCode1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based SearchCode1
Plancraft: an evaluation dataset for planning with LLM agentsCode1
Long Context vs. RAG for LLMs: An Evaluation and RevisitsCode1
RAG with Differential PrivacyCode1
Jasper and Stella: distillation of SOTA embedding modelsCode1
Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp)Code1
Towards Interpretable Radiology Report Generation via Concept Bottlenecks using a Multi-Agentic RAGCode1
PA-RAG: RAG Alignment via Multi-Perspective Preference OptimizationCode1
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference AlignmentCode1
Context-DPO: Aligning Language Models for Context-FaithfulnessCode1
EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented GenerationCode1
RAG Playground: A Framework for Systematic Evaluation of Retrieval Strategies and Prompt Engineering in RAG SystemsCode1
SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report GenerationCode1
CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language ModelsCode1
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsCode1
KG-Retriever: Efficient Knowledge Indexing for Retrieval-Augmented Large Language ModelsCode1
SurgBox: Agent-Driven Operating Room Sandbox with Surgery CopilotCode1
Retrieval-Augmented Machine Translation with Unstructured KnowledgeCode1
HEAL: Hierarchical Embedding Alignment Loss for Improved Retrieval and Representation LearningCode1
MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question ComplexityCode1
AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge ReasoningCode1
Multi-modal Retrieval Augmented Multi-modal Generation: A Benchmark, Evaluate Metrics and Strong BaselinesCode1
Show:102550
← PrevPage 7 of 43Next →

No leaderboard results yet.