SOTAVerified

RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Showing 451500 of 2111 papers

TitleStatusHype
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement LearningCode1
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question CoverageCode1
Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision TraceabilityCode1
Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language ModelCode1
Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End SystemCode1
POQD: Performance-Oriented Query Decomposer for Multi-vector retrievalCode1
Plancraft: an evaluation dataset for planning with LLM agentsCode1
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement LearningCode1
Deep Equilibrium Object DetectionCode1
Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented GenerationCode1
Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer FrameworkCode1
Merging-Diverging Hybrid Transformer Networks for Survival Prediction in Head and Neck CancerCode1
DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point ThinkingCode1
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
R^2AG: Incorporating Retrieval Information into Retrieval Augmented GenerationCode1
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation SystemsCode1
Removal of Hallucination on Hallucination: Debate-Augmented RAGCode1
SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report GenerationCode1
Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language ModelsCode0
QPaug: Question and Passage Augmentation for Open-Domain Question Answering of LLMsCode0
Customized Retrieval Augmented Generation and Benchmarking for EDA Tool Documentation QACode0
Pairing Analogy-Augmented Generation with Procedural Memory for Procedural Q&ACode0
Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based ReasoningCode0
Optimizing and Evaluating Enterprise Retrieval-Augmented Generation (RAG): A Content Design PerspectiveCode0
Out of Style: RAG's Fragility to Linguistic VariationCode0
Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image GenerationCode0
Attribute or Abstain: Large Language Models as Long Document AssistantsCode0
On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation SystemsCode0
Attention Instruction: Amplifying Attention in the Middle via PromptingCode0
AIC CTU system at AVeriTeC: Re-framing automated fact-checking as a simple RAG taskCode0
NeoQA: Evidence-based Question Answering with Generated News EventsCode0
Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class ImbalanceCode0
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented GeneratorCode0
Corpus Poisoning via Approximate Greedy Gradient DescentCode0
CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAGCode0
Multimodal Integrated Knowledge Transfer to Large Language Models through Preference Optimization with Biomedical ApplicationsCode0
A Comparison of Methods for Evaluating Generative IRCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG SystemsCode0
A System for Comprehensive Assessment of RAG FrameworksCode0
Conversational Gold: Evaluating Personalized Conversational Search System using Gold NuggetsCode0
Mitigating Bias in RAG: Controlling the EmbedderCode0
Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-ReasoningCode0
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting FrameworkCode0
MindScope: Exploring cognitive biases in large language models through Multi-Agent SystemsCode0
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented GenerationCode0
A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory TextsCode0
MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAGCode0
Continual Stereo Matching of Continuous Driving Scenes With Growing ArchitectureCode0
Show:102550
← PrevPage 10 of 43Next →

No leaderboard results yet.