RAG

Retrieval-Augmented Generation (RAG) is a task that combines the strengths of both retrieval-based models and generation-based models. In this approach, a retrieval system selects relevant documents or passages from a large corpus, and a generation model, typically a neural language model, uses the retrieved information to generate a response. This method enhances the accuracy and coherence of generated text, especially in tasks requiring detailed knowledge or long context handling.

RAG is particularly useful in open-domain question answering, knowledge-grounded dialogue, and summarization tasks. The retrieval step helps the model to access and incorporate external information, making it less reliant on memorized knowledge and better suited for generating responses based on the latest or domain-specific information.

The performance of RAG systems is usually measured using metrics such as precision, recall, F1 score, BLEU score, and exact match. Some popular datasets for evaluating RAG models include Natural Questions, MS MARCO, TriviaQA, and SQuAD.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1851–1900 of 2111 papers

Title	Date	Tasks	Status	Hype
Tabular Embedding Model (TEM): Finetuning Embedding Models For Tabular RAG Applications	Apr 28, 2024	Code GenerationRAG	—Unverified	0
Generative AI for Low-Carbon Artificial Intelligence of Things with Large Language Models	Apr 28, 2024	Language ModellingLarge Language Model	—Unverified	0
Tool Calling: Enhancing Medication Consultation via Retrieval-Augmented Large Language Models	Apr 27, 2024	Answer GenerationQuestion Answering	—Unverified	0
Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering	Apr 26, 2024	Knowledge GraphsQuestion Answering	—Unverified	0
Human-Imperceptible Retrieval Poisoning Attacks in LLM-Powered Applications	Apr 26, 2024	RAGRetrieval	—Unverified	0
From Local to Global: A Graph RAG Approach to Query-Focused Summarization	Apr 24, 2024	Query-focused SummarizationQuestion Answering	CodeCode Available	14
Prompt Leakage effect and defense strategies for multi-turn LLM interactions	Apr 24, 2024	RAG	—Unverified	0
Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documents	Apr 24, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Telco-RAG: Navigating the Challenges of Retrieval-Augmented Language Models for Telecommunications	Apr 24, 2024	RAGRetrieval	CodeCode Available	2
IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents	Apr 23, 2024	RAG	—Unverified	0
Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations	Apr 22, 2024	RAGRetrieval-augmented Generation	CodeCode Available	0
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation	Apr 22, 2024	HallucinationRAG	CodeCode Available	1
Retrieval-Augmented Audio Deepfake Detection	Apr 22, 2024	Audio Deepfake DetectionDeepFake Detection	—Unverified	0
Evaluating Retrieval Quality in Retrieval-Augmented Generation	Apr 21, 2024	GPULanguage Modeling	CodeCode Available	1
Dubo-SQL: Diverse Retrieval-Augmented Generation and Fine Tuning for Text-to-SQL	Apr 19, 2024	RAGRetrieval	CodeCode Available	1
Unlocking Multi-View Insights in Knowledge-Dense Retrieval-Augmented Generation	Apr 19, 2024	RAGRetrieval	—Unverified	0
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation	Apr 18, 2024	GPURAG	—Unverified	0
RAGAR, Your Falsehood Radar: RAG-Augmented Reasoning for Political Fact-Checking using Multimodal Large Language Models	Apr 18, 2024	Fact CheckingLanguage Modeling	—Unverified	0
LongEmbed: Extending Embedding Models for Long Context Retrieval	Apr 18, 2024	4k8k	CodeCode Available	2
RAM: Towards an Ever-Improving Memory System by Learning from Communications	Apr 18, 2024	Lifelong learningRAG	—Unverified	0
iRAG: Advancing RAG for Videos with an Incremental Approach	Apr 18, 2024	Information RetrievalRAG	—Unverified	0
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory	Apr 17, 2024	HallucinationLanguage Modeling	CodeCode Available	1
Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning: A Comparative Study	Apr 17, 2024	Question AnsweringRAG	—Unverified	0
A Survey on Retrieval-Augmented Text Generation for Large Language Models	Apr 17, 2024	RAGRetrieval	—Unverified	0
Position Engineering: Boosting Large Language Models through Positional Information Manipulation	Apr 17, 2024	In-Context LearningPosition	—Unverified	0
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence	Apr 16, 2024	Question AnsweringRAG	CodeCode Available	1
Spiral of Silence: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering	Apr 16, 2024	Information RetrievalLanguage Modeling	CodeCode Available	1
Generative AI Agents with Large Language Model for Satellite Networks via a Mixture of Experts Transmission	Apr 14, 2024	Language ModelingLanguage Modelling	—Unverified	0
Cross-Data Knowledge Graph Construction for LLM-enabled Educational Question-Answering System: A Case Study at HCMUT	Apr 14, 2024	graph constructionKnowledge Graphs	—Unverified	0
Generative AI Agent for Next-Generation MIMO Design: Fundamentals, Challenges, and Vision	Apr 13, 2024	AI AgentLanguage Modeling	—Unverified	0
Reducing hallucination in structured outputs via Retrieval-Augmented Generation	Apr 12, 2024	HallucinationRAG	—Unverified	0
Generative Information Retrieval Evaluation	Apr 11, 2024	Information RetrievalRAG	—Unverified	0
LLMs in Biomedicine: A study on clinical Named Entity Recognition	Apr 10, 2024	few-shot-nerFew-shot NER	CodeCode Available	0
Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation	Apr 10, 2024	AllRAG	CodeCode Available	1
Towards Robustness of Text-to-Visualization Translation against Lexical and Phrasal Variability	Apr 10, 2024	RAGRetrieval	—Unverified	0
Superposition Prompting: Improving and Accelerating Retrieval-Augmented Generation	Apr 10, 2024	Question AnsweringRAG	CodeCode Available	2
Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology	Apr 10, 2024	RAGRetrieval	—Unverified	0
AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval	Apr 9, 2024	AllInformation Retrieval	CodeCode Available	2
RAR-b: Reasoning as Retrieval Benchmark	Apr 9, 2024	Information RetrievalRAG	CodeCode Available	1
Dimensionality Reduction in Sentence Transformer Vector Databases with Fast Fourier Transform	Apr 9, 2024	Computational EfficiencyDimensionality Reduction	—Unverified	0
MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering	Apr 8, 2024	BenchmarkingMedical Question Answering	—Unverified	0
Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models	Apr 8, 2024	DescriptiveIn-Context Learning	—Unverified	0
IITK at SemEval-2024 Task 2: Exploring the Capabilities of LLMs for Safe Biomedical Natural Language Inference for Clinical Trials	Apr 6, 2024	Natural Language InferenceRAG	CodeCode Available	0
A Comparison of Methods for Evaluating Generative IR	Apr 5, 2024	Information RetrievalLanguage Modelling	CodeCode Available	0
CBR-RAG: Case-Based Reasoning for Retrieval Augmented Generation in LLMs for Legal Question Answering	Apr 4, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
CONFLARE: CONFormal LArge language model REtrieval	Apr 4, 2024	Conformal PredictionLanguage Modeling	CodeCode Available	1
uTeBC-NLP at SemEval-2024 Task 9: Can LLMs be Lateral Thinkers?	Apr 3, 2024	In-Context LearningPrompt Engineering	CodeCode Available	0
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt Optimization	Apr 2, 2024	RAGRetrieval	CodeCode Available	4
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems	Apr 2, 2024	FormLong Form Question Answering	CodeCode Available	1
Octopus v2: On-device language model for super agent	Apr 2, 2024	Language ModelingLanguage Modelling	—Unverified	0

Show:10 25 50

← PrevPage 38 of 43Next →

No leaderboard results yet.