SOTAVerified

Retrieval-augmented Generation

Papers

Showing 501550 of 2196 papers

TitleStatusHype
Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based ReasoningCode0
Out of Style: RAG's Fragility to Linguistic VariationCode0
Optimizing Code Runtime Performance through Context-Aware Retrieval-Augmented GenerationCode0
Consistent Autoformalization for Constructing Mathematical LibrariesCode0
ConQRet: Benchmarking Fine-Grained Evaluation of Retrieval Augmented Argumentation with LLM JudgesCode0
Unipa-GPT: Large Language Models for university-oriented QA in ItalianCode0
Concurrent Brainstorming & Hypothesis Satisfying: An Iterative Framework for Enhanced Retrieval-Augmented Generation (R2CBR3H-SR)Code0
On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation SystemsCode0
Optimizing and Evaluating Enterprise Retrieval-Augmented Generation (RAG): A Content Design PerspectiveCode0
RAD-Bench: Evaluating Large Language Models Capabilities in Retrieval Augmented DialoguesCode0
Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented GenerationCode0
NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question AnsweringCode0
Agentic Search Engine for Real-Time IoT DataCode0
CommunityKG-RAG: Leveraging Community Structures in Knowledge Graphs for Advanced Retrieval-Augmented Generation in Fact-CheckingCode0
Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class ImbalanceCode0
MuseRAG: Idea Originality Scoring At ScaleCode0
Agentic Reasoning: Reasoning LLMs with Tools for the Deep ResearchCode0
NeoQA: Evidence-based Question Answering with Generated News EventsCode0
Ask, Retrieve, Summarize: A Modular Pipeline for Scientific Literature SummarizationCode0
Mitigating Bias in RAG: Controlling the EmbedderCode0
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail KnowledgeCode0
ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented GenerationCode0
Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented GenerationCode0
Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-ReasoningCode0
MindScope: Exploring cognitive biases in large language models through Multi-Agent SystemsCode0
ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate DisclosuresCode0
Climate Finance BenchCode0
Memory and Knowledge Augmented Language Models for Inferring Salience in Long-Form StoriesCode0
MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAGCode0
Memorization and Knowledge Injection in Gated LLMsCode0
MedMobile: A mobile-sized language model with expert-level clinical capabilitiesCode0
ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance LabelingCode0
Citegeist: Automated Generation of Related Work Analysis on the arXiv CorpusCode0
Agent-Enhanced Large Language Models for Researching Political InstitutionsCode0
CiteCheck: Towards Accurate Citation Faithfulness DetectionCode0
Medical large language models are easily distractedCode0
CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-TrainingCode0
MedImageInsight: An Open-Source Embedding Model for General Domain Medical ImagingCode0
MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented GenerationCode0
MCCoder: Streamlining Motion Control with LLM-Assisted Code Generation and Rigorous VerificationCode0
Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials ScienceCode0
LTRR: Learning To Rank Retrievers for LLMsCode0
You Only Use Reactive Attention Slice For Long Context RetrievalCode0
LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device CollaborationCode0
Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic ReasoningCode0
Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practiceCode0
LLMs are Biased Evaluators But Not Biased for Retrieval Augmented GenerationCode0
LLM Robustness Against Misinformation in Biomedical Question AnsweringCode0
LLMs in Biomedicine: A study on clinical Named Entity RecognitionCode0
AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMsCode0
Show:102550
← PrevPage 11 of 44Next →

No leaderboard results yet.