| Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments | May 2, 2025 | Dataset GenerationLanguage Modeling | —Unverified | 0 |
| EnronQA: Towards Personalized RAG over Private Documents | May 1, 2025 | BenchmarkingMemorization | —Unverified | 0 |
| Empowering Agentic Video Analytics Systems with Video Language Models | May 1, 2025 | Knowledge GraphsRAG | —Unverified | 0 |
| A Multi-Granularity Retrieval Framework for Visually-Rich Documents | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Patchwork: A Unified Framework for RAG Serving | May 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection | May 1, 2025 | Extractive Question-AnsweringHallucination | —Unverified | 0 |
| Traceback of Poisoning Attacks to Retrieval-Augmented Generation | Apr 30, 2025 | RAGRetrieval | —Unverified | 0 |
| Homa at SemEval-2025 Task 5: Aligning Librarian Records with OntoAligner for Subject Tagging | Apr 30, 2025 | RAGRetrieval | —Unverified | 0 |
| Optimization of embeddings storage for RAG systems using quantization and dimensionality reduction techniques | Apr 30, 2025 | Dimensionality ReductionMTEB Benchmark | —Unverified | 0 |
| Memorization and Knowledge Injection in Gated LLMs | Apr 30, 2025 | Continual LearningMemorization | CodeCode Available | 0 |
| Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA | Apr 30, 2025 | Information RetrievalMedical Question Answering | CodeCode Available | 0 |
| ARCS: Agentic Retrieval-Augmented Code Synthesis with Iterative Refinement | Apr 29, 2025 | Code GenerationHumanEval | —Unverified | 0 |
| CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models | Apr 29, 2025 | DiagnosticRAG | CodeCode Available | 0 |
| Information Retrieval in the Age of Generative AI: The RGB Model | Apr 29, 2025 | Information RetrievalRAG | CodeCode Available | 0 |
| Graph RAG for Legal Norms: A Hierarchical and Temporal Approach | Apr 29, 2025 | Knowledge GraphsRAG | —Unverified | 0 |
| AKIBoards: A Structure-Following Multiagent System for Predicting Acute Kidney Injury | Apr 29, 2025 | DiagnosticRAG | —Unverified | 0 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation | Apr 28, 2025 | ChunkingRAG | CodeCode Available | 0 |
| Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses | Apr 28, 2025 | ChatbotDiagnostic | —Unverified | 0 |
| OpenTCM: A GraphRAG-Empowered LLM-based System for Traditional Chinese Medicine Knowledge Retrieval and Diagnosis | Apr 28, 2025 | DiagnosticInformation Retrieval | —Unverified | 0 |
| Can LLMs Be Trusted for Evaluating RAG Systems? A Survey of Methods and Datasets | Apr 28, 2025 | ArticlesBenchmarking | —Unverified | 0 |
| LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection | Apr 25, 2025 | Feature EngineeringRAG | —Unverified | 0 |
| RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models | Apr 25, 2025 | RAGRed Teaming | —Unverified | 0 |
| FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation | Apr 22, 2025 | Question AnsweringRAG | —Unverified | 0 |
| CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction | Apr 22, 2025 | ArticlesInformation Retrieval | —Unverified | 0 |
| The Viability of Crowdsourcing for RAG Evaluation | Apr 22, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 |
| Synergizing RAG and Reasoning: A Systematic Review | Apr 22, 2025 | RAGRetrieval | —Unverified | 0 |
| A LoRA-Based Approach to Fine-Tuning LLMs for Educational Guidance in Resource-Constrained Settings | Apr 22, 2025 | Computational EfficiencyGPU | CodeCode Available | 0 |
| The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models | Apr 21, 2025 | Question AnsweringRAG | —Unverified | 0 |
| POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications | Apr 21, 2025 | HallucinationLogical Reasoning | —Unverified | 0 |
| Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges | Apr 21, 2025 | RAGRetrieval-augmented Generation | —Unverified | 0 |
| Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation | Apr 21, 2025 | RetrievalRetrieval-augmented Generation | —Unverified | 0 |
| Efficient Document Retrieval with G-Retriever | Apr 21, 2025 | graph constructionQuestion Answering | CodeCode Available | 0 |
| LLMs as Data Annotators: How Close Are We to Human Performance | Apr 21, 2025 | In-Context Learningnamed-entity-recognition | —Unverified | 0 |
| FinSage: A Multi-aspect RAG System for Financial Filings Question Answering | Apr 20, 2025 | Question AnsweringRAG | —Unverified | 0 |
| Towards Optimal Circuit Generation: Multi-Agent Collaboration Meets Collective Intelligence | Apr 20, 2025 | Code GenerationRetrieval-augmented Generation | CodeCode Available | 0 |
| ResNetVLLM-2: Addressing ResNetVLLM's Multi-Modal Hallucinations | Apr 20, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval | Apr 19, 2025 | Information RetrievalQuestion Answering | —Unverified | 0 |
| CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models | Apr 18, 2025 | Knowledge GraphsRAG | —Unverified | 0 |
| SCRAG: Social Computing-Based Retrieval Augmented Generation for Community Response Forecasting in Social Media Environments | Apr 18, 2025 | ArticlesPublic Relations | —Unverified | 0 |
| Secure Multifaceted-RAG for Enterprise: Hybrid Knowledge Retrieval with Security Filtering | Apr 18, 2025 | RAGRetrieval | —Unverified | 0 |
| Fashion-RAG: Multimodal Fashion Image Editing via Retrieval-Augmented Generation | Apr 18, 2025 | Multimodal fashion image editingRAG | —Unverified | 0 |
| RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines | Apr 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild | Apr 17, 2025 | Decision MakingInformation Retrieval | —Unverified | 0 |
| InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning | Apr 17, 2025 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks | Apr 17, 2025 | Epistemic ReasoningLarge Language Model | CodeCode Available | 0 |
| Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification | Apr 17, 2025 | DiversityGaussian Processes | CodeCode Available | 0 |
| ACoRN: Noise-Robust Abstractive Compression in Retrieval-Augmented Language Models | Apr 17, 2025 | Data AugmentationRAG | —Unverified | 0 |
| A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment | Apr 16, 2025 | Information RetrievalRAG | CodeCode Available | 0 |
| Towards Conversational AI for Human-Machine Collaborative MLOps | Apr 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |