| Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning | Jul 9, 2025 | BenchmarkingImage Retrieval | CodeCode Available | 0 | 5 |
| Out of Style: RAG's Fragility to Linguistic Variation | Apr 11, 2025 | Question AnsweringRAG | CodeCode Available | 0 | 5 |
| Optimizing Code Runtime Performance through Context-Aware Retrieval-Augmented Generation | Jan 28, 2025 | In-Context LearningRetrieval | CodeCode Available | 0 | 5 |
| Consistent Autoformalization for Constructing Mathematical Libraries | Oct 5, 2024 | DenoisingRAG | CodeCode Available | 0 | 5 |
| ConQRet: Benchmarking Fine-Grained Evaluation of Retrieval Augmented Argumentation with LLM Judges | Dec 6, 2024 | BenchmarkingRetrieval | CodeCode Available | 0 | 5 |
| Unipa-GPT: Large Language Models for university-oriented QA in Italian | Jul 19, 2024 | ChatbotInformation Retrieval | CodeCode Available | 0 | 5 |
| Concurrent Brainstorming & Hypothesis Satisfying: An Iterative Framework for Enhanced Retrieval-Augmented Generation (R2CBR3H-SR) | Jan 3, 2024 | Decision MakingInformation Retrieval | CodeCode Available | 0 | 5 |
| On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems | Feb 20, 2025 | Long Form Question AnsweringQuestion Answering | CodeCode Available | 0 | 5 |
| Optimizing and Evaluating Enterprise Retrieval-Augmented Generation (RAG): A Content Design Perspective | Oct 1, 2024 | Question AnsweringRAG | CodeCode Available | 0 | 5 |
| RAD-Bench: Evaluating Large Language Models Capabilities in Retrieval Augmented Dialogues | Sep 19, 2024 | RAGRetrieval | CodeCode Available | 0 | 5 |
| Not All Languages are Equal: Insights into Multilingual Retrieval-Augmented Generation | Oct 29, 2024 | AllRetrieval | CodeCode Available | 0 | 5 |
| NitiBench: A Comprehensive Studies of LLM Frameworks Capabilities for Thai Legal Question Answering | Feb 15, 2025 | ChunkingInformation Retrieval | CodeCode Available | 0 | 5 |
| Agentic Search Engine for Real-Time IoT Data | Mar 15, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| CommunityKG-RAG: Leveraging Community Structures in Knowledge Graphs for Advanced Retrieval-Augmented Generation in Fact-Checking | Aug 16, 2024 | Fact CheckingInformation Retrieval | CodeCode Available | 0 | 5 |
| Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance | Jan 21, 2025 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| MuseRAG: Idea Originality Scoring At Scale | May 22, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research | Feb 7, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 | 5 |
| NeoQA: Evidence-based Question Answering with Generated News Events | May 9, 2025 | ArticlesQuestion Answering | CodeCode Available | 0 | 5 |
| Ask, Retrieve, Summarize: A Modular Pipeline for Scientific Literature Summarization | May 22, 2025 | Document SummarizationMulti-Document Summarization | CodeCode Available | 0 | 5 |
| Mitigating Bias in RAG: Controlling the Embedder | Feb 24, 2025 | FairnessRAG | CodeCode Available | 0 | 5 |
| MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge | Dec 22, 2024 | Multi-hop Question AnsweringQuestion Answering | CodeCode Available | 0 | 5 |
| ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation | May 30, 2025 | RAGRetrieval | CodeCode Available | 0 | 5 |
| Mix-of-Granularity: Optimize the Chunking Granularity for Retrieval-Augmented Generation | Jun 1, 2024 | ChunkingRAG | CodeCode Available | 0 | 5 |
| Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning | Jun 5, 2025 | Question AnsweringRAG | CodeCode Available | 0 | 5 |
| MindScope: Exploring cognitive biases in large language models through Multi-Agent Systems | Oct 6, 2024 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Jun 14, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 | 5 |
| Climate Finance Bench | May 28, 2025 | Logical ReasoningQuantization | CodeCode Available | 0 | 5 |
| Memory and Knowledge Augmented Language Models for Inferring Salience in Long-Form Stories | Sep 8, 2021 | FormLanguage Modeling | CodeCode Available | 0 | 5 |
| MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAG | Mar 17, 2025 | Information RetrievalQuestion Answering | CodeCode Available | 0 | 5 |
| Memorization and Knowledge Injection in Gated LLMs | Apr 30, 2025 | Continual LearningMemorization | CodeCode Available | 0 | 5 |
| MedMobile: A mobile-sized language model with expert-level clinical capabilities | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling | Feb 21, 2024 | MMLURetrieval | CodeCode Available | 0 | 5 |
| Citegeist: Automated Generation of Related Work Analysis on the arXiv Corpus | Mar 29, 2025 | ArticlesRAG | CodeCode Available | 0 | 5 |
| Agent-Enhanced Large Language Models for Researching Political Institutions | Mar 14, 2025 | Document SummarizationInformation Retrieval | CodeCode Available | 0 | 5 |
| CiteCheck: Towards Accurate Citation Faithfulness Detection | Feb 15, 2025 | parameter-efficient fine-tuningRAG | CodeCode Available | 0 | 5 |
| Medical large language models are easily distracted | Apr 1, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training | Jun 12, 2025 | RAGResponse Generation | CodeCode Available | 0 | 5 |
| MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging | Oct 9, 2024 | Age EstimationFairness | CodeCode Available | 0 | 5 |
| MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation | Feb 24, 2025 | RAGRetrieval | CodeCode Available | 0 | 5 |
| MCCoder: Streamlining Motion Control with LLM-Assisted Code Generation and Rigorous Verification | Oct 19, 2024 | Code GenerationRAG | CodeCode Available | 0 | 5 |
| Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science | Jun 4, 2025 | ArticlesCode Generation | CodeCode Available | 0 | 5 |
| LTRR: Learning To Rank Retrievers for LLMs | Jun 16, 2025 | Learning-To-RankRAG | CodeCode Available | 0 | 5 |
| You Only Use Reactive Attention Slice For Long Context Retrieval | Sep 3, 2024 | RAGRetrieval | CodeCode Available | 0 | 5 |
| LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration | May 8, 2025 | Privacy PreservingRAG | CodeCode Available | 0 | 5 |
| Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | Jun 5, 2025 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice | Jan 2, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| LLMs are Biased Evaluators But Not Biased for Retrieval Augmented Generation | Oct 28, 2024 | Keyword ExtractionRAG | CodeCode Available | 0 | 5 |
| LLM Robustness Against Misinformation in Biomedical Question Answering | Oct 27, 2024 | MisinformationQuestion Answering | CodeCode Available | 0 | 5 |
| LLMs in Biomedicine: A study on clinical Named Entity Recognition | Apr 10, 2024 | few-shot-nerFew-shot NER | CodeCode Available | 0 | 5 |
| AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs | Nov 5, 2023 | Question AnsweringRAG | CodeCode Available | 0 | 5 |