| Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning | Jul 9, 2025 | BenchmarkingImage Retrieval | CodeCode Available | 0 |
| AiSciVision: A Framework for Specializing Large Multimodal Models in Scientific Image Classification | Oct 28, 2024 | image-classificationImage Classification | CodeCode Available | 0 |
| U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack | Mar 1, 2025 | HallucinationRAG | CodeCode Available | 0 |
| Citegeist: Automated Generation of Related Work Analysis on the arXiv Corpus | Mar 29, 2025 | ArticlesRAG | CodeCode Available | 0 |
| Experience Retrieval-Augmentation with Electronic Health Records Enables Accurate Discharge QA | Mar 23, 2025 | RAGRetrieval | CodeCode Available | 0 |
| SyllabusQA: A Course Logistics Question Answering Dataset | Mar 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Out of Style: RAG's Fragility to Linguistic Variation | Apr 11, 2025 | Question AnsweringRAG | CodeCode Available | 0 |
| Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework | Feb 20, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 0 |
| LLM Embedding-based Attribution (LEA): Quantifying Source Contributions to Generative Model's Response for Vulnerability Analysis | Jun 12, 2025 | AttributeRAG | CodeCode Available | 0 |
| Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models | Aug 24, 2024 | RAGRetrieval | CodeCode Available | 0 |