| IRSC: A Zero-shot Evaluation Benchmark for Information Retrieval through Semantic Comprehension in Retrieval-Augmented Generation Scenarios | Sep 24, 2024 | Information RetrievalRAG | CodeCode Available | 0 | 5 |
| DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation | May 15, 2025 | graph constructionHallucination | CodeCode Available | 0 | 5 |
| Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systems | Mar 12, 2024 | Domain AdaptationHallucination | CodeCode Available | 0 | 5 |
| Knowledgeable-r1: Policy Optimization for Knowledge Exploration in Retrieval-Augmented Generation | Jun 5, 2025 | counterfactualRAG | CodeCode Available | 0 | 5 |
| Benchmarking Multimodal RAG through a Chart-based Document Question-Answering Generation Framework | Feb 20, 2025 | BenchmarkingQuestion Answering | CodeCode Available | 0 | 5 |
| Do "New Snow Tablets" Contain Snow? Large Language Models Over-Rely on Names to Identify Ingredients of Chinese Drugs | Apr 3, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| IntellBot: Retrieval Augmented LLM Chatbot for Cyber Threat Knowledge Delivery | Nov 8, 2024 | ChatbotLarge Language Model | CodeCode Available | 0 | 5 |
| Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts | Apr 2, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use | May 4, 2025 | Knowledge GraphsLegal Reasoning | CodeCode Available | 0 | 5 |
| LLM Robustness Against Misinformation in Biomedical Question Answering | Oct 27, 2024 | MisinformationQuestion Answering | CodeCode Available | 0 | 5 |
| BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science | Jun 29, 2024 | AI AgentClaim Verification | CodeCode Available | 0 | 5 |
| Information Retrieval in the Age of Generative AI: The RGB Model | Apr 29, 2025 | Information RetrievalRAG | CodeCode Available | 0 | 5 |
| Improving Medical Multi-modal Contrastive Learning with Expert Annotations | Mar 15, 2024 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 0 | 5 |
| Improving In-Context Learning with Small Language Model Ensembles | Oct 29, 2024 | Domain LabellingIn-Context Learning | CodeCode Available | 0 | 5 |
| Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems | Sep 29, 2024 | FairnessOpen-Domain Question Answering | CodeCode Available | 0 | 5 |
| Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings | Mar 19, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 0 | 5 |
| AI-University: An LLM-based platform for instructional alignment to scientific classrooms | Apr 11, 2025 | Large Language ModelRAG | CodeCode Available | 0 | 5 |
| Improving RAG for Personalization with Author Features and Contrastive Examples | Mar 24, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| Awakening Augmented Generation: Learning to Awaken Internal Knowledge of Large Language Models for Question Answering | Mar 22, 2024 | Open-Domain Question AnsweringOut-of-Distribution Generalization | CodeCode Available | 0 | 5 |
| ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models | Jan 20, 2025 | RAGRetrieval | CodeCode Available | 0 | 5 |
| Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents | Nov 23, 2024 | Question AnsweringRAG | CodeCode Available | 0 | 5 |
| IITK at SemEval-2024 Task 2: Exploring the Capabilities of LLMs for Safe Biomedical Natural Language Inference for Clinical Trials | Apr 6, 2024 | Natural Language InferenceRAG | CodeCode Available | 0 | 5 |
| AITEE -- Agentic Tutor for Electrical Engineering | May 27, 2025 | Electrical EngineeringRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| Hybrid Context Retrieval Augmented Generation Pipeline: LLM-Augmented Knowledge Graphs and Vector Database for Accreditation Reporting Assistance | May 24, 2024 | Knowledge GraphsRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| HomeBench: Evaluating LLMs in Smart Homes with Valid and Invalid Instructions Across Single and Multiple Devices | May 26, 2025 | In-Context LearningRetrieval-augmented Generation | CodeCode Available | 0 | 5 |