| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Jun 14, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 | 5 |
| Climate Finance Bench | May 28, 2025 | Logical ReasoningQuantization | CodeCode Available | 0 | 5 |
| Memory and Knowledge Augmented Language Models for Inferring Salience in Long-Form Stories | Sep 8, 2021 | FormLanguage Modeling | CodeCode Available | 0 | 5 |
| MES-RAG: Bringing Multi-modal, Entity-Storage, and Secure Enhancements to RAG | Mar 17, 2025 | Information RetrievalQuestion Answering | CodeCode Available | 0 | 5 |
| Memorization and Knowledge Injection in Gated LLMs | Apr 30, 2025 | Continual LearningMemorization | CodeCode Available | 0 | 5 |
| MedMobile: A mobile-sized language model with expert-level clinical capabilities | Oct 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling | Feb 21, 2024 | MMLURetrieval | CodeCode Available | 0 | 5 |
| Citegeist: Automated Generation of Related Work Analysis on the arXiv Corpus | Mar 29, 2025 | ArticlesRAG | CodeCode Available | 0 | 5 |
| Agent-Enhanced Large Language Models for Researching Political Institutions | Mar 14, 2025 | Document SummarizationInformation Retrieval | CodeCode Available | 0 | 5 |
| CiteCheck: Towards Accurate Citation Faithfulness Detection | Feb 15, 2025 | parameter-efficient fine-tuningRAG | CodeCode Available | 0 | 5 |
| Medical large language models are easily distracted | Apr 1, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training | Jun 12, 2025 | RAGResponse Generation | CodeCode Available | 0 | 5 |
| MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging | Oct 9, 2024 | Age EstimationFairness | CodeCode Available | 0 | 5 |
| MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation | Feb 24, 2025 | RAGRetrieval | CodeCode Available | 0 | 5 |
| MCCoder: Streamlining Motion Control with LLM-Assisted Code Generation and Rigorous Verification | Oct 19, 2024 | Code GenerationRAG | CodeCode Available | 0 | 5 |
| Matter-of-Fact: A Benchmark for Verifying the Feasibility of Literature-Supported Claims in Materials Science | Jun 4, 2025 | ArticlesCode Generation | CodeCode Available | 0 | 5 |
| LTRR: Learning To Rank Retrievers for LLMs | Jun 16, 2025 | Learning-To-RankRAG | CodeCode Available | 0 | 5 |
| You Only Use Reactive Attention Slice For Long Context Retrieval | Sep 3, 2024 | RAGRetrieval | CodeCode Available | 0 | 5 |
| LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration | May 8, 2025 | Privacy PreservingRAG | CodeCode Available | 0 | 5 |
| Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning | Jun 5, 2025 | Arithmetic ReasoningMath | CodeCode Available | 0 | 5 |
| Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice | Jan 2, 2025 | RAGRetrieval-augmented Generation | CodeCode Available | 0 | 5 |
| LLMs are Biased Evaluators But Not Biased for Retrieval Augmented Generation | Oct 28, 2024 | Keyword ExtractionRAG | CodeCode Available | 0 | 5 |
| LLM Robustness Against Misinformation in Biomedical Question Answering | Oct 27, 2024 | MisinformationQuestion Answering | CodeCode Available | 0 | 5 |
| LLMs in Biomedicine: A study on clinical Named Entity Recognition | Apr 10, 2024 | few-shot-nerFew-shot NER | CodeCode Available | 0 | 5 |
| AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs | Nov 5, 2023 | Question AnsweringRAG | CodeCode Available | 0 | 5 |