| ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures | Jun 14, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 | 5 |
| Abstracting Concept-Changing Rules for Solving Raven's Progressive Matrix Problems | Jul 15, 2023 | Answer GenerationAnswer Selection | CodeCode Available | 0 | 5 |
| A Hybrid Approach to Information Retrieval and Answer Generation for Regulatory Texts | Feb 24, 2025 | Answer GenerationInformation Retrieval | CodeCode Available | 0 | 5 |
| Answering Naturally: Factoid to Full length Answer Generation | Nov 1, 2019 | Answer GenerationQuestion Answering | CodeCode Available | 0 | 5 |
| MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures | Oct 20, 2024 | Answer GenerationInformativeness | CodeCode Available | 0 | 5 |
| Multilingual State Space Models for Structured Question Answering in Indic Languages | Feb 1, 2025 | Answer GenerationDiversity | CodeCode Available | 0 | 5 |
| Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain | Mar 29, 2024 | Answer GenerationDecision Making | CodeCode Available | 0 | 5 |
| Large Language Models and Multimodal Retrieval for Visual Word Sense Disambiguation | Oct 21, 2023 | Answer GenerationImage Retrieval | CodeCode Available | 0 | 5 |
| HSI: Head-Specific Intervention Can Induce Misaligned AI Coordination in Large Language Models | Feb 9, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 0 | 5 |
| Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching | Apr 3, 2025 | Answer GenerationEEG | CodeCode Available | 0 | 5 |