| LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models | Oct 31, 2024 | Fact CheckingMedical Question Answering | —Unverified | 0 | 0 |
| LLM-MedQA: Enhancing Medical Question Answering through Case Studies in Large Language Models | Dec 31, 2024 | Medical Question AnsweringMedQA | —Unverified | 0 | 0 |
| MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways | Mar 17, 2025 | Decision MakingMedical Question Answering | —Unverified | 0 | 0 |
| MKRAG: Medical Knowledge Retrieval Augmented Generation for Medical Question Answering | Sep 27, 2023 | In-Context LearningMedical Question Answering | —Unverified | 0 | 0 |
| MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering | Apr 8, 2024 | BenchmarkingMedical Question Answering | —Unverified | 0 | 0 |
| MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering | Jun 3, 2024 | Medical Question AnsweringMedQA | —Unverified | 0 | 0 |
| MedGo: A Chinese Medical Large Language Model | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models | Feb 20, 2025 | Decision MakingHallucination | —Unverified | 0 | 0 |
| Medical Knowledge-enriched Textual Entailment Framework | Nov 10, 2020 | Data AugmentationMedical Question Answering | —Unverified | 0 | 0 |
| MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications | Sep 11, 2024 | EthicsHallucination | —Unverified | 0 | 0 |
| MedPAIR: Measuring Physicians and AI Relevance Alignment in Medical Question Answering | May 29, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning | Feb 27, 2025 | MathMedical Question Answering | —Unverified | 0 | 0 |
| MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models | Jun 12, 2025 | Image SegmentationMedical Diagnosis | —Unverified | 0 | 0 |
| MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering | Mar 20, 2025 | Knowledge GraphsMedical Question Answering | —Unverified | 0 | 0 |