| AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset | Nov 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Benchmark for Long-Form Medical Question Answering | Nov 14, 2024 | Answer GenerationForm | CodeCode Available | 0 |
| Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering | Nov 14, 2024 | Medical Question AnsweringMisinformation | —Unverified | 0 |
| The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models | Nov 13, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Nov 6, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Diagnosing Medical Datasets with Training Dynamics | Nov 3, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Rationale-Guided Retrieval Augmented Generation for Medical Question Answering | Nov 1, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 1 |
| LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models | Oct 31, 2024 | Fact CheckingMedical Question Answering | —Unverified | 0 |
| Large Language Model Benchmarks in Medical Tasks | Oct 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| MedGo: A Chinese Medical Large Language Model | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |