| 70B-parameter large language models in Japanese medical question-answering | Jun 21, 2024 | Continual PretrainingDomain Adaptation | —Unverified | 0 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| MedExQA: Medical Question Answering Benchmark with Multiple Explanations | Jun 10, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models | Jun 7, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering | Jun 3, 2024 | Medical Question AnsweringMedQA | —Unverified | 0 |
| Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study | Jun 3, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study | May 29, 2024 | Answer GenerationHallucination | —Unverified | 0 |
| Efficient Medical Question Answering with Knowledge-Augmented Question Generation | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| MediFact at MEDIQA-M3G 2024: Medical Question Answering in Dermatology with Multimodal Learning | Apr 27, 2024 | Answer GenerationMedical Question Answering | CodeCode Available | 0 |
| WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction | Apr 22, 2024 | DiversityLanguage Modeling | —Unverified | 0 |