| Towards Generalist Biomedical AI | Jul 26, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models | Aug 25, 2024 | Decision MakingHallucination | —Unverified | 0 | 0 |
| Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study | May 29, 2024 | Answer GenerationHallucination | —Unverified | 0 | 0 |
| Uncertainty Estimation of Large Language Models in Medical Question Answering | Jul 11, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Unifying Corroborative and Contributive Attributions in Large Language Models | Nov 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction | Apr 22, 2024 | DiversityLanguage Modeling | —Unverified | 0 | 0 |
| What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs | May 15, 2025 | AllBenchmarking | —Unverified | 0 | 0 |
| What Would it Take to get Biomedical QA Systems into Practice? | Sep 21, 2021 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Word-Sequence Entropy: Towards Uncertainty Estimation in Free-Form Medical Question Answering Applications and Beyond | Feb 22, 2024 | FormMedical Question Answering | —Unverified | 0 | 0 |
| 70B-parameter large language models in Japanese medical question-answering | Jun 21, 2024 | Continual PretrainingDomain Adaptation | —Unverified | 0 | 0 |
| A Comprehensive Study on Fine-Tuning Large Language Models for Medical Question Answering Using Classification Models and Comparative Analysis | Jan 27, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge | Feb 18, 2025 | Graph GenerationKnowledge Graphs | —Unverified | 0 | 0 |
| AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset | Nov 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Grounded Well-being Conversational Agent with Multiple Interaction Modes: Preliminary Results | Nov 28, 2021 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Sep 27, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| An Empirical Evaluation of Large Language Models on Consumer Health Questions | Dec 31, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge | Aug 1, 2019 | Medical Question AnsweringNatural Language Inference | —Unverified | 0 | 0 |
| ARS\_NITK at MEDIQA 2019:Analysing Various Methods for Natural Language Inference, Recognising Question Entailment and Medical Question Answering System | Aug 1, 2019 | Information RetrievalMedical Question Answering | —Unverified | 0 | 0 |
| A Survey for Large Language Models in Biomedicine | Aug 29, 2024 | DiagnosticDrug Discovery | —Unverified | 0 | 0 |
| Building a Human-Verified Clinical Reasoning Dataset via a Human LLM Hybrid Pipeline for Trustworthy Medical AI | May 11, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding | Apr 30, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| Challenges of GPT-3-based Conversational Agents for Healthcare | Aug 28, 2023 | Medical Question AnsweringMedQA | —Unverified | 0 | 0 |
| ClinBench-HPB: A Clinical Benchmark for Evaluating LLMs in Hepato-Pancreato-Biliary Diseases | May 30, 2025 | Medical Question AnsweringMultiple-choice | —Unverified | 0 | 0 |
| Collaboration among Multiple Large Language Models for Medical Question Answering | May 22, 2025 | Medical Question AnsweringMultiple-choice | —Unverified | 0 | 0 |
| Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering | Nov 14, 2024 | Medical Question AnsweringMisinformation | —Unverified | 0 | 0 |