| An Empirical Evaluation of Large Language Models on Consumer Health Questions | Dec 31, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge | Aug 1, 2019 | Medical Question AnsweringNatural Language Inference | —Unverified | 0 |
| ARS\_NITK at MEDIQA 2019:Analysing Various Methods for Natural Language Inference, Recognising Question Entailment and Medical Question Answering System | Aug 1, 2019 | Information RetrievalMedical Question Answering | —Unverified | 0 |
| A Survey for Large Language Models in Biomedicine | Aug 29, 2024 | DiagnosticDrug Discovery | —Unverified | 0 |
| Building a Human-Verified Clinical Reasoning Dataset via a Human LLM Hybrid Pipeline for Trustworthy Medical AI | May 11, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding | Apr 30, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Challenges of GPT-3-based Conversational Agents for Healthcare | Aug 28, 2023 | Medical Question AnsweringMedQA | —Unverified | 0 |
| ClinBench-HPB: A Clinical Benchmark for Evaluating LLMs in Hepato-Pancreato-Biliary Diseases | May 30, 2025 | Medical Question AnsweringMultiple-choice | —Unverified | 0 |
| Collaboration among Multiple Large Language Models for Medical Question Answering | May 22, 2025 | Medical Question AnsweringMultiple-choice | —Unverified | 0 |
| Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering | Nov 14, 2024 | Medical Question AnsweringMisinformation | —Unverified | 0 |
| Contextual Evaluation of Large Language Models for Classifying Tropical and Infectious Diseases | Sep 13, 2024 | Medical Question AnsweringNavigate | —Unverified | 0 |
| Continuous Training and Fine-tuning for Domain-Specific Language Models in Medical Question Answering | Nov 1, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Developing ChatGPT for Biology and Medicine: A Complete Review of Biomedical Question Answering | Jan 15, 2024 | Cross-Modal RetrievalMedical Diagnosis | —Unverified | 0 |
| DUT-NLP at MEDIQA 2019: An Adversarial Multi-Task Network to Jointly Model Recognizing Question Entailment and Question Answering | Aug 1, 2019 | Medical Question AnsweringMulti-Task Learning | —Unverified | 0 |
| emrQA-msquad: A Medical Dataset Structured with the SQuAD V2.0 Framework, Enriched with emrQA Medical Information | Apr 18, 2024 | Decision MakingMachine Reading Comprehension | —Unverified | 0 |
| Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models | Oct 17, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Mitigating the Risk of Health Inequity Exacerbated by Large Language Models | Oct 7, 2024 | Bias DetectionMedical Question Answering | —Unverified | 0 |
| Enhancing Healthcare through Large Language Models: A Study on Medical Question Answering | Aug 8, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| ER-REASON: A Benchmark Dataset for LLM-Based Clinical Reasoning in the Emergency Room | May 28, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems | Mar 19, 2025 | counterfactualDecision Making | —Unverified | 0 |
| Exploiting Sentence Embedding for Medical Question Answering | Nov 15, 2018 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Do Large Language Models have Shared Weaknesses in Medical Question Answering? | Oct 11, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs | Apr 15, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Finding Similar Medical Questions from Question Answering Websites | Oct 14, 2018 | DiversityMedical Question Answering | —Unverified | 0 |
| Fine-Tuning LLMs for Reliable Medical Question-Answering Services | Oct 21, 2024 | Decision MakingMedical Question Answering | —Unverified | 0 |
| MultiMed: Massively Multimodal and Multitask Medical Understanding | Aug 22, 2024 | BenchmarkingMedical Question Answering | —Unverified | 0 |
| OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models | Feb 29, 2024 | Medical Question AnsweringMedQA | —Unverified | 0 |
| Overview of TREC 2024 Biomedical Generative Retrieval (BioGen) Track | Nov 27, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track | Dec 15, 2024 | Image CaptioningMedical Question Answering | —Unverified | 0 |
| PEFT-MedAware: Large Language Model for Medical Awareness | Nov 17, 2023 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| PR-Attack: Coordinated Prompt-RAG Attacks on Retrieval-Augmented Generation in Large Language Models via Bilevel Optimization | Apr 10, 2025 | Anomaly DetectionBilevel Optimization | —Unverified | 0 |
| Large Language Models Leverage External Knowledge to Extend Clinical Insight Beyond Language Boundaries | May 17, 2023 | Clinical KnowledgeFew-Shot Learning | —Unverified | 0 |
| RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Feb 19, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering? | Feb 18, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Large Language Models are In-context Teachers for Knowledge Reasoning | Nov 12, 2023 | In-Context LearningInformation Retrieval | —Unverified | 0 |
| SemioLLM: Assessing Large Language Models for Semiological Analysis in Epilepsy Research | Jul 3, 2024 | DiagnosticMedical Question Answering | —Unverified | 0 |
| Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework | Mar 7, 2025 | Conformal PredictionMedical Question Answering | —Unverified | 0 |
| Structured Outputs Enable General-Purpose LLMs to be Medical Experts | Mar 5, 2025 | Clinical KnowledgeMedical Question Answering | —Unverified | 0 |
| Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study | Jun 3, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Task Specific Pruning with LLM-Sieve: How Many Parameters Does Your Task Really Need? | May 23, 2025 | Medical Question AnsweringQuantization | —Unverified | 0 |
| TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models | Jun 7, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Towards Generalist Biomedical AI | Jul 26, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models | Aug 25, 2024 | Decision MakingHallucination | —Unverified | 0 |
| Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study | May 29, 2024 | Answer GenerationHallucination | —Unverified | 0 |
| Uncertainty Estimation of Large Language Models in Medical Question Answering | Jul 11, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Unifying Corroborative and Contributive Attributions in Large Language Models | Nov 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction | Apr 22, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs | May 15, 2025 | AllBenchmarking | —Unverified | 0 |
| What Would it Take to get Biomedical QA Systems into Practice? | Sep 21, 2021 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| LLM-MedQA: Enhancing Medical Question Answering through Case Studies in Large Language Models | Dec 31, 2024 | Medical Question AnsweringMedQA | —Unverified | 0 |