| MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering | Mar 20, 2025 | Knowledge GraphsMedical Question Answering | —Unverified | 0 |
| Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems | Mar 19, 2025 | counterfactualDecision Making | —Unverified | 0 |
| MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways | Mar 17, 2025 | Decision MakingMedical Question Answering | —Unverified | 0 |
| Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework | Mar 7, 2025 | Conformal PredictionMedical Question Answering | —Unverified | 0 |
| Structured Outputs Enable General-Purpose LLMs to be Medical Experts | Mar 5, 2025 | Clinical KnowledgeMedical Question Answering | —Unverified | 0 |
| Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks | Mar 5, 2025 | Medical Question Answeringparameter-efficient fine-tuning | CodeCode Available | 0 |
| Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning | Feb 27, 2025 | MathMedical Question Answering | —Unverified | 0 |
| MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models | Feb 20, 2025 | Decision MakingHallucination | —Unverified | 0 |
| RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering | Feb 19, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge | Feb 18, 2025 | Graph GenerationKnowledge Graphs | —Unverified | 0 |
| SearchRAG: Can Search Engines Be Helpful for LLM-based Medical Question Answering? | Feb 18, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Improving Clinical Question Answering with Multi-Task Learning: A Joint Approach for Answer Extraction and Medical Categorization | Feb 18, 2025 | Information RetrievalMedical Question Answering | —Unverified | 0 |
| A Comprehensive Study on Fine-Tuning Large Language Models for Medical Question Answering Using Classification Models and Comparative Analysis | Jan 27, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Causal Graphs Meet Thoughts: Enhancing Complex Reasoning in Graph-Augmented LLMs | Jan 24, 2025 | Knowledge GraphsMedical Question Answering | CodeCode Available | 0 |
| An Empirical Evaluation of Large Language Models on Consumer Health Questions | Dec 31, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| LLM-MedQA: Enhancing Medical Question Answering through Case Studies in Large Language Models | Dec 31, 2024 | Medical Question AnsweringMedQA | —Unverified | 0 |
| Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track | Dec 15, 2024 | Image CaptioningMedical Question Answering | —Unverified | 0 |
| Overview of TREC 2024 Biomedical Generative Retrieval (BioGen) Track | Nov 27, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| AfriMed-QA: A Pan-African, Multi-Specialty, Medical Question-Answering Benchmark Dataset | Nov 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Benchmark for Long-Form Medical Question Answering | Nov 14, 2024 | Answer GenerationForm | CodeCode Available | 0 |
| Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering | Nov 14, 2024 | Medical Question AnsweringMisinformation | —Unverified | 0 |
| The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models | Nov 13, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? | Nov 6, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Diagnosing Medical Datasets with Training Dynamics | Nov 3, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models | Oct 31, 2024 | Fact CheckingMedical Question Answering | —Unverified | 0 |
| Large Language Model Benchmarks in Medical Tasks | Oct 28, 2024 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| MedGo: A Chinese Medical Large Language Model | Oct 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fine-Tuning LLMs for Reliable Medical Question-Answering Services | Oct 21, 2024 | Decision MakingMedical Question Answering | —Unverified | 0 |
| MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures | Oct 20, 2024 | Answer GenerationInformativeness | CodeCode Available | 0 |
| Mitigating the Risk of Health Inequity Exacerbated by Large Language Models | Oct 7, 2024 | Bias DetectionMedical Question Answering | —Unverified | 0 |
| CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures | Oct 7, 2024 | Argument MiningMedical Question Answering | CodeCode Available | 0 |
| AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Sep 27, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making | Sep 16, 2024 | Answer GenerationDecision Making | CodeCode Available | 0 |
| Contextual Evaluation of Large Language Models for Classifying Tropical and Infectious Diseases | Sep 13, 2024 | Medical Question AnsweringNavigate | —Unverified | 0 |
| MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications | Sep 11, 2024 | EthicsHallucination | —Unverified | 0 |
| Enhancing Healthcare LLM Trust with Atypical Presentations Recalibration | Sep 5, 2024 | Decision MakingMedical Question Answering | CodeCode Available | 0 |
| A Survey for Large Language Models in Biomedicine | Aug 29, 2024 | DiagnosticDrug Discovery | —Unverified | 0 |
| Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models | Aug 25, 2024 | Decision MakingHallucination | —Unverified | 0 |
| MultiMed: Massively Multimodal and Multitask Medical Understanding | Aug 22, 2024 | BenchmarkingMedical Question Answering | —Unverified | 0 |
| Evaluating Fine-Tuning Efficiency of Human-Inspired Learning Strategies in Medical Question Answering | Aug 15, 2024 | Medical Question AnsweringNatural Language Understanding | CodeCode Available | 0 |
| Enhancing Healthcare through Large Language Models: A Study on Medical Question Answering | Aug 8, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Uncertainty Estimation of Large Language Models in Medical Question Answering | Jul 11, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| SemioLLM: Assessing Large Language Models for Semiological Analysis in Epilepsy Research | Jul 3, 2024 | DiagnosticMedical Question Answering | —Unverified | 0 |
| 70B-parameter large language models in Japanese medical question-answering | Jun 21, 2024 | Continual PretrainingDomain Adaptation | —Unverified | 0 |
| MedExQA: Medical Question Answering Benchmark with Multiple Explanations | Jun 10, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 0 |
| TCMD: A Traditional Chinese Medicine QA Dataset for Evaluating Large Language Models | Jun 7, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering | Jun 3, 2024 | Medical Question AnsweringMedQA | —Unverified | 0 |
| Superhuman performance in urology board questions by an explainable large language model enabled for context integration of the European Association of Urology guidelines: the UroBot study | Jun 3, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study | May 29, 2024 | Answer GenerationHallucination | —Unverified | 0 |
| Efficient Medical Question Answering with Knowledge-Augmented Question Generation | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |