| Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions | Aug 1, 2024 | Medical Question AnsweringMedQA | CodeCode Available | 4 |
| Benchmarking Retrieval-Augmented Generation for Medicine | Feb 20, 2024 | BenchmarkingInformation Retrieval | CodeCode Available | 4 |
| Huatuo-26M, a Large-scale Chinese Medical QA Dataset | May 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning | Jun 11, 2025 | Medical Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Improving Medical Reasoning through Retrieval and Self-Reflection with Retrieval-Augmented Large Language Models | Jan 27, 2024 | Medical Question AnsweringMultiple-choice | CodeCode Available | 2 |
| Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models | Apr 16, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| PMC-LLaMA: Towards Building Open-source Language Models for Medicine | Apr 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains | Feb 15, 2024 | Few-Shot LearningMedical Question Answering | CodeCode Available | 2 |
| GreaseLM: Graph REASoning Enhanced Language Models for Question Answering | Jan 21, 2022 | Knowledge GraphsMedical Question Answering | CodeCode Available | 2 |
| MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning | Mar 10, 2025 | BenchmarkingMedical Question Answering | CodeCode Available | 2 |
| AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator | Feb 15, 2024 | BenchmarkingDiagnostic | CodeCode Available | 2 |
| Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global Inference | Dec 16, 2020 | Feature EngineeringMedical Question Answering | CodeCode Available | 1 |
| Kformer: Knowledge Injection in Transformer Feed-Forward Layers | Jan 15, 2022 | Language ModellingMedical Question Answering | CodeCode Available | 1 |
| A Gradually Soft Multi-Task and Data-Augmented Approach to Medical Question Understanding | Aug 1, 2021 | Data AugmentationDecoder | CodeCode Available | 1 |
| Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model | Oct 13, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| STLLaVA-Med: Self-Training Large Language and Vision Assistant for Medical Question-Answering | Jun 28, 2024 | Medical DiagnosisMedical Question Answering | CodeCode Available | 1 |
| Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resources | Sep 18, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Relation-Aware Language-Graph Transformer for Question Answering | Dec 2, 2022 | Medical Question AnsweringMedQA | CodeCode Available | 1 |
| Question-Driven Summarization of Answers to Consumer Health Questions | May 18, 2020 | Medical Question AnsweringQuestion Answering | CodeCode Available | 1 |
| FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering | Feb 23, 2023 | Knowledge GraphsMedical Question Answering | CodeCode Available | 1 |
| Rationale-Guided Retrieval Augmented Generation for Medical Question Answering | Nov 1, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations | Apr 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models | Aug 4, 2024 | DiagnosticMedical Question Answering | CodeCode Available | 1 |
| MedLM: Exploring Language Models for Medical Question Answering Systems | Jan 21, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 1 |
| LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge Graphs | Apr 20, 2022 | Conversational Question AnsweringDialogue Generation | CodeCode Available | 1 |
| Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs | Feb 7, 2025 | Federated LearningMedical Question Answering | CodeCode Available | 1 |
| KnowTuning: Knowledge-aware Fine-tuning for Large Language Models | Feb 17, 2024 | Medical Question AnsweringQuestion Answering | CodeCode Available | 1 |
| JMedLoRA:Medical Domain Adaptation on Japanese Large Language Models using Instruction-tuning | Oct 16, 2023 | Domain AdaptationMedical Question Answering | CodeCode Available | 1 |
| Integrating UMLS Knowledge into Large Language Models for Medical Question Answering | Oct 4, 2023 | Medical Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Large language model validity via enhanced conformal prediction methods | Jun 14, 2024 | Conformal PredictionLanguage Modeling | CodeCode Available | 1 |
| Benchmarking large language models for biomedical natural language processing applications and recommendations | May 10, 2023 | BenchmarkingDocument Classification | CodeCode Available | 1 |
| Towards Expert-Level Medical Question Answering with Large Language Models | May 16, 2023 | Medical Question AnsweringMedQA | CodeCode Available | 1 |
| Contextual Evaluation of Large Language Models for Classifying Tropical and Infectious Diseases | Sep 13, 2024 | Medical Question AnsweringNavigate | —Unverified | 0 |
| Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering | Nov 14, 2024 | Medical Question AnsweringMisinformation | —Unverified | 0 |
| An Empirical Evaluation of Large Language Models on Consumer Health Questions | Dec 31, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Collaboration among Multiple Large Language Models for Medical Question Answering | May 22, 2025 | Medical Question AnsweringMultiple-choice | —Unverified | 0 |
| Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge | Feb 18, 2025 | Graph GenerationKnowledge Graphs | —Unverified | 0 |
| ClinBench-HPB: A Clinical Benchmark for Evaluating LLMs in Hepato-Pancreato-Biliary Diseases | May 30, 2025 | Medical Question AnsweringMultiple-choice | —Unverified | 0 |
| Challenges of GPT-3-based Conversational Agents for Healthcare | Aug 28, 2023 | Medical Question AnsweringMedQA | —Unverified | 0 |
| AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow | Sep 27, 2024 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Generating multiple-choice questions for medical question answering with distractors and cue-masking | Mar 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Finding Similar Medical Questions from Question Answering Websites | Oct 14, 2018 | DiversityMedical Question Answering | —Unverified | 0 |
| GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records | Feb 2, 2022 | Clinical Concept ExtractionLanguage Modeling | —Unverified | 0 |
| GreaseLM: Graph REASoning Enhanced Language Models | Sep 29, 2021 | Knowledge GraphsMedical Question Answering | —Unverified | 0 |
| IITP at MEDIQA 2019: Systems Report for Natural Language Inference, Question Entailment and Question Answering | Jun 14, 2019 | Medical Question AnsweringNatural Language Inference | —Unverified | 0 |
| Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs | Apr 15, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Do Large Language Models have Shared Weaknesses in Medical Question Answering? | Oct 11, 2023 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Calibrating Uncertainty Quantification of Multi-Modal LLMs using Grounding | Apr 30, 2025 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |
| Generating Explanations in Medical Question-Answering by Expectation Maximization Inference over Evidence | Oct 2, 2023 | Explanation GenerationMedical Question Answering | —Unverified | 0 |
| Exploiting Sentence Embedding for Medical Question Answering | Nov 15, 2018 | Medical Question AnsweringQuestion Answering | —Unverified | 0 |