| Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine | Nov 28, 2023 | Electrical EngineeringExperimental Design | CodeCode Available | 5 | 5 |
| Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions | Aug 1, 2024 | Medical Question AnsweringMedQA | CodeCode Available | 4 | 5 |
| What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams | Sep 28, 2020 | MedQAMultiple-choice | CodeCode Available | 2 | 5 |
| MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning | Nov 16, 2023 | MedQAMMLU | CodeCode Available | 2 | 5 |
| GreaseLM: Graph REASoning Enhanced Language Models for Question Answering | Jan 21, 2022 | Knowledge GraphsMedical Question Answering | CodeCode Available | 2 | 5 |
| Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Feb 25, 2025 | Decision MakingDiagnostic | CodeCode Available | 2 | 5 |
| Synthetic Data RL: Task Definition Is All You Need | May 18, 2025 | AllGSM8K | CodeCode Available | 2 | 5 |
| Kformer: Knowledge Injection in Transformer Feed-Forward Layers | Jan 15, 2022 | Language ModellingMedical Question Answering | CodeCode Available | 1 | 5 |
| FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering | Feb 23, 2023 | Knowledge GraphsMedical Question Answering | CodeCode Available | 1 | 5 |
| Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-Intensive Tasks | May 28, 2023 | MedQAMemorization | CodeCode Available | 1 | 5 |